BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001312
         (1102 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225432064|ref|XP_002273922.1| PREDICTED: nuclear export mediator factor Nemf-like [Vitis vinifera]
          Length = 1110

 Score = 1615 bits (4182), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 810/1068 (75%), Positives = 900/1068 (84%), Gaps = 38/1068 (3%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTTAY RDK  TPSGFTLKLRKHIRTRRLEDVRQLGYDR++LFQFGLG NAHYV
Sbjct: 61   ESGVRLHTTAYVRDKSMTPSGFTLKLRKHIRTRRLEDVRQLGYDRVVLFQFGLGANAHYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDSEF V+TLLRSHRDDDKGVAIMSRHRYP EICRVFERT  +KL AA
Sbjct: 121  ILELYAQGNILLTDSEFMVMTLLRSHRDDDKGVAIMSRHRYPVEICRVFERTATTKLQAA 180

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            LTS KE ++NE  + +E GN VS+A +E  G  KG KS + SKN+N    DGARAKQ TL
Sbjct: 181  LTSPKESESNEAVEASEGGNKVSDAPREKQGNNKGVKSSEPSKNTN----DGARAKQATL 236

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            KTVLGEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +V KFE+WL+DVIS
Sbjct: 237  KTVLGEALGYGPALSEHIILDAGLIPNTKVTKDSKFDIDTIQRLAQSVTKFENWLEDVIS 296

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQFRSREFVKFETFDAALD 359
            GD VPEGYILMQNK  GKD PP++    +Q IYDEFCP+LLNQF+SREFVKFETFDAALD
Sbjct: 297  GDQVPEGYILMQNKIFGKDCPPSQPDRGSQVIYDEFCPILLNQFKSREFVKFETFDAALD 356

Query: 360  EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            EFYSKIESQR+EQQ KAKE +A  KL KI +DQENRVHTLK+EVD  +KMAELIEYNLED
Sbjct: 357  EFYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENRVHTLKKEVDHCIKMAELIEYNLED 416

Query: 420  VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
            VDAAILAVRVALAN M+WEDLARMVKEE+K+GNPVAGLIDKLYLERNCM+LLLSNNLDEM
Sbjct: 417  VDAAILAVRVALANGMNWEDLARMVKEEKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEM 476

Query: 480  DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
            DD+EKTLPV+KVEVDLALSAHANARRWYE KK+QE+KQEKT+ AH KAFKAAEKKTRLQ+
Sbjct: 477  DDDEKTLPVDKVEVDLALSAHANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQL 536

Query: 540  LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
             QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADL
Sbjct: 537  SQEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADL 596

Query: 600  HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
            HGASSTVIKNH+PE PVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGE
Sbjct: 597  HGASSTVIKNHKPEHPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGE 656

Query: 660  YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGH 719
            YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG  DFE++  
Sbjct: 657  YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGAQDFEENES 716

Query: 720  HKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSK 779
             K NSD ESEK++TDEK  AES                      + P E++ + NG DS+
Sbjct: 717  LKGNSDSESEKEETDEKRTAES----------------------KIPLEERNMLNGNDSE 754

Query: 780  -IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVR 838
             I DI+    + V PQLEDLIDRAL LGS + S  K+ +ET+Q DL EE  H +R ATVR
Sbjct: 755  HIADISGGHVSSVNPQLEDLIDRALELGSNTASGKKYALETSQVDL-EEHNHEDRKATVR 813

Query: 839  DKPYISKAERRKLKKGQGSSVVDP---KVEREKERGKDASSQPESIVRKTKIEGGKISRG 895
            +KPYISKAERRKLKKGQ +S  D      + E E    ++SQP+  V+ ++  GGKISRG
Sbjct: 814  EKPYISKAERRKLKKGQKTSTSDAGGDHGQEEIEENNVSTSQPDKDVKNSQPAGGKISRG 873

Query: 896  QKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDA 955
            QKGKLKKMKEKY DQDEEER+IRMALLASAG+  K D + +NENA T K  KP   P +A
Sbjct: 874  QKGKLKKMKEKYADQDEEERSIRMALLASAGRAHKIDKEKENENADTGKGMKPVNGPEEA 933

Query: 956  PKVCYKCKKAGHLSKDCKEHPDDSSH----GVEDNPCVGLDETA-EMDKVAMEEEDIHEI 1010
            PK+CYKCKK GHLS+DC EHPD + H    GVED   V LD +A EMD+VAMEE+DIHEI
Sbjct: 934  PKICYKCKKVGHLSRDCPEHPDGTIHSHSNGVEDRR-VDLDNSATEMDRVAMEEDDIHEI 992

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIP 1058
            GEEEKG+LNDVDYLTGNPLP+DILLY +PVCGPYSA+Q+YKYRVKIIP
Sbjct: 993  GEEEKGKLNDVDYLTGNPLPNDILLYAVPVCGPYSALQTYKYRVKIIP 1040


>gi|255556494|ref|XP_002519281.1| conserved hypothetical protein [Ricinus communis]
 gi|223541596|gb|EEF43145.1| conserved hypothetical protein [Ricinus communis]
          Length = 1092

 Score = 1534 bits (3971), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 785/1072 (73%), Positives = 869/1072 (81%), Gaps = 65/1072 (6%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVTESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRI+LFQFGLG NAHYV
Sbjct: 61   ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIVLFQFGLGANAHYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDS+FTVLTLLRSHRDDDKG AIMSRHRYPTEICRVFER TA KL  +
Sbjct: 121  ILELYAQGNILLTDSDFTVLTLLRSHRDDDKGFAIMSRHRYPTEICRVFERITAEKLQES 180

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNA-SKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            LTS KEP+ +EP  VN+  NN+S    KE  G   G KS D SK+++    DG RAKQ T
Sbjct: 181  LTSFKEPEISEP--VNDGENNMSEKLKKEKQGKSTGTKSSDPSKSAS----DGNRAKQTT 234

Query: 240  LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
            LK VLGEALGYGPALSEH+ILD GLVPN K S+ N+L+DNAIQVLV AVAK EDWLQD+I
Sbjct: 235  LKNVLGEALGYGPALSEHMILDAGLVPNTKFSKSNRLDDNAIQVLVQAVAKLEDWLQDII 294

Query: 300  SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
            SGD +PEGYILMQNK++GK+HP +ES  + +IYDEFCP+LLNQF+ RE+VKF+TFDAALD
Sbjct: 295  SGDKIPEGYILMQNKNVGKNHPSSES--AFKIYDEFCPILLNQFKMREYVKFDTFDAALD 352

Query: 360  EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            EFYSKIESQRAEQQ K KE++A  KLNKI +DQENRV TL++EVD  V+ AELIEYNLED
Sbjct: 353  EFYSKIESQRAEQQQKTKENSAIQKLNKIRLDQENRVLTLRKEVDLCVRKAELIEYNLED 412

Query: 420  VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
            VDAAILAVRVALA  MSWEDL RMVKEE+K GNPVA LIDKL+LERNCM+LLLSNNLD+M
Sbjct: 413  VDAAILAVRVALAKGMSWEDLTRMVKEEKKLGNPVASLIDKLHLERNCMTLLLSNNLDDM 472

Query: 480  DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
            DD+EKTLPV+KVE+DLALSAHANARRWYE+KKKQESKQ KT+TAH KAFKAAE+KTRLQ+
Sbjct: 473  DDDEKTLPVDKVEIDLALSAHANARRWYEMKKKQESKQGKTVTAHEKAFKAAERKTRLQL 532

Query: 540  LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
             QEK+VA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+L
Sbjct: 533  SQEKSVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAEL 592

Query: 600  HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
            HGASSTVIKNHRPEQPVPPLTLNQAGC+TVC SQAWDSK+VTSAWWVYPHQVSKTAPTGE
Sbjct: 593  HGASSTVIKNHRPEQPVPPLTLNQAGCYTVCQSQAWDSKIVTSAWWVYPHQVSKTAPTGE 652

Query: 660  YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGH 719
            YLTVGSFMIRGKKNFL PHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM+DFE+SG 
Sbjct: 653  YLTVGSFMIRGKKNFLSPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMNDFEESGP 712

Query: 720  HKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGI-DS 778
              E SD ESEK++  ++ ++ES +            +A  VDS  F  +  T + GI + 
Sbjct: 713  PLEISDSESEKEEIGKEVMSESKTT----------ADAEVVDSINF-LQQGTAAGGISND 761

Query: 779  KIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVR 838
             I DI  N  A  TPQLEDLIDRALGLG A++S   +G+E ++ DLS+E+          
Sbjct: 762  DISDIVGNDVASATPQLEDLIDRALGLGPATVSQKNYGVEISKIDLSKEEI--------- 812

Query: 839  DKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDA-SSQPESIVRKTKIEGGKISRGQK 897
                     RR  K              E+ +  DA  SQ E   +  K   GKISRGQK
Sbjct: 813  ---------RRNXK--------------EESKENDAFVSQREKSSQSNKAGSGKISRGQK 849

Query: 898  GKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNE-----NASTHKEKKPAISP 952
             KLKKMKEKY DQDEEER+IRMALLASAG  +K  GD QNE     NAS  K K P    
Sbjct: 850  SKLKKMKEKYADQDEEERSIRMALLASAGNTRKKGGDSQNESVATDNASADKGKTPVTGS 909

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSH-----GVEDNPCVGLDETA-EMDKVAMEEED 1006
             DAPKVCYKCKK GHLS+DC E+PDDSSH     G  +   V L  T  E D+VAMEE+D
Sbjct: 910  EDAPKVCYKCKKPGHLSRDCPENPDDSSHNHANGGPAEESHVDLGRTTLEADRVAMEEDD 969

Query: 1007 IHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIP 1058
            IHEIGEE+KG+LND DYLTGNPL SDILLY +PVCGPYSAVQSYKYRVKI+P
Sbjct: 970  IHEIGEEDKGKLNDTDYLTGNPLASDILLYAVPVCGPYSAVQSYKYRVKIVP 1021


>gi|449485009|ref|XP_004157045.1| PREDICTED: nuclear export mediator factor NEMF homolog [Cucumis
            sativus]
          Length = 1090

 Score = 1519 bits (3933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 779/1093 (71%), Positives = 890/1093 (81%), Gaps = 28/1093 (2%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAEVKCL+RLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEVKCLKRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG +AHYV
Sbjct: 61   ESGVRLHTTEYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGASAHYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDSEFTVLTLLRSHRDD+KGVAIMSRHRYPTEI RVFE+TTA+KL  A
Sbjct: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDNKGVAIMSRHRYPTEISRVFEKTTAAKLQEA 180

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            LT S     +    V  +GNN ++  K+    QK  K+      S+K   DG+R+KQ TL
Sbjct: 181  LTLS-----DNIVNVTGNGNNETDPLKQQADNQKVSKT----SVSSKAQGDGSRSKQSTL 231

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            K VLGEALGYG ALSEHIIL+ GL+PNMKL   NKL+DN++  L+ AVA FEDWL+DVI 
Sbjct: 232  KAVLGEALGYGTALSEHIILNAGLIPNMKLCNDNKLDDNSLDCLMQAVANFEDWLEDVIF 291

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            G  +PEGYILMQ K + K+   +E+ ++ +IYDEFCP+LLNQF SR++ KFETFDAALDE
Sbjct: 292  GTRIPEGYILMQKKDVKKEE--SEAATANEIYDEFCPILLNQFMSRKYTKFETFDAALDE 349

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            FYSKIESQR+EQQ KAKE +A HKLNKI MDQ NRV  LKQEVD SVKMAELIEYNLEDV
Sbjct: 350  FYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLEDV 409

Query: 421  DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
            DA ILAVRVALA  MSWEDLARMVKEE+K+GNPVAGLIDKL LERNCM+LLLSNNLDEMD
Sbjct: 410  DAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLSNNLDEMD 469

Query: 481  DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
            D+EKT PV+KVEVD++LSAHANARRWYELKKKQESKQEKTITAH KAFKAAE+KTRLQ+ 
Sbjct: 470  DDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTITAHEKAFKAAERKTRLQLS 529

Query: 541  QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
            QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+LH
Sbjct: 530  QEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAELH 589

Query: 601  GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
            GASSTVIKNH+PEQ VPPLTLNQAGC+TVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 590  GASSTVIKNHKPEQLVPPLTLNQAGCYTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 649

Query: 661  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
            LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE+G++  E++   
Sbjct: 650  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEDGVNGVEENEPL 709

Query: 721  KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKI 780
             E SDIE EK +++E     S +  NS  PA S    +  +S E P ED    NG++   
Sbjct: 710  NEESDIEYEKRESEEV----SNTSANSFIPAISEPEGT--ESLEIPIEDIMTLNGVNKDT 763

Query: 781  FDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDK 840
                RN  + VTPQLEDLID+AL LGSA+ SS  + +ET++ +  +E    ++ AT R+K
Sbjct: 764  QPDVRNNVSLVTPQLEDLIDKALELGSATASSKSYILETSKVNSVDEPCLDDKNATGREK 823

Query: 841  PYISKAERRKLKKGQGSSVVDPKVEREKERGK---DASSQPESIVRKTKIEGGKISRGQK 897
            PYISKAERRKLKKGQ SS  D  +++E E+ +   D+S+  ++ V   K+   KISRGQ+
Sbjct: 824  PYISKAERRKLKKGQNSSSTDGSIKQESEQPRDIDDSSNLLQNKVNNPKLGSVKISRGQR 883

Query: 898  GKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPK 957
            GKLKKMKEKY DQDEEER+IRMALLAS+GK  KN+G  QN    T + KKP     +A K
Sbjct: 884  GKLKKMKEKYADQDEEERSIRMALLASSGKSPKNEGG-QNVKEITSEVKKPDGGAEEASK 942

Query: 958  VCYKCKKAGHLSKDCKEHPDDSSH----GV-EDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
            +CYKCKK GHLS+DC EHPD+ SH    GV + +  V LD  AE+DK+ MEE+DIHEIGE
Sbjct: 943  ICYKCKKPGHLSRDCPEHPDNLSHNHSNGVTQYDHHVVLDNDAELDKITMEEDDIHEIGE 1002

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG--IQIF 1070
            EE+ +LNDVDYLTGNPL +DILLY +PVCGPY+AVQSYKY VKI+PG  KKGKG    +F
Sbjct: 1003 EEREKLNDVDYLTGNPLATDILLYAVPVCGPYNAVQSYKYHVKIVPGPLKKGKGKLASVF 1062

Query: 1071 YSLLLLMLSLTPV 1083
             +  + +  + P+
Sbjct: 1063 ITNTIFIDKIEPL 1075


>gi|449441522|ref|XP_004138531.1| PREDICTED: nuclear export mediator factor Nemf-like [Cucumis sativus]
          Length = 1119

 Score = 1519 bits (3932), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 776/1085 (71%), Positives = 882/1085 (81%), Gaps = 26/1085 (2%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAEVKCL+RLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEVKCLKRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG +AHYV
Sbjct: 61   ESGVRLHTTEYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGASAHYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDSEFTVLTLLRSHRDD+KGVAIMSRHRYPTEI RVFE+TTA+KL  A
Sbjct: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDNKGVAIMSRHRYPTEISRVFEKTTAAKLQEA 180

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            LT S     +    V  +GNN ++  K+    QK  K+      S+K   DG+R+KQ TL
Sbjct: 181  LTLS-----DNIVNVTGNGNNETDPLKQQADNQKVSKT----SVSSKAQGDGSRSKQSTL 231

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            K VLGEALGYG ALSEHIIL+ GL+PNMKL   NKL+DN++  L+ AVA FEDWL+DVI 
Sbjct: 232  KAVLGEALGYGTALSEHIILNAGLIPNMKLCNDNKLDDNSLDCLMQAVANFEDWLEDVIF 291

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            G  +PEGYILMQ K + K+   +E+ ++ +IYDEFCP+LLNQF SR++ KFETFDAALDE
Sbjct: 292  GTRIPEGYILMQKKDVKKEE--SEAATANEIYDEFCPILLNQFMSRKYTKFETFDAALDE 349

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            FYSKIESQR+EQQ KAKE +A HKLNKI MDQ NRV  LKQEVD SVKMAELIEYNLEDV
Sbjct: 350  FYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLEDV 409

Query: 421  DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
            DA ILAVRVALA  MSWEDLARMVKEE+K+GNPVAGLIDKL LERNCM+LLLSNNLDEMD
Sbjct: 410  DAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLSNNLDEMD 469

Query: 481  DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
            D+EKT PV+KVEVD++LSAHANARRWYELKKKQESKQEKTITAH KAFKAAE+KTRLQ+ 
Sbjct: 470  DDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTITAHEKAFKAAERKTRLQLS 529

Query: 541  QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
            QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+LH
Sbjct: 530  QEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAELH 589

Query: 601  GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
            GASSTVIKNH+PEQ VPPLTLNQAGC+TVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 590  GASSTVIKNHKPEQLVPPLTLNQAGCYTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 649

Query: 661  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
            LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE+G++  E++   
Sbjct: 650  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEDGVNGVEENEPL 709

Query: 721  KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKI 780
             E SDIE EK +++E     S +  NS  PA S    +  +S E P ED    NG++   
Sbjct: 710  NEESDIEYEKRESEEV----SNTSANSFIPAISGPEGT--ESLEIPIEDIMTLNGVNKDT 763

Query: 781  FDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDK 840
                RN  + VTPQLEDLID+AL LGSA+ SS  + +ET++ +  +E    ++ AT R+K
Sbjct: 764  QPDVRNNVSLVTPQLEDLIDKALELGSATASSKSYILETSKVNSVDEPCLDDKNATGREK 823

Query: 841  PYISKAERRKLKKGQGSSVVDPKVEREKERGK---DASSQPESIVRKTKIEGGKISRGQK 897
            PYISKAERRKLKKGQ SS  D  +++E E+ +   D+S+  ++ V   K+   KISRGQ+
Sbjct: 824  PYISKAERRKLKKGQNSSSTDGSIKQESEQPRDIDDSSNLLQNKVNNPKLGSVKISRGQR 883

Query: 898  GKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPK 957
            GKLKKMKEKY DQDEEER+IRMALLAS+GK  KN+G  QN    T + KKP     +A K
Sbjct: 884  GKLKKMKEKYADQDEEERSIRMALLASSGKSPKNEGG-QNVKEITSEVKKPDGGAEEASK 942

Query: 958  VCYKCKKAGHLSKDCKEHPDDSSHGVEDNPC-----VGLDETAEMDKVAMEEEDIHEIGE 1012
            +CYKCKK GHLS+DC EHPD+ SH   +        V LD  AE+DK+ MEE+DIHEIGE
Sbjct: 943  ICYKCKKPGHLSRDCPEHPDNLSHNHSNGVTQYDHHVVLDNDAELDKITMEEDDIHEIGE 1002

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYS 1072
            EE+ +LNDVDYLTGNPL +DILLY +PVCGPY+AVQSYKY VKI+PG  KKGK  +   +
Sbjct: 1003 EEREKLNDVDYLTGNPLATDILLYAVPVCGPYNAVQSYKYHVKIVPGPLKKGKAAKTALN 1062

Query: 1073 LLLLM 1077
            L   M
Sbjct: 1063 LFTHM 1067


>gi|357448763|ref|XP_003594657.1| Serologically defined colon cancer antigen-like protein [Medicago
            truncatula]
 gi|355483705|gb|AES64908.1| Serologically defined colon cancer antigen-like protein [Medicago
            truncatula]
          Length = 1146

 Score = 1497 bits (3876), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 779/1117 (69%), Positives = 881/1117 (78%), Gaps = 56/1117 (5%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDL+PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLTPKTYVFKLMNSSGMTESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESG RLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRI+LFQFGLG NA+YV
Sbjct: 61   ESGARLHTTVYMRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIVLFQFGLGENANYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGN++LTDS FTVLTLLRSHRDDDKG+AIMSRHRYP E CRVFERTT +KL  A
Sbjct: 121  ILELYAQGNVILTDSSFTVLTLLRSHRDDDKGLAIMSRHRYPVESCRVFERTTTAKLQTA 180

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            LTSSKE D +E  K N +G +VSN  KE  G +K GKS+                   TL
Sbjct: 181  LTSSKEDDNDEAVKANGNGTDVSNVEKEKQGSKKSGKSY------------------ATL 222

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            K +LGEALGYGPALSEH+ILD GL+PN K+S+    +D  +Q LV AVAKFEDW+QD+IS
Sbjct: 223  KIILGEALGYGPALSEHMILDAGLIPNEKVSKDKVWDDATVQALVQAVAKFEDWMQDIIS 282

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            G+IVPEGYILMQNK LGKD   ++  S  QIYDEFCP+LLNQF+SR+  KFETFD ALDE
Sbjct: 283  GEIVPEGYILMQNKVLGKDSSVSQPESLKQIYDEFCPILLNQFKSRDHTKFETFDLALDE 342

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ----------ENRVHTLKQEVDRSVKMA 410
            FYSKIESQR+EQQH AKE++A  KLNKI  DQ          ENRVHTL++E D  +KMA
Sbjct: 343  FYSKIESQRSEQQHTAKENSALQKLNKIRNDQVGTHVQTSTIENRVHTLRKEADNCIKMA 402

Query: 411  ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
            ELIEYNLEDVDAAILAVRV+LA  MSW+DLARMVKEE+KAGNPVAGLIDKL+LERNCM+L
Sbjct: 403  ELIEYNLEDVDAAILAVRVSLAKGMSWDDLARMVKEEKKAGNPVAGLIDKLHLERNCMTL 462

Query: 471  LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
            LLSNNLDEMDD+EKTLP +KVEVDLALSAHANARRWYELKKKQESKQEKTITAH KAFKA
Sbjct: 463  LLSNNLDEMDDDEKTLPADKVEVDLALSAHANARRWYELKKKQESKQEKTITAHEKAFKA 522

Query: 531  AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
            AE+KTRLQ+ QEKTVA+ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK
Sbjct: 523  AERKTRLQLNQEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 582

Query: 591  GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
            GD+YVHA+LHGASSTVIKNH+P QPVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQ
Sbjct: 583  GDLYVHAELHGASSTVIKNHKPMQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQ 642

Query: 651  VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
            VSKTAPTGEYLTVGSFMIRGKKN+LPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 
Sbjct: 643  VSKTAPTGEYLTVGSFMIRGKKNYLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEET 702

Query: 711  MDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPN--SAHPAPSH-------------T 755
            +DD  ++G  +E SD ESEK+  D +  A+S    N  +  P PS               
Sbjct: 703  IDDNVETGPVEEQSDSESEKNVADGETAADSERNGNLSADSPIPSEDLLADTSQTSLAAI 762

Query: 756  NASNVDSHEFPAEDKTISNGIDS-KIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTK 814
            NA    S +F A+D +  N +DS K+ D + N  A V+PQLE+++DRALGLGS + S+  
Sbjct: 763  NAKTTVSDDFSAKDPSTKNMLDSEKLSDFSGNGLASVSPQLEEILDRALGLGSVAKSNKS 822

Query: 815  HGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKK--GQGSSVVDPKVEREKERGK 872
            +  E TQ DLS E+ +      VRDKPYISKAERRKLK     G +       ++K + K
Sbjct: 823  YEAENTQLDLSSENHNESSKPAVRDKPYISKAERRKLKNEPKHGEAHPSDGNGKDKSKLK 882

Query: 873  DASSQPESI-VRKTKIEGG-KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGK-VQ 929
            D S    +      K  GG KISRGQKGKLKKMKEKY DQDEEER+IRM+LLAS+GK ++
Sbjct: 883  DISGDLHAKDAENLKTGGGKKISRGQKGKLKKMKEKYADQDEEERSIRMSLLASSGKPIK 942

Query: 930  KNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHG-----VE 984
            K +  P  E  ++ K KK    P+DAPK+CYKCKK GHLS+DCKE P+D  H       E
Sbjct: 943  KEETLPVIE--TSDKGKKSDSGPIDAPKICYKCKKVGHLSRDCKEQPNDLLHSHATSEAE 1000

Query: 985  DNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPY 1044
            +NP +     +  D+VAMEE+DI+EIGEEEK +LNDVDYLTGNPLP+DILLY +PVCGPY
Sbjct: 1001 ENPNMNASNLSLEDRVAMEEDDINEIGEEEKEKLNDVDYLTGNPLPNDILLYAVPVCGPY 1060

Query: 1045 SAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            +AVQSYKYRVKIIPG  KKGK  +   +L   M   T
Sbjct: 1061 NAVQSYKYRVKIIPGPVKKGKAAKTAMNLFSHMSEAT 1097


>gi|356529076|ref|XP_003533123.1| PREDICTED: nuclear export mediator factor Nemf-like [Glycine max]
          Length = 1131

 Score = 1480 bits (3832), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 778/1114 (69%), Positives = 885/1114 (79%), Gaps = 64/1114 (5%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVR+NTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGV+ESGESEKVLLLM
Sbjct: 1    MVKVRLNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVSESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG NA+YV
Sbjct: 61   ESGVRLHTTLYLRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGENANYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDS FTV+TLLRSHRDDDKG+AIMSRHRYP E CRVFERTT  KL  +
Sbjct: 121  ILELYAQGNILLTDSTFTVMTLLRSHRDDDKGLAIMSRHRYPVESCRVFERTTIEKLRTS 180

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            L SSKE D ++  K + +G+N SN +KE  G  KGGKS                    TL
Sbjct: 181  LVSSKEDDNDDAVKADGNGSNASNVAKEKQGTHKGGKS------------------SATL 222

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            K VLGEALGYGPALSEHI+LD GL+P+ K+ +    +D  +Q LV AV +FEDW+QDVIS
Sbjct: 223  KIVLGEALGYGPALSEHILLDAGLIPSTKVPKDRTWDDATVQALVQAVVRFEDWMQDVIS 282

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            G++VPEGYILMQNK++GKD   ++ GS +Q+YDEFCP+LLNQF+SR++ KFETFDAALDE
Sbjct: 283  GELVPEGYILMQNKNMGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDE 342

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            FYSKIESQR+EQQ KAKE++A  KLN+I  DQENRVH L++E D  VKMAELIEYNLEDV
Sbjct: 343  FYSKIESQRSEQQQKAKENSASQKLNRIRQDQENRVHALRKEADHCVKMAELIEYNLEDV 402

Query: 421  DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
            DAAILAVRVALA  M+W+DLARMVKEE+KAGNPVAGLIDKL+L+RNCM+LLLSNNLDEMD
Sbjct: 403  DAAILAVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLDRNCMTLLLSNNLDEMD 462

Query: 481  DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
            D+EKTLPV+KVEVDLALSAHANARRWYE KKKQESKQ KT+TAH KAFKAAE+KTRLQ+ 
Sbjct: 463  DDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQGKTVTAHEKAFKAAERKTRLQLN 522

Query: 541  QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
            QEKTVA+ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADLH
Sbjct: 523  QEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLH 582

Query: 601  GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
            GASSTVIKNH+P QPVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 583  GASSTVIKNHKPAQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 642

Query: 661  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
            LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE  DD+E++G  
Sbjct: 643  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEAADDYEETGPL 702

Query: 721  KENSDIESEKDDTDEKPVAE-----SLS------VPNSAHPAPSHTNASNVD-----SHE 764
            ++ SD ESEKD TD +P  +     +LS      +P      PS T+ +  D     S +
Sbjct: 703  EDKSDSESEKDVTDIEPATDLERNGNLSADSHKPLPEDFPADPSQTSLATTDAETAISQD 762

Query: 765  FPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDL 824
            FPA++ +  N +D +I              LE+L+D+AL LG  + SS K+GIE +Q DL
Sbjct: 763  FPAKETSTLNMVDREILS-----------DLEELLDQALELGPVAKSSKKYGIEKSQIDL 811

Query: 825  SEEDKHVERTAT-VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERG--KDASSQ-PES 880
              E +H E+T T VR+KPYISKAERRKLKK Q     D  VE  K+    KD S+  P  
Sbjct: 812  DTE-QHFEQTKTAVREKPYISKAERRKLKKEQKPGEEDSNVEHGKDESKLKDISANLPVK 870

Query: 881  IVRKTKIEGG-KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNEN 939
              +  K  GG KISRGQKGKLKK+KEKY DQDEEER+IRM LLAS+GK    + +  +EN
Sbjct: 871  EDQNLKKGGGQKISRGQKGKLKKIKEKYADQDEEERSIRMTLLASSGKSITKE-ETSSEN 929

Query: 940  ASTHKEKKPAIS-------PVDAPKVCYKCKKAGHLSKDCKEHPDDSSHG-----VEDNP 987
             +  K KKP          P DAPK+CYKCKKAGHLS+DCK+ PDD  H       E+NP
Sbjct: 930  DALDKGKKPGSGPSDAPKIPSDAPKICYKCKKAGHLSRDCKDQPDDLLHRNAVGEAEENP 989

Query: 988  CVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAV 1047
                 +T++ D+VAMEE+DI+EIGEEEK +LNDVDYLTGNPLP+DILLY +PVCGPYSAV
Sbjct: 990  KTTAIDTSQADRVAMEEDDINEIGEEEKEKLNDVDYLTGNPLPNDILLYAVPVCGPYSAV 1049

Query: 1048 QSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            QSYKYRVKIIPG  KKGK  +   +L   M   T
Sbjct: 1050 QSYKYRVKIIPGPTKKGKAAKTATNLFSHMSEAT 1083


>gi|297795761|ref|XP_002865765.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
 gi|297311600|gb|EFH42024.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
          Length = 1080

 Score = 1465 bits (3793), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 749/1089 (68%), Positives = 853/1089 (78%), Gaps = 66/1089 (6%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYV
Sbjct: 61   ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +
Sbjct: 121  ILELYAQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQES 180

Query: 181  LT--SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            LT  S K+ +A + ++            KE  GG+KGGKS           ND   AKQ 
Sbjct: 181  LTAFSLKDHEAKQIER------------KEQNGGKKGGKS-----------NDSTGAKQY 217

Query: 239  TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            TLK +LG+ALGYGP LSEHIILD GL+P  KLSE  KL+DN IQ+LV AV  FEDWL+D+
Sbjct: 218  TLKNILGDALGYGPQLSEHIILDAGLIPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDI 277

Query: 299  ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            I+G  VPEGYILMQ + L  D  P+ESG   ++YDEFC +LLNQF+SR + KFETFDAAL
Sbjct: 278  INGQKVPEGYILMQKQILAND-TPSESGGVKKMYDEFCSILLNQFKSRVYEKFETFDAAL 336

Query: 359  DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
            DEFYSKIESQR+EQQ KAKED+A  KLNKI  DQENRV  LK+EV+  V MAELIEYNLE
Sbjct: 337  DEFYSKIESQRSEQQQKAKEDSASQKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNLE 396

Query: 419  DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            DVDAAILAVRVALA  M W+DLARMVKEE+K GNPVAGLIDKLYLE+NCM+LLL NNLDE
Sbjct: 397  DVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGLIDKLYLEKNCMTLLLCNNLDE 456

Query: 479  MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
            MDD+EKTLPVEKVEVDL+LSAH NARRWYE+KKKQE+KQEKT++AH KAF+AAEKKTR Q
Sbjct: 457  MDDDEKTLPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTVSAHEKAFRAAEKKTRHQ 516

Query: 539  ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            + QEK VA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+
Sbjct: 517  LSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAE 576

Query: 599  LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
            LHGASSTVIKNH+PEQ VPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQV+KTAPTG
Sbjct: 577  LHGASSTVIKNHKPEQNVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVTKTAPTG 636

Query: 659  EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSG 718
            EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG+HLNERRVRGEEEGM+D     
Sbjct: 637  EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGAHLNERRVRGEEEGMNDVVMET 696

Query: 719  HH-KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGID 777
            H   E+SD+ESE +      V E++S         S T  S  D+  F           D
Sbjct: 697  HAPDEHSDVESENE-----AVNEAVSASGEVDLEESSTILSQ-DTSSF-----------D 739

Query: 778  SKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATV 837
                 IA       T QLEDL+DR LGLG+A+++  K  IET++ ++ E+    E+ A V
Sbjct: 740  MNSSGIAEENVESATSQLEDLLDRTLGLGAATVAGKKDTIETSKDEMEEKMTQEEKKAVV 799

Query: 838  RDKPYISKAERRKLKKGQ-GSSVVDPKVEREKE--RGKDAS--SQPESIVRKTKIEGGKI 892
            RDKPY+SKAERRKLK GQ G++ VD    +EK+  + KD S  SQ    +   K  G K+
Sbjct: 800  RDKPYMSKAERRKLKMGQSGNTAVDGNTGQEKQQRKEKDVSSLSQANKSIPDNKPAGEKV 859

Query: 893  SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISP 952
            SRGQ+GKLKKMKEKY DQDE+ER IRMALLAS+GK QK D + QN   +   EKKP+   
Sbjct: 860  SRGQRGKLKKMKEKYADQDEDERKIRMALLASSGKPQKTDVESQNAKTAVTVEKKPSEET 919

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
             DA K+CY+CKK GHL++DC        HG          ET+EMDKV MEE+DI+E+G+
Sbjct: 920  EDAVKICYRCKKVGHLARDC--------HG---------KETSEMDKVVMEEDDINEVGD 962

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYS 1072
            EEK +L DVDYLTGNPLP+DILLY +PVCGPY+A+QSYKYRVK IPG+ KKGK  +   +
Sbjct: 963  EEKEKLIDVDYLTGNPLPTDILLYAVPVCGPYNALQSYKYRVKAIPGSMKKGKAAKTAMN 1022

Query: 1073 LLLLMLSLT 1081
            L   M   T
Sbjct: 1023 LFTHMTEAT 1031


>gi|15240582|ref|NP_199804.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
 gi|8777424|dbj|BAA97014.1| unnamed protein product [Arabidopsis thaliana]
 gi|332008489|gb|AED95872.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
          Length = 1080

 Score = 1458 bits (3774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 742/1085 (68%), Positives = 848/1085 (78%), Gaps = 66/1085 (6%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYV
Sbjct: 61   ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +
Sbjct: 121  ILELYAQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQES 180

Query: 181  LTSS--KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            LT+   K+ DA + +             KE  GG+KGGKS           ND   AKQ 
Sbjct: 181  LTAFVLKDHDAKQIE------------PKEQNGGKKGGKS-----------NDSTGAKQY 217

Query: 239  TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            TLK +LG+ALGYGP LSEHIILD GLVP  KLSE  KL+DN IQ+LV AV  FEDWL+D+
Sbjct: 218  TLKNILGDALGYGPQLSEHIILDAGLVPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDI 277

Query: 299  ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            I+G  VPEGYILMQ + L  D   +ESG   ++YDEFC +LLNQF+SR + KFETFDAAL
Sbjct: 278  INGQKVPEGYILMQKQILAND-TTSESGGVKKMYDEFCSILLNQFKSRVYEKFETFDAAL 336

Query: 359  DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
            DEFYSKIESQR+EQQ KAKED+A  KLNKI  DQENRV  LK+EV+  V MAELIEYNLE
Sbjct: 337  DEFYSKIESQRSEQQQKAKEDSASLKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNLE 396

Query: 419  DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            DVDAAILAVRVALA  M W+DLARMVKEE+K GNPVAG+ID+LYLE+NCM+LLL NNLDE
Sbjct: 397  DVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGVIDRLYLEKNCMTLLLCNNLDE 456

Query: 479  MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
            MDD+EKT+PVEKVEVDL+LSAH NARRWYE+KKKQE+KQEKT++AH KAF+AAEKKTR Q
Sbjct: 457  MDDDEKTVPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTVSAHEKAFRAAEKKTRHQ 516

Query: 539  ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            + QEK VA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+
Sbjct: 517  LSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAE 576

Query: 599  LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
            LHGASSTVIKNH+PEQ VPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQV+KTAPTG
Sbjct: 577  LHGASSTVIKNHKPEQNVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVTKTAPTG 636

Query: 659  EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSG 718
            EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG+HLNERRVRGEEEGM+D     
Sbjct: 637  EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGAHLNERRVRGEEEGMNDVVMET 696

Query: 719  HH-KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGID 777
            H   E+SD ESE +  +E   A                 +  VD  E        ++ +D
Sbjct: 697  HAPDEHSDTESENEAVNEVVSA-----------------SGEVDLQESSTALSQDTSSLD 739

Query: 778  SKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATV 837
                 I     A  T QLEDL+DR LGLG+A+++  K  IET++ D+ E+ K  E+ A V
Sbjct: 740  MSSSGITEENVASATSQLEDLLDRTLGLGAATVAGKKDTIETSKDDMEEKMKQEEKNAVV 799

Query: 838  RDKPYISKAERRKLKKGQ-GSSVVDPKVEREKE--RGKDAS--SQPESIVRKTKIEGGKI 892
            RDKPY+SKAERRKLK GQ G++  D    +EK+  + KD S  SQ    +   K  G K+
Sbjct: 800  RDKPYMSKAERRKLKMGQSGNTAADGNTGQEKQQRKEKDVSSLSQATKSIPDNKPAGEKV 859

Query: 893  SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISP 952
            SRGQ+GKLKKMKEKY DQDE+ER IRMALLAS+GK QK D + QN   +  + KKP+   
Sbjct: 860  SRGQRGKLKKMKEKYADQDEDERKIRMALLASSGKPQKTDVESQNAKTAVTEVKKPSEET 919

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
             DA K+CY+CKK GHL++DC        HG          ET++MDKV MEE+DIHE+G+
Sbjct: 920  DDAVKICYRCKKVGHLARDC--------HG---------KETSDMDKVVMEEDDIHEVGD 962

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYS 1072
            EEK +L DVDYLTGNPLP+DILLY +PVCGPY+A+QSYKYRVK IPG+ KKGK  +   +
Sbjct: 963  EEKEKLIDVDYLTGNPLPTDILLYAVPVCGPYNALQSYKYRVKAIPGSMKKGKAAKTAMN 1022

Query: 1073 LLLLM 1077
            L   M
Sbjct: 1023 LFTHM 1027


>gi|356558107|ref|XP_003547349.1| PREDICTED: nuclear export mediator factor NEMF homolog [Glycine max]
          Length = 1119

 Score = 1452 bits (3760), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 777/1086 (71%), Positives = 872/1086 (80%), Gaps = 68/1086 (6%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGV+ESGESEKVLLLM
Sbjct: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVSESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG NA+YV
Sbjct: 61   ESGVRLHTTLYMRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGENANYV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDS FTV+TLLRSHRDDDKG+AIMSRHRYP E CRVFERTT  KL  +
Sbjct: 121  ILELYAQGNILLTDSTFTVMTLLRSHRDDDKGLAIMSRHRYPVESCRVFERTTIEKLRTS 180

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            L SSKE DA+E  K N +G+N SN +KE    +KGGKS                    TL
Sbjct: 181  LVSSKEDDADEAVKANGNGSNASNVAKEKQETRKGGKS------------------SATL 222

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            K VLGEALGYGPALSEHIILD GL+P+ K+ +    +D  +Q LV AV KFEDW+QDVIS
Sbjct: 223  KIVLGEALGYGPALSEHIILDAGLIPSTKVPKDRTWDDATVQALVQAVVKFEDWMQDVIS 282

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            G+IVPEGYILMQNK+LGKD   ++ GS +Q+YDEFCP+LLNQF+SR++ KFETFDAALDE
Sbjct: 283  GEIVPEGYILMQNKNLGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDE 342

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            FYSKIESQRAEQQ K+KE++A  KLNKI  DQENRVH L++E D  VKMAELIEYNLEDV
Sbjct: 343  FYSKIESQRAEQQQKSKENSAAQKLNKIRQDQENRVHVLRKEADHCVKMAELIEYNLEDV 402

Query: 421  DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
            DAAILAVRVALA  M+W+DLARMVKEE+KAGNPVAGLIDKL+LERNCM+LLLSNNLDEMD
Sbjct: 403  DAAILAVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLERNCMNLLLSNNLDEMD 462

Query: 481  DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
            D+EKTLPV+KVEVDLALSAHANARRWYE KKKQESKQEKT+TAH KAFKAAE+KTRLQ+ 
Sbjct: 463  DDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQEKTVTAHEKAFKAAERKTRLQLN 522

Query: 541  QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
            QEKTVA+ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE+IVKRYMSKGD+YVHADLH
Sbjct: 523  QEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNELIVKRYMSKGDLYVHADLH 582

Query: 601  GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
            GASSTVIKNH+P QPVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 583  GASSTVIKNHKPAQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 642

Query: 661  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
            LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE  DD+E++G  
Sbjct: 643  LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEAADDYEETGPL 702

Query: 721  KENSDIESEKDDTDEKPVAESLSVPN----SAHPAP------------SHTNASNVDSHE 764
            +  SD E EKD TD K   +S    N    S  P P            +  NA    S +
Sbjct: 703  EGKSDSEFEKDVTDIKSATDSERNDNLSADSHKPLPEDFPADASQTSLATINAETAISQD 762

Query: 765  FPAEDKTISNGIDSKIF-DIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFD 823
            FPA++ +  N +D +I  D++ N  A VTPQLE+L+D+ L LG  + S+ K+GIE +Q D
Sbjct: 763  FPAKETSTLNVVDREILSDVSGNGLASVTPQLEELLDQVLELGPIAKSNKKYGIEKSQID 822

Query: 824  LSEEDKHVERTAT-VRDKPYISKAERRKLKKGQGSSVVDPKVEREK--ERGKDASSQPES 880
            L  E +++E++ T VRDKPYISKAERRKLKK Q     D  VE  K   + KD S+  ++
Sbjct: 823  LDTE-QYLEQSKTAVRDKPYISKAERRKLKKEQKHGEEDLNVEHGKYESKLKDISANLQA 881

Query: 881  IVRKTKIEGG--KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNE 938
               +   +GG  KISRGQKGKLKK+KEKY DQDEEER+IRMALLAS+GK  K + +  +E
Sbjct: 882  KEDQNLKKGGGQKISRGQKGKLKKIKEKYADQDEEERSIRMALLASSGKSIKKE-ETSSE 940

Query: 939  NASTHKEKKPAISPVDAPKV-------CYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGL 991
            N +  + KKP   P DAPKV       CYKCKKAGHLS+DCKE PD              
Sbjct: 941  NDTLDQGKKPGSGPSDAPKVPSDAPKICYKCKKAGHLSRDCKEQPD-------------- 986

Query: 992  DETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYK 1051
                  D+VAMEE+DI+EIGEEEK +LNDVDYLTGNPLP+DILLY +PVCGPYSAVQSYK
Sbjct: 987  -----ADRVAMEEDDINEIGEEEKEKLNDVDYLTGNPLPNDILLYAVPVCGPYSAVQSYK 1041

Query: 1052 YRVKII 1057
            YRVKII
Sbjct: 1042 YRVKII 1047


>gi|115489110|ref|NP_001067042.1| Os12g0564600 [Oryza sativa Japonica Group]
 gi|108862839|gb|ABA98970.2| zinc knuckle family protein, putative, expressed [Oryza sativa
            Japonica Group]
 gi|113649549|dbj|BAF30061.1| Os12g0564600 [Oryza sativa Japonica Group]
          Length = 1159

 Score = 1340 bits (3467), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 716/1114 (64%), Positives = 854/1114 (76%), Gaps = 48/1114 (4%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVK RM TADVAAEVKCLRRLIGMR SNVY ++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1    MVKARMTTADVAAEVKCLRRLIGMRLSNVYGITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTT Y RDK  TPSGFTLKLRKHIR++RLEDVR LGYDRIILFQFGLG NAH+V
Sbjct: 61   ESGVRLHTTQYVRDKSTTPSGFTLKLRKHIRSKRLEDVRMLGYDRIILFQFGLGSNAHFV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDSE+TVLTLLRSHRDD+KG+AIMSRHRYP E CRVFERT  +KL   
Sbjct: 121  ILELYAQGNILLTDSEYTVLTLLRSHRDDNKGLAIMSRHRYPVEACRVFERTDFTKLKDT 180

Query: 181  L---------TSSKEP---DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
            L         +S   P   DA EP     DG  V++ S+E      G KS   +K S+ N
Sbjct: 181  LMMNAVDDKESSQVTPGSIDAQEPSVTPSDGVPVTDKSEEP-STTTGKKSASKNKQSSSN 239

Query: 229  S--NDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVL 284
            +  ++ A + + TLKT+LGEAL YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ L
Sbjct: 240  AKASNNAPSNKSTLKTLLGEALAYGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSL 299

Query: 285  VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQ 342
            V +++KFEDWL DV+SG  +PEGYILMQNK   K +  P E  S++Q IYDE+CP+LLNQ
Sbjct: 300  VESISKFEDWLVDVMSGQRIPEGYILMQNKAAAKKNLTPLEGSSASQKIYDEYCPVLLNQ 359

Query: 343  FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            F+SREF +FETFDAALDEFYSKIESQR  QQ K+KED+A  +LNKI +DQENRVHTL++E
Sbjct: 360  FKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRVHTLRKE 419

Query: 403  VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
            VD S+KMAELIEYNLEDVDAAI+AVRV+LAN MSW+ LARM+KEE+KAGNPVAGLIDKL 
Sbjct: 420  VDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAGLIDKLS 479

Query: 463  LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
             ERNC++LLLSNNLD+MD+EEKT PVEKVEVDL+LSAHANARRWYELKKKQESKQEKT+T
Sbjct: 480  FERNCITLLLSNNLDDMDEEEKTAPVEKVEVDLSLSAHANARRWYELKKKQESKQEKTVT 539

Query: 523  AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
            AH KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+
Sbjct: 540  AHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNEL 599

Query: 583  IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
            IVKRYMSKGD+YVHA+LHGASST+IKNH+P+ P+PPLTLNQAG FTVCHS+AWDSK+VTS
Sbjct: 600  IVKRYMSKGDLYVHAELHGASSTIIKNHKPDNPIPPLTLNQAGSFTVCHSKAWDSKIVTS 659

Query: 643  AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
            AWWVYP+QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNER
Sbjct: 660  AWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNER 719

Query: 703  RVRGE-EEGMDDFEDSGHHKE-------NSDIESEKDDTDEKPVAESLSVPNSAHPAPSH 754
            RVRGE EE + D E              +SD E+ K+  D++   ++++V    +P PS+
Sbjct: 720  RVRGEDEEALPDVESQKLESNAELDGELDSDSETGKEKHDDESSLDNINVKKIDNPIPSN 779

Query: 755  TN--ASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISS 812
                  N DS E  +E +T+ N   S      +     V+ QLEDL+D+ LGLG   +  
Sbjct: 780  APYVKDNADSSEQLSEIRTVVNSTTST--SKGQTSDRTVSSQLEDLLDKNLGLGPTKVLG 837

Query: 813  TKHGIETTQFDLSEE-DKHVERTATVRDKPYISKAERRKLKKGQ--GSSVVD-PKVEREK 868
                + +    ++++ D    +  +VRDKPYISKA+RRKLKKGQ  G S  D P  E  K
Sbjct: 838  RSSLLSSNSASVADDIDDLDTKKTSVRDKPYISKADRRKLKKGQNVGDSTSDSPNGEAAK 897

Query: 869  ERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKV 928
               K  +SQ E      K    K+SRGQKGKLKK+KEKYG+QDEEER IRMALLAS+G+ 
Sbjct: 898  ---KPVNSQQEKGKTIEKPANPKVSRGQKGKLKKIKEKYGEQDEEEREIRMALLASSGRA 954

Query: 929  QKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEH-----PDDSSHGV 983
             + D   ++ + +T  + KP+    D  K+CYKCKK+GHLS+DC E      P D + G 
Sbjct: 955  SQKDKPSEDVDGATAAQSKPSTGEDDRSKICYKCKKSGHLSRDCPESTSEVDPADVNVGR 1014

Query: 984  EDNPCVGLDETA--EMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVC 1041
              +   G+D ++      V M+E+DIHE+G+EEK +L D+DYLTGNPLPSDILLY +PVC
Sbjct: 1015 AKD---GMDRSSAPAGSSVTMDEDDIHELGDEEKEKLIDLDYLTGNPLPSDILLYAVPVC 1071

Query: 1042 GPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
             PY+A+Q+YKYRVKI PGTAKKGK  +   SL L
Sbjct: 1072 APYNALQAYKYRVKITPGTAKKGKAAKTAMSLFL 1105


>gi|242085896|ref|XP_002443373.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
 gi|241944066|gb|EES17211.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
          Length = 1158

 Score = 1335 bits (3455), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 694/1113 (62%), Positives = 844/1113 (75%), Gaps = 47/1113 (4%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVK RM T DVAAEVKCLRRLIGMR +NVYD++PKTY+FKLMNSSG+TESGESE+VLLLM
Sbjct: 1    MVKARMTTTDVAAEVKCLRRLIGMRLANVYDITPKTYLFKLMNSSGITESGESERVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVR HTT Y RDK  TPSGFTLKLRKHIR +RLEDVR LGYDRIILFQFGLG NAH++
Sbjct: 61   ESGVRFHTTQYVRDKSTTPSGFTLKLRKHIRNKRLEDVRMLGYDRIILFQFGLGSNAHFI 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDSE+TV+TLLRSHRDD+KG+AIMSRHRYP E+CRVF RT  +KL   
Sbjct: 121  ILELYAQGNILLTDSEYTVMTLLRSHRDDNKGLAIMSRHRYPVEVCRVFVRTDFAKLKDM 180

Query: 181  LT-----------SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKN-SNKN 228
            LT           +S   DA EP +   D   ++  S+++L  ++   +    ++ SN  
Sbjct: 181  LTMPDKADDKEEITSGSTDAQEPSQSTNDEVLITEISEKSLSRKEKKAAAKAKQSGSNAK 240

Query: 229  SNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVL 286
            +N+G ++ + TLKT+LGEAL YGPAL+EHIILD GLVP+ K+ +   + ++D+ +Q L+ 
Sbjct: 241  ANNGVQSNKATLKTILGEALAYGPALAEHIILDAGLVPSTKVGKDPESTVDDSTVQALME 300

Query: 287  AVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH--PPTESGSSTQIYDEFCPLLLNQFR 344
            ++ +FEDWL D+ISG  +PEGYILMQNK   K +  P  E+ ++ +IYDE+CP+LLNQF+
Sbjct: 301  SITRFEDWLVDIISGQRIPEGYILMQNKLTAKKNLTPSEEASTNHKIYDEYCPILLNQFK 360

Query: 345  SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
            SRE+ +F TFDAALDEFYSKIESQ+  QQ KAKE++A  +LNKI +DQENRVHTL++EVD
Sbjct: 361  SREYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENRVHTLRKEVD 420

Query: 405  RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
              VKMAELIEYNLEDVDAAILAVRV+LAN MSWE L RM+KEERKAGNPVAGLIDKL  E
Sbjct: 421  HCVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVAGLIDKLNFE 480

Query: 465  RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
            RNC++LLLSNNLD+MD++EKT PVEKVEVD+ALSAHANARRWYE+KKKQESKQEKTITAH
Sbjct: 481  RNCITLLLSNNLDDMDEDEKTAPVEKVEVDIALSAHANARRWYEMKKKQESKQEKTITAH 540

Query: 525  SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
             KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+IV
Sbjct: 541  EKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNELIV 600

Query: 585  KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            KRYMSKGD+YVHA+LHGASST+IKNH+P+ P+PPLTLNQAGCFTVCHS+AWDSK+VTSAW
Sbjct: 601  KRYMSKGDLYVHAELHGASSTIIKNHKPDTPIPPLTLNQAGCFTVCHSKAWDSKIVTSAW 660

Query: 645  WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
            WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNERRV
Sbjct: 661  WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNERRV 720

Query: 705  RGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHE 764
            RGE+E + + E     K+++    E+  +DE    E+       H   S  N    +S E
Sbjct: 721  RGEDEALQEMEAESRKKQSNPESDEEIGSDEGANKET-------HEDESSGNIGTANSPE 773

Query: 765  FP--AEDKTISNGID---------SKIFDIARNVA------APVTPQLEDLIDRALGLGS 807
             P    ++++ NG             + D   +++      A V+ QL+DL+D+ L LG 
Sbjct: 774  LPEIQAEESLDNGSSISKEETIQAEDLLDNGSSISKEETIEASVSSQLDDLLDKTLRLGP 833

Query: 808  ASISSTKHGIETTQFDLSEEDKHVE-RTATVRDKPYISKAERRKLKKGQGSSVVDPKVER 866
            A +S     + +    L+E+D  +E +  T+RDKPYISKAERRKLKKGQ +       + 
Sbjct: 834  AKVSGKSSLLTSVPSSLAEDDDDLELKRPTIRDKPYISKAERRKLKKGQVNGETATDSQN 893

Query: 867  EKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAG 926
             ++  +   SQ E     T+    K+SRGQKGKLKK+KEKY +QDEEER IRMALL S+G
Sbjct: 894  GEKLSQPGYSQQEKGKGSTQAANAKVSRGQKGKLKKIKEKYAEQDEEEREIRMALL-SSG 952

Query: 927  KVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPD--DSSHG-- 982
            K  + D   Q+E  S  KE KP+    D+ K+CYKCKKAGHLS+DC E     D + G  
Sbjct: 953  KALRKDKPSQDEETSV-KESKPSAGEDDSSKICYKCKKAGHLSRDCPESTSEVDRNDGSI 1011

Query: 983  VEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCG 1042
             +    +G + +       M+E+D+ EIG+EEK +L D+DYLTGNPLPSDILLY +PVC 
Sbjct: 1012 SKSRDVMGTNTSPAGGNSPMDEDDVQEIGDEEKEKLIDLDYLTGNPLPSDILLYAVPVCA 1071

Query: 1043 PYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            PY+A+Q+YKYRVKI PGTAKKGK  +   SL L
Sbjct: 1072 PYNALQTYKYRVKITPGTAKKGKAAKTAMSLFL 1104


>gi|125579741|gb|EAZ20887.1| hypothetical protein OsJ_36526 [Oryza sativa Japonica Group]
          Length = 1176

 Score = 1327 bits (3434), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 715/1131 (63%), Positives = 854/1131 (75%), Gaps = 65/1131 (5%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVK RM TADVA+EVKCLRRLIGMR SNVY ++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1    MVKARMTTADVASEVKCLRRLIGMRLSNVYGITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTT Y RDK  TPSGFTLKLRKHIR++RLEDVR LGYDRIILFQFGLG NAH+V
Sbjct: 61   ESGVRLHTTQYVRDKSTTPSGFTLKLRKHIRSKRLEDVRMLGYDRIILFQFGLGSNAHFV 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTDSE+TVLTLLRSHRDD+KG+AIMSRHRYP E CRVFERT  +KL   
Sbjct: 121  ILELYAQGNILLTDSEYTVLTLLRSHRDDNKGLAIMSRHRYPVEACRVFERTDFTKLKDT 180

Query: 181  L---------TSSKEP---DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
            L         +S   P   DA EP     DG  V++ S+E      G KS   +K S+ N
Sbjct: 181  LMMNAVDDKESSQVTPGSIDAQEPSVTPSDGVPVTDKSEEP-STTTGKKSASKNKQSSSN 239

Query: 229  S--NDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVL 284
            +  ++ A + + TLKT+LGEAL YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ L
Sbjct: 240  AKASNNAPSNKSTLKTLLGEALAYGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSL 299

Query: 285  VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQ 342
            V +++KFEDWL DV+SG  +PEGYILMQNK   K +  P E  S++Q IYDE+CP+LLNQ
Sbjct: 300  VESISKFEDWLVDVMSGQRIPEGYILMQNKAAAKKNLTPLEGSSASQKIYDEYCPVLLNQ 359

Query: 343  FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            F+SREF +FETFDAALDEFYSKIESQR  QQ K+KED+A  +LNKI +DQENRVHTL++E
Sbjct: 360  FKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRVHTLRKE 419

Query: 403  VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
            VD S+KMAELIEYNLEDVDAAI+AVRV+LAN MSW+ LARM+KEE+KAGNPVAGLIDKL 
Sbjct: 420  VDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAGLIDKLS 479

Query: 463  LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
             ERNC++LLLSNNLD+MD+EEKT PVEKVEVDL+LSAHANARRWYELKKKQESKQEKT+T
Sbjct: 480  FERNCITLLLSNNLDDMDEEEKTAPVEKVEVDLSLSAHANARRWYELKKKQESKQEKTVT 539

Query: 523  AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
            AH KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+
Sbjct: 540  AHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNEL 599

Query: 583  IVKRYMSKGDV-----------------YVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
            IVKRYMSKGD+                 YVHA+LHGASST+IKNH+P+ P+PPLTLNQAG
Sbjct: 600  IVKRYMSKGDLSLRFSRKLLVYFASLDSYVHAELHGASSTIIKNHKPDNPIPPLTLNQAG 659

Query: 626  CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
             FTVCHS+AWDSK+VTSAWWVYP+QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG
Sbjct: 660  SFTVCHSKAWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFG 719

Query: 686  LLFRLDESSLGSHLNERRVRGE-EEGMDDFEDSGHHKE-------NSDIESEKDDTDEKP 737
            +LFRLDESSL SHLNERRVRGE EE + D E              +SD E+ K+  D++ 
Sbjct: 720  ILFRLDESSLASHLNERRVRGEDEEALPDVESQKLESNAELDGELDSDSETGKEKHDDES 779

Query: 738  VAESLSVPNSAHPAPSHTN--ASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQL 795
              ++++V    +P PS+      N DS E  +E +T+ N   S      +     V+ QL
Sbjct: 780  SLDNINVKKIDNPIPSNAPYVKDNADSSEQLSEIRTVVNSTTST--SKGQTSDRTVSSQL 837

Query: 796  EDLIDRALGLGSASISSTKHGIETTQFDLSEE-DKHVERTATVRDKPYISKAERRKLKKG 854
            EDL+D+ LGLG   +      + +    ++++ D    +  +VRDKPYISKA+RRKLKKG
Sbjct: 838  EDLLDKNLGLGPTKVLGRSSLLSSNSASVADDIDDLDTKKTSVRDKPYISKADRRKLKKG 897

Query: 855  Q--GSSVVD-PKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQD 911
            Q  G S  D P  E  K   K  +SQ E      K    K+SRGQKGKLKK+KEKYG+QD
Sbjct: 898  QNVGDSTSDSPNGEAAK---KPVNSQQEKGKTIEKPANPKVSRGQKGKLKKIKEKYGEQD 954

Query: 912  EEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKD 971
            EEER IRMALLAS+G+  + D   ++ + +T  + KP+    D  K+CYKCKK+GHLS+D
Sbjct: 955  EEEREIRMALLASSGRASQKDKPSEDVDGATAAQSKPSTGEDDRSKICYKCKKSGHLSRD 1014

Query: 972  CKEH-----PDDSSHGVEDNPCVGLDETA--EMDKVAMEEEDIHEIGEEEKGRLNDVDYL 1024
            C E      P D + G   +   G+D ++      V M+E+DIHE+G+EEK +L D+DYL
Sbjct: 1015 CPESTSEVDPADVNVGRAKD---GMDRSSAPAGSSVTMDEDDIHELGDEEKEKLIDLDYL 1071

Query: 1025 TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            TGNPLPSDILLY +PVC PY+A+Q+YKYRVKI PGTAKKGK  +   SL L
Sbjct: 1072 TGNPLPSDILLYAVPVCAPYNALQAYKYRVKITPGTAKKGKAAKTAMSLFL 1122


>gi|357161759|ref|XP_003579195.1| PREDICTED: nuclear export mediator factor Nemf-like [Brachypodium
            distachyon]
          Length = 1163

 Score = 1311 bits (3393), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 681/1099 (61%), Positives = 839/1099 (76%), Gaps = 48/1099 (4%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVK RM TADVAAEVKCLRRLIGMR SNVYD++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1    MVKARMTTADVAAEVKCLRRLIGMRLSNVYDITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLHTT Y RDK  TPSGFTLKLRKH+R++RLEDVR LGYDR+ILFQFGLG NAH++
Sbjct: 61   ESGVRLHTTQYVRDKSTTPSGFTLKLRKHVRSKRLEDVRMLGYDRMILFQFGLGSNAHFI 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNI+LTDSE+TV+TLLRSHRDD+KG+AIMSRHRYP E CR FERT  +KL   
Sbjct: 121  ILELYAQGNIILTDSEYTVMTLLRSHRDDNKGLAIMSRHRYPVEACRTFERTDFTKLKDT 180

Query: 181  L-------------TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSK-NSN 226
            L              +    D++EP +   DG  V++  +E     +   +  + +  SN
Sbjct: 181  LKLSNTVDGEDSSQVTPNSADSHEPSESVNDGVPVTDKLEEPSNRTEKKSAVKIKQPGSN 240

Query: 227  KNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVL 284
              +++G ++ + TLKT+LGEAL YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ L
Sbjct: 241  AKASNGTQSNKSTLKTLLGEALAYGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSL 300

Query: 285  VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQ 342
            V +V +FEDWL D+ISG  +PEGYILMQNK   K +  P+E  S+ Q IYDE+CP+LL Q
Sbjct: 301  VESVTRFEDWLVDIISGQRIPEGYILMQNKMSAKKNITPSEVSSTNQKIYDEYCPILLKQ 360

Query: 343  FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            F++RE+ +FETFDAALDEFYSKIESQR  QQ KAKED+A  +LNKI +DQENRVHTL++E
Sbjct: 361  FKAREYDEFETFDAALDEFYSKIESQRVNQQQKAKEDSAVQRLNKIKLDQENRVHTLRKE 420

Query: 403  VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
             D  +KMAELIEYNLEDVDAAI+AVRV+LAN MSWE LARM+KEER+AGNPVAGLIDKL 
Sbjct: 421  ADHCIKMAELIEYNLEDVDAAIVAVRVSLANGMSWEALARMIKEERRAGNPVAGLIDKLS 480

Query: 463  LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
             E NC++LLLSNNLD+MD++EKT PVEKVEVDL+LSAHANARRWYE+KKKQE+KQEKTIT
Sbjct: 481  FENNCITLLLSNNLDDMDEDEKTAPVEKVEVDLSLSAHANARRWYEMKKKQETKQEKTIT 540

Query: 523  AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
            AH KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL++SGRDAQQNE+
Sbjct: 541  AHDKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIVSGRDAQQNEL 600

Query: 583  IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
            +VKRYMSKGD+YVHA+LHGASST+IKNH+P+ P+PPLTLNQAGCFTVCHS+AWDSK+VTS
Sbjct: 601  VVKRYMSKGDLYVHAELHGASSTIIKNHKPDSPIPPLTLNQAGCFTVCHSKAWDSKIVTS 660

Query: 643  AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
            AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDES L SHLNER
Sbjct: 661  AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESCLASHLNER 720

Query: 703  RVRGEEEGMDDFEDSGHHKEN---------SDIESEKDDTDEKPVAESLSVPNSAHPAPS 753
            R+RGE+E + + E     + N         +D E+ K   + +   +  SV  +   +PS
Sbjct: 721  RIRGEDEALPEIEVEPWKRHNISELDDKLANDNETSKGIHENESSRDYTSVQQNYDASPS 780

Query: 754  H--TNASNVDSHEFPAEDKTI-SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASI 810
            +  +N     S E  +E +T+ +NG+ S   +  R+ +  V+ QLEDL+D+ LGLG A +
Sbjct: 781  NQPSNMGTASSSEQLSEAQTVENNGVASTFNEETRDDS--VSSQLEDLLDKNLGLGPAKV 838

Query: 811  SSTKHGIETTQFDLSEEDKHVERTATV-RDKPYISKAERRKLKKGQGS--SVVDPKVERE 867
            S     + ++   L E+   ++   T+ R+KPY+SKAERRKLKKGQ S  S  DP  +  
Sbjct: 839  SGKSSLLISSHSSLPEDTDDLDVKKTIQREKPYVSKAERRKLKKGQNSCESTSDP--QNG 896

Query: 868  KERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGK 927
            +   K  +SQ E     TK    K SRGQKGKLKK+KEKY +QD+EER IRMALLAS+GK
Sbjct: 897  EAVKKPGNSQQEKGKDNTKTANPKTSRGQKGKLKKIKEKYAEQDDEEREIRMALLASSGK 956

Query: 928  VQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
              +     Q+   +  K+ K +   VD+ K+CYKCK++GHLS+DC   P+ +S  V  + 
Sbjct: 957  ASQKGKPSQDGEDTNAKQAKSSTGEVDSVKICYKCKRSGHLSRDC---PESTSVVVPTDV 1013

Query: 988  CVG-----LDETAEM---DKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIP 1039
             VG      D++A       + M+E+DIHE+G+EEK +L D+DYLTG PLPSDILLY +P
Sbjct: 1014 NVGRSRDVTDKSASAPVDGSIDMDEDDIHELGDEEKEKLIDLDYLTGIPLPSDILLYAVP 1073

Query: 1040 VCGPYSAVQSYKYRVKIIP 1058
            VC PY+A+Q+YKYRVKI P
Sbjct: 1074 VCAPYNALQTYKYRVKITP 1092


>gi|125537046|gb|EAY83534.1| hypothetical protein OsI_38746 [Oryza sativa Indica Group]
          Length = 1153

 Score = 1285 bits (3325), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 695/1108 (62%), Positives = 833/1108 (75%), Gaps = 65/1108 (5%)

Query: 24   MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFT 83
            MR SNVY ++PKTY+FKLMNSSG+TESGESEKVLLLMESGVRLHTT Y RDK  TPSGFT
Sbjct: 1    MRLSNVYGITPKTYLFKLMNSSGITESGESEKVLLLMESGVRLHTTQYVRDKSTTPSGFT 60

Query: 84   LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
            LKLRKHIR++RLEDVR LGYDRIILFQFGLG NAH+VILELYAQGNILLTDSE+TVLTLL
Sbjct: 61   LKLRKHIRSKRLEDVRMLGYDRIILFQFGLGSNAHFVILELYAQGNILLTDSEYTVLTLL 120

Query: 144  RSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL---------TSSKEP---DANE 191
            RSHRDD+KG+AIMSRHRYP E CRVFERT  +KL   L         +S   P   DA E
Sbjct: 121  RSHRDDNKGLAIMSRHRYPVEACRVFERTDFTKLKDTLMMNAVDDKESSQVTPGSIDAQE 180

Query: 192  PDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS--NDGARAKQPTLKTVLGEALG 249
            P     DG  V++ S+E      G KS   +K S+ N+  ++ A + + TLKT+LGEAL 
Sbjct: 181  PSVTPSDGVPVTDKSEEP-STTTGKKSASKNKQSSSNAKASNNAPSNKSTLKTLLGEALA 239

Query: 250  YGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEG 307
            YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ LV +++KFEDWL DV+SG  +PEG
Sbjct: 240  YGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSLVESISKFEDWLVDVMSGQRIPEG 299

Query: 308  YILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
            YILMQNK   K +  P E  S++Q IYDE+CP+LLNQF+SREF +FETFDAALDEFYSKI
Sbjct: 300  YILMQNKAAAKKNLTPLEGSSASQKIYDEYCPVLLNQFKSREFNEFETFDAALDEFYSKI 359

Query: 366  ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
            ESQR  QQ K+KED+A  +LNKI +DQENRVHTL++EVD S+KMAELIEYNLEDVDAAI+
Sbjct: 360  ESQRVNQQQKSKEDSAAQRLNKIKLDQENRVHTLRKEVDHSIKMAELIEYNLEDVDAAIV 419

Query: 426  AVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT 485
            AVRV+LAN MSW+ LARM+KEE+KAGNPVAGLIDKL  ERNC++LLLSNNLD+MD+EEKT
Sbjct: 420  AVRVSLANGMSWDALARMIKEEKKAGNPVAGLIDKLSFERNCITLLLSNNLDDMDEEEKT 479

Query: 486  LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV 545
             PVEKVEVDL+LSAHANARRWYELKKKQESKQEKT+TAH KAFKAAEKKTRLQ+ QEKTV
Sbjct: 480  APVEKVEVDLSLSAHANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTV 539

Query: 546  ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV------------ 593
            A I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+IVKRYMSKGD+            
Sbjct: 540  AAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNELIVKRYMSKGDLSLRFSRKLLVYF 599

Query: 594  -----YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
                 YVHA+LHGASST+IKNH+P+ P+PPLTLNQAG FTVCHS+AWDSK+VTSAWWVYP
Sbjct: 600  ASLDSYVHAELHGASSTIIKNHKPDNPIPPLTLNQAGSFTVCHSKAWDSKIVTSAWWVYP 659

Query: 649  HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE- 707
            +QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNERRVRGE 
Sbjct: 660  YQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNERRVRGED 719

Query: 708  EEGMDDFEDSGHHKE-------NSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTN--AS 758
            EE + D E              +SD E+ K+  D++   ++++V    +P PS+      
Sbjct: 720  EEALPDVESQKLESNAELDGELDSDSETGKEKHDDESSLDNINVKKIDNPIPSNAPYVKD 779

Query: 759  NVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIE 818
            N DS E  +E +T+ N   S      +     V+ QLEDL+D+ LGLG   +      + 
Sbjct: 780  NADSSEQLSEIRTVVNSTTST--SKGQTSDRTVSSQLEDLLDKNLGLGPTKVLGRSSLLS 837

Query: 819  TTQFDLSEE-DKHVERTATVRDKPYISKAERRKLKKGQ--GSSVVD-PKVEREKERGKDA 874
            +    ++++ D    +  +VRDKPYISKA+RRKLKKGQ  G S  D P  E  K   K  
Sbjct: 838  SNSASVADDIDDLDTKKTSVRDKPYISKADRRKLKKGQNVGDSTSDSPNGEAAK---KPV 894

Query: 875  SSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGD 934
            +SQ E      K    K+SRGQKGKLKK+KEKYG+QDEEER IRMALLAS+G+  + D  
Sbjct: 895  NSQQEKGKTIEKPANPKVSRGQKGKLKKIKEKYGEQDEEEREIRMALLASSGRASQKDKP 954

Query: 935  PQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEH-----PDDSSHGVEDNPCV 989
             ++ + +T  + KP+    D  K+CYKCKK+GHLS+DC E      P D + G   +   
Sbjct: 955  SEDVDGATAAQSKPSTGEDDRSKICYKCKKSGHLSRDCPESTSEVDPADVNVGRAKD--- 1011

Query: 990  GLDETA--EMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAV 1047
            G+D ++      V M+E+DIHE+G+EEK +L D+DYLTGNPLPSDILLY +PVC PY+A+
Sbjct: 1012 GMDRSSAPAGSSVTMDEDDIHELGDEEKEKLIDLDYLTGNPLPSDILLYAVPVCAPYNAL 1071

Query: 1048 QSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            Q+YKYRVKI PGTAKKGK  +   SL L
Sbjct: 1072 QAYKYRVKITPGTAKKGKAAKTAMSLFL 1099


>gi|296083204|emb|CBI22840.3| unnamed protein product [Vitis vinifera]
          Length = 993

 Score = 1261 bits (3262), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 612/781 (78%), Positives = 670/781 (85%), Gaps = 39/781 (4%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK  TPSGFTLKLRKHIRTRRLEDVRQLGYDR++LFQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSMTPSGFTLKLRKHIRTRRLEDVRQLGYDRVVLFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSEF V+TLLRSHRDDDKGVAIMSRHRYP EICRVFERT  +KL AA
Sbjct: 121 ILELYAQGNILLTDSEFMVMTLLRSHRDDDKGVAIMSRHRYPVEICRVFERTATTKLQAA 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LTS KE ++NE  +    GNN            KG KS + SKN+N    DGARAKQ TL
Sbjct: 181 LTSPKESESNEAKQ----GNN------------KGVKSSEPSKNTN----DGARAKQATL 220

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           KTVLGEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +V KFE+WL+DVIS
Sbjct: 221 KTVLGEALGYGPALSEHIILDAGLIPNTKVTKDSKFDIDTIQRLAQSVTKFENWLEDVIS 280

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           GD VPEGYILMQNK  GKD PP++    +QIYDEFCP+LLNQF+SREFVKFETFDAALDE
Sbjct: 281 GDQVPEGYILMQNKIFGKDCPPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAALDE 340

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIESQR+EQQ KAKE +A  KL KI +DQENRVHTLK+EVD  +KMAELIEYNLEDV
Sbjct: 341 FYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENRVHTLKKEVDHCIKMAELIEYNLEDV 400

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           DAAILAVRVALAN M+WEDLARMVKEE+K+GNPVAGLIDKLYLERNCM+LLLSNNLDEMD
Sbjct: 401 DAAILAVRVALANGMNWEDLARMVKEEKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEMD 460

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           D+EKTLPV+KVEVDLALSAHANARRWYE KK+QE+KQEKT+ AH KAFKAAEKKTRLQ+ 
Sbjct: 461 DDEKTLPVDKVEVDLALSAHANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQLS 520

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADLH
Sbjct: 521 QEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLH 580

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASSTVIKNH+PE PVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 581 GASSTVIKNHKPEHPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 640

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG  DFE++   
Sbjct: 641 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGAQDFEENESL 700

Query: 721 KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKI 780
           K NSD                    +SAH   + +N  +++  E P E++ + NG D K 
Sbjct: 701 KGNSD-------------------SDSAHNELTTSNVGSINLPEVPLEERNMLNGNDKKP 741

Query: 781 F 781
           +
Sbjct: 742 Y 742


>gi|168034467|ref|XP_001769734.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679083|gb|EDQ65535.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1100

 Score = 1099 bits (2842), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 599/1122 (53%), Positives = 752/1122 (67%), Gaps = 111/1122 (9%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVK+RMNTADVAAEV+CLRRLIG RC+NVYDL+PKTY+ KL  SSGVTESGESE+ LLL+
Sbjct: 1    MVKLRMNTADVAAEVRCLRRLIGFRCANVYDLTPKTYVIKLSRSSGVTESGESERSLLLL 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVR HTT +ARDK  TPSGFTLKLRKHIRTRRLEDVRQLG DR+I  QFG+G   H++
Sbjct: 61   ESGVRFHTTEFARDKSTTPSGFTLKLRKHIRTRRLEDVRQLGIDRVIDLQFGMGEGTHHI 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGNILLTD ++ VLTLLR+H+D+DKG+ +M++H YP   CR+F R +  KL AA
Sbjct: 121  ILELYAQGNILLTDGDYNVLTLLRTHKDEDKGLVMMAKHEYPVNACRLFNRFSLEKLEAA 180

Query: 181  LTSSK-EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            +   K + DA+E     E     S   KE+ G                           T
Sbjct: 181  MRDQKTQADADEYIDAKEVKVKTSWGKKEDTG--------------------------RT 214

Query: 240  LKTVLGEALGYGPALSEHIILDTGLVPNMKLS----EVNKLEDNAIQVLVLAVAKFEDWL 295
            LK+VLG  LGYGPAL EHI+LD+GL   MK+S     V  +    +  L+ A+++FEDWL
Sbjct: 215  LKSVLGGCLGYGPALCEHIVLDSGLQSGMKVSLGPDGVLSISKENLGDLMGAISRFEDWL 274

Query: 296  QDVISGDIVPEGYILMQNKHLGKDHPPTESG--SSTQIYDEFCPLLLNQFRSREFVKFET 353
              V++GD +PEG++ MQ K++ KD    +       ++YDEF PL L QF  R  ++ ET
Sbjct: 275  DSVVNGDRIPEGFVYMQKKNIKKDKVLLDDQLQEEEKVYDEFSPLHLKQFDDRTVMRMET 334

Query: 354  FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
            +DAALDEF+SKIE QRAEQQ KA+ED+AF KL+KI  DQ  RV  LKQEVD++V+MAELI
Sbjct: 335  YDAALDEFFSKIEGQRAEQQRKAQEDSAFSKLDKIRADQTQRVEVLKQEVDQTVRMAELI 394

Query: 414  EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
            EYNLEDVD AILAVR  +A+ M W+DLARM+KEE+KAGNPVAGLI  L LE+N ++LLLS
Sbjct: 395  EYNLEDVDNAILAVRSTVASGMDWKDLARMIKEEKKAGNPVAGLIHSLQLEKNQITLLLS 454

Query: 474  NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
            NNLD+MDD+EKT PV KV+VD+ LSAHANARRW+E KKK   KQ+KT  AH KAFKAAEK
Sbjct: 455  NNLDDMDDDEKTQPVSKVDVDIGLSAHANARRWFEQKKKHAVKQDKTKAAHEKAFKAAEK 514

Query: 534  KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
            KT  Q+ Q K+VA ISHMRKVHWFEKFNWF+SSENYL+ISGRDAQQNE++VKRYM KGD+
Sbjct: 515  KTLQQLAQAKSVAAISHMRKVHWFEKFNWFVSSENYLIISGRDAQQNELVVKRYMRKGDL 574

Query: 594  YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK 653
            YVHADLHGASSTVI+NH P  P+PPLT+NQAG FTVC SQAWDSK+VTSAWWV  HQVSK
Sbjct: 575  YVHADLHGASSTVIQNHNPLYPIPPLTINQAGVFTVCRSQAWDSKIVTSAWWVEAHQVSK 634

Query: 654  TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
            TAPTGEYLTVGSFM+RGKKNFLPP+PL+MGFG+LFRLD+SS+ +HLNERRVRGE E  D 
Sbjct: 635  TAPTGEYLTVGSFMVRGKKNFLPPNPLVMGFGVLFRLDDSSIAAHLNERRVRGEVEDDDT 694

Query: 714  FEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS 773
                     N+D+ S+  D       E L             N   V   + P  +  I 
Sbjct: 695  LT---LVTSNNDVYSKTPDA-----IEELDGVGEEEEQDIEFNEDEVADSKCPDVEVEIG 746

Query: 774  NGIDSKI-FDIARNVAAPVTPQLEDLIDRALGLGSA---SISSTKHGIET--TQFDLSEE 827
            N +D K+   I    ++     L+ L+DRAL L +    + +++K+G++T   Q   +E 
Sbjct: 747  N-LDEKVDAGIEGEGSSDDASGLDALLDRALELRAGPKRTDTNSKYGLDTLPAQVSDTEY 805

Query: 828  DKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDA---------SSQP 878
            D  V + A+ R+KPYISKAERRK KKG        KVE+  E+   A         +SQ 
Sbjct: 806  DLPVAK-ASQREKPYISKAERRKAKKG-------GKVEKGSEKDASAETVDGEEEKTSQE 857

Query: 879  ESIVRKTKI-----------EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGK 927
            E++  K+ I            G K+ RG+KGKLKK+K KY +QDE+ER +RM+LLA +  
Sbjct: 858  ENLKTKSAIFKDDKMSESSPLGEKVGRGRKGKLKKIKAKYAEQDEDERELRMSLLAVSFN 917

Query: 928  VQKNDGDPQNENASTHKEKK----PAISPVDA----PKVCYKCKKAGHLSKDCKEHPDDS 979
                   P   +   +  +     P IS  DA     KVCYKCKK GHL++DC       
Sbjct: 918  F------PSMIHVKYYFIQDCWSLPYISG-DAIALGSKVCYKCKKVGHLARDCT------ 964

Query: 980  SHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIP 1039
                            +++ + + EE++ E+G++E+ +L ++D LTG P  +D+LLY +P
Sbjct: 965  --------------VTDVEPLLLAEENVQELGDDERDKLTELDSLTGCPTATDVLLYAVP 1010

Query: 1040 VCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            VC PY A+Q YKYRVK+ PG  KKGK  +    +   M  +T
Sbjct: 1011 VCAPYQALQGYKYRVKLTPGNGKKGKVAKFAVDIFSHMQEIT 1052


>gi|302768961|ref|XP_002967900.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
 gi|300164638|gb|EFJ31247.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
          Length = 1083

 Score = 1066 bits (2756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1098 (52%), Positives = 735/1098 (66%), Gaps = 98/1098 (8%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVK R+N ADVAAEVKCLR LIGMRC+NVYDL+PKTY+ KL  SSG+T SGE E+ L+L+
Sbjct: 1    MVKGRLNVADVAAEVKCLRCLIGMRCANVYDLTPKTYVIKLAKSSGLTSSGEGERALVLL 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLH T ++RDK  TPSGFTLKLRKHIRTRRLE+V+QLG DR++ FQFG G  AH++
Sbjct: 61   ESGVRLHMTEFSRDKSVTPSGFTLKLRKHIRTRRLENVQQLGVDRVVDFQFGTGELAHHI 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHR-------DDDKGVAIMSRHRYPTEICRVFERTT 173
            ILELYAQGN+LLTD+++ VLTLLRSHR       DD KG+A+M+RHRYP E CR F+RTT
Sbjct: 121  ILELYAQGNVLLTDADYNVLTLLRSHRQACRFFLDDYKGIAMMARHRYPVENCRTFQRTT 180

Query: 174  ASKLHAALT-SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
               L  A +   K+ +  E  +  +D           L  +K  + F             
Sbjct: 181  MQDLIRAFSPDEKKAEQQEAQQTPQDAR---------LQKKKDDEGF------------- 218

Query: 233  ARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNK---LEDNAIQVLVLAVA 289
                  TLK++L ++  YGPA+ EH+ILD GL PNMK+ + +    + +  +  L+ A+ 
Sbjct: 219  ------TLKSILLDSFSYGPAVFEHVILDAGLQPNMKVCDASNRSMVSEKDLHSLLEAIK 272

Query: 290  KFEDWLQDVISGDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +FEDWL+ V +GD  PEGYI    NK   K +  +   +  +++DEF PLLL Q   RE+
Sbjct: 273  RFEDWLESVTTGDFTPEGYITFHPNKTAKKKNAES---AEEKMFDEFSPLLLKQSAHREY 329

Query: 349  VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
            VKF+TFDAALDEF+SKIE QR +QQ K +ED+A+ KL KI  DQ +RV +LK+EVD++V 
Sbjct: 330  VKFDTFDAALDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRADQRSRVESLKREVDQAVH 389

Query: 409  MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
             AELIEYNL DVD AI AVR ALAN M W+DL RM+KEERKAGNPVAGLI  L LE+N +
Sbjct: 390  TAELIEYNLADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHI 449

Query: 469  SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            +LLLSNNLD+MDD++KT P +KVEVDL+LSAHANAR+W+++KKKQ  KQEKT+ AH KAF
Sbjct: 450  TLLLSNNLDDMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQALKQEKTVAAHEKAF 509

Query: 529  KAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
            KAAE+KT+ Q+ Q K VA ISH+RKVHWFEKFNWFISSENYL+ISGRDAQQNE IVKRYM
Sbjct: 510  KAAERKTQQQLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISGRDAQQNEQIVKRYM 569

Query: 589  SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
             KGD+YVHADLHGASST+IKNH P QPV PLT+NQAGCFTVC SQAWDSK++TSAWWVY 
Sbjct: 570  KKGDLYVHADLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQAWDSKIITSAWWVYD 629

Query: 649  HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE- 707
            HQVSKTAPTGEYLTVGSFMIRGKKNFLPP+PL+MGFGL FRLDESS+ +H NERR+R E 
Sbjct: 630  HQVSKTAPTGEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESSIPAHFNERRIRAEG 689

Query: 708  --EEGMDDFEDSGHHKENSDIESEKDDTDE-KPVAESLSVPNSAHPAPSHTNASNVDSHE 764
              EE   + +D     +++ +E  +D   E K   +  S    A    +    S     E
Sbjct: 690  DNEEPEAEIQDD-EEIDDASVEDSQDKVHERKESGDGGSTIEKASVTEAEEARSEEAESE 748

Query: 765  FPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGS---ASISSTKHGIETTQ 821
                 +T +  +D +        A      ++ L+D+AL L S   + + + K+G+   Q
Sbjct: 749  EARAPETENAAMDEQ-----EEQAPQSDSDIDSLLDKALELKSVLPSQVDTNKYGLGEVQ 803

Query: 822  FDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESI 881
             +   +D   E T   R+KPYISKAERRKLKKG  +  V    E EK+  ++ SS     
Sbjct: 804  TEDQVDDADQE-TKVAREKPYISKAERRKLKKGGNTQEV--AQENEKDGIEEGSS----- 855

Query: 882  VRKTKIEGGKISRGQKGKLK------KMKEKYGDQDEEERNIRMALLASAGKVQKNDGDP 935
                   G K S G   +++      K  +KY +QD+EER +RM+LL S  K Q      
Sbjct: 856  -------GAKPSEGSNKQVRGKKGKLKKLKKYAEQDDEERELRMSLL-SVTKEQPEKPSV 907

Query: 936  QNENASTH----KEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGL 991
            +NE +S       E          P +CY CKK+GH++ +C +     S           
Sbjct: 908  KNEGSSCTLIFVLEFASVTDAAKKPVICYTCKKSGHVASECPDSKQTES----------- 956

Query: 992  DETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYK 1051
             E A ++     EE+I ++ EEE+ +L ++D LTG PLP+DILLY +PVCGPYSA+QSYK
Sbjct: 957  -EIAAINA----EENIVDLDEEEREKLTELDALTGRPLPNDILLYAVPVCGPYSALQSYK 1011

Query: 1052 YRVKIIPGTAKKGKGIQI 1069
            Y VKI PG +KKGKG ++
Sbjct: 1012 YHVKITPGPSKKGKGAKM 1029


>gi|224101503|ref|XP_002312307.1| predicted protein [Populus trichocarpa]
 gi|222852127|gb|EEE89674.1| predicted protein [Populus trichocarpa]
          Length = 796

 Score = 1064 bits (2751), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 556/760 (73%), Positives = 619/760 (81%), Gaps = 29/760 (3%)

Query: 331  IYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHM 390
            IYDEFCPLLLNQFR RE VKF+ FDAALDEFYSKIESQ++E Q K KE +A  KLNKI +
Sbjct: 1    IYDEFCPLLLNQFRMREHVKFDAFDAALDEFYSKIESQKSEHQQKTKEGSAIQKLNKIRL 60

Query: 391  DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
            DQENRV  L++EVD SVKMAELIEYNLEDV++AILAVRVALA  M WEDLARMVK+E+KA
Sbjct: 61   DQENRVEMLRKEVDHSVKMAELIEYNLEDVNSAILAVRVALAKGMGWEDLARMVKDEKKA 120

Query: 451  GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
            GNPVAGLIDKL+ E+NCM+LLLSNNLDEMDD+EKT PV+KVEVDLALSAHANARRWYELK
Sbjct: 121  GNPVAGLIDKLHFEKNCMTLLLSNNLDEMDDDEKTFPVDKVEVDLALSAHANARRWYELK 180

Query: 511  KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
            KKQESKQEKT+TAH KAFKAAEKKTRLQ+ QEK+VA ISHMRKVHWFEKFNWFISSENYL
Sbjct: 181  KKQESKQEKTVTAHEKAFKAAEKKTRLQLSQEKSVATISHMRKVHWFEKFNWFISSENYL 240

Query: 571  VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
            VISGRDAQQNEMIVKRY+SKGD+YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC
Sbjct: 241  VISGRDAQQNEMIVKRYVSKGDLYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 300

Query: 631  HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
            HSQAWDSK+VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL
Sbjct: 301  HSQAWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 360

Query: 691  DESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKP-VAESLSVPNSAH 749
            DESSLGSHLNERRVRGEE+G++D E+S   KE SD ESE+++   K  V ES        
Sbjct: 361  DESSLGSHLNERRVRGEEDGVNDVEESQPLKEISDSESEEEEVAGKELVLES-------- 412

Query: 750  PAPSHTN---ASNVDSHEFPAEDKTISNGID-SKIFDIARNVAAPVTPQLEDLIDRALGL 805
               SH+N    SN   HE   ++ ++ NG++   + D+  N  APVTPQLEDLIDRALGL
Sbjct: 413  --ESHSNDLTVSNTILHESSVQETSL-NGVNIENLSDVVGNDVAPVTPQLEDLIDRALGL 469

Query: 806  GSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVE 865
            G  ++SS  +G+E  Q D++EE  H E     RDKPYISKAERRKLKKGQ SS  D +VE
Sbjct: 470  GPTAVSSKNYGVEPLQVDMTEE--HHEEA---RDKPYISKAERRKLKKGQRSSATDAEVE 524

Query: 866  REKERGKD---ASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
            REKE  KD   +  QPE  V+  K  GGKI RGQ+ KLKKMKEKY +QDEEER+IRMALL
Sbjct: 525  REKEELKDNVVSVDQPEKHVQNNKQGGGKIIRGQRSKLKKMKEKYANQDEEERSIRMALL 584

Query: 923  ASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDS--- 979
            ASAG  +KNDG+ QN N +T K K       DA KVCYKCKKAGHLS+DC EHPDDS   
Sbjct: 585  ASAGNTRKNDGEIQNGNEATDKGKISITGTEDALKVCYKCKKAGHLSRDCPEHPDDSLNS 644

Query: 980  -SHGVEDNPCVGL-DETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYV 1037
             + G  D   V L D T+E+D+VAMEEEDIHEIGEEEK RLND+DYLTGNPLP DIL Y 
Sbjct: 645  RADGAVDKSHVSLVDSTSEVDRVAMEEEDIHEIGEEEKERLNDLDYLTGNPLPIDILSYA 704

Query: 1038 IPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLM 1077
            +PVCGPYSAVQSYKYRVK+IPGT KKGK  +   +L   M
Sbjct: 705  VPVCGPYSAVQSYKYRVKVIPGTVKKGKAARTAMNLFSHM 744


>gi|302761200|ref|XP_002964022.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
 gi|300167751|gb|EFJ34355.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
          Length = 1052

 Score = 1063 bits (2749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 575/1097 (52%), Positives = 732/1097 (66%), Gaps = 136/1097 (12%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            MVK R+N ADVAAEVKCLR LIGMRC+NVYDL+PKTY+ KL  SSG+T SGE E+ L+L+
Sbjct: 1    MVKGRLNVADVAAEVKCLRCLIGMRCANVYDLTPKTYVIKLAKSSGLTSSGEGERALVLL 60

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESGVRLH T ++RDK  TPSGFTLKLRKHIRTRRLE+V+QLG DR++ FQFG G  AH++
Sbjct: 61   ESGVRLHMTEFSRDKSVTPSGFTLKLRKHIRTRRLENVQQLGVDRVVDFQFGTGELAHHI 120

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELYAQGN+LLTD+++ VLTLLRS       +A+M+RHRYP E CR F+RTT   L  A
Sbjct: 121  ILELYAQGNVLLTDADYNVLTLLRS-------IAMMARHRYPVENCRTFQRTTMQDLIRA 173

Query: 181  LT-SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
             +   K+ +  E  +  +D           L  +K  + F                   T
Sbjct: 174  FSPDEKKAEQQEAQQTPQDAR---------LQKKKDDEGF-------------------T 205

Query: 240  LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNK---LEDNAIQVLVLAVAKFEDWLQ 296
            LK++L ++  YGPA+ EH+ILD GL PNMK+ + +    + +  +  L+ A+ +FEDWL+
Sbjct: 206  LKSILLDSFSYGPAVFEHVILDAGLQPNMKVCDASNRSMVSEKDLHSLLEAIKRFEDWLE 265

Query: 297  DVISGDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
             V +GD  PEGYI    NK   K +  +   +  +++DEF PLLL Q   RE++KF+TFD
Sbjct: 266  SVTTGDFTPEGYITFHPNKTAKKKNAES---AEEKMFDEFSPLLLKQSAHREYIKFDTFD 322

Query: 356  AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            AALDEF+SKIE QR +QQ K +ED+AF KL KI  DQ +RV +LK+EVD++V  AELIEY
Sbjct: 323  AALDEFFSKIEGQRLDQQRKTQEDSAFSKLEKIRADQRSRVESLKREVDQAVHTAELIEY 382

Query: 416  NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
            NL DVD AI AVR ALAN M W+DL RM+KEERKAGNPVAGLI  L LE+N ++LLLSNN
Sbjct: 383  NLADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHITLLLSNN 442

Query: 476  LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
            LD+MDD++KT P +KVEVDL+LSAHANAR+W+++KKKQ  KQEKT+ AH KAFKAAE+KT
Sbjct: 443  LDDMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQALKQEKTVAAHEKAFKAAERKT 502

Query: 536  RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
            + Q+ Q K VA ISH+RKVHWFEKFNWFISSENYL+ISGRDAQQNE IVKRYM KGD+YV
Sbjct: 503  QQQLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISGRDAQQNEQIVKRYMKKGDLYV 562

Query: 596  HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
            HADLHGASST+IKNH P QPV PLT+NQAGCFTVC SQAWDSK++TSAWWVY HQVSKTA
Sbjct: 563  HADLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQAWDSKIITSAWWVYDHQVSKTA 622

Query: 656  PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE---EEGMD 712
            PTGEYLTVGSFMIRGKKNFLPP+PL+MGFGL FRLDESS+ +H NERR+R E   EE   
Sbjct: 623  PTGEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESSIPAHFNERRIRAEGDNEEPEA 682

Query: 713  DFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTI 772
            + +D     +++ +E  +D   E+  +E+ ++      AP   + S++DS          
Sbjct: 683  EIQDD-EEIDDASVEDSQDKVHERKESENAAMDEQEEQAPQ--SDSDIDS---------- 729

Query: 773  SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGS---ASISSTKHGIETTQFDLSEEDK 829
                                     L+D+AL L S   + + + K+G+   Q +   +D 
Sbjct: 730  -------------------------LLDKALELKSVLPSQVDTNKYGLGEVQTEDQVDDA 764

Query: 830  HVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEG 889
              E T   R+KPYISKAERRKLKKG  +  V    E EK+  ++ SS            G
Sbjct: 765  DQE-TKVAREKPYISKAERRKLKKGGNTQEV--AQENEKDGIEEGSS------------G 809

Query: 890  GKISRGQKGKLK------KMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTH 943
             K S G   +++      K  +KY +QD+EER +RM+LL+SAG+ +K     Q E  S  
Sbjct: 810  AKPSEGSNKQVRGKKGKLKKLKKYAEQDDEERELRMSLLSSAGR-EKPSAKEQPEKPSVK 868

Query: 944  KEKKPAI--------SPVDA---PKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLD 992
             E             S  DA   P +CY CKK+GH++ +C +     S            
Sbjct: 869  NEGSSCTLIFVLEFASFTDAAKKPVICYTCKKSGHVASECPDSKQTES------------ 916

Query: 993  ETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKY 1052
            E A ++     EE+I ++ EEE+ +L ++D LTG PLP+DILLY +PVCGPYSA+QSYKY
Sbjct: 917  EIAAINA----EENIVDLDEEEREKLTELDALTGRPLPNDILLYAVPVCGPYSALQSYKY 972

Query: 1053 RVKIIPGTAKKGKGIQI 1069
             VKI PG +KKGKG ++
Sbjct: 973  HVKITPGPSKKGKGAKM 989


>gi|297736754|emb|CBI25955.3| unnamed protein product [Vitis vinifera]
          Length = 712

 Score =  993 bits (2568), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/745 (68%), Positives = 568/745 (76%), Gaps = 93/745 (12%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           RMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FK MNSSGVTESG SEKVLLLM+SGV
Sbjct: 6   RMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKFMNSSGVTESGGSEKVLLLMKSGV 65

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           RLHTTAY R    TPSGFTLKLRKHI TRRLEDVRQLGYDR+ILFQFGLG NAHYVILEL
Sbjct: 66  RLHTTAYVR---MTPSGFTLKLRKHICTRRLEDVRQLGYDRVILFQFGLGANAHYVILEL 122

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
            AQGNILLTDSEF V+TLL SHRDDDKGVAI+SRH YP EICRVFE TT +KL AALTS 
Sbjct: 123 CAQGNILLTDSEFMVMTLLGSHRDDDKGVAIISRHWYPVEICRVFECTTTTKLQAALTSP 182

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           KE ++NE  +                G +KG KS + SKN+N    DGARAKQ TLKTVL
Sbjct: 183 KESESNEAKQ----------------GNRKGAKSSEPSKNTN----DGARAKQATLKTVL 222

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
           GEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +VAKFE+WL+DVI GD V
Sbjct: 223 GEALGYGPALSEHIILDAGLIPNTKVTKDSKFDFDTIQRLAQSVAKFENWLEDVILGDQV 282

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
           PEGYILMQNK  GKD  P++    +QIYDEFCP+LLNQF+SREFVKFETFDAA DEFYSK
Sbjct: 283 PEGYILMQNKIFGKDCRPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAASDEFYSK 342

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           IE QR+EQQ KAKE              ENRVHTLK+E DR +KMAELIEYNLEDVDAAI
Sbjct: 343 IEGQRSEQQQKAKE--------------ENRVHTLKKEDDRCIKMAELIEYNLEDVDAAI 388

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           LAVRVALAN M+WEDLARMVKE++K+GNPVAGLIDKLYLERNCM+LLLSNNLDEMDD+EK
Sbjct: 389 LAVRVALANGMNWEDLARMVKEKKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEMDDDEK 448

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
           TL V+KVEVDLALSAHANAR+WYE KK+QE+K+EKTI AH K  K  +++         +
Sbjct: 449 TLHVDKVEVDLALSAHANARQWYEQKKRQENKREKTIIAHEKLLKLLKRRL------ASS 502

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY-------MSKGDVYVHA 597
             +   +   +WFEKFNWFISS+NY VISGRDAQ NEMIVKRY       M     + +A
Sbjct: 503 FHSYWPLVLFNWFEKFNWFISSKNYFVISGRDAQLNEMIVKRYIELRRKKMRPNSTHYYA 562

Query: 598 -------------------------------------------DLHGASSTVIKNHRPEQ 614
                                                      D HGASSTVIKNH+PE 
Sbjct: 563 TKKELCKDFEFPTYCNTVISILVKVFLKLIGFSYLSNARYIHADPHGASSTVIKNHKPEH 622

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           PVPPLTLNQAGCFTVCHSQ WDSK+VTSAWWVYPHQVSKTAPTGEYLTVGSFMI GKKNF
Sbjct: 623 PVPPLTLNQAGCFTVCHSQVWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIHGKKNF 682

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHL 699
           LPPHPL+MGFGLLF LDE +   H+
Sbjct: 683 LPPHPLMMGFGLLFCLDERAPWDHI 707


>gi|414878087|tpg|DAA55218.1| TPA: hypothetical protein ZEAMMB73_985047 [Zea mays]
          Length = 608

 Score =  862 bits (2227), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/608 (71%), Positives = 502/608 (82%), Gaps = 15/608 (2%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK RM T DVAAEVKCLRRLIGMR +NVYD++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1   MVKARMTTTDVAAEVKCLRRLIGMRLANVYDITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR HTT Y RDK  TPSGFTLKLRKHIR +RLEDVR LGYDRIILFQFGLG NAH++
Sbjct: 61  ESGVRFHTTQYVRDKSTTPSGFTLKLRKHIRNKRLEDVRMLGYDRIILFQFGLGSNAHFI 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSE+TVLTLLRSHRDD+KG+AIMSRHRYP E CRVF RT  +KL   
Sbjct: 121 ILELYAQGNILLTDSEYTVLTLLRSHRDDNKGLAIMSRHRYPVEACRVFGRTDFAKLKDM 180

Query: 181 LT-----------SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSK-NSNKN 228
           LT           +S   DA E  +   D   V+  S+++L  ++   +    +  SN  
Sbjct: 181 LTKPDKADDKEEITSGSTDAQETSQSTNDEVLVTEISEKSLSKKEKKAAAKAKQFGSNAK 240

Query: 229 SNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVL 286
            N+GA++ + TLKT+LGEAL YGPAL+EHIILD GLVP+ K+ +   + + D+ +Q L+ 
Sbjct: 241 VNNGAQSNKATLKTILGEALAYGPALAEHIILDAGLVPSTKVGKDPESTINDSTVQSLME 300

Query: 287 AVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGS-STQIYDEFCPLLLNQFRS 345
           ++ +FEDWL D+ISG  +PEGYILMQNK   K+  P E  S + +IYDE+CP+LLNQF+S
Sbjct: 301 SITRFEDWLVDIISGQRIPEGYILMQNKMTAKNITPLEEASINHKIYDEYCPVLLNQFKS 360

Query: 346 REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
           RE+ +F TFDAALDEFYSKIESQ+  QQ KAKE++A  +LNKI +DQENRVHTL++EVD 
Sbjct: 361 REYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENRVHTLRKEVDH 420

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
            VKMAELIEYNLEDVDAAILAVRV+LAN MSWE L RM+KEERKAGNPVAGLIDKL  ER
Sbjct: 421 CVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVAGLIDKLNFER 480

Query: 466 NCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
           NC++LLLSNNLD+MD++EKT PVEKVEVD+ALSAHANARRWYE+KKKQESKQEKTITAH 
Sbjct: 481 NCITLLLSNNLDDMDEDEKTAPVEKVEVDIALSAHANARRWYEMKKKQESKQEKTITAHD 540

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+IVK
Sbjct: 541 KAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNELIVK 600

Query: 586 RYMSKGDV 593
           RYMSKGD+
Sbjct: 601 RYMSKGDL 608


>gi|384249421|gb|EIE22903.1| hypothetical protein COCSUDRAFT_16391 [Coccomyxa subellipsoidea
           C-169]
          Length = 1029

 Score =  657 bits (1695), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/723 (47%), Positives = 458/723 (63%), Gaps = 78/723 (10%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGE-SEKVLL 58
           MVK RM+TADV  EV CLR  ++GMR +NVYD + KTYI KL      ++SGE  EK LL
Sbjct: 1   MVKQRMSTADVVGEVACLRHSVLGMRVANVYDANAKTYIIKL------SKSGEEGEKALL 54

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++ESGVR HTT Y +DK +TPS FTLKLRKH+RTRRL+DVRQLG DR++ F FG G   +
Sbjct: 55  VLESGVRFHTTRYLKDKADTPSNFTLKLRKHLRTRRLDDVRQLGVDRVVDFSFGTGEACY 114

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++ILELYAQGN++L D+ +++LTLLRSHRDDDKG+AIM+RH YP    R+    T ++L 
Sbjct: 115 HLILELYAQGNVILADANYSILTLLRSHRDDDKGLAIMARHAYPVHAIRLRSALTQAQLD 174

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           AAL S+ +                                                  + 
Sbjct: 175 AALASADD--------------------------------------------------KQ 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL+  L   + YGPALSEH  L  GL P  K  + + L +     L+  V  +E WL   
Sbjct: 185 TLRGALASVVPYGPALSEHCTLLAGLRPTRK-PKADPLCEEERTALLGGVRHWEAWLDAC 243

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +    PEG+I ++    G +     +     +YD F PL+L Q   +E ++F T++AAL
Sbjct: 244 ETA--APEGFISLKRPADGSE--AASASGDCLVYDSFDPLILQQNSGQEVLRFPTYNAAL 299

Query: 359 DEFYSK-----------IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           DEFY+K           +E Q+AEQ     E AA  KL++I +DQ  R   L +E   + 
Sbjct: 300 DEFYAKARPAPLCLTMSVEGQKAEQARLQAEQAALSKLDRIRIDQTGRAEALDREAKEAE 359

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
             A+LIE N E VD AI AVRVALA  +SW +L R++++E  AGN VAGL+  L+L+RN 
Sbjct: 360 AKAQLIEANAEAVDQAINAVRVALAQGLSWAELERLIRDEAAAGNQVAGLVHALHLDRNA 419

Query: 468 MSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
           ++LL SN   E +DE  T +P   VEVDL L+A  NAR W+  +K + +KQ KT+ A+ +
Sbjct: 420 VTLLDSNA--ESNDETGTDVPTALVEVDLDLNAQQNARAWHSDRKARSAKQAKTLDANKR 477

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           A   A+KK ++Q+ + K VA +  +RK  WFEKFNWF++SENYLV+SGRDAQQNE++VKR
Sbjct: 478 ALVEADKKVQVQLSKVKAVAAVQQLRKPAWFEKFNWFVTSENYLVVSGRDAQQNELLVKR 537

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           Y+ K D+YVHA+LHGAS+TV++NH P +P   L ++QAG   VC SQAWD+K+VTSAWWV
Sbjct: 538 YLRKDDLYVHAELHGASTTVVRNHNPSRPGMAL-VSQAGTACVCRSQAWDAKIVTSAWWV 596

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
           + HQVSK+AP+GEYL  GSFMIRG+KNFLPPHPLIMG   LF+LDES +  HL ER  + 
Sbjct: 597 HAHQVSKSAPSGEYLPTGSFMIRGRKNFLPPHPLIMGLTFLFKLDESCIAGHLGERAPKS 656

Query: 707 EEE 709
            E+
Sbjct: 657 AED 659



 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 87/171 (50%), Gaps = 20/171 (11%)

Query: 905  EKYGDQDEEERNIRMALLASAG-KVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCK 963
            EKY  QD+E+R + +  LA AG +    +   + E     K +K A +  D   V  +  
Sbjct: 826  EKYAHQDDEDRQLALQFLAPAGGRFPAWEKKDKKEKREARKARKKAGATGDGNAVADRLP 885

Query: 964  KAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDY 1023
             AG L+          + G    P +            + EE++  + EE+K +L ++D 
Sbjct: 886  TAGELA----------AAGARLGPRIA---------AILAEENVELVPEEDKDKLQELDS 926

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            LTG P P D+LLY IP+C PYSA+QSYK +VK+ PGT +KG+  +    LL
Sbjct: 927  LTGQPRPDDVLLYAIPMCAPYSAIQSYKLKVKLTPGTQRKGRAGRQAIELL 977


>gi|189239405|ref|XP_001813943.1| PREDICTED: similar to CG11847 CG11847-PA [Tribolium castaneum]
 gi|270010510|gb|EFA06958.1| hypothetical protein TcasGA2_TC009916 [Tribolium castaneum]
          Length = 972

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 413/1094 (37%), Positives = 583/1094 (53%), Gaps = 169/1094 (15%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++ +GMR +NVYD+  KTY+ +L  S         EK ++L+E
Sbjct: 1    MKTRFNTFDIICTVTELQKCVGMRVNNVYDIDNKTYLIRLQRSE--------EKAVILLE 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R H T +   K   PSGF++KLRKH++ +RLE + QLG DRI+ FQFG G  A++VI
Sbjct: 53   SGNRFHETGFEWPKNVAPSGFSMKLRKHLKNKRLESLAQLGTDRIVDFQFGSGEAAYHVI 112

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GNI+LTD EFT+L +LR H + D+    + R +YP +  R     T  +L   L
Sbjct: 113  LELYDKGNIILTDFEFTILNVLRPHTEGDR-FKFVVREKYPQDRARQSSLITRDELVQLL 171

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             ++K  D                                                   LK
Sbjct: 172  KAAKNGDQ--------------------------------------------------LK 181

Query: 242  TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             VL   L YGP L EH++L  G   + K+ +   +E +  +VL  A+ + E+   +    
Sbjct: 182  KVLVPNLEYGPPLIEHVLLKQGFSNSTKIGKTFNIESDVDKVLC-ALEEAENLFSEAKKA 240

Query: 302  DIVPEGYILMQNKH--LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
                +GYI+ + +   +  D+P  E   S Q   EF P+L  Q +S    +F +F++A+D
Sbjct: 241  GF--KGYIIQKKEERVVSADNPEKEYYYSNQ---EFHPVLYEQHKSSISKEFPSFNSAVD 295

Query: 360  EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
            EF+S +ESQ+ E +   +E  A  KL  +  D   R+  L+  QE+D+  + AELI  N 
Sbjct: 296  EFFSSLESQKLELKALQQEREALKKLENVKKDHSQRLLALEKTQEIDK--QKAELITRNQ 353

Query: 418  EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
            E VD AILAV+ ALA ++SW DLA ++KE    G+ +A  I +L LE N +SL L++   
Sbjct: 354  ELVDKAILAVQTALATQISWSDLADLIKEAASQGDEIAQRIKELKLETNHISLYLTDPYA 413

Query: 475  ---NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
               +  + +D +  +P   V+VDL LSA AN RR+Y+ K+    KQ+KTI + SKAFK+A
Sbjct: 414  EDDSESDDEDNDDKIPPMVVDVDLDLSAFANGRRYYDQKRNAAKKQQKTIESQSKAFKSA 473

Query: 532  EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
            EKKT+  +   +T+ NI+  RKV+WFEKF WFISSENYLVI+GRD QQNE+IVKRYM   
Sbjct: 474  EKKTKQTLKDVQTITNINKARKVYWFEKFFWFISSENYLVIAGRDQQQNELIVKRYMKST 533

Query: 592  DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
            DVYVHAD+HGASS VIKN    Q VPP TLN+AG   +C+S AWD+K+VT+A+WV+  QV
Sbjct: 534  DVYVHADVHGASSVVIKNPSG-QAVPPKTLNEAGTMAICYSVAWDAKVVTNAYWVWGEQV 592

Query: 652  SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
            SKTAPTGEYL+ GSFMIRGKKNFLP   LI+G   LF+L+ES +  H +ERRV     G 
Sbjct: 593  SKTAPTGEYLSTGSFMIRGKKNFLPLSHLILGLSFLFKLEESCIEKHKDERRVIA--PGE 650

Query: 712  DDFEDSGHHKENSDIESEK-DDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDK 770
            +DF ++   +   ++E E  D++DE+                   N   V S    A DK
Sbjct: 651  EDFVETVESENKDEVEVEVLDESDEE-------------------NKEEVKS----AADK 687

Query: 771  TISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKH 830
             I N  +S   +   +   P T            +       TK  I T     ++E   
Sbjct: 688  EIENEENSSSSEDEESSKFPDT-----------QIKIQHFEGTKINILTEPVIRNDETDE 736

Query: 831  VERTATVRD-KPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEG 889
             E    + D KP + K  +R  +    S    PK + + ER ++ ++      ++TK   
Sbjct: 737  NETVVYLGDNKPVVVKPNQRS-RNTSESKTKQPKNDAKNERKEETNN------KQTK--- 786

Query: 890  GKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPA 949
                RGQK KLKK+KEKY DQDEEER +RM +L SAG         +++    +K    A
Sbjct: 787  ----RGQKSKLKKIKEKYKDQDEEERKLRMEILQSAG------SQKESKKNKKNKNSNKA 836

Query: 950  ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVED-NPCVGLDETAEMDKVAMEEEDIH 1008
              P + PK+     K   L     E  DD   G ED  P V     AE+D          
Sbjct: 837  KKP-EQPKII----KERILPVQKSEMIDDG--GAEDEEPVV----QAELDM--------- 876

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
                        ++ LTG P   D LL+ +PV  PY+A+ +YK+++KI PGT+++GK  +
Sbjct: 877  ------------INSLTGVPFADDELLFAVPVVAPYNALTNYKFKIKITPGTSRRGKAAR 924

Query: 1069 IFYSLLLLMLSLTP 1082
               ++ L   S+TP
Sbjct: 925  TAVNMFLKDRSITP 938


>gi|428183447|gb|EKX52305.1| hypothetical protein GUITHDRAFT_65529, partial [Guillardia theta
           CCMP2712]
          Length = 703

 Score =  580 bits (1495), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 314/716 (43%), Positives = 441/716 (61%), Gaps = 79/716 (11%)

Query: 21  LIGMRCSNVYDLSPKTYIFK------------LMNSSGVTES---GESEKVLLLMESGVR 65
           L+G R +N+YDL  KTY+ K             + S  +TE       EK L+L+ESG+R
Sbjct: 1   LLGARLANIYDLDAKTYLLKTNKVRHALAGGAWLLSPWMTERFPLQSGEKCLVLLESGIR 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            HTT + RDK N PSGFTLKLRKHIR +R+E+V+QLG DR+++F FG    A ++ILEL+
Sbjct: 61  FHTTEFMRDKSNMPSGFTLKLRKHIRMKRIEEVKQLGVDRVVIFTFGAADEAFHLILELF 120

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSK 185
           A GNI+L D ++T+L LLR++ D+                       T +K+    T   
Sbjct: 121 AGGNIILVDHQYTILALLRTYTDE----------------------ATNTKVAVKETYQL 158

Query: 186 EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLG 245
           + + NE  K++ D           L  +  GK              GA+ ++ T++ VL 
Sbjct: 159 DSNQNENRKISVD-----------LLMEAFGK--------------GAKNEKATMRDVLI 193

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK-FEDWLQDVISGDIV 304
           + L YGPAL EH +L T L   MK+SE+    D+ +   +  V K  +D + ++  G  +
Sbjct: 194 KELDYGPALVEHALLGTSLDGKMKVSEMEITRDSPVVSTLFGVFKEVDDMIANLTDGGKM 253

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
            EG ++   K  G+D P          YD+F P++L Q+  ++   F++FD A+D ++S 
Sbjct: 254 IEGVLV--RKGAGEDSP----------YDDFGPVVLRQYAGKKLDMFDSFDKAMDAYFSI 301

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
            E ++ EQQ   ++ AA  K+ ++    E  +  L++E   +   A LIE NL DVD AI
Sbjct: 302 AEDKKLEQQKVQQKKAAVSKVERVKRAHEASIQALQEEEAENYHRATLIEANLSDVDNAI 361

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           L +   L+  M W  L ++VKEE + GNP+A +I  L L+ N ++LLL+  LD M++EE+
Sbjct: 362 LVINSMLSQGMDWASLKKLVKEEGRKGNPIAQMIHGLKLDSNQITLLLTFGLDAMEEEEQ 421

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
           TLPV  V+VDL ++A+ NA+ +Y  KKK   K EKT+ A  KA K AE+K +  + +  T
Sbjct: 422 TLPVVAVDVDLGMNAYQNAQSYYSSKKKVALKAEKTMQAAGKAIKGAERKAKEDLKKADT 481

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
            A+I  +RK HWFEKF WFISSEN+LV+ GRDAQQNE++VKR+M KGD+Y+HAD+HGA++
Sbjct: 482 KASIQQIRKTHWFEKFIWFISSENFLVLCGRDAQQNELLVKRHMEKGDIYLHADIHGAAT 541

Query: 605 TVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            +IKNH  +  VPPLTL QAG   VC SQAWD+KMVTSA+WV+P QVSK+APTGEYL+ G
Sbjct: 542 HIIKNHTKD-AVPPLTLAQAGLSCVCRSQAWDAKMVTSAYWVHPEQVSKSAPTGEYLSTG 600

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG---EEEGMDDFEDS 717
           SFMIRGKKN+LPP+ LIMGFGLLFR+DES L  H+ ER++RG   +EE M    DS
Sbjct: 601 SFMIRGKKNYLPPNSLIMGFGLLFRIDESCLAHHVGERKIRGLGEQEEEMGKAGDS 656


>gi|340713692|ref|XP_003395373.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
            terrestris]
          Length = 971

 Score =  580 bits (1494), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 387/1083 (35%), Positives = 578/1083 (53%), Gaps = 162/1083 (14%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R N+ D+A  +  L++ IGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1    MKTRFNSYDIACTICELQKFIGMRVNQIYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R+HTTA+   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A++VI
Sbjct: 53   SGNRIHTTAFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGIDRMIDLQFGSGEAAYHVI 112

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GNI+LTD E T+L +LR H + DK +    + +YP              +  A 
Sbjct: 113  LELYDRGNIVLTDHEMTILNILRPHTEGDK-IRFAVKEKYP--------------MDRAH 157

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             ++  P  N                +++L   K G+S                     LK
Sbjct: 158  QNTMPPIEN---------------IQQHLQNAKAGES---------------------LK 181

Query: 242  TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             +L   L +G +L +H++L  G     K+ +   + ++  + L+LA+ ++ + + D    
Sbjct: 182  KLLNPLLEFGSSLIDHVLLKHGFTLGCKIGKDFNVAEHMPK-LILAL-EYANEMMDFARK 239

Query: 302  DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
            + V +GYI+ +     K+  PT  G    IY   EF P L  Q+    + +F++FD A+D
Sbjct: 240  N-VSKGYIIQK-----KESKPTTDGKENFIYTNIEFHPFLFEQYADYPYKEFDSFDVAVD 293

Query: 360  EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
            E++S +E Q+ + +   +E  A  KL  +  D + R+  L+  QE+D+  + AELI  N 
Sbjct: 294  EYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDK--QKAELISRNQ 351

Query: 418  EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
              VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352  ALVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478  EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
            + D+E +  P+  +++DLA +A  NA ++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412  DSDEESELKPM-LIDIDLAHTAFGNATKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538  QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
             + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471  TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKSGDIYVHA 530

Query: 598  DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
            DL GASS VIKN   +  VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531  DLTGASSVVIKNPGNDS-VPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658  GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
            GEYLT GSFMIRGKKN+LPP  L+MG G LFRL+ESS+  H NERRVR         +D 
Sbjct: 590  GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIERHKNERRVRV-------IDDE 642

Query: 718  GHH-----KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTI 772
              H     +E+ +IE   D  +++               P + N  N    E   +    
Sbjct: 643  SEHTDSLIEEDREIELIGDSEEDE--------------QPENKNNLNPIQEESKIDMIME 688

Query: 773  SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVE 832
             N ++  + D   N+A    P  +  ID        S S  K  ++  Q  +  + K ++
Sbjct: 689  ENNVNQDVSDEENNLAQ--FPDTQIRID-------VSGSKVKLHVDNNQSTVIPQ-KDLD 738

Query: 833  RTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKI 892
                  DKP I  A    ++K        P  +  KER +    + + +V K        
Sbjct: 739  VIYLGDDKPVIINA--VNMQKRSEIKQKPPLKKDNKERIETEPKKNDQVVLK-------- 788

Query: 893  SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISP 952
             RGQKG+LKKMKEKY DQDEE+R + M +L SAG  ++N    +N++ S  K+       
Sbjct: 789  -RGQKGRLKKMKEKYKDQDEEDRRLSMQVLQSAGAAKENKRKNKNKDPSGPKQ------- 840

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
                    + KK G       ++     +  E++P  G     E+D              
Sbjct: 841  --------QTKKKGMARPVAPQNIQIVENIEEEDPGPG----PEVDM------------- 875

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYS 1072
                    +D LTG P+  D LL+ +PV  PY+ V +YK++VK+ PGT K+GK  +   +
Sbjct: 876  --------LDQLTGKPVSEDELLFAVPVIAPYNTVLNYKFKVKLTPGTGKRGKAAKTAMT 927

Query: 1073 LLL 1075
            + +
Sbjct: 928  VFM 930


>gi|350409527|ref|XP_003488770.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
            impatiens]
          Length = 971

 Score =  578 bits (1489), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 383/1083 (35%), Positives = 584/1083 (53%), Gaps = 162/1083 (14%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R N+ D+A  +  L++ IGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1    MKTRFNSYDIACTICELQKFIGMRVNQIYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R+HTTA+   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A++VI
Sbjct: 53   SGNRIHTTAFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRMIDLQFGSGEAAYHVI 112

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GNI+LTD E T+L +LR H + DK +    + +YP              +  A 
Sbjct: 113  LELYDRGNIVLTDHEMTILNILRPHTEGDK-IRFAVKEKYP--------------MDRAH 157

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             ++  P  N                +++L   K G+S                     LK
Sbjct: 158  QNTMPPIEN---------------IQQHLQSAKAGES---------------------LK 181

Query: 242  TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             +L   + +G ++ +H++L  G     K+ +   + ++  + L+LA+ ++ + + D    
Sbjct: 182  KLLNPLVEFGASVIDHVLLKHGFTLGCKIGKDFNVAEHMPK-LILAL-EYANEMMDFARK 239

Query: 302  DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
            + V +GYI+ +     K+  PT  G    IY   EF P L  Q+ +  + +F++FD A+D
Sbjct: 240  N-VSKGYIIQK-----KESKPTADGKEDFIYTNIEFHPFLFEQYTNYPYKEFDSFDVAVD 293

Query: 360  EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
            E++S +E Q+ + +   +E  A  KL  +  D + R+  L+  QE+D+  + AELI  N 
Sbjct: 294  EYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDK--QKAELISRNQ 351

Query: 418  EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
              VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352  TLVDNAILAIQSALANQMAWPDIKVLLKEAESRGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478  EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
            + D+E +  P+  +++DLA +A  NA ++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412  DSDEESELKPM-LIDIDLAHTAFGNATKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538  QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
             + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471  TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKSGDIYVHA 530

Query: 598  DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
            DL GASS VIKN   +  VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531  DLTGASSVVIKNPGSDS-VPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658  GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
            GEYLT GSFMIRGKKN+LPP  L+MG G LFRL+ESS+  H +ERRVR         +D 
Sbjct: 590  GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIERHKDERRVRI-------IDDE 642

Query: 718  GHHKENSDIESEKD-----DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTI 772
              H  +S IE +++     D++E   +E+    N+ +P    +    +            
Sbjct: 643  SEHT-DSLIEEDREIELIGDSEEDEQSEN---KNNLNPIQEESKVDIIMEE--------- 689

Query: 773  SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVE 832
             N ++  + D   N+     P  +  ID        S S  K  ++  Q  +  + K ++
Sbjct: 690  -NNVNQDVSDEENNLVQ--FPDTQIRID-------VSGSKVKLHVDNNQLTVMPQ-KDLD 738

Query: 833  RTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKI 892
                  DKP I  A        Q SS +  K+  +K+  +    +P+      K +   +
Sbjct: 739  VIYLGDDKPVIINAVNM-----QKSSEIKQKLPLKKDNKEKIEIEPK------KNDQVVL 787

Query: 893  SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISP 952
             RGQKG+LKKMKEKY DQDEE+R + M +L SAG  ++N    +N++ S  K+       
Sbjct: 788  KRGQKGRLKKMKEKYKDQDEEDRRLSMQVLQSAGAAKENKRKNKNKDPSGPKQ------- 840

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
                    + KK G       ++     +  E++P  G     E+D              
Sbjct: 841  --------QTKKKGMAKPVAPQNIQIVENIEEEDPGPG----PEVDM------------- 875

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYS 1072
                    +D LTG P+  D LL+ +PV  PY+ V +YK++VK+ PGT K+GK  +   +
Sbjct: 876  --------LDQLTGKPVSEDELLFAVPVIAPYNTVLNYKFKVKLTPGTGKRGKAAKTAMT 927

Query: 1073 LLL 1075
            + +
Sbjct: 928  VFM 930


>gi|356640194|ref|NP_001239258.1| serologically defined colon cancer antigen 1 [Gallus gallus]
          Length = 1071

 Score =  575 bits (1483), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 411/1140 (36%), Positives = 587/1140 (51%), Gaps = 189/1140 (16%)

Query: 2    VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R +T D+ A V  LR  L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1    MKSRFSTVDIRALVAELRLSLLGMRVNNVYDVDSKTYLIRLQKPDC--------KATLLL 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESG+R+HTT +   K   PSGF +K RKH++TRRL  VRQLG DRI+ FQFG    A+++
Sbjct: 53   ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKTRRLVSVRQLGIDRIVDFQFGSNEAAYHL 112

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +              +A
Sbjct: 113  IIELYDRGNIVLTDHEYLILNILRFRTDEADDVRFAVRERYPVD--------------SA 158

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
               +  P      ++      +SNA K   G Q                          L
Sbjct: 159  KAPTPLPTLERLTEI------ISNAPK---GEQ--------------------------L 183

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            K VL   L YG  L EH +++ G    +K+ +  + ++N I+ ++ A+ K E ++   ++
Sbjct: 184  KRVLNPHLPYGATLIEHCLIEAGFSGYVKIDQHMESKEN-IEKVLSALEKAEGYM--TLT 240

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
             D   +GYI+ Q K       P +       Y+EF P L +Q     +++F++F+ A DE
Sbjct: 241  EDFNGKGYII-QKKEKKPSLEPDKPAEDIYTYEEFHPFLFSQHSKCPYLEFDSFNKAADE 299

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ--EVDRSVKMAELIEYNLE 418
            FYSK+E Q+ + +   +E  A  KL  +  D E R+  L+Q  EVD+ +K  ELIE NLE
Sbjct: 300  FYSKLEGQKIDLKALQQEKQALKKLENVRRDHEQRLEALQQAQEVDK-IK-GELIEMNLE 357

Query: 419  DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN---- 474
             V  AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N    
Sbjct: 358  IVSRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLRNPYVL 417

Query: 475  ---------------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
                                          ++   +K  P   V+VDL+LSA+ANA+++Y
Sbjct: 418  SEEEEEGEDADLEKEETEEPKGKKKKNKSKQLKKPQKNKP-SLVDVDLSLSAYANAKKYY 476

Query: 508  ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
            + K+    K +KT+ A  KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSE
Sbjct: 477  DHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSE 536

Query: 568  NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            NYLVI+GRD QQNE+IVKRY+  GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 537  NYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 595

Query: 628  TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
             +C+S AWD+++VTSAWWV  +QVSKTAPTGEYLT GSFMIRGKKNFL P  L+MGF  L
Sbjct: 596  ALCYSAAWDARVVTSAWWVSHNQVSKTAPTGEYLTTGSFMIRGKKNFLQPSYLMMGFSFL 655

Query: 688  FRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH--KENSDIESEKDDTDEKPVAESLSVP 745
            F++DES +  H  ER+++ ++E ++    S      E  ++    D + E+  AE     
Sbjct: 656  FKVDESCVWRHREERKIKVQDEDLETVSSSASELVAEEVELLEGGDSSSEEDKAE----- 710

Query: 746  NSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARN-VAAPVTPQLEDLIDRALG 804
               H AP    A             T  N  D  + D+ ++ V+ P  P  E + D   G
Sbjct: 711  --CHEAPEDVEA-------------TPENNGDENVADLDQDRVSTPPVP--EGVSDEDDG 753

Query: 805  LGSASISSTKHGIET-------TQFDLS--EEDKHVERTATVRDKPYISKAE---RRKLK 852
                     K  ++        T  DLS  +  + +++T    ++P +S ++   RR L 
Sbjct: 754  ESEVEQPEPKSEVKEEEVNYPDTTIDLSHLQSQRSLQKTIPKEEEPNLSDSKSQGRRHLS 813

Query: 853  -----------KGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLK 901
                       +   S  +DP  ER+K+   +    P     K       I RGQK K+K
Sbjct: 814  AKERREMKKKKQQSDSENLDPPEERQKD--TETQRPPPPNTNKGVPAPQPIKRGQKSKMK 871

Query: 902  KMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYK 961
            KMKEKY DQDEE+R + M LL SAG    N  +   +      +++ A       K  ++
Sbjct: 872  KMKEKYKDQDEEDRELIMKLLGSAG---SNKEEKGKKGKKGKTKEEQAKKQQQKSKAVHR 928

Query: 962  CKKAG----------HLSKDC--KEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHE 1009
                G          H S+D   +E  D+     +D P V  D TA +D           
Sbjct: 929  SAGGGKEMMPGVVVLHESEDLAPEEQQDEKDEQDQDQPGVE-DGTALLDS---------- 977

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
                          LTG P   DILL+ +P+C PY+A+ +YKY+VK+ PGT KKGK  +I
Sbjct: 978  --------------LTGQPHAEDILLFAVPICAPYTAMTNYKYKVKLTPGTQKKGKAAKI 1023


>gi|452822547|gb|EME29565.1| RNA-binding protein [Galdieria sulphuraria]
          Length = 1067

 Score =  573 bits (1477), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 394/1157 (34%), Positives = 587/1157 (50%), Gaps = 216/1157 (18%)

Query: 1    MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLM-----------NSSGVT 48
            M + R +  D+ AEVK LRR  IG R  N+YD++P TY+ K+              S V 
Sbjct: 1    MPRNRFSLLDLQAEVKYLRRRFIGARVVNIYDVTPTTYLLKISVPSRNQISVEETISVVE 60

Query: 49   ESGES--EKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI 106
            ES  S  EK  +L+ESG+R+H T + RDK N PSGF++KLRKHIR+R+++++R LG DR+
Sbjct: 61   ESSNSNWEKTFVLIESGIRIHETRFYRDKANIPSGFSVKLRKHIRSRKIQEIRTLGADRV 120

Query: 107  I-------LFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRD----DDKGVAI 155
            +       +F+         +I+E Y+ GNI+LTD E+T+L+ LRS++       + V I
Sbjct: 121  VELVFSSRVFEGSTIERPCRLIVEFYSSGNIVLTDEEYTILSALRSYKGPFGVTKEPVHI 180

Query: 156  MSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKG 215
             +R++YP  + R                                +N+S +    L   K 
Sbjct: 181  FTRNKYPVHLLR--------------------------------SNISLSKNSVLALLKN 208

Query: 216  GKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN- 274
            G   D+ +N                   L   L  GP + EH ++ +G  P  K+ E+  
Sbjct: 209  GSQTDIVRN------------------FLSTRLYCGPQVIEHALVASGFEPKTKIKELFL 250

Query: 275  KLEDNAIQV------LVLAVAKFEDWLQD---VISGDIVPEGYILMQNKHLGKDHPPTE- 324
              EDN   V       + ++  FE  L D         +  GY+  +     KD   T+ 
Sbjct: 251  NAEDNEEGVSHKTLSFLQSLESFESSLCDNDSTCESLSLERGYLFYR-----KDAHTTDV 305

Query: 325  --SGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
              S S   +Y++F P LL    +   ++F TF+ A+D +++ +E +RA+     +E    
Sbjct: 306  SMSNSERLLYEDFSPFLLCHLSNTSHIEFPTFNEAVDIYFANLEKERAQIVASKQESVVS 365

Query: 383  HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
             K++ +  D E R+  L++  + + K+AE IE N ++VD AI  VR  +AN ++W++L +
Sbjct: 366  KKVDSLRKDLERRIDELERAKEENFKIAEAIELNADEVDKAIWVVRAMIANGVAWDELDK 425

Query: 443  MVKEERKAGNPVAGLIDKLYLERNCMSLLL------------------SNNLDEMDDEEK 484
            M++EE++ GNPVA  I  L+L+RN ++L+L                  S ++   DD ++
Sbjct: 426  MLEEEKEKGNPVAETIHSLHLDRNEITLMLPIDPILEDEFVNENFQYQSEDITYYDDTDE 485

Query: 485  T----------------LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            T                 PV   +VDL+LSA ANA R++E +K+ + K+EKT+ A  +A 
Sbjct: 486  TEEHFQTERMVAELNASKPVVLADVDLSLSAFANAARYFESRKRAQEKKEKTMEATKRAL 545

Query: 529  KAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
              AEKK   Q+ +      K    I  +RK  WFEKF+WFISSEN+LVI+G+DAQQNE +
Sbjct: 546  NVAEKKASKQMERSQQRSLKPAVAIREIRKPAWFEKFDWFISSENFLVIAGKDAQQNEQV 605

Query: 584  VKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
            VKRYM   DVYVHAD+HGASS V+KN   ++PVP  TL +AG F +CHS AW SK+V+SA
Sbjct: 606  VKRYMKTFDVYVHADIHGASSVVVKNRFRDKPVPLQTLIEAGAFAMCHSSAWSSKIVSSA 665

Query: 644  WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERR 703
            WWV+  QVSKTAP+GEYLT GSFMIRGKKN+LPP  L+MG+G+LF++D S    H NER+
Sbjct: 666  WWVHASQVSKTAPSGEYLTTGSFMIRGKKNYLPPSQLVMGYGILFKMDPSCTRDHENERQ 725

Query: 704  VRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSH 763
             R   E ++     GH K N D    + D D        + P SA      T  ++   H
Sbjct: 726  RRPLNEAVE-----GHLKTNEDCAENEPDFDNLE-----TFPTSA------TGNADQFYH 769

Query: 764  EFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFD 823
            E   ++  +++  D     +  N+             + L L S  + +TK   E  QF 
Sbjct: 770  ENNLQEADVAHLFDKYHESLPDNL-------------KTLQLDSTGMLATKED-ELDQFR 815

Query: 824  LSEEDKHV---ERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPES 880
             SEE+  +    RT   RD                 S+ V    + + E  K+  + P  
Sbjct: 816  -SEENLELIKYSRTKKARDH----------------STQVGHTKQAQPETFKEKKTSPVD 858

Query: 881  IVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENA 940
            ++    +   K+ RG++ K+K+ K+KY +Q  EERN+ MALL S+   Q          +
Sbjct: 859  LIENVDV--SKLPRGKRSKMKRAKKKYAEQTLEERNLAMALLGSSKSEQV--------TS 908

Query: 941  STHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKV 1000
            ST++E       VD  K     K  G+  ++   + +DS +  E                
Sbjct: 909  STNEEHGREEISVDINK---GLKGKGNHMEEVSNYTEDSKNADE---------------- 949

Query: 1001 AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGT 1060
              EE +  E   +E   +N     TG PL SDI+ + +PVC P+ AV  YKYRVK+IPG+
Sbjct: 950  --EENNSTENFTDETSVVN---LFTGQPLESDIIEFALPVCAPFLAVSRYKYRVKLIPGS 1004

Query: 1061 AKKGKGIQIFYSLLLLM 1077
             KKGK  ++  SL+L M
Sbjct: 1005 MKKGKAAKVANSLMLKM 1021


>gi|307209071|gb|EFN86238.1| Serologically defined colon cancer antigen 1-like protein
            [Harpegnathos saltator]
          Length = 989

 Score =  571 bits (1471), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 398/1095 (36%), Positives = 582/1095 (53%), Gaps = 168/1095 (15%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L+RLIGMR + +YD+  +TY+ +   S         EK +LL+E
Sbjct: 1    MKTRFNTYDLVCSVTELQRLIGMRVNQIYDIDHRTYLIRFQRSE--------EKCVLLLE 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A+++I
Sbjct: 53   SGNRIHTTGFEWPKNIAPSGFSMKMRKHLKNKRLESLMQVGIDRIIDLQFGSGEAAYHII 112

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GNI+LTD E  +L +LR H + DK +    R +YP                   
Sbjct: 113  LELYDRGNIILTDHEMVILYILRPHTEGDK-IRFAVREKYPL------------------ 153

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                       D+ + +     +   E+L   K G+S                     LK
Sbjct: 154  -----------DRAHNEAMPPIDEIHEHLQKAKTGES---------------------LK 181

Query: 242  TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
             VL   L +G A+ +H++L        K+S + N  ED  +  L+LA+    + + +   
Sbjct: 182  KVLNPILEFGSAVIDHVLLKATFALGCKISKDFNITED--MPKLILALEDANNIMDNAKK 239

Query: 301  GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAAL 358
                 +GYI+ +     K+  PT+ G    I+   EF PLL  Q++ + + +F++FDA +
Sbjct: 240  S--ASKGYIIQK-----KEARPTQDGKEEFIFANIEFHPLLFEQYKDQPYKEFDSFDATV 292

Query: 359  DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYN 416
            DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QEVD+  + AELI  N
Sbjct: 293  DEYFSTMEGQKLDLKALQQEREALKKLENVRKDHDQRLITLEKTQEVDK--QKAELISRN 350

Query: 417  LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
               VD AILA++ ALAN+MSW D+  ++KE +   +PVA  I +L LE N +SLLL +  
Sbjct: 351  QTLVDNAILAIQSALANQMSWPDIQVLLKEAQARSDPVASAIKQLKLETNHISLLLHDPY 410

Query: 477  DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
            +E D+E +  P+  ++VDLA +A  NAR++Y  K+    KQ+KTI +H KA K+AEKKT+
Sbjct: 411  EESDEESELKPM-IIDVDLAHTAFGNARKYYSQKRSAAKKQQKTIESHGKALKSAEKKTK 469

Query: 537  LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
              + + +T+ +I  +RKV+WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YVH
Sbjct: 470  QTLKEVQTIHSIIKLRKVYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLRAGDLYVH 529

Query: 597  ADLHGASSTVIKNHRPEQP----VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
            ADL GASS VIKN     P    VPP +L +AG   + +S AWD+K+V +AWWV+  QVS
Sbjct: 530  ADLTGASSVVIKN-----PTGGFVPPKSLAEAGTMAIAYSVAWDAKVVANAWWVHHDQVS 584

Query: 653  KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
            K+APTGEYLT GSFMIRGKKN+LPP  LIMG G++FRL+E+S+  H +ER+V+       
Sbjct: 585  KSAPTGEYLTTGSFMIRGKKNYLPPSQLIMGLGIMFRLEENSIERHKDERKVKA------ 638

Query: 713  DFEDSGHHKENSD--IESEKD-----DTDEKPVAESLSVPNSAHP----APSHTNASNVD 761
                 G   EN D  IE +K+     D+DE    E  +  N  H      P      N D
Sbjct: 639  ----VGEESENVDSVIEDDKEIELEGDSDEDENLEDKNALNPIHEEDHLEPESCATDNKD 694

Query: 762  SHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQ 821
            +      +K   N  + +  +       P T    DL          S    K  ++  Q
Sbjct: 695  A------NKDEGNDEEEEEEEDDTKCQFPDTQIKLDL----------SGPKVKLHVDNNQ 738

Query: 822  FDLSEEDKHVERTATV-RDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPES 880
              ++ +    E    +  DKP I     ++ K+ +    V PK  +EK    D +     
Sbjct: 739  PLIATQKDAEENVVYLGDDKPVIVNLPIKE-KRAKTKQKVQPKEPKEKIEKSDKTE---- 793

Query: 881  IVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENA 940
             +   KIE   + RGQKGKLKKMKEKY DQDEE+R + M +L SAG             A
Sbjct: 794  -IDNKKIEQPVLKRGQKGKLKKMKEKYKDQDEEDRRLSMLVLQSAG-------------A 839

Query: 941  STHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKV 1000
            +   +KK     + +PK+  K K    ++      P  S+H +++               
Sbjct: 840  AKEDKKKNRSKDLSSPKLQGKKKPNVRMNV-----PAPSAHIIDN--------------- 879

Query: 1001 AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGT 1060
              +EED     E     ++ ++ LTG P P D LL+ +PV  PYS + +YK++VK+ PG 
Sbjct: 880  -ADEEDTGPTPE-----VDMLEQLTGKPFPEDELLFAVPVVAPYSTLLNYKFKVKLTPGI 933

Query: 1061 AKKGKGIQIFYSLLL 1075
             K+GK  +   ++ L
Sbjct: 934  GKRGKAAKTAIAVFL 948


>gi|410898599|ref|XP_003962785.1| PREDICTED: nuclear export mediator factor Nemf-like [Takifugu
            rubripes]
          Length = 1029

 Score =  569 bits (1466), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 407/1123 (36%), Positives = 578/1123 (51%), Gaps = 197/1123 (17%)

Query: 2    VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R  T D+ A +  +    +GMR +NVYD+  KTY+ +L             K +LL+
Sbjct: 1    MKTRFTTVDIKAVIAEINSNYMGMRVNNVYDIDTKTYLIRLQKPDS--------KAILLI 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESG R+H+T +   K   PSGF +K RKH++TRRL  V+QLG DRI+  QFG    A+++
Sbjct: 53   ESGTRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTQVKQLGNDRIVDIQFGSDEAAYHL 112

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            I+ELY +GN++L D E+T+L LLR    +   V I  R RYP E  R  E   + +    
Sbjct: 113  IVELYDRGNVILADHEYTILNLLRFRTAEVDDVKIAVRERYPVESARPPEPLISLQRLTE 172

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            L S+                            Q+G +                      +
Sbjct: 173  LLSA---------------------------AQQGDQ----------------------I 183

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKL---SEVNKLEDNAIQVLVLA---VAKFEDW 294
            K VL   L YG  L EH +++ GL  + K+   + V ++    ++ L +A   +AK E++
Sbjct: 184  KRVLNPHLSYGATLIEHSLIEVGLPGSAKVDSQASVAQVASKILEALTVAEAYMAKTENF 243

Query: 295  LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKF 351
                       +GYI+ +++      P    G  ++    YDEF P L  Q     +++F
Sbjct: 244  ---------TGKGYIIQKSEK----KPSVTPGKPSEELLTYDEFHPFLFAQHSKSPYLEF 290

Query: 352  ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKM 409
            ++FD A+DEF+SK+ESQ+ + +    E  A  KL  +  D E R+  L   QE+DR +K 
Sbjct: 291  DSFDKAVDEFFSKMESQKIDMKALQLEKHAMKKLENVKKDHEQRLEALHQAQEIDR-IK- 348

Query: 410  AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
             ELIE NL  V+ A+  V  ALAN++ W ++  +VKE + AG+PVA  I +L L+ N ++
Sbjct: 349  GELIEMNLAIVERALQVVCSALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHIT 408

Query: 470  LLLSNNLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYEL 509
            LLL N     DDE++   VE+                    V+VDL+LSA+ANA+++Y+ 
Sbjct: 409  LLLKNPYVSEDDEQEDDVVEETGRKNKNKKSKKFQKNKPMLVDVDLSLSAYANAKKYYDN 468

Query: 510  KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
            K+  + K+ KTI A  KA K+AEKKT+  + + +TV  I   RKV+WFEKF WFIS+ENY
Sbjct: 469  KRSAKRKEFKTIEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFISAENY 528

Query: 570  LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
            LVI+GRD QQNEMIVKRY+  GD+YVHADLHGA+S VIKN   + PVPP TL +AG   V
Sbjct: 529  LVIAGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSGD-PVPPRTLTEAGTMAV 587

Query: 630  CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
            C+S AW++K+VTSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKN+LPP  LIMGFG LF+
Sbjct: 588  CYSAAWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFGFLFK 647

Query: 690  LDESSLGSHLNERRVRGEEEGMDDFEDSGHHK--------------ENSDIESEKDDTDE 735
            +DE S+  H  ER+V+  EE   D E++                  ++SD +  +D+ D+
Sbjct: 648  VDEHSVFRHRGERKVKTVEE---DAEEAASKTAELLNEEGEELMGDDSSDGDEGEDEHDD 704

Query: 736  KPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQL 795
              V E    P           +  +    FP  D +IS                      
Sbjct: 705  SEVKEVTPGPEDDEDDTRDEESEEIS---FP--DTSIS---------------------- 737

Query: 796  EDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQ 855
                   L     + S+ K G +       E D  V +  T +        +RR+ KK Q
Sbjct: 738  -------LSHLQPNSSAQKPGFKQEVTLQVERDSQVRKHMTAK--------QRREEKKKQ 782

Query: 856  GSSVVDPKVE---------REKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEK 906
                 + K E         +  + G D+S QP             + RGQK KLKK+KEK
Sbjct: 783  KQEDTEEKTEIPAGGSTNNQGSKSGGDSSQQP-------------LKRGQKNKLKKIKEK 829

Query: 907  YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
            Y DQDEE+R + M LLASAG  ++       E     K+ K    PV  P      +K  
Sbjct: 830  YKDQDEEDRELMMQLLASAGPTKEE-----KEKGKKGKKGKGKEEPVRKPPPQKPAQKPH 884

Query: 967  HLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTG 1026
            HL     + P+++    E          AE ++   +E D    G EE   L  +  LTG
Sbjct: 885  HLE---AKKPEEAVGKEEGEKGGEERGAAEQEE-KEDEADQDNPGAEETEDL--LTSLTG 938

Query: 1027 NPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
             P   D+LL+ +PVC PY+A+ +YK++VK+ PG+ KKGK  ++
Sbjct: 939  QPHSEDVLLFAVPVCAPYTALSNYKHKVKVTPGSQKKGKAARV 981


>gi|302854251|ref|XP_002958635.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
           nagariensis]
 gi|300256024|gb|EFJ40301.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
           nagariensis]
          Length = 744

 Score =  562 bits (1449), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 331/743 (44%), Positives = 424/743 (57%), Gaps = 94/743 (12%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGE-SEKVLL 58
           MVK RM++ADVAAEV CLR R++G+R +N+YDL+PKTY+ KL        SGE  EKV L
Sbjct: 1   MVKQRMSSADVAAEVACLRQRILGLRVANIYDLTPKTYVIKL------ARSGEDGEKVYL 54

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           L+ESG R HTT       + PS FTLKLRKH RTRR+E VRQLG DR +    G G  A 
Sbjct: 55  LLESGSRFHTTKVGEKSSDLPSNFTLKLRKHCRTRRVEAVRQLGVDRCMELTLGSGPAAV 114

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++ILE+YAQGN++LTD ++ VLTLLRSHRDD KG+ IM+RH YP    R+  + T     
Sbjct: 115 HLILEMYAQGNVVLTDYKYEVLTLLRSHRDDAKGLVIMARHPYPMSAMRLASKVT----- 169

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                                GK  D        +   A   Q 
Sbjct: 170 -------------------------------------GKQLD----EAAAAAAAAGGAQA 188

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
             + +L   L YGP ++EH+ +D G  PN  +    +  +   +    A           
Sbjct: 189 NYRALLSAVLPYGPTIAEHVAMDAGFDPNAAVPLEGEEVEEEGEGAATAATAAAAAAAPP 248

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
             G  +P          + +        +   ++ EF PL L  +  +  ++  TFD AL
Sbjct: 249 GGGGALP--------ADVRRSLLAALVAAGELVFAEFSPLPLLPYSGQPCLELSTFDDAL 300

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYSKIE QRA       E AA  KL+KI +DQ  R   L ++ +     A+LI YNLE
Sbjct: 301 DEFYSKIEGQRAGIARADAERAALSKLDKIKLDQGTRAEALLRQAEECELKAQLITYNLE 360

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            VDA +LAV   LA  M W  LA +V+ ER+AGNPVA LI  L LE N +S+LL+N LD+
Sbjct: 361 MVDAVLLAVNQMLATGMDWSALADLVRNERRAGNPVAALIASLELENNRVSVLLANTLDD 420

Query: 479 MDD---------------EEKTLPVEK-------------VEVDLALSAHANARRWYELK 510
             +                E+  P                V VDL+LSA ANA  ++E +
Sbjct: 421 TGEEGEEEAMTRKAVKVASEECFPQHTQRHTQRHTHTHILVFVDLSLSAAANASTYFEAR 480

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVA-NISHMRKVHWFEKFNWFISSENY 569
           ++  +K  KT+ A+  A  AAEKK   Q+ Q +     +  +RK  WFE+F+WFISSENY
Sbjct: 481 RRHLAKHAKTLAANEAALAAAEKKVEAQLKQVRAAPPALQPVRKPMWFERFHWFISSENY 540

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LV+SGRDAQQNE++VKRY  KGDVYVHA+LHG  +T+    R   P+PPLTL QAGC  V
Sbjct: 541 LVVSGRDAQQNELLVKRYFRKGDVYVHAELHG--TTICVRWRSGGPIPPLTLQQAGCACV 598

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C S+AWDSK+VTSAWWV+  QVSKTAPTGEYLT GSFMIRGKKNFLPP PL+MGFG LF+
Sbjct: 599 CRSRAWDSKLVTSAWWVHHQQVSKTAPTGEYLTTGSFMIRGKKNFLPPQPLVMGFGFLFK 658

Query: 690 LDESSLGSHLNERRVRG-EEEGM 711
           LD+SS+ +HL ER VRG + +GM
Sbjct: 659 LDDSSIPAHLGERAVRGLDPDGM 681


>gi|321467512|gb|EFX78502.1| hypothetical protein DAPPUDRAFT_305191 [Daphnia pulex]
          Length = 997

 Score =  560 bits (1442), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 394/1099 (35%), Positives = 583/1099 (53%), Gaps = 168/1099 (15%)

Query: 2    VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R  + D+ A +  +  +LIGMR + +YD+  KTY+ +L  S         EK +LL 
Sbjct: 1    MKARFTSIDIVAAIAEINLKLIGMRVNQIYDVDHKTYLIRLHRSE--------EKAMLLF 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESG+R+HTT +   K   PSGF++KLRKH+  +RLE   Q+G DRII  QFG G  A++V
Sbjct: 53   ESGIRIHTTDFQWPKNPAPSGFSMKLRKHLNNKRLEMASQVGQDRIINLQFGTGEAAYHV 112

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHA 179
            I+ELY +GNI+L D E+ +L +LR  R + + V  + + +YP E   V +  T ++ L  
Sbjct: 113  IIELYDRGNIVLCDFEYVILNILRP-RTEGEDVRFLVKEKYPLEGTSVEDCITNTEVLEN 171

Query: 180  ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
             L+S+K  D                                                   
Sbjct: 172  WLSSAKTGD--------------------------------------------------N 181

Query: 240  LKTVLGEALGYGPALSEHIILDTGLVPNMKL-SEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            LK +L     YGPAL EH++L+ G  PN ++ ++ +   D  +  L LA+   +  +Q++
Sbjct: 182  LKKILVPKTNYGPALIEHVLLEFGFPPNSRIGTQFDITRD--LPKLHLALKSADSIMQNI 239

Query: 299  ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIY--DEFCPLLLNQFRSREFVKFETFDA 356
             S   + +G ++ +     ++  PT SG +   +   EF P+L  Q  S  F++  +F+ 
Sbjct: 240  GS---ISKGIVVQK-----RESRPTPSGENQDFFTNQEFHPMLYKQHESHPFIELPSFNQ 291

Query: 357  ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
            A+DEF+SK+ESQ+ + +   +E  A  KL  I  D E R+  L   QE+D     A LIE
Sbjct: 292  AVDEFFSKMESQKLDLKVVQQERDAMKKLANIRQDHEKRLANLHHVQEIDEL--KARLIE 349

Query: 415  YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
             N   +D AI  VR ALAN++SW+++  +V+E  + G+PVA +I KL L  N +SL+LS+
Sbjct: 350  MNQPLIDHAIQVVRSALANQVSWKEIDELVEEATRKGDPVAKIIKKLKLSTNHISLMLSH 409

Query: 475  NLDEMD---DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
               E D   + +++   + V++DL L+A ANAR+++  KK    K++KTI +  KAFK+A
Sbjct: 410  PYAEQDSDSESDESYKPQLVDIDLDLTAFANARKYFGEKKNASKKEQKTIESSHKAFKSA 469

Query: 532  EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
            EKK +  + +   +A I   RKV WFEKF WFISS+NY+V+ GRD QQNE++VKRY+  G
Sbjct: 470  EKKAKQTLKESAAIATIRKARKVLWFEKFYWFISSDNYIVVGGRDRQQNELLVKRYLKAG 529

Query: 592  DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
            D+YVHADLHGASS ++KN      +PP TL +AG   V +S AW++K++T+AWWV   QV
Sbjct: 530  DIYVHADLHGASSVIVKNVSASNRIPPRTLQEAGLMAVGYSAAWEAKVMTTAWWVESSQV 589

Query: 652  SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
            SKTAP+GEYLT GSFMIRGKKNFLPP P+++GFGLLFRL+ESS+  HLN+R+ +  +   
Sbjct: 590  SKTAPSGEYLTTGSFMIRGKKNFLPPLPIVLGFGLLFRLEESSIARHLNDRKPKALD--- 646

Query: 712  DDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKT 771
                                  DE P+ ++ +V        S ++ S  D  E     K+
Sbjct: 647  ----------------------DESPILDTETVDEPV----SCSSDSESDGDEKNDYAKS 680

Query: 772  ISN-----GIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSAS-ISSTKHGIETTQFDLS 825
            I N     G+ S++ D A  VA P T     ++D +   G+ + I +       T     
Sbjct: 681  IENARALLGL-SRVTDNAE-VAFPDT-----VVDMSTSSGNRNKIKALNEDESYTIIGDV 733

Query: 826  EEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKT 885
                  +R   V+++P ISK+        Q S  +   + +E E  K+   QP S     
Sbjct: 734  LTINKTQREGKVKEEP-ISKS-------NQSSKKMTESITQETEGEKN--QQPTS----- 778

Query: 886  KIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKE 945
                    RGQKGK+KK+KEKY DQDE+ER ++M LL SAG  +  D     +     + 
Sbjct: 779  -------KRGQKGKMKKIKEKYKDQDEDERQLKMELLQSAGPAR--DKGKNKKKGKNTET 829

Query: 946  KKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVG------LDETAEMDK 999
            KK   S    P +    K+   +  +      D +   +     G      ++E AE D 
Sbjct: 830  KKVIFSKTTVPGL---KKEEILVETEAVPVKSDETPATQQPATDGQLATEQIEENAEGDG 886

Query: 1000 VAMEEEDIHEIGEEEKGRLNDVDYL---TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKI 1056
            +   +ED+      ++  + D D L   TG P   D LLYVIPV  PYS +  YKY+VKI
Sbjct: 887  I---DEDV------DQPVITDTDILNAMTGIPQLEDELLYVIPVVAPYSTLMPYKYKVKI 937

Query: 1057 IPGTAKKGKGIQIFYSLLL 1075
            +PG  K+GK  +   S+ L
Sbjct: 938  LPGQTKRGKASKTAMSVFL 956


>gi|307173031|gb|EFN64173.1| Serologically defined colon cancer antigen 1-like protein [Camponotus
            floridanus]
          Length = 988

 Score =  553 bits (1425), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 380/1097 (34%), Positives = 578/1097 (52%), Gaps = 173/1097 (15%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++LIGMR + +YD+  +TY+ +   S         EK +LL+E
Sbjct: 1    MKTRFNTYDLVCSVTELQKLIGMRVNQIYDIDHRTYLIRFQRSE--------EKCVLLLE 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG RLH T +   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A+++I
Sbjct: 53   SGNRLHMTNFEWPKNVAPSGFSMKMRKHLKNKRLESLTQVGMDRIINLQFGSGEAAYHII 112

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LE+Y +GNI+LTD E  +L +LR H + DK +    R +YP +          + +H  +
Sbjct: 113  LEVYDRGNIILTDYEMVILYVLRPHTEGDK-IRFAVREKYPLDRAHSTTMPPINVIHEHI 171

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
              +KE            G+N                                      LK
Sbjct: 172  QKAKE------------GHN--------------------------------------LK 181

Query: 242  TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             VL   L +G A+ +H++L  G     K+ +   +  +  + L+LA+   ++ +    + 
Sbjct: 182  KVLNPLLEFGSAVIDHVLLKAGFTLGCKIGKDFHITKDMPK-LILALEDADNIMDH--AK 238

Query: 302  DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
              + +GYI+ +     K+   T+ G    I+   EF P L  Q++++ + +F++FDAA+D
Sbjct: 239  KHISKGYIIQK-----KEAKMTQDGKEDFIFANIEFHPFLFEQYKNQPYKEFDSFDAAVD 293

Query: 360  EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            E++S +E Q+ + +   +E  A  KL ++  D + R+ TL++  +   + AELI  N   
Sbjct: 294  EYFSTMEGQKLDLKVLQQEREALQKLERVKKDHDQRLVTLEKSQELDKQKAELISRNQIL 353

Query: 420  VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
            VD AILA++ ALAN+MSW D+  ++KE +  G+PVA  I +L LE N ++LLL +  ++ 
Sbjct: 354  VDNAILAIQSALANQMSWPDIQILLKEAQVIGDPVASAIKQLKLETNHITLLLHDPYEDS 413

Query: 480  DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
            D+E +  P+  +++DLA +A +NA+ +Y  KK    K +KTI +  KA K+AEKKT+  +
Sbjct: 414  DEESELKPM-LIDIDLAHTAFSNAKNYYSQKKSAARKHQKTIESQGKALKSAEKKTKQTL 472

Query: 540  LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
             + +T+  I+ +RK +WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YVHADL
Sbjct: 473  KEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLKAGDLYVHADL 532

Query: 600  HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
             GASS VIKN   + PVPP +L +AG   V +S AWDSK++ SAWWV+  QVSK+APTGE
Sbjct: 533  TGASSVVIKNPSGD-PVPPKSLAEAGTMAVAYSIAWDSKVIASAWWVHHDQVSKSAPTGE 591

Query: 660  YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG-EEEGMDD--FED 716
            YLT GSFMIRGKKN+L    LIMG G++FRL+ESS+  H NERRV+  +EE   D   ED
Sbjct: 592  YLTTGSFMIRGKKNYLTQSQLIMGLGVMFRLEESSIERHKNERRVKTIDEESEKDSIIED 651

Query: 717  SGHHK-----------ENSDI------ESEKDDTDEKPVAESLSVPNSAHPAPSHTNASN 759
                +           EN D+      E +KD T+++  +ES +  N+ +      +  +
Sbjct: 652  DKEIEIEDDSDEDENLENKDMLKPIQEEDQKDLTEDQEKSESCT-KNNTNEDSCQEDDED 710

Query: 760  VDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIET 819
            V  ++FP  D  I   +      +  N   P+    +D  +  + LG             
Sbjct: 711  V-KYKFP--DTQIKIDLSGPKVKLHVNNNQPLIQMQKDTEENVVYLGD------------ 755

Query: 820  TQFDLSEEDKHVERTATVRDKPY-ISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQP 878
                               DKP  I+ + + K  K +    +  ++E+ ++  K+     
Sbjct: 756  -------------------DKPVIINTSTKEKYTKTKQKEHLIEEIEKMEKNDKNECDN- 795

Query: 879  ESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNE 938
                   K E     RGQKGKLKKMKEKY DQDEE+R + M +L SAG  +        E
Sbjct: 796  ------KKKEQPVFKRGQKGKLKKMKEKYKDQDEEDRRLSMLVLQSAGAAK--------E 841

Query: 939  NASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMD 998
            +   +K K P+      PK   K K                       P V L     +D
Sbjct: 842  DKRKNKVKDPS-----GPKQQGKKK-------------------TNSKPNVSLQSMQSID 877

Query: 999  KVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIP 1058
             +  ++ED   I E     ++ +D LTG P   D LL+ +P+  PY+ +Q+YK++VK+ P
Sbjct: 878  NI--DDEDAGPIPE-----VDMLDQLTGKPFSEDELLFAVPIVAPYNTLQNYKFKVKLTP 930

Query: 1059 GTAKKGKGIQIFYSLLL 1075
            G  ++GK  +   ++ L
Sbjct: 931  GIGRRGKAAKTAMAVFL 947


>gi|195038845|ref|XP_001990823.1| GH19576 [Drosophila grimshawi]
 gi|193895019|gb|EDV93885.1| GH19576 [Drosophila grimshawi]
          Length = 983

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 391/1108 (35%), Positives = 577/1108 (52%), Gaps = 202/1108 (18%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R N+ D+   V  L+RL+G+R + +YD+  KTY+F+L  S      G SEK  LL+E
Sbjct: 1    MKTRFNSYDIICGVAELQRLVGLRVNQIYDIDNKTYLFRLHGS------GASEKATLLLE 54

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R HTTA+   K   PSGF++KLRKH++ +RL+ VRQLG DRI+ FQFG G  A++V+
Sbjct: 55   SGTRFHTTAFEWPKNVAPSGFSMKLRKHLKNKRLQHVRQLGADRIVDFQFGTGEAAYHVL 114

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E T+L +LR H + +  V    R +YP                  +
Sbjct: 115  LELYDRGNVILTDYEQTILYILRPHTEGE-SVRFAMREKYP------------------I 155

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
              +KE +    + ++ED      A ++ +   KGG+S                     L+
Sbjct: 156  DRAKEGNC---ETMSED------AMRQRIENSKGGES---------------------LR 185

Query: 242  TVLGEALGYGPALSEHIILDTGL---------------------VPNMKLSEVN------ 274
            ++L   L  GPA+ EH++++ G+                       N K ++ N      
Sbjct: 186  SILMPILDCGPAVIEHVLVEHGIENCIVNSAPDADEPAKEEMTKTQNPKKNKRNQKTCKT 245

Query: 275  KLED--NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIY 332
            KL D    +Q L++A+    D ++   SG+    GYI+       K+  P ++ ++   Y
Sbjct: 246  KLFDLVTDLQKLMMAIKDARDIIEIGQSGN--SNGYIIQV-----KEEKPLDTENTEHFY 298

Query: 333  D--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHM 390
               EF P L  Q + + F K+ETF  A+DEF+S  ESQ+ + +   +E  A  KL+ +  
Sbjct: 299  RNVEFHPYLFVQNKDQPFKKYETFMEAVDEFFSTQESQKIDIKTLQQEREALKKLSNVKN 358

Query: 391  DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
            D   R+  L +  D   + AELI  N   VD AILA++ A+A+++SW D+  +VKE +  
Sbjct: 359  DHTKRLDELNKLQDIDKRKAELITSNQSLVDKAILAIQSAIASQLSWPDIQELVKEAQTN 418

Query: 451  GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
            G+ VA  I +L LE N +SLLL++       E        V+VDLALSA ANARR+Y+ K
Sbjct: 419  GDVVASSIKQLKLEINHISLLLTDPY-----ECNDDDSIIVDVDLALSAWANARRYYDQK 473

Query: 511  KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
            +    K++KTI A  KA K+AE+KT+  + + +T++NI+  RKV WFEKF WF+SSENYL
Sbjct: 474  RSAALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFYWFVSSENYL 533

Query: 571  VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
            VI GRDAQQNE+IVKRYM   D+YVHAD+ GASS +I+N      +PP TL +AG   + 
Sbjct: 534  VIGGRDAQQNELIVKRYMRPKDIYVHADIQGASSVIIRNATGGD-IPPKTLLEAGTMAIS 592

Query: 631  HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
            +S AWD+K+VT+++WVY +QVSKTAP+GEYL  GSFMIRGKKNFLP   LIMG  LLF+L
Sbjct: 593  YSVAWDAKVVTNSYWVYSNQVSKTAPSGEYLGTGSFMIRGKKNFLPSCHLIMGLSLLFKL 652

Query: 691  DESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIE-SEKDDTDEKPVAESLSVPNSAH 749
            +E  +  H  ER++R      DD  D     + ++I  +E D+  E   A+++    +A 
Sbjct: 653  EEGFVQRHAGERKIR----NTDDVADEDDKAQQAEITYTELDEISESNEADNVCANANAF 708

Query: 750  PAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSAS 809
            P             E   E  T    + +++    R  + P T ++              
Sbjct: 709  P-----------DTEVKVEHDTGRITVKTELL---REDSKPKTVEI-------------- 740

Query: 810  ISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKE 869
              S ++ I      ++EE+  +      R K   +  +RR+ K      V   K + E+ 
Sbjct: 741  --SQENNI------INEEETVIIEAGPSRKKTQTTNKKRREAK------VRSDKADIER- 785

Query: 870  RGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGK-- 927
                  SQ         I   K+ RGQK KLKKMK KY DQDEEER +RM +L S+GK  
Sbjct: 786  ------SQASVTEMLEPINASKVKRGQKAKLKKMKSKYRDQDEEERKMRMLILNSSGKDK 839

Query: 928  -VQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDN 986
             +  ND +                                          D+  + ++ N
Sbjct: 840  VITSNDNE------------------------------------------DEKPNTLKVN 857

Query: 987  PCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSA 1046
            P   LD     +++ ++E D   +  +     + +D LTG PL  D LL+ IPV  PY +
Sbjct: 858  PVETLDAPIAKNQIEIDENDDAPVIVDA----DLLDTLTGVPLDDDELLFAIPVVAPYQS 913

Query: 1047 VQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            +Q YK++VK+ PGT K+GK  ++  S+ 
Sbjct: 914  LQQYKFKVKLTPGTGKRGKAAKLALSIF 941


>gi|332016223|gb|EGI57136.1| Serologically defined colon cancer antigen 1 [Acromyrmex echinatior]
          Length = 990

 Score =  552 bits (1422), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 386/1083 (35%), Positives = 575/1083 (53%), Gaps = 143/1083 (13%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L+RLIGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1    MKTRFNTYDLVCSVTELQRLIGMRVNQIYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R+H TA+   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A++VI
Sbjct: 53   SGNRIHITAFEWPKNVAPSGFSMKMRKHLKNKRLESLMQVGTDRIIKLQFGSGEAAYHVI 112

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LE+Y +GNI+LTD E  +L +LR H + DK +    + +YP +            +H  +
Sbjct: 113  LEVYDRGNIILTDHEMVILYVLRPHTEGDK-IRFAVKEKYPLDRAHSTTMPHIDVIHDHI 171

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
              +KE D                                                   LK
Sbjct: 172  QKAKEGD--------------------------------------------------NLK 181

Query: 242  TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAK-FEDWLQDVI 299
             VL   L +G A+ +H++L  G     K+  + +  ED    +L L  A    D+ +  +
Sbjct: 182  KVLNPLLEFGSAVIDHVLLKAGFNLGCKIGKDFHITEDMPRLILALEDANNIMDYAKKNV 241

Query: 300  SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAA 357
            S     +GYI+ +     K+   T+ G    I+   EF P L  Q+ ++ + +F +FDAA
Sbjct: 242  S-----KGYIIQK-----KESKLTQDGKEDFIFANIEFHPFLFEQYNNQPYKEFNSFDAA 291

Query: 358  LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEY 415
            +DE++S +E Q+ + +   +E  A  KL ++  D   R+ TL+  QE+D+  + AELI  
Sbjct: 292  VDEYFSMMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDK--QKAELISR 349

Query: 416  NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
            N   VD AILA++ ALAN+MSW D+  ++KE +  G+PVA  I +L LE N ++L+L + 
Sbjct: 350  NQVLVDNAILAIQSALANQMSWPDIQVLLKEAQTRGDPVASAIKQLKLETNHIALMLHDP 409

Query: 476  LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
             ++ D+E K  P+  +++DLA +A +NA+++Y  KK    KQ+KTI +  KA K+AEKKT
Sbjct: 410  YEDSDEESKLKPM-MIDIDLAHTAFSNAKKYYSQKKSAAKKQQKTIESQGKALKSAEKKT 468

Query: 536  RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
            +  + + +T+  I+ +RK +WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YV
Sbjct: 469  KQTLKEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLKAGDLYV 528

Query: 596  HADLHGASSTVIKNHRPE-QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
            HADL GASS VIKN  P   PVPP +L +AG   V +S AWDSK++ SAWWV+  QVSK+
Sbjct: 529  HADLTGASSVVIKN--PSGNPVPPKSLAEAGTMAVAYSIAWDSKVIASAWWVHHDQVSKS 586

Query: 655  APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDF 714
            APTGEYLT GSFMIRGKKN+L    LIMG G++FRL++SS+  H +ERRV+  +E  +  
Sbjct: 587  APTGEYLTTGSFMIRGKKNYLTHSQLIMGLGIMFRLEDSSIERHKDERRVKTVDEESEKA 646

Query: 715  EDSGHHKENSDIESEKD-DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS 773
            +         ++E + D D + +   ++L   N+ HP     +    +SH      K   
Sbjct: 647  DSIVEDDREIELEGDSDEDENLEKQEQNLENKNTLHPI-QEEDQEKSESHTTDYSVKKDI 705

Query: 774  NGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVER 833
             G D K  D       P T    DL          S    K  ++  Q  +  +    E 
Sbjct: 706  YGEDEKDTDEDTKYQFPDTQIKIDL----------SGPKVKIHVDNNQPLMQSQKNTKEN 755

Query: 834  TATV-RDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKI 892
               +  DKP I  A   +    Q +     K+E++ +   D            K E   +
Sbjct: 756  VVYLGDDKPIIINASTMEKHAKQKTKESTKKIEKDDKNEND----------NKKGEQPTL 805

Query: 893  SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISP 952
             RGQKGKLKK+KEKY DQDEE+R + M +L SAG  +        E+   ++ K P+   
Sbjct: 806  KRGQKGKLKKIKEKYKDQDEEDRRLSMLVLQSAGAAK--------EDKRKNRAKDPS--- 854

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
               PK   + KK  +   +    P  S H +++                +++ED   I E
Sbjct: 855  --GPK--QQGKKKTNPKPNI---PSQSMHTIDN----------------IDDEDTGPIPE 891

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYS 1072
                 ++ +D LTG P+  D LL+ +PV  PY+ +Q+YK++VK+ PG  K+GK  +   +
Sbjct: 892  -----VDMLDQLTGKPVSEDELLFAVPVVAPYNTLQNYKFKVKLTPGIGKRGKAAKTAIA 946

Query: 1073 LLL 1075
            + L
Sbjct: 947  VFL 949


>gi|147771936|emb|CAN75697.1| hypothetical protein VITISV_035984 [Vitis vinifera]
          Length = 431

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 316/589 (53%), Positives = 360/589 (61%), Gaps = 163/589 (27%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           RMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FK MNSSGVTESG             
Sbjct: 6   RMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKFMNSSGVTESG------------- 52

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
                                                G +++ILFQFGLG NA YVILEL
Sbjct: 53  -------------------------------------GSEKVILFQFGLGANAXYVILEL 75

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
            AQGNILLTDSEF V+TLL SHR+    +  M + R P E                    
Sbjct: 76  CAQGNILLTDSEFMVMTLLGSHRN----LRAMKQSR-PVE-------------------- 110

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
                         GN VS+A +E  G +KG KS + SKN+N    DGARAKQ TLKTVL
Sbjct: 111 -------------GGNKVSDAPREKQGNRKGAKSSEPSKNTN----DGARAKQATLKTVL 153

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
           GEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +VAKFE+WL+DVI GD V
Sbjct: 154 GEALGYGPALSEHIILDAGLIPNTKVTKDSKFDXDTIQRLAQSVAKFENWLEDVILGDQV 213

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
           PEGYILMQNK  GKD  P++    +QIYDEFCP+LLNQF+SREFVKFETFDAA DEFYSK
Sbjct: 214 PEGYILMQNKIFGKDCRPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAASDEFYSK 273

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           IE QR+EQQ KAKE  A  KL+KI MDQENRVHTLK+E DR +KMAELIEYNLEDVDAAI
Sbjct: 274 IEGQRSEQQQKAKEVXAMQKLSKICMDQENRVHTLKKEDDRCIKMAELIEYNLEDVDAAI 333

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           LAVRVALAN M+WEDLARM                                         
Sbjct: 334 LAVRVALANGMNWEDLARM----------------------------------------- 352

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
                 VEVDLALSAHANAR WYE KK+QE+K+EKTI AH K  K  +++          
Sbjct: 353 ------VEVDLALSAHANARXWYEQKKRQENKREKTIIAHEKLLKLLKRRLA-------- 398

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
                           + F SS+NY VISGRDAQ NEMIVKRYMSKGD+
Sbjct: 399 ----------------SSFHSSKNYFVISGRDAQLNEMIVKRYMSKGDL 431


>gi|443707183|gb|ELU02895.1| hypothetical protein CAPTEDRAFT_151175 [Capitella teleta]
          Length = 1023

 Score =  550 bits (1416), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 304/705 (43%), Positives = 425/705 (60%), Gaps = 72/705 (10%)

Query: 2   VKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K +  T D+ A V +  RR IGMR +NVYD+  KTY+ KL        +   +K LL++
Sbjct: 1   MKTKFTTVDIRASVLEVKRRWIGMRVTNVYDIDNKTYLVKL--------AKPDQKALLVL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H+T +   K N+PSGF++KLRKH+R RRLE V+QLG DR++  QFG    A+++
Sbjct: 53  ESGSRFHSTEFDWPKNNSPSGFSMKLRKHLRGRRLESVQQLGADRVVDMQFGSNEAAYHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LELY +GN++LTD E+ +L LLR   D+ + V +     YP +  R  +     KLH+A
Sbjct: 113 VLELYDRGNLVLTDHEYNILNLLRVRTDESQDVKLAVHESYPLQTARQ-DTVDHDKLHSA 171

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L  +KE D                                                   L
Sbjct: 172 LLEAKEGDH--------------------------------------------------L 181

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YGPAL EH +   GL  N ++ +   ++++   +L  A+ + +  L+++  
Sbjct: 182 KRILNPLLPYGPALIEHSLRAAGLPENCRMGKEFIVQEHMASLLA-ALVEAQRILENM-- 238

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G    +GYI+ + +   K    TE G     Y+EF P L  Q  S   ++FE+F  A+DE
Sbjct: 239 GSESSKGYIIQKKE---KKASSTE-GDELITYNEFHPYLYKQHESCPHLEFESFSKAVDE 294

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+SKIESQ+ + +   +E +   KL  +  D   R+  L  E ++     +LIE NL  V
Sbjct: 295 FFSKIESQKLDMKTLQQEKSVLRKLENVRKDHAQRLQALANEQEKDNIKGQLIEMNLPLV 354

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           + AIL V+ ALAN++ W D+ ++VKE +  G+PVA  I  L L+ N  +++L +  +   
Sbjct: 355 ERAILVVQSALANQLDWADINQLVKEAQAQGDPVASSISSLQLQSNHFTMMLRDCYE--G 412

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           DEE  LP +KV++DL LSA+ANAR++Y+ KK    K++KT+ A +KA K+AEKKT+  + 
Sbjct: 413 DEEDMLPAQKVQIDLGLSAYANARKYYDKKKHAAQKEQKTVAASTKALKSAEKKTKQTLK 472

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           + +  A I   RK HWFEKF WFISSENYLVI GRD QQNE++VKR++  GD+YVHADLH
Sbjct: 473 EVQVAATIRKQRKTHWFEKFLWFISSENYLVIGGRDQQQNELLVKRHLRPGDLYVHADLH 532

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASS +IKN      VPP TLN+AG   +CHS AWD+K+VTSAWWV+ HQVSKTAPTGEY
Sbjct: 533 GASSVIIKN---PSGVPPKTLNEAGTMALCHSAAWDAKVVTSAWWVHHHQVSKTAPTGEY 589

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           LT GSFMIRGKKNFLPP  LI GFG LF++D++S+  H +ER+VR
Sbjct: 590 LTTGSFMIRGKKNFLPPSYLIYGFGFLFKVDDTSIFRHQDERKVR 634


>gi|195504496|ref|XP_002099104.1| GE23561 [Drosophila yakuba]
 gi|194185205|gb|EDW98816.1| GE23561 [Drosophila yakuba]
          Length = 996

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 406/1117 (36%), Positives = 579/1117 (51%), Gaps = 207/1117 (18%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1    MKTRFNTYDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R HTT +   K   PSGF++KLRKH++ +RLE ++QLG DRI+  QFG G  A++VI
Sbjct: 55   SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKIQQLGSDRIVDLQFGTGDAAYHVI 114

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E T L +LR H + +  +    R +YP E                 
Sbjct: 115  LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE----------------- 156

Query: 182  TSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +K+P    EPD + +   N  N                                   L
Sbjct: 157  -RAKQPTKELEPDALVKLLENARNGD--------------------------------YL 183

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN------------------------KL 276
            + +L   L  GPA+ EH++L  GL  ++   E                          KL
Sbjct: 184  RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKEATEETPEADDKPEKGGKKQRKKQQNTKL 243

Query: 277  EDNA------IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
            E         + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G    
Sbjct: 244  EQKPFDMVKDLPILQQAVKDAQELIAEGSSGK--SKGYIIQV-----KEEKPTENGKVEF 296

Query: 331  IYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
             +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ +
Sbjct: 297  FFRNIEFHPYLFTQFKNFETATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNV 356

Query: 389  HMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
              D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE
Sbjct: 357  KNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKE 414

Query: 447  ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANA 503
             +  G+ VA  I +L LE N +SL+LS+  D  +D++  L   +   V+VDLALSA ANA
Sbjct: 415  AQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDDDLKAPELTVVDVDLALSAWANA 474

Query: 504  RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWF 563
            RR+Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WF
Sbjct: 475  RRYYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWF 534

Query: 564  ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
            ISSENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +
Sbjct: 535  ISSENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLE 593

Query: 624  AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
            AG   + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG
Sbjct: 594  AGSMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMG 653

Query: 684  FGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAE 740
              LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +D D  P A 
Sbjct: 654  LSLLFKLEDSFIERHLGERKVR----SLDDDQIDQNVKETEVEHDLLSDNEDADTNPNA- 708

Query: 741  SLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDL 798
            +LS             +SN +   FP  +  I +       D  R +  +  + P+LE  
Sbjct: 709  NLS-----------EQSSNTEITAFPNTEVKIEH-------DTGRIIVRSDSLNPELE-- 748

Query: 799  IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSS 858
                         +TK      +  L + D   E T  +   P      R+K        
Sbjct: 749  -------------ATKENEVVLEKILKKTDD--EETTIILAGP-----SRKK-------Q 781

Query: 859  VVDPKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNI 917
            V   K + +K R K +A+ Q  + V        ++ RGQKGKLKKMK+KY DQD+EER I
Sbjct: 782  VSAKKTKEDKARAKQEAAKQEVAPVSTEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREI 841

Query: 918  RMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPD 977
            RM +L S+GK                  +KP  +            KA   S+  KE+  
Sbjct: 842  RMMILKSSGK------------------EKPQAN----------ADKAVEKSESTKEYVK 873

Query: 978  DSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYV 1037
                    NP V LD+  E+            +G    G ++ ++ LTG P   D LL+ 
Sbjct: 874  PEKSAAPKNP-VELDDGDEV-----------PVG----GDVDVLNSLTGQPHEGDELLFA 917

Query: 1038 IPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            IPV  PY A+Q+YK++VK+ PGT K+GK  ++  ++ 
Sbjct: 918  IPVVAPYQALQNYKFKVKLTPGTGKRGKAAKLALNIF 954


>gi|32130521|ref|NP_079717.2| nuclear export mediator factor Nemf [Mus musculus]
 gi|47606756|sp|Q8CCP0.2|NEMF_MOUSE RecName: Full=Nuclear export mediator factor Nemf; AltName:
           Full=Serologically defined colon cancer antigen 1
           homolog
          Length = 1064

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 310/745 (41%), Positives = 431/745 (57%), Gaps = 100/745 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-- 475
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRNPYL 415

Query: 476 LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWY 507
           L E +D +    +E                             V+VDL+LSA+ANA+++Y
Sbjct: 416 LSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 475

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 476 DHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 535

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 536 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 594

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 595 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 654

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 655 FKVDESCVWRHRGERKVRVQDEDME 679


>gi|148704665|gb|EDL36612.1| mCG3169, isoform CRA_a [Mus musculus]
          Length = 1083

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 310/745 (41%), Positives = 431/745 (57%), Gaps = 100/745 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 20  MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 71

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 72  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 131

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 132 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 178

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 179 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 202

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 203 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 258

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 259 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 314

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 315 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 374

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-- 475
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 375 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRNPYL 434

Query: 476 LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWY 507
           L E +D +    +E                             V+VDL+LSA+ANA+++Y
Sbjct: 435 LSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 494

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 495 DHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 554

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 555 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 613

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 614 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 673

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 674 FKVDESCVWRHRGERKVRVQDEDME 698


>gi|431893718|gb|ELK03539.1| Serologically defined colon cancer antigen 1 [Pteropus alecto]
          Length = 1077

 Score =  543 bits (1400), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 308/747 (41%), Positives = 437/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +  R  E          
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVREHYPVDHARAVE--------PL 164

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  +             ++NA K  L                             L
Sbjct: 165 LTLERLTEV------------IANAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYIK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFSGKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -------------NLDEMDDEEKTLPVEK----------------VEVDLALSAHANARR 505
                        N+++++ E      +K                V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDINVEKIETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H +ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRSERKVRVQDEDME 681


>gi|194908933|ref|XP_001981863.1| GG11364 [Drosophila erecta]
 gi|190656501|gb|EDV53733.1| GG11364 [Drosophila erecta]
          Length = 994

 Score =  543 bits (1399), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 405/1117 (36%), Positives = 583/1117 (52%), Gaps = 209/1117 (18%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1    MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R HTT +   K   PSGF++KLRKH++ +RLE ++QLG DRI+ FQFG G  A++VI
Sbjct: 55   SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLERIQQLGSDRIVDFQFGTGDAAYHVI 114

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E T L +LR H + +  +    R +YP E                 
Sbjct: 115  LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE----------------- 156

Query: 182  TSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +K+P    EP+ + +   N  N                                   L
Sbjct: 157  -RAKQPTKELEPEALVKLLENARNGD--------------------------------YL 183

Query: 241  KTVLGEALGYGPALSEHIILDTGL------------------------------VPNMKL 270
            + +L   L  GPA+ EH++L  GL                                N KL
Sbjct: 184  RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKEATEETPEADDKPEKGGKKQRKKQQNTKL 243

Query: 271  SE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSS 328
             +   + ++D  + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G  
Sbjct: 244  EQKPFDMIKD--LPILQQAVKDAQELITEGSSGK--SKGYIIQV-----KEEKPTENGKV 294

Query: 329  TQIYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLN 386
               +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+
Sbjct: 295  EFFFKNIEFHPYLFIQFKNFEKATFESFMDAVDEFYSTQESQKIDIKTLQQEREALKKLS 354

Query: 387  KIHMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
             +  D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +V
Sbjct: 355  NVKNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELV 412

Query: 445  KEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANA 503
            KE +  G+ VA  I +L LE N +SL+LS+  D  +D++   P +  V+VDLALSA ANA
Sbjct: 413  KEAQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKAPELTVVDVDLALSAWANA 472

Query: 504  RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWF 563
            RR+Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WF
Sbjct: 473  RRYYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWF 532

Query: 564  ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
            ISSENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +
Sbjct: 533  ISSENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLE 591

Query: 624  AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
            AG   + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG
Sbjct: 592  AGSMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMG 651

Query: 684  FGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAE 740
              LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +D D   +  
Sbjct: 652  LSLLFKLEDSFIERHLGERKVR----NLDDDQIDPNVKETEVEHDLLSDNEDADAN-LNG 706

Query: 741  SLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDL 798
            +LS P           +SN +   FP  +  I +       D  R +  +  + P+LE  
Sbjct: 707  NLSEP-----------SSNTEITAFPNTEVKIEH-------DTGRIIVRSDSLNPELE-- 746

Query: 799  IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSS 858
                         +TK      +  + + D   E T  +   P      R+K        
Sbjct: 747  -------------ATKENEVVIEKIVKKPDD--EETTIILAGP-----SRKK-------Q 779

Query: 859  VVDPKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNI 917
            V   K + +K R K +A+ Q  + V        ++ RGQKGKLKKMK+KY DQD+EER I
Sbjct: 780  VSAKKTKEDKARAKQEAAKQEVAPVSTEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREI 839

Query: 918  RMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPD 977
            RM +L S+GK                  +KP  S   A KV  K       S+  KE+  
Sbjct: 840  RMMILKSSGK------------------EKPQAS---ADKVVEK-------SESTKEYVK 871

Query: 978  DSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYV 1037
                    NP            V +++ D   +G    G ++ ++ LTG P   D LL+ 
Sbjct: 872  PEKSAAPKNP------------VELDDADDVPVG----GDVDVLNSLTGQPHEGDELLFA 915

Query: 1038 IPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            IPV  PY A+Q+YK++VK+ PGT K+GK  ++  ++ 
Sbjct: 916  IPVVAPYQALQNYKFKVKLTPGTGKRGKAAKLALNIF 952


>gi|291403822|ref|XP_002718277.1| PREDICTED: serologically defined colon cancer antigen 1
           [Oryctolagus cuniculus]
          Length = 1076

 Score =  543 bits (1398), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 309/748 (41%), Positives = 435/748 (58%), Gaps = 104/748 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLCAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   RQLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSARQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIILTDYEYLILNILRFRTDEADDVKFAVRERYPLDHAR------------- 159

Query: 181 LTSSKEPDANEP----DKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
                   A EP    +++ E    +S+A K  L                          
Sbjct: 160 --------AAEPLLSLERLTE---VISSAPKGEL-------------------------- 182

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
              LK VL   L YGPAL EH ++++G   N+K+ E  KLE   I+ ++  + K ED+++
Sbjct: 183 ---LKRVLNPLLPYGPALIEHCLMESGFPGNVKVDE--KLESKDIEKVLTCLQKAEDYMK 237

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
              + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD 
Sbjct: 238 --TTSNFRGKGYII-QKREIKPSLEVDKPSEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L  N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLHTNHVTMLLRNPY 414

Query: 475 ------------NLDEMDDEEKTLPVEK------------------VEVDLALSAHANAR 504
                       ++    +E + L  +K                  V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVTVEKNENEPLKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRIQDEDME 681


>gi|326921280|ref|XP_003206889.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
            [Meleagris gallopavo]
          Length = 1080

 Score =  542 bits (1397), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 389/1112 (34%), Positives = 569/1112 (51%), Gaps = 174/1112 (15%)

Query: 21   LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
            L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS
Sbjct: 32   LLGMRVNNVYDVDNKTYLIRLQKPDC--------KATLLLESGIRIHTTEFEWPKNMMPS 83

Query: 81   GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            GF +K RKH++TRRL  VRQLG DRI+ FQFG    A+++I+ELY               
Sbjct: 84   GFAMKCRKHLKTRRLVSVRQLGIDRIVDFQFGSNEAAYHLIIELY--------------- 128

Query: 141  TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
                     D+G  +++ H Y   I  +    T                       ++ +
Sbjct: 129  ---------DRGNIVLTDHEY--LILNILRFRT-----------------------DEAD 154

Query: 201  NVSNASKEN--LGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHI 258
            +V  A +E   +   K        +   +  +D  + +Q  LK VL   L YG  L EH 
Sbjct: 155  DVRFAVRERYPVDSAKAPTPLPSLERLTEIISDAPKGEQ--LKRVLNPHLPYGATLIEHC 212

Query: 259  ILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGK 318
            +++ G    +K+ +  + ++N I+ ++ A+ K E+++   ++ D   +GYI+ Q K    
Sbjct: 213  LIEAGFSGYVKIDQHMESKEN-IEKVLSALEKAEEYM--TLTEDFNGKGYII-QKKEKKP 268

Query: 319  DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
               P +       Y+EF P L +Q     +++F++F+ A DEFYSK+E Q+ + +   +E
Sbjct: 269  SLEPDKPAEDIYTYEEFHPFLFSQHSKCPYLEFDSFNKAADEFYSKLEGQKIDLKALQQE 328

Query: 379  DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
              A  KL  +  D E R+  L+Q  +      ELIE NLE V+ AI  VR ALAN++ W 
Sbjct: 329  KQALKKLENVRRDHEQRLEALQQAQEVDKIKGELIEMNLEIVNRAIQVVRSALANQIDWT 388

Query: 439  DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------------------------ 474
            ++  +VKE +  G+PVA  I +L L+ N +++LL N                        
Sbjct: 389  EIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLRNPYVLSEEEEEGEDADLEKEETEEP 448

Query: 475  -------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                      ++   +K  P   V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  KA
Sbjct: 449  KGKKKKNKNKQLKKPQKNKP-SLVDVDLSLSAYANAKKYYDHKRHAAKKTQKTVEAAEKA 507

Query: 528  FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
            FK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSENYLVI+GRD QQNE+IVKRY
Sbjct: 508  FKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSENYLVIAGRDQQQNELIVKRY 567

Query: 588  MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
            +  GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S AWD+++VTSAWWV 
Sbjct: 568  LKPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYSAAWDARVVTSAWWVS 626

Query: 648  PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
             +QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+++ +
Sbjct: 627  HNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHREERKIKVQ 686

Query: 708  EEGMDDFEDSGHHKENSDIE--SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEF 765
            +E ++    S     + ++E     D + E+  AE        H AP    A        
Sbjct: 687  DEDLETVSSSASELVSEEVELLEGGDSSSEEDKAE-------CHEAPEDVEA-------- 731

Query: 766  PAEDKTISNGIDSKIFDIARN-VAAPVTPQLEDLIDRALGLG-------SASISSTKHGI 817
                 T  N  D  + D+ ++ V+ P  P  E + +   G          + +   +   
Sbjct: 732  -----TAENNGDENVADLDQDRVSTPPVP--EGVSEEDDGESEVEHPEPQSEVKEEEVNY 784

Query: 818  ETTQFDLS--EEDKHVERTATVRDKPYISKAE---RRKLK-----------KGQGSSVVD 861
              T  DLS  +  + +++T    ++P +S ++   RR L            +   S  +D
Sbjct: 785  PDTTIDLSHLQSQRSLQKTVPKEEEPNLSDSKSQGRRHLSAKERREMKKKKQQNDSENLD 844

Query: 862  PKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMAL 921
            P  ER+K+   +    P     K       I RGQK K+KKMKEKY DQDEE+R + M L
Sbjct: 845  PPEERQKD--TETQRPPPPNTTKGVPAPQPIKRGQKSKMKKMKEKYKDQDEEDRELIMKL 902

Query: 922  LASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSH 981
            L SAG          N+     K KK  +      K   K K   H +   KE       
Sbjct: 903  LGSAG---------SNKEEKGKKGKKGKMKEEPVKKQQQKSKAVHHGAGGGKEM------ 947

Query: 982  GVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLND----VDYLTGNPLPSDILLYV 1037
                    G     E +  A+EE+   E  E+++ +  D    +D LTG P   DILL+ 
Sbjct: 948  ------LPGGVLLHESEDPALEEQQ-DEKDEQDQDQPGDGTALLDSLTGQPHAEDILLFA 1000

Query: 1038 IPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
            +P+C PY+A+ +YKY+VK+ PGT KKGK  +I
Sbjct: 1001 VPICAPYTAMTNYKYKVKLTPGTQKKGKAAKI 1032


>gi|240978882|ref|XP_002403060.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215491284|gb|EEC00925.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 651

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 317/719 (44%), Positives = 430/719 (59%), Gaps = 92/719 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  LR RL+GMR   VYD   KTY+FKL        +   EK +LL+
Sbjct: 1   MKSRFSTVDIVAMICELRQRLVGMRVIQVYDADSKTYLFKL--------NRHDEKAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT +A  K  +PSGF++KLRKH+R +R+E V QLG DRI+  QFG+   A++V
Sbjct: 53  ESGVRLHTTDFAWPKNLSPSGFSMKLRKHLRNKRVESVSQLGADRIVDIQFGVNEAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           ILELY +GN++LTD ++ +L +LR     DD  V  + R RYP +              +
Sbjct: 113 ILELYDRGNLVLTDGDYMILNILRPRTGKDDDDVKFVVRERYPVQ--------------S 158

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP- 238
           AL+ + + +A                                         D  R  +P 
Sbjct: 159 ALSPALDAEA---------------------------------------LTDILRFAKPA 179

Query: 239 -TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
            TL+ +L   + YGPAL EH++   GL    K+++V+   D         VA   + LQD
Sbjct: 180 DTLRKLLTPKVSYGPALLEHVLRARGLSTGAKVADVDASRD---------VATLLECLQD 230

Query: 298 VIS----GDIVP-EGYILMQNKHLGKDHPPTESGSSTQI--YDEFCPLLLNQFRSREFVK 350
             +        P +GYIL++   + K   P + GS T+I  Y EF P L  Q      V+
Sbjct: 231 AEALMERARTEPSKGYILVR---VEKRVTPADDGS-TEITSYQEFHPFLWRQHEKERVVE 286

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
             +F AA+D+F+S +E QR   +   KE  A  KL  I MD E R+  L+Q        A
Sbjct: 287 LASFSAAVDQFFSSLEMQRISLKAHQKEKEALKKLENIRMDHEKRIVALEQVQREDKHKA 346

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           ELIE NL+ V+ A+L +R ALAN++ W ++  +++E ++ G+PVA  I +L L+ N  ++
Sbjct: 347 ELIEINLDLVERALLVLRSALANQIGWAEITELLREAQEQGDPVAQSIKQLKLDTNHFAM 406

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           LL +  +E  D   TL    V++DL LSA+ANARR+Y+ K+    KQ+KT+ + +KA+K+
Sbjct: 407 LLRDPYEE--DARDTL----VDIDLDLSAYANARRYYDQKRHAAGKQQKTLESSTKAYKS 460

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           AEKKT+  + Q    +NI+  RK  WFEKF WFISSE+YLVI GRDAQQNEMIVKR+++ 
Sbjct: 461 AEKKTKEALKQVALTSNIARARKAFWFEKFFWFISSEDYLVIGGRDAQQNEMIVKRHLNP 520

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GDVYVHADLHGASS VIKN      VPP TLN+AG   +C+S AWD+K+VTSAWWV+ HQ
Sbjct: 521 GDVYVHADLHGASSIVIKNP-GGGSVPPKTLNEAGTMAICYSAAWDAKVVTSAWWVHHHQ 579

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           VSKTAPTG+YLT G+FMIRGKKN+LPP  LIMGFG L++LDE S+  H  ERRVR  EE
Sbjct: 580 VSKTAPTGQYLTPGAFMIRGKKNYLPPSYLIMGFGFLYKLDEDSVERHSGERRVRTAEE 638


>gi|355778566|gb|EHH63602.1| hypothetical protein EGM_16603 [Macaca fascicularis]
 gi|380817886|gb|AFE80817.1| nuclear export mediator factor NEMF [Macaca mulatta]
 gi|383422753|gb|AFH34590.1| nuclear export mediator factor NEMF [Macaca mulatta]
 gi|384950256|gb|AFI38733.1| nuclear export mediator factor NEMF [Macaca mulatta]
          Length = 1077

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 308/744 (41%), Positives = 431/744 (57%), Gaps = 96/744 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A++    
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR------AAEPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L S  E  A+ P                                           K   L
Sbjct: 167 LESLTEIVASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 --------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWYE 508
                         N  E    +K     K            V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSEN
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSEN 538

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG   
Sbjct: 539 YLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMA 597

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF
Sbjct: 598 LCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLF 657

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
           ++DES +  H  ER+VR ++E M+
Sbjct: 658 KVDESCVWRHRGERKVRVQDEDME 681


>gi|329664770|ref|NP_001192434.1| nuclear export mediator factor NEMF [Bos taurus]
          Length = 1076

 Score =  540 bits (1392), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 310/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVRVQDEDME 681


>gi|426233096|ref|XP_004010553.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Ovis
           aries]
          Length = 1076

 Score =  540 bits (1392), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 311/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDIEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVRVQDEDME 681


>gi|380024993|ref|XP_003696268.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
            homolog [Apis florea]
          Length = 970

 Score =  540 bits (1392), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 383/1101 (34%), Positives = 569/1101 (51%), Gaps = 199/1101 (18%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R N+ D+A  +  L++LIGMR + VYD+  +TY+ +L  S         EK +LL+E
Sbjct: 1    MKTRFNSYDIACTINELQKLIGMRVNQVYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A+++I
Sbjct: 53   SGNRIHTTVFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRMIDLQFGSGEAAYHII 112

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GNI+LTD E   +T+L   R   +G  I    R+      V E+    + H  +
Sbjct: 113  LELYDRGNIVLTDYE---MTILNILRPHTEGDKI----RFA-----VKEKYPMDRAHQNI 160

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                E              N+    +++L   K G++                     LK
Sbjct: 161  MPPIE--------------NI----QQHLQNAKIGEN---------------------LK 181

Query: 242  TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             +L   L +G A+ +H++L  G     K+     +E++ +  L+LA+    D +    + 
Sbjct: 182  KILNPLLEFGSAIIDHVLLKHGFTLGCKIGRDFNIEED-MSKLILALEYANDMMN--FAR 238

Query: 302  DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
              V +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD A+D
Sbjct: 239  QNVSKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDVAVD 293

Query: 360  EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
            E++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI  N 
Sbjct: 294  EYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELISRNQ 351

Query: 418  EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
              VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352  TLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478  EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
            + D+E +  P+  +++DLA +A  NAR++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412  DSDEESELKPM-LIDIDLAHTAFGNARKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538  QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
             + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471  TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKTGDIYVHA 530

Query: 598  DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
            DL GASS +IKN      VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531  DLTGASSVIIKNPGG-GSVPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658  GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR---GEEEGMDDF 714
            GEYLT GSFMIRGKKN+LPP  L+MG G LF L+ESS+  H +ER+VR    E E  + F
Sbjct: 590  GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFXLEESSIERHKDERKVRIIDDENEHTESF 649

Query: 715  EDSGHHKENSDIE-SEKDDTDEKPVAESLSVPNSAHPAPSHTNAS--------NVDSHE- 764
             +     E+ +IE  E  + DE+P  +     N+ +P    +           N DS+E 
Sbjct: 650  IE-----EDKEIELIEDSEEDEQPENK-----NNLNPIQEESKKDLFMEEKNINQDSNEE 699

Query: 765  ---FPAEDKTISNGIDSKIFDIARNVAAPVTPQ---LEDLIDRALGLGSASISST---KH 815
               F   D  I   I      +  +   P T     +ED+I   LG     + +T   K 
Sbjct: 700  DNPFQFPDTQIKIDISGSKVKLHVDNNQPTTISQEVVEDII--YLGDDKPVLINTMCKKK 757

Query: 816  GIETTQFDLSEEDKH-VERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDA 874
             +E  Q    +E+K  +E  +   D+  + + ++ +LKK              KE+ KD 
Sbjct: 758  DLEVKQKSFKKENKEKIEIDSKKNDQVILKRGQKGRLKKM-------------KEKYKD- 803

Query: 875  SSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGD 934
                                               QDEE+R + M +L SAG  +     
Sbjct: 804  -----------------------------------QDEEDRRLSMQVLQSAGNAK----- 823

Query: 935  PQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDET 994
               E+   ++ K P+      PK   K K         K  P  +   VE+         
Sbjct: 824  ---EDKKKNRNKDPS-----GPKQQTKKKSI------MKSVPPQNFQIVEN--------- 860

Query: 995  AEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRV 1054
                   +EEED     E     ++ +D LTG P+  D LL+ IPV  PY+ V +YK++V
Sbjct: 861  -------IEEEDTGPGPE-----IDMLDQLTGKPVTEDELLFAIPVVAPYNTVLNYKFKV 908

Query: 1055 KIIPGTAKKGKGIQIFYSLLL 1075
            K+ PGT K+GK  +   ++ +
Sbjct: 909  KLTPGTGKRGKAAKTAMAVFM 929


>gi|440907236|gb|ELR57405.1| Serologically defined colon cancer antigen 1 [Bos grunniens mutus]
          Length = 1077

 Score =  540 bits (1391), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 310/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGVPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVRVQDEDME 681


>gi|167516076|ref|XP_001742379.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779003|gb|EDQ92617.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1051

 Score =  540 bits (1391), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 380/1121 (33%), Positives = 566/1121 (50%), Gaps = 164/1121 (14%)

Query: 2    VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R +T D+  ++  L+ RL GMR +N+YD+  KTY+ +L  +         EK +LL+
Sbjct: 1    MKNRFSTLDLQVQLAELKPRLTGMRVANIYDIDNKTYLIRLQQTP--------EKAVLLI 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESG+R HTT Y   K + PSGFT+K RKH+RTRRL D++QLG DR+I   FG    A+++
Sbjct: 53   ESGIRFHTTEYDWPKGDAPSGFTMKCRKHLRTRRLTDMKQLGVDRVIDLTFGSDEAAYHL 112

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            I+ELY +GNI+LT+S + +L LLR  R D + V      RYP E  +     T  +L AA
Sbjct: 113  IIELYDRGNIILTESTYNILALLR-RRTDSEDVKFAVGERYPIEASKQPSPITRERLEAA 171

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              SSK+ D                                                 P  
Sbjct: 172  FASSKKGD-------------------------------------------------PAR 182

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWL-QDVI 299
            K  L   +  GP   EH +   G   N K+ +   + D+  +VL  A+ + ED L + + 
Sbjct: 183  KA-LNPIMECGPQAIEHCMQLHGFPNNAKVGKGLAIPDDLDRVLA-AMKQAEDLLFEKLK 240

Query: 300  SGDIVPEGYILMQNKHLGKDHPPTESG--SSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +GDI     ++   ++L  D      G  +   + D+  P ++ QF  R  +   +FD A
Sbjct: 241  AGDISVSATVV---QYLPIDTIRLAEGDEAPVLVLDDVIPFMMKQFEDRPHIHLPSFDRA 297

Query: 358  LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
            +D ++S++E+Q+ + +   +E AA  KL  +    E  V   +   + + + A+++E NL
Sbjct: 298  IDRYFSELETQKLQMRAMQQEAAALKKLEAVKASHEKHVEGYRLAQEANERKAQVLEANL 357

Query: 418  EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
            E VD AI  +R  +AN++ W ++A +VKE ++ G+P A +ID L L++N M++ L N   
Sbjct: 358  EQVDRAIEIIRSMVANKLDWVEIAELVKEAQQQGDPDARIIDGLKLDKNHMTIRLPNPEA 417

Query: 475  -----------------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
                                            +      T P   +++DLAL+A+ANA  
Sbjct: 418  HAESSESDSSSASDSEEEEEEEEQKAIAAASKKRGTSSATDPFLTIDLDLALTAYANACN 477

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
             Y+ KK    K++K   A   A ++AE+KT+ Q+ Q      ++  RK++WFEKF WFIS
Sbjct: 478  MYQHKKISAVKEQKARDATELAIQSAERKTQQQLQQNNVTTAVNKQRKIYWFEKFLWFIS 537

Query: 566  SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
            SENYLVI GRD QQNE++V+RY+ KGDVYVHADLHGA+S ++KN R    VPP+TL +AG
Sbjct: 538  SENYLVIGGRDRQQNEILVRRYLKKGDVYVHADLHGAASVIVKNPRGGD-VPPITLQEAG 596

Query: 626  CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
               V +S +W+++M TSAWWV+  QVSKTAP GEYL+ GSFMIRGKKN+LP   L+MGF 
Sbjct: 597  HMAVIYSGSWEARMPTSAWWVHHDQVSKTAPAGEYLSTGSFMIRGKKNYLPKVELVMGFA 656

Query: 686  LLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVP 745
            +LF++DE S+  H+NERR RG  E              S+  S       +PV  S S  
Sbjct: 657  ILFKVDEGSVARHVNERRPRGLGEA-------------SEASSPAVSRPPEPVEASSSGA 703

Query: 746  NSAHPAPSHTNASNVDSHE-------FPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
              A P  + + A +  + +        PA    ++  + ++        AA   P  E  
Sbjct: 704  GDASPVAAESEAGDSTATQNKNKAESQPAGTAVVAPEVPAESSSAMSTAAAMAFPDTEIS 763

Query: 799  IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKK----- 853
            +D A    SAS+S T    +      SE D  V R+     K  +S  ++R+LKK     
Sbjct: 764  VDYASATPSASVSRTVSHAQ------SEADTAV-RSRMQGSKARLSAKQKRQLKKKGYTP 816

Query: 854  GQGSSVVDPKV-EREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDE 912
             Q SS+   ++ E   E G+D  S+ E   R    +   +   ++GK KK ++KY +QDE
Sbjct: 817  AQMSSLTAAELQELTGESGED--SEGEDDQRNEHAQQPAVRG-KRGKKKKKQQKYAEQDE 873

Query: 913  EERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDC 972
            +ER +R+ LL SAG        PQ   A     +K             K       ++D 
Sbjct: 874  DERQLRLDLLGSAG--------PQLSRADKRARRKE------------KLAAKQQATRDP 913

Query: 973  KEHPDDSSHGVEDN-----PCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGN 1027
             E        V D        VGL  T +  K   +E+   +I  +E+ +L  +D LTG 
Sbjct: 914  SEAVLQQISSVTDRIMATAESVGLVTTEQTSK---QEKIDEQIQAQEEDQLTYLDALTGL 970

Query: 1028 PLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            P P D L++ +PV  PY AV+ Y+++ KI+PG  KKGK I+
Sbjct: 971  PHPDDELMFALPVVAPYGAVRQYRFKAKIVPGEQKKGKAIR 1011


>gi|348572143|ref|XP_003471853.1| PREDICTED: nuclear export mediator factor NEMF-like [Cavia
           porcellus]
          Length = 1076

 Score =  540 bits (1390), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 307/744 (41%), Positives = 430/744 (57%), Gaps = 96/744 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDV--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E          
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPVDHARAAE--------PL 164

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  D             +++A K  L                             L
Sbjct: 165 LTLERLTDV------------IASAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ + K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLESKEIEKVLVCMQKAEDYVK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFSGKGYII-QKREIKPSLEVDKPAEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D E R+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHETRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 -------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANARRWYE 508
                      ++  E  LP  K                   V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVDGDVSVEKNETELPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSEN
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSEN 538

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E  VPP TL +AG   
Sbjct: 539 YLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-VVPPRTLTEAGTMA 597

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF
Sbjct: 598 LCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLF 657

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
           ++DES +  H  ER+VR ++E M+
Sbjct: 658 KVDESCVWRHRGERKVRVQDEDME 681


>gi|242018711|ref|XP_002429817.1| Serologically defined colon cancer antigen, putative [Pediculus
           humanus corporis]
 gi|212514835|gb|EEB17079.1| Serologically defined colon cancer antigen, putative [Pediculus
           humanus corporis]
          Length = 1024

 Score =  539 bits (1389), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 308/728 (42%), Positives = 431/728 (59%), Gaps = 90/728 (12%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T D+   V   ++ IG+R + VYD+  KTY+ +L  +         EKV++L+E
Sbjct: 1   MKTRFSTFDIVCSVAEFQKYIGLRVNQVYDIDHKTYLIRLQKTD--------EKVVILLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF +KLRKH+R +RLE ++QLG+DRI+  QFG G  A++V 
Sbjct: 53  SGTRIHTTDFEWPKNVAPSGFCMKLRKHLRNKRLESLKQLGFDRIVHLQFGTGDAAYHVF 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICR-VFERTTASKLHAA 180
           LELY +GNI+LTD +  +L +LR H + DK +    R +YP    R V    T  ++   
Sbjct: 113 LELYDKGNIVLTDCDLIILNILRPHTEGDK-IRFAVREKYPINRARDVCNFPTEEQIKNI 171

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             S+K                                           SND        L
Sbjct: 172 FASAK-------------------------------------------SNDN-------L 181

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YGPAL EH++L   L    K+ +   L++N +  ++ A+ + +D +++   
Sbjct: 182 KKILNFNLDYGPALIEHVLLGVDLRGTEKIGQGFDLQNN-LSKIINALKEAQDIVENASL 240

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESG-SSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
              V +GYI+ +      +  PTESG S   +  EF P L  Q     F + ETF  A+D
Sbjct: 241 S--VSKGYIIQK-----VEKRPTESGMSDFHVNTEFHPFLFRQHVKNPFNECETFLKAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNL 417
            F+S +ESQ+ + +   +E  A  K+  +  D   R+  L   QE+DR +K AELI  NL
Sbjct: 294 SFFSSLESQKIDMKAINQEKEALKKIENVRRDHNQRLQQLFETQELDR-IK-AELITTNL 351

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             VD A+LA+R A+AN++SW D+  +VKE + AG+PVA  I KL L+ N ++L LS+   
Sbjct: 352 TLVDQAVLAIRTAIANQISWPDIDILVKEGKNAGDPVASSIKKLKLDINHITLQLSDPYR 411

Query: 475 ----------NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTI 521
                       +  DD+   + V K   V++DL L+A ANAR++Y++K+    KQ+KTI
Sbjct: 412 SDSSSSEEEEEEETNDDKPIKIKVPKIIDVDIDLDLTAFANARKYYDMKRSAAKKQQKTI 471

Query: 522 TAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
            +  KA K+AEKKT+  + + KT+ NI+ +RK  WFEKF WFISSENYLVI+GRD  QNE
Sbjct: 472 ESQDKALKSAEKKTKQALKEMKTIVNITKVRKTFWFEKFFWFISSENYLVIAGRDMMQNE 531

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT 641
           ++VKRYM  GD+YVHAD+HGASS +IKN   E PVPP TLN+AG   + +SQAW++K+VT
Sbjct: 532 LLVKRYMKSGDLYVHADIHGASSVIIKNPSNE-PVPPKTLNEAGVMAISYSQAWEAKVVT 590

Query: 642 SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNE 701
           SAWWV+  QVSKTAPTGEYL  GSFMIRGKKN+LPP  LIMGF  LF+LD++SL  H ++
Sbjct: 591 SAWWVHNTQVSKTAPTGEYLGTGSFMIRGKKNYLPPANLIMGFSFLFKLDDNSLSRHKDD 650

Query: 702 RRVRGEEE 709
           R+VR  EE
Sbjct: 651 RKVRSLEE 658


>gi|296483277|tpg|DAA25392.1| TPA: hypothetical protein BOS_10863 [Bos taurus]
          Length = 1076

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 309/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+V+ ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVKVQDEDME 681


>gi|311245467|ref|XP_001924665.2| PREDICTED: nuclear export mediator factor NEMF [Sus scrofa]
          Length = 1076

 Score =  538 bits (1387), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 309/747 (41%), Positives = 430/747 (57%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLCAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH++ RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKGRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A++    
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHAR------AAEPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L    E  A+ P                                           K   L
Sbjct: 167 LERLTEIIASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K E+ +Q   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEECMQTTSS 241

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P  +       Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYIIQKREVKPSLEVDKPTVD----ILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -------------NLDEMDDEEKTLPVEK----------------VEVDLALSAHANARR 505
                        N ++ + E      +K                V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDINTEKNESEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E MD
Sbjct: 655 FLFKVDESCVWRHRGERKVRVQDEDMD 681


>gi|417405795|gb|JAA49597.1| Putative rna-binding protein [Desmodus rotundus]
          Length = 1081

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 306/742 (41%), Positives = 426/742 (57%), Gaps = 106/742 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHA 179
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP    R  E   T  +L  
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVGHARAVEPLPTLERLTE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            +TS+ E +                                                   
Sbjct: 173 VITSAAEGE--------------------------------------------------L 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK  L   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 183 LKRALNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  E       Y+EF P L +Q     +++FE+FD 
Sbjct: 239 ASNFSGKGYIIQKREVKPSLEVDKPAEE----ILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+    D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNFRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L  VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LPVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 --------------NLDEMDDE-----------------EKTLPVEKVEVDLALSAHANA 503
                         N+++ + E                 +K  P+  V+VDL+LSA+ANA
Sbjct: 415 LLSEEEDDDVDGEINVEKSETEPPKGKKKKQKNKQLQRPQKNRPL-LVDVDLSLSAYANA 473

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWF 563
           +++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
           ISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +
Sbjct: 534 ISSENYLIIGGRDQQQNEVIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTE 592

Query: 624 AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MG
Sbjct: 593 AGTMALCYSAAWDARIITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 652

Query: 684 FGLLFRLDESSLGSHLNERRVR 705
           F  LF+++ES    H  ERRVR
Sbjct: 653 FSFLFKVEESCAWRHRGERRVR 674


>gi|281362528|ref|NP_001163721.1| caliban, isoform B [Drosophila melanogaster]
 gi|281362530|ref|NP_651341.2| caliban, isoform C [Drosophila melanogaster]
 gi|332319785|sp|Q9VBX1.2|NEMF_DROME RecName: Full=Nuclear export mediator factor NEMF homolog; AltName:
            Full=Protein Caliban
 gi|157816462|gb|ABV82224.1| IP12923p [Drosophila melanogaster]
 gi|272477156|gb|ACZ95015.1| caliban, isoform B [Drosophila melanogaster]
 gi|272477157|gb|AAF56406.2| caliban, isoform C [Drosophila melanogaster]
          Length = 992

 Score =  537 bits (1384), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 403/1114 (36%), Positives = 579/1114 (51%), Gaps = 205/1114 (18%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1    MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R HTT +   K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 55   SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 114

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E   LT L   R   +G  +    R    + R  + T   +L A +
Sbjct: 115  LELYDRGNVILTDYE---LTTLYILRPHTEGENLRFAMREKYPVERAKQPTKELELEALV 171

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                                               K  + ++N +             L+
Sbjct: 172  -----------------------------------KLLENARNGD------------YLR 184

Query: 242  TVLGEALGYGPALSEHIILDTGL------------------------------VPNMKLS 271
             +L   L  GPA+ EH++L  GL                                N KL 
Sbjct: 185  QILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKLE 244

Query: 272  EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
            +      N + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G+    
Sbjct: 245  QKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQ-----VKEEKPTENGTVEFF 297

Query: 332  YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
            +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ + 
Sbjct: 298  FRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVK 357

Query: 390  MDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
             D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358  NDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 415

Query: 448  RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
            +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR+
Sbjct: 416  QANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKDPEVTVVDVDLALSAWANARRY 475

Query: 507  YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
            Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFISS
Sbjct: 476  YDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISS 535

Query: 567  ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
            ENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG 
Sbjct: 536  ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAGS 594

Query: 627  FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
              + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  L
Sbjct: 595  MAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSL 654

Query: 687  LFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESLS 743
            LF+L++S +  HL ER+VR     ++D +   + KEN    D+ S+ +D D        S
Sbjct: 655  LFKLEDSFIERHLGERKVR----SLEDDQIDPNVKENEVEHDLLSDNEDAD--------S 702

Query: 744  VPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLIDR 801
              N + P      +SN +   FP  +  I +       D  R +  +  V P++E+  + 
Sbjct: 703  NINLSEP------SSNTEITAFPNTEVKIEH-------DTGRIIVRSDSVNPEIEETKES 749

Query: 802  ALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
             + L                      DK +++T        ++   R+K        V  
Sbjct: 750  EVVL----------------------DKILKKTDDEETTIILAGPSRKK-------QVSA 780

Query: 862  PKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMA 920
             K + +K R K +A+ Q    V        ++ RGQKGKLKKMK+KY DQD+EER IRM 
Sbjct: 781  KKTKEDKARAKQEAAKQEVPPVSSEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRMM 840

Query: 921  LLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSS 980
            +L S+GK                  +KP  S   A KV  K       S+  KE+     
Sbjct: 841  ILKSSGK------------------EKPQAS---ADKVVEK-------SESTKEYVKPEK 872

Query: 981  HGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPV 1040
                 NP V LD+  E+            +G    G ++ ++ LTG P   D LL+ IPV
Sbjct: 873  SAAPKNP-VELDDADEV-----------PVG----GDVDVLNSLTGQPHEGDELLFAIPV 916

Query: 1041 CGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
              PY A+Q+YK++VK+ PGT K+GK  ++  ++ 
Sbjct: 917  VAPYQALQNYKFKVKLTPGTGKRGKAAKLALNIF 950


>gi|119586145|gb|EAW65741.1| serologically defined colon cancer antigen 1, isoform CRA_a [Homo
           sapiens]
          Length = 828

 Score =  537 bits (1383), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 310/747 (41%), Positives = 433/747 (57%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A++    
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR------AAEPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L    E  A+ P                                           K   L
Sbjct: 167 LERLTEIVASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFSGKGYIIQKREIKPCLEADKPVEDILT----YEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARR 505
                            N  E    +K     K            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHQGERKVRVQDEDME 681


>gi|312384850|gb|EFR29482.1| hypothetical protein AND_01485 [Anopheles darlingi]
          Length = 1109

 Score =  536 bits (1381), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 388/1143 (33%), Positives = 582/1143 (50%), Gaps = 209/1143 (18%)

Query: 1    MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            M K R NT DV   V  L++LIGMR + +YD+  KTY+ +L  +         EKV+LL+
Sbjct: 1    MTKTRFNTYDVVCSVTELQKLIGMRVNQIYDIDNKTYLIRLARNE--------EKVVLLL 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESG+R HTT++   K   PSGFT+KLRKH++ +RLE ++QLG DRI+ FQFG G  A+++
Sbjct: 53   ESGLRFHTTSFEWPKNMAPSGFTMKLRKHLKNKRLESLQQLGVDRIVDFQFGSGEAAYHI 112

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            ILELY +GNILLTD E  +L +LR H + ++ +    R +YP                  
Sbjct: 113  ILELYDRGNILLTDCELRILNILRPHVEGEE-LRFAVREKYPK----------------- 154

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                        D+  +D               +G  S +  K + + ++ G      TL
Sbjct: 155  ------------DRAKQD---------------QGPPSVEQIKGAIEKAHPGD-----TL 182

Query: 241  KTVLGEALGYGPALSEHIILDTGLV-----------PNM-------------KLSEVNKL 276
            +T L   L YG ++ +H++ + GL             N+             + ++V +L
Sbjct: 183  RTALNPVLEYGASVIDHVLHEHGLFGCRIGGELPVDANLPKKAKRKQKNICKEFTKVFEL 242

Query: 277  EDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--- 333
            E N  + L+ A+   E  LQ+    +  P GYI+ +     K+  P + G   + Y    
Sbjct: 243  E-NDFEPLISALNDAETMLQNA-RKEPSP-GYIIQK-----KEVRPAKEGEKEEYYFTNL 294

Query: 334  EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
            E+ P + +Q++     +F++F +A+DEFYS +E+        A+E  A  KL+ +  D  
Sbjct: 295  EYQPYMYSQYQGEPCKEFDSFTSAVDEFYSSLETL-------AQEREALKKLSNVKTDHA 347

Query: 394  NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             R+  L +      K AELI  N + VD A+LAV+ ALA +MSW D+  +VK  +   +P
Sbjct: 348  KRIEELTKAQLGDRKKAELITRNQDLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDP 407

Query: 454  VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-------------LPVEKVEVDLALSAH 500
            VA  I +L LE N +SL LS+    +D+ E               L    V+VDLALSA 
Sbjct: 408  VASCIRQLKLEINHISLYLSDPYAFLDENESDNEEDSDREEDEEKLEPMVVDVDLALSAF 467

Query: 501  ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE-KTVANISHMRKVHWFEK 559
            ANARR+Y+ ++    K++KTI + SKA K AE+KT +Q L++ +T   IS +RKV+WFEK
Sbjct: 468  ANARRYYDQRRFAARKEQKTIESSSKALKNAERKT-IQTLKDVRTQTTISKVRKVYWFEK 526

Query: 560  FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
            F WFISSENYL+I GRD QQNE+IVKRYM   D+YVHA++ GASS +IKN    + +PP 
Sbjct: 527  FYWFISSENYLIIGGRDQQQNELIVKRYMRPNDIYVHAEIQGASSVIIKNPAGGE-IPPK 585

Query: 620  TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
            TL +AG   + +S AWD+K+VTSA+WV+  QVSKTAPTGEYLT GSFMIRG+KNFLPP  
Sbjct: 586  TLLEAGTMAISYSVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFLPPCH 645

Query: 680  LIMGFGLLFRLDESSLGSHLNERRVRG--EEEGMDDFEDSGHHKENSDIESEKDDTDEKP 737
            L++G   LF+L++SS+  H  ER+VR   EE  +   E+     E+ D E + DD  ++ 
Sbjct: 646  LVLGLSFLFKLEDSSVERHRGERKVRNFDEESVISKEEERSEISESVDQEIKLDDESDQE 705

Query: 738  VAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVA-----APVT 792
              E              TN           ED+   N +  K+  ++ + +     +P T
Sbjct: 706  EQEP------------ETN-----------EDQQPDNSLSQKVAGLSVSESQETEKSPST 742

Query: 793  PQLEDLIDRALGLGSASIS----STKHGIETTQFDLSEEDKHVERTATV---RDKPYISK 845
             Q +D  ++        I     + K  + T    L   +   +R A +    +KPYI +
Sbjct: 743  GQSDDEPEQGPQFPDTHIKVEHDTGKVSVRTDPI-LQRLNSETDRKAEIFLGDEKPYIIQ 801

Query: 846  AERRKLKKGQGSSVVDPKVEREKERGKDASSQP-ESIVRKTKIEG----GKISRGQKGKL 900
                +LK+          + + K++ KD   +  E  V   K EG    G++ RGQ+ K+
Sbjct: 802  PAAPRLKQ----------ISKSKQKAKDKEQKAKEKQVAPQKDEGQQKQGQLKRGQRAKM 851

Query: 901  KKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCY 960
            +K+KEKY DQDE++R + M +L SAG          N+  S    ++         +   
Sbjct: 852  RKIKEKYKDQDEDDRKMIMEILKSAG----------NQKPSEGAREEDEQHQQKQQQQKK 901

Query: 961  KCKKAGHLSKDCKEHPDDSSHGVEDNPCVG-LDETAEMDKVAMEEEDIHEIGEEEKGRLN 1019
            +    G+  K  K  P +     +D P V  LD    +    +EE+++            
Sbjct: 902  EWHGEGNAGKRLK--PGEFEEFGDDTPAVTDLDMLDALTGQPVEEDEL------------ 947

Query: 1020 DVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLS 1079
                           L+ +PV  PY ++ +YKY+VK+ PGT K+GK  ++   + L    
Sbjct: 948  ---------------LFAVPVVAPYQSLHNYKYKVKLTPGTGKRGKASKMALQIFLKDKQ 992

Query: 1080 LTP 1082
             TP
Sbjct: 993  CTP 995


>gi|384489957|gb|EIE81179.1| hypothetical protein RO3G_05884 [Rhizopus delemar RA 99-880]
          Length = 1044

 Score =  536 bits (1380), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 300/720 (41%), Positives = 426/720 (59%), Gaps = 111/720 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R N  DV A V  L+ RLIG+R  NVYD++ KT++FK         +   +K L+L 
Sbjct: 1   MKQRFNALDVRATVSNLKERLIGIRLQNVYDVNAKTFLFKF--------AKPDDKELVL- 51

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA--H 118
                                    +RKH+RTRRL +VRQLG DRI+ F+F  G  +  +
Sbjct: 52  -------------------------IRKHLRTRRLTNVRQLGVDRIVDFEFAGGEKSIGY 86

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++I E YA GNI+LTD E+ +L LLR+ +  +     +        +   F++  A +L 
Sbjct: 87  HIICEFYASGNIILTDHEYRILALLRAVQPTETLKMAVGEIYNIQSVLNDFQKVEAEQLR 146

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            AL+++   D                                                  
Sbjct: 147 NALSAAGPKD-------------------------------------------------- 156

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA--IQVLVLAVAKFEDWLQ 296
            LK +L     YGPA+ EHIIL++ L PNMK++      +N+  +Q L+    K +D ++
Sbjct: 157 NLKKILNIKFEYGPAMIEHIILESELDPNMKVASDFDTSENSPMMQALLEGFKKADDMIE 216

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
              +G+ VP+GYI++QN    +     +     +IYDEF P L  QF +R+F +F TFD 
Sbjct: 217 S--TGNSVPKGYIILQND--TRQTKNEKEEEEMEIYDEFHPHLYKQFSNRKFKEFSTFDQ 272

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEF+S IE+Q+ E + + +E+AA  KL  + ++QE RV +L  +   + + A+LIE N
Sbjct: 273 AVDEFFSSIEAQKLELKTRRQEEAALKKLEAVKLEQEKRVESLLNQQLTNTRKAQLIELN 332

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
           L+ VDAAI  +R A+A++M W+DL  +VKEE++ GNP+A +ID L LE N ++LLL++  
Sbjct: 333 LQFVDAAITIIRNAVASQMDWQDLNDLVKEEKRRGNPIALIIDTLKLETNQVTLLLTDPE 392

Query: 477 DEMDDEEKTLP--------------VEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
           +  + E                   + K++VD+ L+A ANAR++YE KK   SK EKTI 
Sbjct: 393 EHEESESDDEEEEEEEEEKEEKPKEIFKIDVDIGLTAFANARKYYEQKKTTASKHEKTIE 452

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           A +KA K+AE+K R  + + K  A I+ +RK  WFEKF WFIS+E YLVI+GRD QQNEM
Sbjct: 453 ASTKALKSAERKIRKDLKETKITATINKIRKPFWFEKFQWFISTEGYLVIAGRDMQQNEM 512

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPE---QPVPPLTLNQAGCFTVCHSQAWDSKM 639
           +V+RY+SK DVYVHADLHGA+S ++KN +P+   QP+ P TL QAG  +VC S+AWDSK+
Sbjct: 513 LVRRYLSKDDVYVHADLHGAASVIVKN-KPQANGQPISPSTLYQAGIMSVCQSKAWDSKI 571

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           VTSA+WVYP QVSK+AP+GEYLT GSFMIRGKKNFLPP  L+ GFG LF+LDESS+G+H+
Sbjct: 572 VTSAYWVYPDQVSKSAPSGEYLTTGSFMIRGKKNFLPPVQLVYGFGYLFKLDESSIGNHI 631


>gi|405952718|gb|EKC20496.1| Serologically defined colon cancer antigen 1-like protein
           [Crassostrea gigas]
          Length = 1084

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 295/724 (40%), Positives = 426/724 (58%), Gaps = 84/724 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +  D+A  +K L+R  GMR  NVYD+  KTY+ KL            +K ++L+E
Sbjct: 1   MKSRFSKVDIAVVIKELKRFYGMRVVNVYDVDSKTYLIKL--------GKPDDKAVILIE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+R+H T Y   K   PSGF++KLRKHI+ RRLE++ QLG DRI+  QFG G  A++VI
Sbjct: 53  SGIRIHGTEYDWPKNMAPSGFSMKLRKHIKGRRLENINQLGMDRIVDLQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD EFT+L +LR   D  + V    R  YP    +     +  KL   +
Sbjct: 113 LELYDRGNVVLTDFEFTILNILRPRTDTCQDVKFAVRETYPVSAAKQHSVPSNEKLREVI 172

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
            ++K  D                                                   LK
Sbjct: 173 LAAKVGDV--------------------------------------------------LK 182

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSE---VNKLEDNAIQVLVLAVAKFEDWLQDV 298
            VL   L YGPA++EH +   G   N+K+ +   V +  D     + LA +  +   ++ 
Sbjct: 183 KVLLPHLDYGPAVTEHCLQCIGFPENVKVGKGFSVTEDMDKLTSAIELAESLLKTLSEEP 242

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--YDEFCPLLLNQFRSREFVKFETFDA 356
             G       +++Q K   +     + G + ++  Y+EF P+L  QF ++    F+ F+ 
Sbjct: 243 CQG-------VIVQKK---EKRAAVKEGENAELLTYEEFHPMLFKQFENKPHSIFDNFNK 292

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           ++DEF+S+IESQ+ + +   +E +A  KL+ I  D E R+  L++E +  +    LIE N
Sbjct: 293 SVDEFFSQIESQKLDMKALQQEKSALKKLDNIKKDHEKRIEGLQKEQETDINKGRLIELN 352

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL---- 472
           L  VD A+L VR ALAN++ W ++  +V E +  G+PVA  I  L L+ N ++LLL    
Sbjct: 353 LPLVDQALLIVRSALANQIDWTEIENLVHEAQLQGDPVASCITGLKLDSNMITLLLRDPY 412

Query: 473 --SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
             S++  + DD++  L   K+++D+++SA+ N+R++++ KK    K++KTI A +KA K+
Sbjct: 413 RYSDDEYDDDDDDDVLKPTKIDIDISMSAYGNSRKYFDKKKTAAKKEQKTIDASAKALKS 472

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           AE+KT+  + +  T A I+  RK +WFEKF WFI+SENYLVI GRD QQNEMIVKRY+  
Sbjct: 473 AERKTKETLKEVATAATINKARKTYWFEKFLWFITSENYLVIGGRDQQQNEMIVKRYLRP 532

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GD+YVHADLHGASS V+KN   E PVPP +LN+AG   +C+S AWD+K+VTSAWWVY  Q
Sbjct: 533 GDLYVHADLHGASSCVLKNPSGE-PVPPKSLNEAGTMAICNSVAWDAKVVTSAWWVYHDQ 591

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           VSKTAP+GEYLT GSFMIRGKKN+LPP  L+ GFGLLF+L++ S+  H  ER+V     G
Sbjct: 592 VSKTAPSGEYLTTGSFMIRGKKNYLPPTHLVYGFGLLFKLEDDSIERHKGERKVH----G 647

Query: 711 MDDF 714
           +DD+
Sbjct: 648 VDDY 651


>gi|119586147|gb|EAW65743.1| serologically defined colon cancer antigen 1, isoform CRA_c [Homo
           sapiens]
          Length = 1001

 Score =  534 bits (1376), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 308/746 (41%), Positives = 425/746 (56%), Gaps = 109/746 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A++    
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR------AAEPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L    E  A+ P                                           K   L
Sbjct: 167 LERLTEIVASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV-- 298
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++    
Sbjct: 184 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMKTTSN 241

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG + P    +      G              Y+EF P L +Q     +++FE+FD A+
Sbjct: 242 FSGKVAPCILTIYCCDLFG--------------YEEFHPFLFSQHSQCPYIEFESFDKAV 287

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+
Sbjct: 288 DEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQ 347

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN---- 474
            VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N    
Sbjct: 348 IVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLL 407

Query: 475 ----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRW 506
                           N  E    +K     K            V+VDL+LSA+ANA+++
Sbjct: 408 SEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKY 467

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISS
Sbjct: 468 YDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISS 527

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG 
Sbjct: 528 ENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGT 586

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  
Sbjct: 587 MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 646

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMD 712
           LF++DES +  H  ER+VR ++E M+
Sbjct: 647 LFKVDESCVWRHQGERKVRVQDEDME 672


>gi|125858778|gb|AAI29514.1| LOC733300 protein [Xenopus laevis]
          Length = 906

 Score =  534 bits (1376), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 359/996 (36%), Positives = 521/996 (52%), Gaps = 173/996 (17%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  L   L+GMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKSRFNTIDIRAVIAELTDSLLGMRVHNVYDIDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHA 179
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +  +  E   +  +L  
Sbjct: 113 IVELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVREHYPIDHAKAPEPLLSVERLKE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L ++K+ D                                                   
Sbjct: 173 VLDNAKKGD--------------------------------------------------Q 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YG  L EH +LDTGL  N+K+ +++  ED  ++ +  A+ K E ++   +
Sbjct: 183 LKKVLNPHLPYGATLIEHCLLDTGLSSNVKVDQISGPED--LEKVHTALRKAEGYMD--L 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +G+I+ Q +       P ++       +EF P L  Q  +  +++ ++F+  +D
Sbjct: 239 TQNFNGKGFII-QKREKKPSLEPDKASEDIFTNEEFHPFLFAQHANSTYIELDSFNKTVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EF+SK+E Q+ + +   +E  A  KL+ +  D E+R+ +L+   D      ELIE NL+ 
Sbjct: 298 EFFSKLEGQKIDIKALQQEKQALKKLDNVRKDHEHRLESLQYAQDADKAKGELIEMNLDI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N ++++L N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMMLKNPYVLS 417

Query: 475 ----------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
                                    +    +K  PV  V+VDL+LSA+ANA+++Y+ K+ 
Sbjct: 418 EEESEDEEDEKEEEPKGKKKKAKNKQPKKVQKNKPV-LVDVDLSLSAYANAKKYYDHKRH 476

Query: 513 QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
              K +KTI A  KAFK+AEKKT+  + + +TV+ I   RKV+WFEKF WFISSENYL+I
Sbjct: 477 AAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWFEKFLWFISSENYLII 536

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +GRD QQNE+IVKRY++ GDVYVHADLHGA+S VIKN   E PVPP TL +AG   VC+S
Sbjct: 537 AGRDQQQNELIVKRYLNPGDVYVHADLHGATSCVIKNPTGE-PVPPRTLTEAGTMAVCYS 595

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            AWD++++TSAWWV+ +QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGFG LF++DE
Sbjct: 596 AAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFGFLFKVDE 655

Query: 693 SSLGSHLNERRVRGEEEGMDDFE--------------DSGHHKENSDIESEKDDTDEKPV 738
           + +  H  ER+V+  +E M+                 D+     NS  + EK DT E+P 
Sbjct: 656 TCVWRHKGERKVKQLDEDMESVTSSNIELAAEENIPLDAPEEDSNSSEDDEKSDTQEQPF 715

Query: 739 A-------------ESLSVPNSAHPAPSHTNASNVDSH----------EFPAEDKTISNG 775
           +             +S+      + APS  N+    SH          E   E       
Sbjct: 716 SGDGYSKEQKGPSTDSIVHKQRENMAPSDQNSDQESSHSEENNSTIKEEAETEPSYPDTA 775

Query: 776 IDSKIFDIARNV--AAPVTPQLEDLIDRAL---GLGSASISSTKHGIETTQFDLSEEDKH 830
           ID       R +  A P  P     +D  L     G   +S+ +      +   +++D++
Sbjct: 776 IDLSHLQTKRTLSKATPTEP-----VDAPLQNESSGRKHMSAKEKRELKKKKKPNDQDEY 830

Query: 831 VERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGG 890
                          +E+++L  G    V D +   +   G    SQP            
Sbjct: 831 -------------QPSEQKEL--GDKKDVADSQSAPQASTG----SQP------------ 859

Query: 891 KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAG 926
            + RGQK KLKK+KEKY DQDEE+R++ M LL SAG
Sbjct: 860 -MKRGQKSKLKKIKEKYKDQDEEDRDLIMQLLGSAG 894


>gi|330841435|ref|XP_003292703.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
 gi|325077022|gb|EGC30763.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
          Length = 1084

 Score =  534 bits (1375), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 302/753 (40%), Positives = 456/753 (60%), Gaps = 94/753 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+   V  L++ LIG+R +N+YDLSP+ ++ K         S    K  L++
Sbjct: 1   MKTRFSSIDIRTTVFNLQKSLIGLRLANLYDLSPRVFLLKF--------SRPDFKKNLII 52

Query: 61  ESGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           ESG+R+H+T + RDK + TP+ F+L LRK+++T+RLE V+QLG DR++ F FG G+   +
Sbjct: 53  ESGIRIHSTNFIRDKGDHTPAPFSLTLRKYLKTKRLESVKQLGVDRVVDFTFGSGVAVQH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           +I+ELY+ GNI+LTD ++ +L             AI+  H+Y  +     E      ++ 
Sbjct: 113 LIIELYSIGNIILTDGDYRIL-------------AILRTHQYNQD-----ESVAVGDVYP 154

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
                     N+  K  E    + ++  EN                        + K+ T
Sbjct: 155 V---------NKAKKPTEFTTELIDSIIEN-----------------------TQDKKET 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK V  ++L +GP L EH IL  GL P++KL +     D+++    L ++ F++  Q + 
Sbjct: 183 LKQVFNKSLDFGPELIEHCILSAGLQPSLKLEQY----DHSVSSQAL-ISAFKEG-QKIY 236

Query: 300 SGDIVPEGYILMQNKHLGKD--------------HPPTESGSSTQIYDEFCPLLLNQFRS 345
              +  +GYI++++    K                PP E      +Y+EF P L  Q+ S
Sbjct: 237 DQSVASKGYIVLKDPKQQKPQQQKKQQQQTSTTAEPPKE----IVMYEEFVPFLYKQYES 292

Query: 346 REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
           ++++++++FD A+D+F+S+IESQ+ EQQ   +E     KL+K+  DQ+ R+ +L      
Sbjct: 293 KKYIEYDSFDGAVDQFFSEIESQKLEQQRIQQEQTVLKKLDKVKEDQQRRIDSLFANEAE 352

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKLYL 463
           +V+ AELIE NL++VD  IL +R  +AN M+W+ L +++KEE+K  NP  VA  I +L L
Sbjct: 353 NVRKAELIEANLQEVDQCILIIRSGVANSMNWDTLNQLLKEEKKK-NPYSVATKIQRLKL 411

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKT 520
           E N ++L L++     DDEE     +K   ++VD++LSA ANAR++Y+ KK+   K +KT
Sbjct: 412 ESNQITLALTDGFLYDDDEEVNKTNKKPTLIDVDISLSAFANARKYYDTKKQSHEKAQKT 471

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           I+    A KAAE KTR Q+ + K+  ++  MRKV WFEKF+WFISS+NY+V+SGRDAQQN
Sbjct: 472 ISQAEFALKAAESKTRQQLSEVKSKHSMIQMRKVFWFEKFHWFISSDNYIVVSGRDAQQN 531

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           E++ K+Y+ K DVYVHAD+ G++S VIKN    + +PP TL QAG  T+C+S AW +K+V
Sbjct: 532 ELLFKKYLEKDDVYVHADIFGSTSCVIKNPNGGE-IPPNTLIQAGTMTMCYSNAWSAKVV 590

Query: 641 TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN 700
           TSA+WVY HQVSKTAP+GEYLT GSFMIRGKKN+LP   L+MGFG +F++DES +G+HLN
Sbjct: 591 TSAYWVYSHQVSKTAPSGEYLTTGSFMIRGKKNYLPHSQLVMGFGFMFKIDESCIGNHLN 650

Query: 701 ERRVRGEEEGMDDFEDSGHHKEN-SDIESEKDD 732
           ER+      G ++ ED G    N S+I +  DD
Sbjct: 651 ERKPLL--SGSNNHEDDGDASNNSSEIVTTNDD 681


>gi|71679669|gb|AAI00005.1| Zgc:153813 protein [Danio rerio]
          Length = 881

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 359/962 (37%), Positives = 516/962 (53%), Gaps = 126/962 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H T +   K   PSGF +K R H+++RRL  VRQLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRMHLKSRRLVHVRQLGVDRIVDLQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNI+LTD +F +L LLR    + + V I  R RYP E  R  E   + +    
Sbjct: 113 ILELYDRGNIILTDHQFMILNLLRFRTAEAEDVKIAVRERYPVENARAEEPIISLQRLTQ 172

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           + S                           G Q G +                      L
Sbjct: 173 VLS---------------------------GAQTGDQ----------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YG  L EH +   G+    K+     L   +++VL  A+   E+++Q   +
Sbjct: 184 KRILNPHLPYGGPLIEHCLASVGMSGLYKVDSQTDLTQVSLKVLE-ALQMAEEYMQK--T 240

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +G+I+ +++      P   +G + +    Y+EF P L  Q     +V+FE+F+ A
Sbjct: 241 ANFSGQGFIIQKSEQ----KPNVCAGDAAEELLTYEEFHPFLFCQHVKSRYVEFESFNKA 296

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEF+S++ESQ+ + +   +E  A  KL  +  D + R+  L Q  +      EL+E NL
Sbjct: 297 VDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELVELNL 356

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             V  A+  VR ALAN++ W ++ RMV E + AG+PVA  I +L L+ N ++LLL N   
Sbjct: 357 PVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPEA 416

Query: 475 ----NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  E+   +K+   EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  KA
Sbjct: 417 CPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKA 476

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           FK+AEKKT+  +   +TV +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKRY
Sbjct: 477 FKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRY 536

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +  GD+YVHADLHGA+S VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV 
Sbjct: 537 LRAGDLYVHADLHGATSCVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQ 595

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR-- 705
             QVSKTAP+GEYLT GSFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++  
Sbjct: 596 HDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMKTL 655

Query: 706 ----------------GEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAH 749
                           GE E +   EDSG+ +EN+D  +  DD +E+ V +S        
Sbjct: 656 EEEEEEEDTTSTAEILGEGEEL-LAEDSGNEEENTDSRT-ADDDEEQQVCKSDEDDEEDQ 713

Query: 750 PAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSAS 809
                    + D  E       + +  DS+       ++ P         D  + L    
Sbjct: 714 RVCREDEDEDEDEDEDALSAADVEDAADSEEEHPGAQISFP---------DTCISLSHLQ 764

Query: 810 ISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKE 869
           I+ T H  +TT     +E + V     V  K +++  +RR +KK Q       K E  ++
Sbjct: 765 INRTAH-TDTTD---PQESQQVNTDTQV--KKHLTAKQRRDMKKKQ-------KQENTED 811

Query: 870 RGKDASSQPESIVRK-TKIEGGK----ISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS 924
             +  + QPE+  R  T   GG     + RGQ+ KLKKMK+KY DQDEE+R + M +L S
Sbjct: 812 LEEGDAKQPETASRTPTSKSGGAAAAPLKRGQRNKLKKMKDKYKDQDEEDREMMMKILGS 871

Query: 925 AG 926
           AG
Sbjct: 872 AG 873


>gi|284005983|gb|ADB57053.1| MIP15468p [Drosophila melanogaster]
          Length = 939

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 401/1103 (36%), Positives = 573/1103 (51%), Gaps = 205/1103 (18%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1    MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R HTT +   K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 55   SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 114

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E   LT L   R   +G  +    R    + R  + T   +L A +
Sbjct: 115  LELYDRGNVILTDYE---LTTLYILRPHTEGENLRFAMREKYPVERAKQPTKELELEALV 171

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                                               K  + ++N +             L+
Sbjct: 172  -----------------------------------KLLENARNGD------------YLR 184

Query: 242  TVLGEALGYGPALSEHIILDTGL------------------------------VPNMKLS 271
             +L   L  GPA+ EH++L  GL                                N KL 
Sbjct: 185  QILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKLE 244

Query: 272  EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
            +      N + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G+    
Sbjct: 245  QKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQ-----VKEEKPTENGTVEFF 297

Query: 332  YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
            +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ + 
Sbjct: 298  FRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVK 357

Query: 390  MDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
             D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358  NDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 415

Query: 448  RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
            +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR+
Sbjct: 416  QANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKDPEVTVVDVDLALSAWANARRY 475

Query: 507  YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
            Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFISS
Sbjct: 476  YDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISS 535

Query: 567  ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
            ENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG 
Sbjct: 536  ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAGS 594

Query: 627  FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
              + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  L
Sbjct: 595  MAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSL 654

Query: 687  LFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESLS 743
            LF+L++S +  HL ER+VR     ++D +   + KEN    D+ S+ +D D        S
Sbjct: 655  LFKLEDSFIERHLGERKVR----SLEDDQIDPNVKENEVEHDLLSDNEDAD--------S 702

Query: 744  VPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLIDR 801
              N + P      +SN +   FP  +  I +       D  R +  +  V P++E+  + 
Sbjct: 703  NINLSEP------SSNTEITAFPNTEVKIEH-------DTGRIIVRSDSVNPEIEETKES 749

Query: 802  ALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
             + L                      DK +++T        ++   R+K        V  
Sbjct: 750  EVVL----------------------DKILKKTDDEETTIILAGPSRKK-------QVSA 780

Query: 862  PKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMA 920
             K + +K R K +A+ Q    V        ++ RGQKGKLKKMK+KY DQD+EER IRM 
Sbjct: 781  KKTKEDKARAKQEAAKQEVPPVSSEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRMM 840

Query: 921  LLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSS 980
            +L S+GK                  +KP  S   A KV  K       S+  KE+     
Sbjct: 841  ILKSSGK------------------EKPQAS---ADKVVEK-------SESTKEYVKPEK 872

Query: 981  HGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPV 1040
                 NP V LD+  E+            +G    G ++ ++ LTG P   D LL+ IPV
Sbjct: 873  SAAPKNP-VELDDADEV-----------PVG----GDVDVLNSLTGQPHEGDELLFAIPV 916

Query: 1041 CGPYSAVQSYKYRVKIIPGTAKK 1063
              PY A+Q+YK++VK+ PGT K+
Sbjct: 917  VAPYQALQNYKFKVKLTPGTGKR 939


>gi|301617501|ref|XP_002938173.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Xenopus (Silurana) tropicalis]
          Length = 951

 Score =  529 bits (1362), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 314/808 (38%), Positives = 449/808 (55%), Gaps = 111/808 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  L   L+G+R  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKSRFNTIDIRAVIAELSDSLLGLRVHNVYDVDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH+++RRL  ++QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKSRRLVSIKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHA 179
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +  +  E   +  KL  
Sbjct: 113 IVELYDRGNIVLTDHEYLILNILRFRTDEADDVKFAVREHYPIDHAKAPEPLLSVEKLKE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L  +                            QKG +                      
Sbjct: 173 ILEKA----------------------------QKGDQ---------------------- 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YG  L EH +LDTGL  N+K+ +++  ED  ++ +  A+ K E+++   +
Sbjct: 183 LKRVLNPHLPYGATLIEHCLLDTGLSSNVKVDQISGPED--LEKVHTALRKAEEYMD--V 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           +     +G+I+ Q +       P +        +EF P L  Q  +  +++ ++F+ A+D
Sbjct: 239 TQHFKGKGFII-QKREKKPSLEPDKPSEDIFTNEEFHPFLFAQHCNNTYIELDSFNKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EF+SK+E QR + +   +E  A  KL  +  D E R+ +L+   D      ELIE NL+ 
Sbjct: 298 EFFSKMEGQRIDLKALQQEKQALKKLENVRKDHEERLESLQHAQDADKAKGELIEMNLDI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W+++  +VKE +  G+ VA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWKEIGLIVKEAQIQGDSVALAIKELKLQTNHITMLLKNPYTLS 417

Query: 475 ----------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
                                    +    +K  PV  V+VDL+LSA+ANA+++Y+ K+ 
Sbjct: 418 EEGSEDEEEEKEEEPKGKKKKSKNKQPKKVQKNKPV-LVDVDLSLSAYANAKKYYDHKRH 476

Query: 513 QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
              K +KTI A  KAFK+AEKKT+  + + +TV+ I   RKV+WFEKF WFISSENYLVI
Sbjct: 477 AAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWFEKFLWFISSENYLVI 536

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E PVPP TL +AG   VC+S
Sbjct: 537 AGRDQQQNELIVKRYLNPGDLYVHADLHGATSCVIKNPTGE-PVPPRTLTEAGTMAVCYS 595

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            AWD++++TSAWWV+ +QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGFG LF++DE
Sbjct: 596 AAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFGFLFKVDE 655

Query: 693 SSLGSHLNERRVRGEEEGMDDFE--------------DSGHHKENSDIESEKDDTDEK-- 736
             +  H  ERRV+  +E M+                 D+     NS  E EK DT E+  
Sbjct: 656 PCVWRHKGERRVKQLDEDMESVTSSNTELAAEENIPLDAAEEDSNSSEEDEKLDTQEEQR 715

Query: 737 -PVAESLSVPNSAHPAPSHTNASNVDSH 763
            P  +S+ +    +  P+  N+    S+
Sbjct: 716 GPCTDSMGLEQKEYMVPADQNSDQESSN 743


>gi|328781799|ref|XP_395865.4| PREDICTED: serologically defined colon cancer antigen 1 homolog
           isoform 1 [Apis mellifera]
          Length = 970

 Score =  526 bits (1356), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 296/720 (41%), Positives = 430/720 (59%), Gaps = 78/720 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+   +  L++LIGMR + VYD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNSYDITCTINELQKLIGMRVNQVYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A+++I
Sbjct: 53  SGNRIHTTVFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRMIDLQFGSGEAAYHII 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E T+L +LR H + DK +    + +YP +           + H  +
Sbjct: 113 LELYDRGNIVLTDYEMTILNILRPHTEGDK-IRFAVKEKYPMD-----------RAHQNI 160

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
               E              N+    +++L   K G+S                     LK
Sbjct: 161 MPPIE--------------NI----QQHLQNAKIGES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +      
Sbjct: 182 KILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSARQN 240

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
             + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + KF +FD A+D
Sbjct: 241 --ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKKFASFDVAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
           E++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI  N 
Sbjct: 294 EYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELISRNQ 351

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
             VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352 SLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           + D+E +  P+  +++DLA +A  NAR++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412 DSDEESELKPM-LIDIDLAHTAFGNARKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471 TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKTGDIYVHA 530

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           DL GASS +IKN      VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531 DLTGASSVIIKNPGG-STVPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR---GEEEGMDDF 714
           GEYLT GSFMIRGKKN+LPP  L+MG G LFRL+ESS+  H +ER+VR    E E M+ F
Sbjct: 590 GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIERHKDERKVRIIDDENEHMESF 649



 Score = 95.5 bits (236), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 93/184 (50%), Gaps = 40/184 (21%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQKG+LKKMKEKY DQDEE+R + M +L SAG  +++    +N++ S          
Sbjct: 786  LKRGQKGRLKKMKEKYKDQDEEDRKLSMQVLQSAGNAKEDKKKNRNKDPS---------- 835

Query: 952  PVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIG 1011
                PK   K K         K  P  +S  VE+                +EEED     
Sbjct: 836  ---GPKQQTKKKSI------MKSVPPQNSQIVEN----------------IEEEDTGPGP 870

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFY 1071
            E     ++ +D LTG P+  D LL+ IPV  PY+ V +YK++VK+ PGT K+GK  +   
Sbjct: 871  E-----IDMLDQLTGKPVTEDELLFAIPVVAPYNTVLNYKFKVKLTPGTGKRGKAAKTAM 925

Query: 1072 SLLL 1075
            ++ +
Sbjct: 926  AVFM 929


>gi|194742419|ref|XP_001953700.1| GF17891 [Drosophila ananassae]
 gi|190626737|gb|EDV42261.1| GF17891 [Drosophila ananassae]
          Length = 999

 Score =  526 bits (1355), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 327/803 (40%), Positives = 453/803 (56%), Gaps = 117/803 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F+L  +  V      EKV LL+E
Sbjct: 1   MKTRFNTYDIICGVAELQKLVGWRVNQIYDVDNKTYLFRLQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE ++QLG DRI+ FQFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNVAPSGFSMKLRKHLKNKRLEKIQQLGADRIVDFQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTDSE T L +LR H + +  +    R +YP E  +              
Sbjct: 115 LELYDRGNLILTDSELTTLYILRPHTEGEH-LRFAMREKYPVERAK-------------- 159

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                                   S E L  +   +  + +KN +             L+
Sbjct: 160 -----------------------QSSEGLKAEALEQLLENAKNGD------------NLR 184

Query: 242 TVLGEALGYGPALSEHIILDTGL----VPNMKLSE----------------------VNK 275
            +L   L  GP++ EH++L+ GL    +   K SE                        K
Sbjct: 185 QILMPNLDCGPSVIEHVLLEQGLENRIIEKEKSSEDAQESEEKPEKGGKKQKKGRNQQTK 244

Query: 276 LED------NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSST 329
           +E       N + +L  AV   ED L +  SG    +GYI+       K+  PTE+G   
Sbjct: 245 VEQKPFDVANDLPLLQQAVKSAEDLLTEGASGKT--KGYIVQV-----KEEKPTENGKVE 297

Query: 330 QIYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             +   EF P    QF+  E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ 
Sbjct: 298 FFFRNIEFHPYQFVQFKDFECATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSN 357

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
           +  D   R+  L +  D   K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358 VKNDHAKRLEELTKVQDEDRKKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 417

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
           +  G+ VA  I +L LE N +SL+LS+   E +DE+   P V  V+VDLALSA ANARR+
Sbjct: 418 QANGDAVASSIKQLKLETNHISLILSDPYGENEDEDLDTPEVTVVDVDLALSAWANARRY 477

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y+LK+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFISS
Sbjct: 478 YDLKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISS 537

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG 
Sbjct: 538 ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIRNPTGEE-IPPKTLLEAGS 596

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   LIMG  L
Sbjct: 597 MAISYSVAWDAKVVTNSYWVTSEQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLIMGLSL 656

Query: 687 LFRLDESSLGSHLNERRVRG-EEEGMD-DFEDSGHHKENSDIESE-KDDTDEKPVAESLS 743
           LF+L++S +  HL ER+VR  ++E  D DF++S      +D+ SE  DD++  PVA    
Sbjct: 657 LFKLEDSFIARHLGERKVRSIDDEPTDQDFKESDVA---NDLLSEPSDDSEATPVA---- 709

Query: 744 VPNSAHPAPSHTNASNVDSHEFP 766
             N + P      +SN D   FP
Sbjct: 710 --NMSEP------SSNTDITAFP 724



 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/166 (31%), Positives = 76/166 (45%), Gaps = 44/166 (26%)

Query: 909  DQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHL 968
            DQD+EER IRM +L S+GK          E A  + EK           V  K       
Sbjct: 836  DQDDEEREIRMMILKSSGK----------EKAQPNSEK-----------VVEK------- 867

Query: 969  SKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNP 1028
            S   KE P    +    NP            + +++ D    G    G ++ ++ LTG P
Sbjct: 868  SVALKEEPKQPKNAPPKNP------------IELDDADDAPAG----GDVDILNSLTGQP 911

Query: 1029 LPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
               D LL+ IPV  PY A+Q+YK++VK+ PGT K+GK  ++  ++ 
Sbjct: 912  AEGDELLFAIPVVAPYQALQNYKFKVKLTPGTGKRGKAAKLALNIF 957


>gi|159155700|gb|AAI54741.1| Zgc:153813 protein [Danio rerio]
          Length = 883

 Score =  526 bits (1355), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 359/961 (37%), Positives = 515/961 (53%), Gaps = 130/961 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H T +   K   PSGF +K RKH+++RRL  VRQLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRKHLKSRRLVHVRQLGVDRIVDLQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNI+LTD +F +L LLR    + + V I  R RYP E  R  E   + +    
Sbjct: 113 ILELYDRGNIILTDHQFMILNLLRFRTAEAEDVKIAVRERYPVENARAEEPIISLQRLTQ 172

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           + S                           G Q G +                      L
Sbjct: 173 VLS---------------------------GAQTGDQ----------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YG  L EH +   G+    K+     L   +++VL  A+   ED++Q   +
Sbjct: 184 KRILNPHLPYGGPLIEHCLASVGMSGLYKVDSQTDLTQVSLKVLE-ALQMAEDYMQK--T 240

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +G+I+ +++      P   +G + +    Y+EF P L  Q     +V+FE+F+ A
Sbjct: 241 ANFSGQGFIIQKSEQ----KPNVCAGDAAEELLTYEEFHPFLFCQHVKSRYVEFESFNKA 296

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEF+S++ESQ+ + +   +E  A  KL  +  D + R+  L Q  +      EL+E NL
Sbjct: 297 VDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELVELNL 356

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             V  A+  VR ALAN++ W ++ +MV E + AG+PVA  I +L L+ N ++LLL N   
Sbjct: 357 PVVQRALQVVRSALANQVDWVEIGQMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPEA 416

Query: 475 ----NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  E+   +K+   EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  KA
Sbjct: 417 CPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKA 476

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           FK+AEKKT+  +   +TV +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKRY
Sbjct: 477 FKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRY 536

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +  GD+YVHADLHGA+S VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV 
Sbjct: 537 LRAGDLYVHADLHGATSCVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQ 595

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG- 706
             QVSKTAP+GEYLT GSFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++  
Sbjct: 596 HDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMKTL 655

Query: 707 ----------------EEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHP 750
                           EE      EDSG+ +E++D  +  DD +++       V  S   
Sbjct: 656 EEEEEEEDTTSTAEILEEGEELLAEDSGNEEEDTDSRTADDDEEQQ-------VCKSDED 708

Query: 751 APSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASI 810
                     D  E   ED+   +  DS+       ++ P         D  + L    I
Sbjct: 709 DEKDQRVCREDEDEDEDEDEDAVSAADSEEEHPGAQISFP---------DTCISLSHLQI 759

Query: 811 SSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKER 870
           + T H  +TT     +E + V     V  K +++  +RR +KK Q       K E  ++ 
Sbjct: 760 NRTAH-TDTTD---PQESQQVNTDTQV--KKHLTAKQRRDMKKKQ-------KQENTEDL 806

Query: 871 GKDASSQPESIVRK-TKIEGGK----ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASA 925
            +  + QPE+  R  T   GG     + RGQ+ KLKKMK+KY DQDEE+R + M +L SA
Sbjct: 807 EEGDAKQPETASRTPTSKSGGAAAAPLKRGQRNKLKKMKDKYKDQDEEDREMMMKILGSA 866

Query: 926 G 926
           G
Sbjct: 867 G 867


>gi|383852746|ref|XP_003701886.1| PREDICTED: nuclear export mediator factor NEMF homolog [Megachile
           rotundata]
          Length = 970

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 298/711 (41%), Positives = 425/711 (59%), Gaps = 81/711 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+   +  L++LIGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNSYDIVCTITELQKLIGMRVNQIYDIDHRTYLIRLQRSE--------EKSVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A++VI
Sbjct: 53  SGNRIHTTVFEWPKNVAPSGFSMKMRKHLKNKRLESLTQVGVDRIIDLQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E T+L +LR H + DK +    + +YP +  R  + T         
Sbjct: 113 LELYDRGNIVLTDHEMTILNILRPHTEGDK-IRFAVKQKYPMD--RAHQNTMPP------ 163

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                        + E  N++ NA        K G+S                     LK
Sbjct: 164 -------------IEEIQNHLQNA--------KAGES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLED--NAIQVLVLAVAKFEDWLQDV 298
            +L   L +G A+ +H++L  G     K+  + N +E   N I  L  A    E   ++V
Sbjct: 182 KILNPLLEFGSAVIDHVLLKHGFSLGCKIGKDFNIVEHMPNLISALQCADEMMETAKKNV 241

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                  +GYI+ +     K+  P   G+   IY   EF P L  Q++   F +F++FDA
Sbjct: 242 ------SKGYIIQK-----KEVKPVVDGTEEFIYTNIEFHPYLFEQYKDYPFKEFDSFDA 290

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           ++DE++S +E Q+ + +   +E  A  KL+ +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 291 SVDEYFSTMEGQKLDMKVLQQEREALKKLDNVKKDHDQRLITLEKTQELDK--QKAELIS 348

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L L+ N +SLLL +
Sbjct: 349 RNQMLVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLDTNHISLLLHD 408

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK 534
             +E D+E +  P+  +++DLA +A  NAR++Y  K+    KQ+KTI +  KA K+AEKK
Sbjct: 409 PYEESDEESELKPM-LIDIDLAHTAFGNARKYYNQKRSAAKKQQKTIESQDKALKSAEKK 467

Query: 535 TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
           T+  + + + + +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+Y
Sbjct: 468 TKQTLKEVQAIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKSGDIY 527

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
           VHADL GASS VIKN     PVPP TL +AG   V +S AWD+K+V  AWWV   QVSKT
Sbjct: 528 VHADLTGASSVVIKNPGG-GPVPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKT 586

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           APTGEYLT GSFMIRGKKN+L P  L+MG G LFRL+ESS+  H +ERR+R
Sbjct: 587 APTGEYLTTGSFMIRGKKNYLSPCQLVMGLGFLFRLEESSIERHKDERRIR 637



 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 95/184 (51%), Gaps = 40/184 (21%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQKG+LKKMKEKY DQDEE+R + M +L SAG  +        E+   ++ K P+  
Sbjct: 786  LKRGQKGRLKKMKEKYKDQDEEDRRLFMQVLQSAGAAK--------EDKKKNRNKDPS-- 835

Query: 952  PVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIG 1011
                PK   K K  G  ++     P ++   + DN               +EEED     
Sbjct: 836  ---GPKQQTKKKGTGKPAQ-----PQNTQ--IVDN---------------IEEEDTGPGP 870

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFY 1071
            E     ++ +D LTG P+  D LL+ +PV  PY+ + +YK++VK+ PGT K+GK  +   
Sbjct: 871  E-----VDMLDQLTGKPVAEDELLFAVPVVAPYNTLLNYKFKVKLTPGTGKRGKAAKTAV 925

Query: 1072 SLLL 1075
            ++ +
Sbjct: 926  AVFM 929


>gi|345306303|ref|XP_001515044.2| PREDICTED: nuclear export mediator factor NEMF [Ornithorhynchus
           anatinus]
          Length = 1076

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 298/730 (40%), Positives = 428/730 (58%), Gaps = 98/730 (13%)

Query: 15  VKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARD 74
           +  L  L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   
Sbjct: 17  LASLNSLLGMRVNNVYDVDNKTYLIRLQKPDV--------KATLLLESGIRIHTTEFEWP 68

Query: 75  KKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTD 134
           K   PS F +K RKH+++RRL  V+QLG DRI+ FQFG    A+++I+ELY +GNI+LTD
Sbjct: 69  KNMMPSSFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTD 128

Query: 135 SEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDK 194
            E+ +L +LR   D+   V    R RYP ++ +                     A EP  
Sbjct: 129 YEYLILNILRFRTDEADDVKFAVRERYPVDLAK---------------------APEP-- 165

Query: 195 VNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPAL 254
                                   F L + +   SN  A   +P LK VL   L YG  L
Sbjct: 166 -----------------------LFTLERLTEIISN--APKGEP-LKRVLNPHLPYGATL 199

Query: 255 SEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK 314
            EH ++++G   N+K+    +++D  I+ +++ + K E++++  I+ +   +GYI+ Q +
Sbjct: 200 IEHCLIESGFPGNVKVDPQFEIKD--IEKVLVCLQKAEEYMK--ITTNFSGKGYII-QKR 254

Query: 315 HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQH 374
                  P +       Y+EF P L +Q     +V+FE+FD A+DEFYSK+E Q+ + + 
Sbjct: 255 EKKPSLEPDKPAEDILTYEEFHPFLFSQHSKYPYVEFESFDKAVDEFYSKLEGQKIDLKA 314

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNLEDVDAAILAVRVALA 432
             +E  A  KL  +  D E+R+  L   QE+D+ VK  ELIE NL+ VD AI  VR ALA
Sbjct: 315 LQQEKQALKKLENVRKDHEHRLEALHQAQEIDK-VK-GELIEMNLQIVDRAIQVVRSALA 372

Query: 433 NRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------------------ 474
           N++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N                  
Sbjct: 373 NQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLKNPYVMSEEEDDDGEDIEKE 432

Query: 475 ------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                          ++   +K  P+  V+VDL+LSA+ANA+++Y+ K+    K +KT+ 
Sbjct: 433 ETEEPKGKKKKQKDKQLKKPQKNKPL-VVDVDLSLSAYANAKKYYDHKRHAARKTQKTVE 491

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           A  KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSENYL+I GRD QQNEM
Sbjct: 492 AAEKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEM 551

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           IVKRY++ GD+YVHADLHGA+S VIKN   E  +PP TL +AG   +C+S AWD++++TS
Sbjct: 552 IVKRYLNSGDIYVHADLHGATSCVIKNPTGEA-IPPRTLTEAGTMALCYSAAWDARVITS 610

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           AWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF+++E+ +  H  ER
Sbjct: 611 AWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVEETCVWRHRGER 670

Query: 703 RVRGEEEGMD 712
           +V+ ++E M+
Sbjct: 671 KVKVQDEDME 680


>gi|391330989|ref|XP_003739933.1| PREDICTED: nuclear export mediator factor NEMF homolog [Metaseiulus
           occidentalis]
          Length = 956

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 302/750 (40%), Positives = 432/750 (57%), Gaps = 79/750 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K +  +AD+ A V  L+ L+GMR   VYD+  KTY+FKL+         + EK +L+ E
Sbjct: 1   MKAKFTSADIVAMVGELKALVGMRVKQVYDVDSKTYLFKLVR--------QEEKAVLIFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+R+HTT Y   K   PSGF+ KLRKH++ +RL  + QLG DRI+  QFG+   A++VI
Sbjct: 53  SGIRIHTTEYDWPKGMAPSGFSSKLRKHLKNKRLATISQLGVDRIVDLQFGINEAANHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GN++LTD+ F +L +LR  +   + V    R +YP              +  A+
Sbjct: 113 VELYDRGNVVLTDNNFIILNILRPRQAGSEDVRFAVREKYP--------------IAGAI 158

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
               EP         +D      A+KE                              T+K
Sbjct: 159 QEVPEPS-------QQDVIEWLTAAKET----------------------------DTVK 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            ++   + +GPA+ EH++L   +  N KL +   L  +  + +  ++ +   +L+ +   
Sbjct: 184 KIIVPKVFFGPAVLEHVLLSREISANTKLRKA-VLTPDFFKSIHSSIVEGNAFLEKLKQP 242

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGS-STQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           D+   G I ++ +   K     E GS     Y+EF P L  Q        F TF  A+D 
Sbjct: 243 DL-STGIISLKVEPRVK---AAEDGSMEIASYNEFHPFLFKQLEGSRVEHFATFGQAVDA 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+S  E Q+ + +    E  A  KL  + +D E R++ L+      ++ A LIE NLE V
Sbjct: 299 FFSMQEQQKIDLRAHNLEKEAVKKLENVKLDHEKRLNALEGTQRTDLEKAMLIENNLELV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           + A+ AVR  +A++ SW+++  M+KE +  G+PVA  I  L+L+RN   +LLSN+     
Sbjct: 359 EKALYAVRSFVASQYSWDEIGHMIKEAQHMGDPVACTIKALHLDRNQFGMLLSNSF---- 414

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
             E  L    V++D+ LSA+ANARR++++KK    KQ+KTI + +KA K+A+KKT+  + 
Sbjct: 415 --ENDLSPSVVDIDIDLSAYANARRYFDMKKHAARKQQKTIESSAKALKSAQKKTKEILK 472

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           Q +   NI+  RK +WFEKF WFISSENYLVI GRDAQQNE+IVK+YM+KGD+YVHADLH
Sbjct: 473 QVELTTNIARTRKSYWFEKFFWFISSENYLVIGGRDAQQNEVIVKKYMTKGDIYVHADLH 532

Query: 601 GASSTVIKN----HR----PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           GASS VIKN    HR        +PP TLN+AG   +C+S AW++K+VTSAWWV+ HQV+
Sbjct: 533 GASSVVIKNPSVTHRFLSVSGGEIPPKTLNEAGTMAICYSAAWEAKVVTSAWWVHHHQVT 592

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KTAP+GEYLT GSFMIRGKKN+LPP  LIMGFG +FRLDE S+ +H N+R+V   +E   
Sbjct: 593 KTAPSGEYLTAGSFMIRGKKNYLPPLYLIMGFGFMFRLDEESVPAHQNDRKVWTADE-TT 651

Query: 713 DFEDSGHHKENSDIESEKD-DTDEKPVAES 741
             ED+    E  D ++E D  T E    ES
Sbjct: 652 AVEDNAIEPEGVDEQNEIDVSTSEDEAGES 681



 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/219 (32%), Positives = 116/219 (52%), Gaps = 36/219 (16%)

Query: 863  KVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
            +V+R+K +G+   + P +  + ++ E  +  +  K K++K+K++YGDQD+EER +RM +L
Sbjct: 725  QVDRKKVKGQKKGAPPPA-AKASEGEQKQPKKLSKAKMRKIKQRYGDQDDEERELRMKIL 783

Query: 923  ASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHG 982
            ASAGK        Q++N  T +               Y C+  G   + C +  +DS   
Sbjct: 784  ASAGK--------QSQNTETEE--------------GYDCRSGGQ-KEACDD--EDSEQK 818

Query: 983  VEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVD-------YLTGNPLPSDILL 1035
              D     L E+ + +    E++D  E  ++    L   D        LTG PLP D+LL
Sbjct: 819  TTDRT---LPESTKTEARTEEQQDGVEDEDDADEDLPSTDDLTAILNSLTGTPLPEDVLL 875

Query: 1036 YVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            Y +PVC PYS + +YK++VK+ PGTAK+GK  +I  ++ 
Sbjct: 876  YGVPVCAPYSIMTNYKFKVKVTPGTAKRGKAAKIALNMF 914


>gi|195107152|ref|XP_001998180.1| GI23827 [Drosophila mojavensis]
 gi|193914774|gb|EDW13641.1| GI23827 [Drosophila mojavensis]
          Length = 962

 Score =  522 bits (1344), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 382/1114 (34%), Positives = 560/1114 (50%), Gaps = 235/1114 (21%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R ++ D+   +  L+RLIG+R + +YD+  KTY+F+L         G SEK +    
Sbjct: 1    MKTRFSSYDIICGIAELQRLIGLRVNQIYDIDNKTYLFRLHGG------GSSEKNM---- 50

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                             PSGF +K RKH++ +RLE + QLG DRI+ FQFG G  A++V 
Sbjct: 51   ----------------APSGFCMKFRKHLKNKRLEHINQLGADRIVDFQFGSGEAAYHVF 94

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E T+L +LR H + +  +    R +YP +  ++             
Sbjct: 95   LELYDRGNVILTDYEKTILYILRPHTEGE-SIRFAVREKYPVDRAKI------------- 140

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                             GN     S+            ++ +NSN+           +LK
Sbjct: 141  -----------------GNCELRESEMR----------EIIENSNEGD---------SLK 164

Query: 242  TVLGEALGYGPALSEHIILDTGL------------------------VPNMKLSEVN--- 274
             +L   L  GPA+ EH++++ GL                          N K SE+N   
Sbjct: 165  RILMPILDCGPAVIEHVLIEHGLENHLIRGSVDQEKGQVESSKKQSTKKNRKSSEINPSD 224

Query: 275  -KLEDNAIQV--LVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
             +  D A  +  L+LA+    D +   I  +   +G+I+       K+   T + ++   
Sbjct: 225  IQFFDLAADLPQLMLAIKSAYDIM--AIGRNGSSKGFIIQV-----KEEKLTNAENTEHF 277

Query: 332  YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
            Y   EF P L +Q++   F ++ETF  A+DEF+S  ESQ+ + +   +E  A  KL+ + 
Sbjct: 278  YRNIEFHPYLFSQYKKLPFKEYETFMEAVDEFFSSQESQKIDIKTLQQEREALKKLSNVK 337

Query: 390  MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
             D   R+  L +  D   K AELI  N   VD AILA++ A+A+++SW D+  +VKE + 
Sbjct: 338  KDHTKRLEELNRVQDDDKKKAELITSNQCLVDKAILAIQSAIASQLSWPDIQELVKEAQA 397

Query: 450  AGNPVAGLIDKLYLERNCMSLLLSN---NLDEMDDEEKTLPVEKVEVDLALSAHANARRW 506
             G+ VA  I +L LE N +SLLLS+   N +E D+ +  +    V++DLALSA ANARR+
Sbjct: 398  NGDIVASSIKQLKLEINHISLLLSDPYKNENENDNADSVI----VDIDLALSAWANARRY 453

Query: 507  YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
            Y+LK+    K++KTI A  KA K+AE+KT+  + + +T++NI+  RK+ WFEKF WFISS
Sbjct: 454  YDLKRSAALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKIFWFEKFFWFISS 513

Query: 567  ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
            ENYLVI GRDAQQNE+IVKRYM   D+YVHAD+ GASS +I+N   E+ +PP TL +AG 
Sbjct: 514  ENYLVIGGRDAQQNELIVKRYMRPKDIYVHADIQGASSVIIRNTTGEE-IPPKTLLEAGT 572

Query: 627  FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
              + +S AWD+K+VT+++WVY HQVSKTAPTGEYL  GSFMIRGKKNFLP   LIMG  L
Sbjct: 573  MAISYSVAWDAKVVTNSYWVYSHQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSL 632

Query: 687  LFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESL---- 742
            LF+L++S L  H  ER++R  E+ ++     G   E  +I S  D  +     ES+    
Sbjct: 633  LFKLEDSFLQRHAGERKIRTTEDIIN-----GDKIEQPEI-SSTDLNEINEACESINEYG 686

Query: 743  --SVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLID 800
              S PN+       T    V +      +KT  + +D +  DI                 
Sbjct: 687  KNSFPNTEVKIEHDTGRITVKTDLLDETNKT--DAVDQQSLDI----------------- 727

Query: 801  RALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVV 860
                                   +++ED  + + A  R K   +K  R   ++ + +++ 
Sbjct: 728  -----------------------INDEDTVIIQPAPSRKKNQSTKKRREDKERSEKANIE 764

Query: 861  DPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMA 920
               V             PE+    +K++     RGQKGKLKK+K KY DQD+EER IRM 
Sbjct: 765  MVYV-----------GSPETDKSSSKVK-----RGQKGKLKKIKLKYRDQDDEERKIRMM 808

Query: 921  LLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSS 980
            +L S+GK      D    N    +EK     P    K+         L+K+  E  D   
Sbjct: 809  ILNSSGK------DKPIANNERQEEK-----PTSLTKITTVEASENILTKNQVEIED--- 854

Query: 981  HGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPV 1040
              ++D+P      T + D                      +D LTG P   D LL+ IPV
Sbjct: 855  --IDDSPI-----TVDTDL---------------------LDSLTGVPFDDDELLFAIPV 886

Query: 1041 CGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
              PY A+Q YK++VK+ PGT K+GK  ++  S+ 
Sbjct: 887  VAPYQALQQYKFKVKLTPGTGKRGKASKLALSIF 920


>gi|345495372|ref|XP_001603770.2| PREDICTED: nuclear export mediator factor NEMF homolog [Nasonia
           vitripennis]
          Length = 972

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 289/711 (40%), Positives = 426/711 (59%), Gaps = 72/711 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+GMR + +YD+  +TY+ +   S         EK +LL+E
Sbjct: 1   MKNRFNTYDLVCSVTELQKLVGMRVNQIYDIDHRTYLIRFQRSE--------EKSILLIE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DR++  QFG    A++++
Sbjct: 53  SGNRIHTTEFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRVVDLQFGSNEAAYHIV 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTDSE T+L +LR H + DK + +  + RYP           A + H  +
Sbjct: 113 LELYDRGNIVLTDSEMTILNILRPHTEGDK-IRLAVKERYP-----------AFRAHTKV 160

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             ++E                              +  D+ KN+ +           +LK
Sbjct: 161 IPTRE------------------------------ELQDIIKNAKQGE---------SLK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            +L   L  G A+ +H++L+ G     K+  E +  +D  +  L  A+   E  L +   
Sbjct: 182 KILNPHLEVGAAVIDHVLLEVGFQLGCKIGKEFDVAKD--VDKLYSALENAEKMLNNAKK 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAAL 358
              V +GYI+ +     K+  P + G    +Y   EF P L  Q +++ + ++ETFD A+
Sbjct: 240 D--VSKGYIIQK-----KEEKPIKDGEEEFMYANIEFHPFLFEQCKNQHYKEYETFDKAV 292

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DE++S +E Q+ + +   +E  A  KL+ +  D + R+ TL +  +   + AELI  N E
Sbjct: 293 DEYFSTMEGQKLDLKVLQQERDALKKLDNVKKDHDQRLVTLGKTQEADKQKAELITRNQE 352

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            VD AILA++ ALAN+MSW+D+  ++KE +  G+PVA  I  L LE N +++LLS+  ++
Sbjct: 353 LVDNAILAMQSALANQMSWQDIQTLLKEAQAKGDPVASAIKHLKLESNHITMLLSDPYED 412

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
            DD+E  L    V++DLA SA +NA R+Y+ K+    KQ+KTI +  KA K+AE+KT+  
Sbjct: 413 SDDDEPELKPMTVDIDLAHSAFSNATRYYDQKRSAAKKQQKTIESQGKALKSAERKTKQT 472

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           + + + + +I+  RKV+WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GDVYVHAD
Sbjct: 473 LKEVQAIHSINKARKVYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLRSGDVYVHAD 532

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           L GASS V+KN     PVPP +L +AG   V +S AW++K++  ++WV   QVSKTAPTG
Sbjct: 533 LTGASSVVVKNPNG-GPVPPKSLAEAGTMAVAYSIAWEAKVIAGSYWVNSDQVSKTAPTG 591

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           EYLT GSFMIRGKKN+LPP  LIMG G LFRL++SS+  H +ERRVR  EE
Sbjct: 592 EYLTTGSFMIRGKKNYLPPCQLIMGLGFLFRLEDSSIERHKDERRVRTLEE 642



 Score = 84.0 bits (206), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 65/186 (34%), Positives = 96/186 (51%), Gaps = 42/186 (22%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKE--KKPA 949
            + RGQ+GKLKK+KEKY DQDEE+R + M +L SAG  +++    +N++ S  K+  KK  
Sbjct: 786  LKRGQRGKLKKIKEKYKDQDEEDRKLLMTVLQSAGAAKEDKRKSKNKDPSGPKQQGKKKG 845

Query: 950  ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHE 1009
            + P                                 NP    ++ AE     ++EED   
Sbjct: 846  VPP-------------------------------RINPAQQQNQVAE----NLDEEDAGP 870

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
              E     ++ +D LTG PLP D LL+ +PV  PYS +QSYK++VK+ PGT K+GK  + 
Sbjct: 871  GPE-----VDMLDQLTGKPLPEDELLFSVPVVAPYSTLQSYKFKVKLTPGTGKRGKAAKT 925

Query: 1070 FYSLLL 1075
              ++ L
Sbjct: 926  AVAVFL 931


>gi|195573753|ref|XP_002104856.1| GD21177 [Drosophila simulans]
 gi|194200783|gb|EDX14359.1| GD21177 [Drosophila simulans]
          Length = 972

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 393/1115 (35%), Positives = 566/1115 (50%), Gaps = 227/1115 (20%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V              
Sbjct: 1    MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV-------------- 46

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                        +K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 47   ------------EKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 94

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E T L +LR H + +  +    R +YP E                 
Sbjct: 95   LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE----------------- 136

Query: 182  TSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +K+P    EP+ + +   N  N                                   L
Sbjct: 137  -RAKQPTKELEPEALVKLLENARNGD--------------------------------YL 163

Query: 241  KTVLGEALGYGPALSEHIILDTGL------------------------------VPNMKL 270
            + +L   L  GPA+ EH++L  GL                                N KL
Sbjct: 164  RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKL 223

Query: 271  SEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
             +      N + +L  AV   ++ + +  SG    +GYI+       K+  P E+G+   
Sbjct: 224  EQKPFDMVNDLPILQQAVKDAQELIAEGNSGK--GKGYIIQ-----VKEEKPAENGTVEF 276

Query: 331  IYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
             +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ +
Sbjct: 277  FFRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNV 336

Query: 389  HMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
              D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE
Sbjct: 337  KNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKE 394

Query: 447  ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARR 505
             +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLA+SA ANARR
Sbjct: 395  AQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKAPEVTVVDVDLAMSAWANARR 454

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
            +Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFIS
Sbjct: 455  YYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFIS 514

Query: 566  SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
            SENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG
Sbjct: 515  SENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAG 573

Query: 626  CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
               + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  
Sbjct: 574  SMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLS 633

Query: 686  LLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESL 742
            LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +DTD   +  +L
Sbjct: 634  LLFKLEDSFIERHLGERKVR----SLDDDQIDPNVKETEVEHDLLSDNEDTD---LNTNL 686

Query: 743  SVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLID 800
            S P           +SN +   FP  +  I +       D  R    +  V P++E+  +
Sbjct: 687  SEP-----------SSNTEITAFPNTEVKIEH-------DTGRITVRSDSVNPEIEETKE 728

Query: 801  RALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVV 860
              + L                      DK + + A V +   I  A  RK        V 
Sbjct: 729  SEVVL----------------------DK-ILKKADVEETTIILAAPSRK------KQVS 759

Query: 861  DPKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRM 919
              K + +K R K +A+ Q  + V        ++ RGQKGKLKKMK+KY DQD+EER IRM
Sbjct: 760  AKKTKEDKARAKQEAAKQEVAPVSTEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRM 819

Query: 920  ALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
             +L S+GK                  +KP  S            K    S+  KE+    
Sbjct: 820  MILKSSGK------------------EKPQAS----------ADKVVETSESTKEYVKPE 851

Query: 980  SHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIP 1039
                  NP V LD+  E+            +G    G ++ ++ LTG P   D LL+ IP
Sbjct: 852  KSAAPKNP-VELDDADEV-----------PVG----GDVDVLNSLTGQPHEGDELLFAIP 895

Query: 1040 VCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            V  PY A+Q+YK++VK+ PGT K+GK  ++  ++ 
Sbjct: 896  VVAPYQALQNYKFKVKLTPGTGKRGKAAKLALNIF 930


>gi|195354790|ref|XP_002043879.1| GM17806 [Drosophila sechellia]
 gi|194129117|gb|EDW51160.1| GM17806 [Drosophila sechellia]
          Length = 972

 Score =  520 bits (1340), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 395/1115 (35%), Positives = 569/1115 (51%), Gaps = 227/1115 (20%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V              
Sbjct: 1    MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV-------------- 46

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                        +K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 47   ------------EKNMAPSGFSMKLRKHLKNKRLEQVQQMGSDRIVDFQFGTGDAAYHVI 94

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GN++LTD E T L +LR H + +  +    R +YP E                 
Sbjct: 95   LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE----------------- 136

Query: 182  TSSKEP-DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +K+P +  EP+ + +   N  N                                   L
Sbjct: 137  -RAKQPTNELEPEALVKLLENARNGD--------------------------------YL 163

Query: 241  KTVLGEALGYGPALSEHIILDTGL------------------------------VPNMKL 270
            + +L   L  GPA+ EH++L  GL                                N KL
Sbjct: 164  RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKL 223

Query: 271  SEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
                    N + +L  AV   ++ + +  SG    +GYI+       K+  P E+G+   
Sbjct: 224  EHKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQ-----VKEEKPAENGTVEF 276

Query: 331  IYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
             +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ +
Sbjct: 277  FFRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNV 336

Query: 389  HMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
              D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE
Sbjct: 337  KNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKE 394

Query: 447  ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARR 505
             +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR
Sbjct: 395  AQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKAPEVTVVDVDLALSAWANARR 454

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
            +Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFIS
Sbjct: 455  YYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFIS 514

Query: 566  SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
            SENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG
Sbjct: 515  SENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAG 573

Query: 626  CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
               + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  
Sbjct: 574  SMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLS 633

Query: 686  LLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESL 742
            LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +D D   +  +L
Sbjct: 634  LLFKLEDSFIERHLGERKVR----SLDDDQIDPNVKETEVEHDLLSDNEDAD---LNTNL 686

Query: 743  SVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLID 800
            S P           +SN +   FP  +  I +       D  R    +  V P++E+  +
Sbjct: 687  SEP-----------SSNTEITAFPNTEVKIEH-------DTGRITVRSDSVNPEIEETKE 728

Query: 801  RALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVV 860
              + L                      DK +++T  V +   I  A  RK        V 
Sbjct: 729  SEVVL----------------------DKILKKT-DVEETTIILAAPSRK------KQVS 759

Query: 861  DPKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRM 919
              K + +K R K +A+ Q  + V        ++ RGQKGKLKKMK+KY DQD+EER IRM
Sbjct: 760  AKKTKEDKARAKQEAAKQEVAPVSTEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRM 819

Query: 920  ALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
             +L S+GK                  +KP  S   A KV  K       ++  KE+    
Sbjct: 820  MILKSSGK------------------EKPQAS---ADKVVEK-------TESTKEYVKPE 851

Query: 980  SHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIP 1039
                  NP V LD+  E+            +G    G ++ ++ LTG P   D LL+ IP
Sbjct: 852  KSAAPKNP-VELDDADEV-----------PVG----GDVDVLNSLTGQPHEGDELLFAIP 895

Query: 1040 VCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            V  PY A+Q+YK++VK+ PGT K+GK  ++  ++ 
Sbjct: 896  VVAPYQALQNYKFKVKLTPGTGKRGKAAKLALNIF 930


>gi|340374096|ref|XP_003385574.1| PREDICTED: nuclear export mediator factor Nemf-like [Amphimedon
           queenslandica]
          Length = 1137

 Score =  517 bits (1332), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 295/718 (41%), Positives = 419/718 (58%), Gaps = 81/718 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A ++ L RRL GMR +N+YD+  KTY+ KL  S         EK++LL+
Sbjct: 1   MKERFTTVDLLASIEYLNRRLTGMRVANIYDVDHKTYLLKLARSE--------EKIVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG RLHTT +   K   PSGF +KLRKH+RT+RL  + QLG DR+I   FG G  AH++
Sbjct: 53  ESGCRLHTTEFEWPKHLQPSGFAMKLRKHLRTKRLISITQLGVDRVIDMVFGSGEYAHHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD  + +L+LLR+  D D  V    R  +  +  +  +   + +  A 
Sbjct: 113 IIELYDRGNIILTDHTYLILSLLRTRTDADADVRFAVREHFSMDTIKQEQILPSIEQVAG 172

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           +  S +P                       G Q                          L
Sbjct: 173 ILGSAKP-----------------------GDQ--------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           + +L     YG +L  H ++  GL  N KL   N    +  QVL  A+ +  +  Q   S
Sbjct: 184 RHILNPHFVYGTSLLTHCLIGIGLTENTKLPATNDSPIDPDQVLK-ALLEAHEIFQSFRS 242

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD---------EFCPLLLNQFRSREFVKF 351
             +  +GY++ +     KD  PT   +++             EF PLL  Q  S  + + 
Sbjct: 243 --MPSKGYLIQK-----KDVAPTVGVATSDTPTTSTEVTTNIEFHPLLYRQHLSSCYKEV 295

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           ETFD A+DEF+S   SQ+ + +    + +A  KL  I  D E R+  L++  D     AE
Sbjct: 296 ETFDRAVDEFFSSKSSQKQDVKVIQLQKSAVKKLENIKQDHEKRIEALRKSQDEDRYKAE 355

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE+N + V+ A L +R A+A+ M W D+  +V + +  G+PVA  I  L L  N ++L 
Sbjct: 356 LIEWNTDLVERACLVIRSAVASSMDWGDIELLVHDAQGRGDPVANSIQGLKLHSNLITLW 415

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           L    +E DD+       KV++DL LS +ANARR+Y++KK+   K++KT  + +KA K+A
Sbjct: 416 LKAPYEEDDDDSI-----KVDIDLGLSVYANARRYYDMKKQAAKKEQKTSESSNKALKSA 470

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           E+KT+  + +   ++ I+  RKVHWFEKF WFISSEN++VI GRD QQNE++VK+Y+++ 
Sbjct: 471 ERKTKQTLKEAAVISRITKARKVHWFEKFYWFISSENFVVIGGRDQQQNELLVKKYLNEH 530

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           DVYVHADLHGA+S ++KNH    PVPP TLN+AG   VC+S AW++K+VTSAWWVY +QV
Sbjct: 531 DVYVHADLHGATSVIVKNHSG-GPVPPKTLNEAGVMAVCYSSAWEAKIVTSAWWVYANQV 589

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           SKTAP+GEYLT GSFMIRGKKNFLPP  L++GF ++F++DESSL +H+NERRVR  +E
Sbjct: 590 SKTAPSGEYLTTGSFMIRGKKNFLPPCHLVLGFSIMFKVDESSLANHINERRVRSADE 647



 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/71 (49%), Positives = 53/71 (74%), Gaps = 1/71 (1%)

Query: 996  EMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVK 1055
            E  +  + +E+I ++ EE +  + D+D LTG+PLP+D LLY IPVC PYSA+ +YK++VK
Sbjct: 1016 EEKRAILADENILQL-EEAQKEMFDLDSLTGSPLPNDELLYAIPVCAPYSAMHNYKFKVK 1074

Query: 1056 IIPGTAKKGKG 1066
            +IPGT ++GK 
Sbjct: 1075 LIPGTNRRGKA 1085



 Score = 46.2 bits (108), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 56/96 (58%), Gaps = 18/96 (18%)

Query: 842 YISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPES------IVRKTKIEGGKISRG 895
           +IS  ER+ LKK Q SS           +G +ASS P S           + +  +  RG
Sbjct: 754 HISAKERKLLKK-QSSS-----------KGHEASSTPASSKPHPKPQPLPQPQSQQYKRG 801

Query: 896 QKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKN 931
           QK K KK+K+KYGDQDEEER +RM LLAS+G ++++
Sbjct: 802 QKSKQKKIKDKYGDQDEEEREMRMNLLASSGALKES 837


>gi|322784867|gb|EFZ11647.1| hypothetical protein SINV_03144 [Solenopsis invicta]
          Length = 985

 Score =  516 bits (1330), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 293/713 (41%), Positives = 429/713 (60%), Gaps = 77/713 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L+RLIGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNTYDLVCSVTELQRLIGMRVNQIYDIDNRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT++   K   PS F++K+RKH++ +RLE + Q+G DRII  QFG G  A+++I
Sbjct: 53  SGNRIHTTSFEWPKNVAPSSFSMKMRKHLKNKRLESLMQVGTDRIIKLQFGSGEAAYHII 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LE+Y +GNI+LTD E  +L +LR H + DK +    + +YP +            +H  +
Sbjct: 113 LEVYDRGNIILTDHEMVILYVLRPHTEGDK-IRFAVKEKYPLDRAHSTTMPPIDVIHEHI 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             +KE +                                                  +LK
Sbjct: 172 QKAKEGE--------------------------------------------------SLK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            VL   L +G A+ +H++L  G     K+  + N  ED  +  L+LA+    + +   ++
Sbjct: 182 KVLNPLLEFGSAVIDHVLLKAGFNFGCKIGKDFNIAED--MPKLILALEDANNMMD--LA 237

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAAL 358
              V +GYIL +     K+   T+ G    I+   EF P L +Q+ ++ + +F++FDAA+
Sbjct: 238 KKTVSKGYILQK-----KESKLTQDGKEDFIFANIEFHPFLFDQYNNQPYKEFDSFDAAV 292

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYN 416
           DE+YS +E Q+ + +   +E  A  KL ++  D   R+ TL+  QE+D+  + AELI  N
Sbjct: 293 DEYYSTMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDK--QKAELISRN 350

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
              VD AILA++ ALAN+MSW D+  ++KE +  G+PVA  I +L LE N ++LLL +  
Sbjct: 351 QALVDNAILAIQSALANQMSWPDIQVLLKEAQARGDPVASAIKQLKLETNHIALLLHDPY 410

Query: 477 DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           ++ D+E +  P+  +++DLA +A +NA+++Y  KK    KQ+KTI +H KA K+AEKKT+
Sbjct: 411 EDSDEESELKPM-IIDIDLAHTAFSNAKKYYSQKKSAAKKQQKTIESHGKALKSAEKKTK 469

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
             + + +T+  I+ +RK++WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YVH
Sbjct: 470 QTLKEVQTIHTINKLRKMYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLKAGDLYVH 529

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           ADL GASS VIKN     PVPP +L +AG   V +S AWDSK++ SAWWV+  QVSK+AP
Sbjct: 530 ADLTGASSVVIKNPSG-NPVPPKSLAEAGTMAVAYSIAWDSKVIASAWWVHHDQVSKSAP 588

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           TGEYLT GSFMIRGKKN+L    LIMG GL+FRL++SS+  H NERRV+  +E
Sbjct: 589 TGEYLTTGSFMIRGKKNYLTQSQLIMGLGLMFRLEDSSIERHKNERRVKAVDE 641



 Score = 73.6 bits (179), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 97/185 (52%), Gaps = 40/185 (21%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQKGKLKKMKEKY DQDEE+R + M +L SAG  +        E+   ++ K P+  
Sbjct: 799  LKRGQKGKLKKMKEKYKDQDEEDRRLSMLVLQSAGAAK--------EDKRKNRAKDPS-- 848

Query: 952  PVDAPKVCYKCKKAGHLSKDCKEH-PDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
                PK      + G    + K + P  S+H + DN               +++ED   I
Sbjct: 849  ---GPK------QQGKKKTNPKPNIPLQSTHTIMDN---------------IDDEDTGPI 884

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIF 1070
             E     ++ +D LTG PL  D LL+ +PV  PY+ +Q+YK++VK+ PG  K+GK  +  
Sbjct: 885  PE-----VDMLDQLTGKPLSEDELLFAVPVVAPYNTLQNYKFKVKLTPGIGKRGKAAKTA 939

Query: 1071 YSLLL 1075
             ++ L
Sbjct: 940  VAVFL 944


>gi|115529351|ref|NP_001070202.1| uncharacterized protein LOC767767 [Danio rerio]
 gi|115313121|gb|AAI24465.1| Zgc:153813 [Danio rerio]
          Length = 694

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 295/718 (41%), Positives = 416/718 (57%), Gaps = 79/718 (11%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H T +   K   PSGF +K RKH+++RRL  VRQLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRKHLKSRRLVHVRQLGVDRIVDLQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNI+LTD +F +L LLR    + + V I  R RYP E  R             
Sbjct: 113 ILELYDRGNIILTDHQFMILNLLRFRTAEAEDVKIAVRERYPVENAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    +    V +      G Q G +                      L
Sbjct: 160 --------AEEPIISLQRLTQVLS------GAQTGDQ----------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YG  L EH +   G+    K+     L   +++VL  A+   E+++Q   +
Sbjct: 184 KRILNPHLPYGGPLIEHCLASVGMSGLYKVDSQTDLTQVSLKVLE-ALQMAEEYMQK--T 240

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +G+I+ +++      P   +G + +    Y+EF P L  Q     +V+FE+F+ A
Sbjct: 241 ANFSGQGFIIQKSEQ----KPNVCAGDAAEELLTYEEFHPFLFCQHVKSRYVEFESFNKA 296

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEF+S++ESQ+ + +   +E  A  KL  +  D + R+  L Q  +      EL+E NL
Sbjct: 297 VDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELVELNL 356

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             V  A+  VR ALAN++ W ++ RMV E + AG+PVA  I +L L+ N ++LLL N   
Sbjct: 357 PVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPEA 416

Query: 475 ----NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  E+   +K+   EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  KA
Sbjct: 417 CPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKA 476

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           FK+AEKKT+  +   +TV +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKRY
Sbjct: 477 FKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRY 536

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +  GD+YVHADLHGA+S VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV 
Sbjct: 537 LRAGDLYVHADLHGATSCVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQ 595

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
             QVSKTAP+GEYLT GSFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++
Sbjct: 596 HDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMK 653


>gi|291190355|ref|NP_001167106.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
 gi|223648156|gb|ACN10836.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
          Length = 1069

 Score =  514 bits (1324), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 304/751 (40%), Positives = 431/751 (57%), Gaps = 104/751 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFNTVDIRAVIAEINANYLGMRVNNVYDIDTKTYLIRLQKPDT--------KSILLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H+T +   K   PSGF +K RKH+++RRL  V+QLG DRI+  QFG    A+++
Sbjct: 53  ESGLRIHSTDFEWPKNMMPSGFAMKCRKHLKSRRLTQVKQLGVDRIVDIQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           I+ELY +GNI+L D E+T+L LLR    + ++ V I  R RYP E               
Sbjct: 113 IVELYDRGNIILADHEYTILNLLRFRTAEGEEDVKIAVRERYPVE--------------- 157

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
                   +A  P+ +          S E L       +  LSK +N             
Sbjct: 158 --------NARPPEPL---------ISLERL-------TEVLSKATNGEQ---------- 183

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +K VL   L YG  L EH +++ GL   +K+        +A ++L  A+   ED+++   
Sbjct: 184 VKRVLNPHLPYGATLIEHCLMEVGLPGFIKVDSQYDAARDAPKILD-ALQMAEDYMEKTA 242

Query: 300 SGDIVPEGYILMQNKHLGKDHPPT---ESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           S D   +GYI+ +      D  P+   E       Y+EF P L  Q  +  +V+F+TFD 
Sbjct: 243 SFD--GKGYIIQKC-----DKKPSLAPEKPEELLTYEEFHPFLFAQHANSHYVEFDTFDK 295

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ--EVDRSVKMAELIE 414
           A+DE+YSK+ESQR + +   +E  A  KL+ +  D   R+  L Q  EVDR     EL+E
Sbjct: 296 AVDEYYSKMESQRIDVKALQQEKQALKKLDNVKRDHVQRLEALHQLQEVDRL--RGELVE 353

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            NL  V+ A+  VR ALAN++ W ++  +VKE + AG+PVA  I +L L+ N +++LL N
Sbjct: 354 MNLPIVERALQVVRSALANQVDWAEIGLIVKEAQAAGDPVACAIKELKLQTNHITMLLKN 413

Query: 475 ----------------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRW 506
                                       +  +    +K  P+  V+VDL+LSA+ANA+++
Sbjct: 414 PYIVPDEVEEEDVAEVAEEKKGKKNKNKDKGQKGKPKKDQPM-LVDVDLSLSAYANAKKY 472

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y+ K+    K++KT+ A  KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISS
Sbjct: 473 YDHKRTAAKKEQKTVEAAQKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISS 532

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYL+I+GRD QQNE+IVKRY+  GD+YVHADLHGA+S VIKN     P+PP TL +AG 
Sbjct: 533 ENYLIIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNASG-VPIPPRTLTEAGT 591

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             VC+S AWD+K++TSAWWV+ HQV+K+APTGEYLT GSFMIRGKKNF+PP  L+MGF  
Sbjct: 592 MAVCYSAAWDAKVITSAWWVHHHQVTKSAPTGEYLTTGSFMIRGKKNFMPPSYLMMGFSF 651

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
           LF++DE  +  H  ER+V+  +E M D   S
Sbjct: 652 LFKVDEQCVFRHRGERKVKTIDEDMADVTSS 682


>gi|198422494|ref|XP_002122733.1| PREDICTED: similar to serologically defined colon cancer antigen 1
           [Ciona intestinalis]
          Length = 1103

 Score =  514 bits (1323), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 293/719 (40%), Positives = 421/719 (58%), Gaps = 87/719 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  +  +++GMR  NVYD+  KTY+FKL        +    K +LL+
Sbjct: 1   MKSRFSTLDICAVLTEINEKVVGMRLVNVYDIDHKTYLFKL--------AKPDHKAMLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H + +   K   PS F++KLRKH+R RRL    QLG DRI+  QFG    +++V
Sbjct: 53  ESGIRIHLSEFDWPKNPMPSNFSMKLRKHLRGRRLVSASQLGIDRIVDLQFGSEDASYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRD---------DDKGVAIMSRHRYPTEICRVFER 171
            +ELY +GNI L+D    +L LLR  +D         ++  V +     YP        R
Sbjct: 113 FVELYDRGNIALSDCNDVILNLLRFRKDLHKPDAEQQENSDVKVAVHEPYP--------R 164

Query: 172 TTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSND 231
            TA ++   ++  K                     KE L   K G               
Sbjct: 165 NTARQVEPFISIEK--------------------LKEILQSAKNGS-------------- 190

Query: 232 GARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
                   +K +L   L YG A  EH I++ G  P++KL    + E +  + L  ++   
Sbjct: 191 -------LVKRILNPHLPYGAACIEHAIINAGFSPDVKLGGEFQFERDC-EKLHESLKSC 242

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
           E+ +Q   S  +  +GYI+ + +        T+S    +   EF P + NQ + R   +F
Sbjct: 243 EEMMQTAKS--LQCKGYIVQKIE--------TKSDGELKTNVEFHPFVFNQHKHRNLQEF 292

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E+F+ A+DEF+  +ESQ+ + +   +E AA  KL  +  D E+R+  L+ E +     A 
Sbjct: 293 ESFNKAVDEFFGSLESQKNDMKSLQRERAAMRKLENVRKDHESRLSGLRSEQESDEMKAA 352

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE NL  VD +IL VR A+AN++ W+++  +VKE +  G+PVA  I  L LE N M + 
Sbjct: 353 LIETNLHLVDQSILVVRSAIANQVDWDEIKLLVKEAQGRGDPVASCIKTLKLETNSMVMA 412

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           L ++ D  DD++ T    K+E+DL+LSA+ANAR++Y  K+    K++KTI A +KAFK+A
Sbjct: 413 LRSHDD--DDQKPT----KIEIDLSLSAYANARKYYGRKRNAAKKEQKTIDASTKAFKSA 466

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           EKKT+  + +   V NI   RKV+WFEKF WFISSENYLVI GR+AQQNE++VK+Y+++G
Sbjct: 467 EKKTKQTLKEAAAVRNILKARKVYWFEKFLWFISSENYLVIGGREAQQNEVLVKKYLNQG 526

Query: 592 DVYVHADLHGASSTVIKNHRPE-QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           D+YVHADLHGA+S +IKN  P  QP+PP TLN+AG    CHS AWD+K+VTSAWWV+  Q
Sbjct: 527 DIYVHADLHGATSCIIKN--PSGQPIPPKTLNEAGTMATCHSAAWDAKVVTSAWWVHHDQ 584

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           VSKTAP+GEYLT GSF+IRGKKN+LPP  L+ GFG LF++DE+ +  H  ERRVR  ++
Sbjct: 585 VSKTAPSGEYLTTGSFLIRGKKNYLPPSYLVYGFGFLFKVDETCVWKHKGERRVRTNDD 643



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 63/169 (37%), Positives = 91/169 (53%), Gaps = 35/169 (20%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RGQK KLKK++EKY DQDEEER ++M LL SA   +      +N+     K+K P   P 
Sbjct: 903  RGQKKKLKKIREKYKDQDEEERQLKMELLQSAKSPKPKKE--KNKVEVKPKKKAPTPQP- 959

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
            +AP          H ++D K   DD +   ED+P  G DE            + H++ + 
Sbjct: 960  EAPL---------HTNQDIK---DDITK--EDDP--GSDE------------ERHQVLKA 991

Query: 1014 EKGRLNDVD----YLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIP 1058
            E   ++ VD     LTG P   DI+++ IPVC PY+A+ +YK++VK+ P
Sbjct: 992  EHLTMDPVDDIIDTLTGCPAADDIIMFAIPVCAPYNAMLNYKFKVKLTP 1040


>gi|66804841|ref|XP_636153.1| DUF814 family protein [Dictyostelium discoideum AX4]
 gi|60464500|gb|EAL62645.1| DUF814 family protein [Dictyostelium discoideum AX4]
          Length = 1268

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 298/785 (37%), Positives = 448/785 (57%), Gaps = 156/785 (19%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+   V  L++ LIG+R +N+YDLSP+ ++ K         S    K  L++
Sbjct: 1   MKTRFSSIDIRTTVVNLQKSLIGLRLANLYDLSPRVFLLKF--------SKPDCKKNLII 52

Query: 61  ESGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           ESG+R+H+T + RDK + TP+ F+L LRK+++T+RLE V+QLG DR++ F FG G+   +
Sbjct: 53  ESGIRIHSTNFVRDKGDHTPAPFSLNLRKYLKTKRLESVKQLGVDRVVDFTFGSGVAVQH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           +I+ELY+ GNI+LTD E+ +L +LR+H+ + D+ VA+     YP +  +V    T S + 
Sbjct: 113 LIVELYSIGNIILTDGEYRILAILRTHQYNQDESVAVGDV--YPIDKVKVPTEFTESLI- 169

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                         D++ E                              N+ D    K+ 
Sbjct: 170 --------------DQIIE------------------------------NTVD----KKE 181

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN------KLEDNAIQVLVLAVAKFE 292
           TLK V  ++L +GP L EH +L  GL P+ KL + +       L D+ IQ          
Sbjct: 182 TLKQVFNKSLDFGPELIEHCLLSAGLQPSTKLEQYDHSKFSKSLRDSFIQG--------- 232

Query: 293 DWLQDVISGDIVPEGYILMQNK-----------------------HLGKDHPPT------ 323
              Q +    I  +GYI++++                         +  D          
Sbjct: 233 ---QKIFDNSIQSKGYIVLKDPKQLKPQQQQKQQKQQQQQQSNTLKISNDLSSNNNNNNN 289

Query: 324 -----ESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
                E      IY+EF P L  Q+  ++F++FE+FDAA+D+F+S+IESQ+ EQQ  A+E
Sbjct: 290 NNNNLEEKKEMVIYEEFVPYLYKQYELKKFIEFESFDAAVDQFFSEIESQKVEQQRIAQE 349

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
                KL+K+  DQ+ R+ +L      +++ A+LIE NL++VD  IL +R  +A+ M WE
Sbjct: 350 QVVLKKLDKVKEDQQRRIDSLFANEVENIRKAQLIEANLQEVDQCILIIRSGVASSMDWE 409

Query: 439 DLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLLSNNL-------------------- 476
            L +++KEE+K  NP  VA  I +L LE N ++L L++N                     
Sbjct: 410 TLNQLLKEEKKK-NPYSVATKIHRLKLESNQITLSLTDNFLYDDNDGDDDDDDEESDEES 468

Query: 477 -------------DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
                         +  +++  L    ++VD++LSA ANAR++Y+ KK+   K +KTI+ 
Sbjct: 469 DEEDQNTKKSIKKSKTSNQKPNL----IDVDISLSAFANARKYYDTKKQSHEKAQKTISQ 524

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
              A KAAEKKTR Q+ + K+  ++  MRK+ WFEKF+WFISS+NY+V+SGRDAQQNE++
Sbjct: 525 AEFALKAAEKKTRQQLSETKSKNSMIAMRKIFWFEKFHWFISSDNYIVVSGRDAQQNELL 584

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
            K+Y+ K D+YVHAD+ G++S VIKN    + +PP TL QAG  T+C+S AW +K+VTSA
Sbjct: 585 YKKYLEKDDIYVHADIFGSTSCVIKNPNGGE-IPPNTLIQAGTMTMCYSNAWSAKVVTSA 643

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERR 703
           +WVY HQVSKTAP+GE+LT GSFMIRGKKN+LP   L+MGFG +F++D+S LG+HLNER+
Sbjct: 644 YWVYSHQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVMGFGFMFKIDDSCLGNHLNERK 703

Query: 704 -VRGE 707
            + GE
Sbjct: 704 PIYGE 708



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 50/92 (54%), Gaps = 9/92 (9%)

Query: 998  DKVAMEEEDIHEIGEEEKGR--------LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQS 1049
            DK+  EE++I  + EEE           + ++D LTG P   DIL + IPV  PYS   +
Sbjct: 1063 DKIK-EEQEIKRLLEEENSSKAVDDQKDITNIDTLTGQPRDDDILHFAIPVVAPYSVFNN 1121

Query: 1050 YKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            YK++VK+ PG  K+GK  +    ++L    LT
Sbjct: 1122 YKFKVKLTPGHLKRGKAAKQAAQVILTNPQLT 1153


>gi|449681046|ref|XP_002157080.2| PREDICTED: nuclear export mediator factor NEMF-like, partial [Hydra
            magnipapillata]
          Length = 1467

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 288/710 (40%), Positives = 422/710 (59%), Gaps = 79/710 (11%)

Query: 18   LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN 77
            L   IG+R +NVYD+  KT++ +L +       GE  K  +L+ESG R+H T Y   K  
Sbjct: 477  LNSSIGLRVANVYDIDNKTFLVRLTH-------GEI-KSTILVESGNRIHLTEYDWPKSM 528

Query: 78   TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEF 137
             PSGF++K RKH++ RRL  + QLG DRI+   FG    A+++I+ELY +GNI+L D E+
Sbjct: 529  MPSGFSMKCRKHLKGRRLASINQLGVDRIVDMTFGYDEAAYHLIVELYDRGNIVLADFEY 588

Query: 138  TVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHAALTSSKEPDANEPDKVN 196
             +L LLR   D++  V    R +YP E+ R  E   + +KL   +               
Sbjct: 589  NILQLLRVRTDENADVKFAVREKYPVELARKEEPLLSINKLEEII--------------- 633

Query: 197  EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
                             K GKS D                  +LK VL   L +GP+L E
Sbjct: 634  -----------------KSGKSTD------------------SLKQVLNPLLIFGPSLLE 658

Query: 257  HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
            H +L+ G  P+ KLS++N  +   I  L  ++   ++ L+++ S +   EGY++ +    
Sbjct: 659  HCLLEGGFSPSTKLSQINTSDKQEISKLYSSLQIGDNILKNISSKE--GEGYLIQK---- 712

Query: 317  GKDHPPTESGSS-TQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHK 375
             K+      G     IY EF P L +Q +S  F+ F +F+  +DEF+SK+ESQ+ + +  
Sbjct: 713  -KESNANAVGEKDLLIYTEFHPFLYHQHKSLPFIHFHSFNKCVDEFFSKLESQKIDLKAL 771

Query: 376  AKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRM 435
             +E AA  +L  +  D E R+H+LK+  D+  + A+LIE NL  ++ AI+ V  A+AN++
Sbjct: 772  QQEKAALKRLENVREDHEKRIHSLKETQDKEARRAKLIELNLPLIERAIIIVNSAIANQL 831

Query: 436  SWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDL 495
             WE++  ++KE +  G+PVA +I  L L+ N +++ + N+ +E + +E  L    + +DL
Sbjct: 832  DWEEIEDLLKEAKLKGDPVANIIKSLQLKTNQITISV-NDEEETESDEDDLDEVDIIIDL 890

Query: 496  ALSAHANARRWYEL----KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM 551
             L+A  NARR+Y +    ++   +K+EKTI A  KA K+AE KT+  + + +    I+  
Sbjct: 891  GLTAFGNARRYYYILHDKRRNAATKEEKTIQASKKALKSAEYKTKETLKEVQNAKIINKT 950

Query: 552  RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
            RK  WFEKF WFISSENYLVI GRD QQNE++VKRY+  GD+YVHADLHGASS +IKN  
Sbjct: 951  RKTFWFEKFYWFISSENYLVIGGRDQQQNEILVKRYLKAGDLYVHADLHGASSVIIKNST 1010

Query: 612  PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
                VPP TLN+AG   +C+S AW+++++TSAWWVY +QVSKTAP+GEYLT GSFMIRGK
Sbjct: 1011 G-LDVPPKTLNEAGTMAICYSAAWEARVITSAWWVYHNQVSKTAPSGEYLTTGSFMIRGK 1069

Query: 672  KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG------MDDFE 715
            KNFLPP  LIMGF +LF+LDES +  H+N+RRV+  ++       ++DFE
Sbjct: 1070 KNFLPPSYLIMGFSVLFKLDESCISRHVNDRRVKSNDDQENKSIEVEDFE 1119



 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 65/229 (28%), Positives = 96/229 (41%), Gaps = 51/229 (22%)

Query: 840  KPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVR-------KTKIEGGKI 892
            KP IS  +RR LKK         KV++      D   +  + ++       K K+   K 
Sbjct: 1260 KPRISAKQRRDLKK---------KVKQNDNEDNDTVPESSNNIKEKLESTTKNKVPANKS 1310

Query: 893  ----------SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENAST 942
                       RG K KLKK+ EKY DQDEE+R +   ++ S                  
Sbjct: 1311 VAEVKTCDPPKRGAKAKLKKINEKYKDQDEEDRQLFQEIIRS------------------ 1352

Query: 943  HKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAM 1002
                +PA  P    K   K K    + +  +E   ++   +      G D++   + +A 
Sbjct: 1353 ---NEPARPPKKTGKNKIKEKNTKQVQQQKREVKKNTVETIIIEQPEG-DQSLTNNIIA- 1407

Query: 1003 EEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYK 1051
               D     E EK  +  +D LTG PL  DILL+ IP+C PYS++Q+YK
Sbjct: 1408 --NDEEPDEEIEKENITIIDSLTGCPLEDDILLHAIPLCAPYSSLQNYK 1454


>gi|390331684|ref|XP_003723334.1| PREDICTED: nuclear export mediator factor Nemf-like
           [Strongylocentrotus purpuratus]
          Length = 1116

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 287/713 (40%), Positives = 428/713 (60%), Gaps = 75/713 (10%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +  +L+G+R  NVYD++ KTY+ +L         G  +KV+LL 
Sbjct: 1   MKSRFTTIDLRAILYEIGSKLLGLRVLNVYDVNNKTYLIRL--------GGTDQKVVLLF 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+HTT++   K   PS F++KLRKH+++RRL +++QLG DR++  QFG    A++V
Sbjct: 53  ESGTRMHTTSFDWPKSQMPSNFSMKLRKHLKSRRLTEIKQLGVDRVVDLQFGSDEAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GN+ LTD E+T+LTLLR+ R D + V    R RYP +                
Sbjct: 113 IVELYDRGNVALTDHEYTILTLLRT-RKDSEDVRFAVRERYPVDT--------------- 156

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A  PD +           +E L   K G +                     +
Sbjct: 157 --------ARHPDPIPS-----LERIQEILAAGKPGDN---------------------I 182

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           + +L     YGPAL EH +L+ G   N K +    ++ +  +V+  ++++ E +++   S
Sbjct: 183 RKLLNPHFIYGPALIEHCLLNQGFPSNAKGNNGFDIQQDMSRVMT-SLSEGEQYVEK--S 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G    +GYI+ + +   K    ++ G   ++  EF  L  N   S+ +++F+TFD A DE
Sbjct: 240 GSEC-KGYIVQKRE---KKPAASQDGEDAELLTEFI-LYTN---SQPYLEFDTFDQAADE 291

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+SK+ESQ+ + +   +E  A  KL+ +  D E R+ +L+Q  + + K   LIE NL  V
Sbjct: 292 FFSKMESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLV 351

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           + A+  VR A+AN++ W+++  ++KE +  G+PVA  I  L L+ N   +LL +   + D
Sbjct: 352 EQALRVVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIKSLRLDTNHFQMLLRDPYKQYD 411

Query: 481 D----EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           D    EE       V++D+A SA+ANAR+++  KK  + K++KT+ + SKA K+AEKKT 
Sbjct: 412 DADEGEEDVARPMLVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSSKAIKSAEKKTM 471

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
             +    TVA+I+  RK +WFEK+ W ISSENY++I+GRD QQNE++VK+Y+S GD+YVH
Sbjct: 472 QALKDVATVASINKSRKTYWFEKYYWCISSENYIIIAGRDQQQNEIVVKKYLSPGDIYVH 531

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           AD+HGASS +IKN +   PVPP TL +AG   VC+S AWD+K++TSAWWV   QVSKTAP
Sbjct: 532 ADIHGASSVIIKNPKG-GPVPPKTLQEAGTMAVCYSVAWDAKVITSAWWVRHDQVSKTAP 590

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           TGE+LT GSFM+RGKKNFLPP  L+MGFG L ++DES    H +ERR+RG +E
Sbjct: 591 TGEFLTTGSFMVRGKKNFLPPTQLVMGFGFLMKIDESCAWRHKDERRIRGTDE 643



 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 53/122 (43%), Gaps = 26/122 (21%)

Query: 933  GDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLD 992
            GD Q ENA   +E   +I PV         KK     +  +E  DD     E N  +   
Sbjct: 1021 GDEQKENAEQSEES--SIKPVIKTHTWQAKKKKETTDEQNQEESDDEVDAAEANSKL--- 1075

Query: 993  ETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKY 1052
                     MEE             ++ +D LTG P P D+LL+ IPVC PY+ + SYK+
Sbjct: 1076 ---------MEES------------VSVLDTLTGCPDPEDLLLFAIPVCAPYNVMNSYKF 1114

Query: 1053 RV 1054
            +V
Sbjct: 1115 KV 1116


>gi|449504623|ref|XP_002200475.2| PREDICTED: nuclear export mediator factor Nemf [Taeniopygia
           guttata]
          Length = 1213

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 299/731 (40%), Positives = 416/731 (56%), Gaps = 98/731 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+H T +   K   PS
Sbjct: 158 LLGMRVNNVYDVDNKTYLIRLQKPEC--------KATLLLESGIRIHLTEFEWPKNMMPS 209

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F +K RKH+RTRRL  VRQLG DR++  QFG    A+++ILELY +GN++LTD E+ +L
Sbjct: 210 SFAMKCRKHLRTRRLVSVRQLGVDRVVDLQFGSEQAAYHLILELYDRGNVVLTDHEYLIL 269

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            +LR   D+   V    R RYP E         ++K    L +         D++ E   
Sbjct: 270 NILRFRTDEADDVRFAVRERYPVE---------SAKAAVPLPTL--------DRLTEI-- 310

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIIL 260
            +SNA K   G Q                          LK VL   L YG +L EH ++
Sbjct: 311 -ISNAPK---GEQ--------------------------LKRVLNPLLPYGSSLIEHCLI 340

Query: 261 DTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW--LQDVISGDIVPEGYILMQNKHLGK 318
           + G    +K+ +  + ++N  +VL  A+ K E++  L D  SG    +GY++ Q +    
Sbjct: 341 EAGFSGAVKIDQHLEKKENLEKVLS-ALEKAEEYMALTDNFSG----KGYVI-QKREKKP 394

Query: 319 DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
              P +       Y+EF P L +Q     +++F++F+ A DEFYSK+E Q+ + +   +E
Sbjct: 395 SLEPDKPAEDIYTYEEFHPFLFSQHSKCPYLEFDSFNKATDEFYSKLEGQKIDLKALQQE 454

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
             A  KL  +  D E+R+  L+Q  +      ELIE NL  VD AI  VR ALAN++ W 
Sbjct: 455 KQALKKLENVRRDHEHRLEALQQAQEADKLKGELIEMNLAVVDRAIQVVRSALANQIDWT 514

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------------------------ 474
           ++  +VKE +  G+PVA  I +L L+ N +++LL N                        
Sbjct: 515 EIGAIVKEAQAQGDPVATAIKELKLQTNHITMLLRNPYVLSEEEEEEDDADIEKEETEEP 574

Query: 475 -------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                     ++   +K  P   V+VDL LSA+ANA+++Y+ K+    K +KT+ A  KA
Sbjct: 575 KGKKKKNKTKQLKKPQKNKP-SLVDVDLNLSAYANAKKYYDHKRHAAKKTQKTVEAAEKA 633

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           FK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSENYLVI+GRD QQNE+IVKRY
Sbjct: 634 FKSAEKKTKQTLREVQTVTTIQKARKVYWFEKFLWFISSENYLVIAGRDQQQNELIVKRY 693

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +  GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S AWD+++VTSAWWV 
Sbjct: 694 LKPGDIYVHADLHGATSCVIKNPSGE-PIPPRTLTEAGTMALCYSAAWDARVVTSAWWVS 752

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
             QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+V+ +
Sbjct: 753 HSQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHREERKVKVQ 812

Query: 708 EEGMDDFEDSG 718
           +E +D    S 
Sbjct: 813 DEDLDTVSSSA 823



 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 79/179 (44%), Positives = 99/179 (55%), Gaps = 18/179 (10%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            I RGQK K+KKMKEKY DQDEE+R + M LL SAG   + D   + +   T +E      
Sbjct: 1004 IKRGQKSKMKKMKEKYRDQDEEDRELIMKLLGSAGS-NREDKGKKGKKGKTKEEA----- 1057

Query: 952  PVDAPKVCYKCKKAGHLSKDCKEH-PDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
               A K   K K   H +   KE  P          P  GLDE  E DK   EE+D  + 
Sbjct: 1058 ---AKKQQQKTKPLRHAAGGGKETLPAGIVLHEAQEP--GLDELQE-DK---EEQDQEQP 1108

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
            G EE   L  +D LTG P P DILL+ +P+C PY+A+ +YKY+VK+ PGT KKGK  +I
Sbjct: 1109 GLEESEAL--LDSLTGQPHPEDILLFAVPICAPYTAMANYKYKVKLTPGTQKKGKAAKI 1165



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 8/65 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+H T +   K   PS
Sbjct: 92  LLGMRVNNVYDVDNKTYLIRLQKPEC--------KATLLLESGIRIHLTEFEWPKNMMPS 143

Query: 81  GFTLK 85
            F +K
Sbjct: 144 SFAMK 148


>gi|224101505|ref|XP_002312308.1| predicted protein [Populus trichocarpa]
 gi|222852128|gb|EEE89675.1| predicted protein [Populus trichocarpa]
          Length = 309

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 249/326 (76%), Positives = 269/326 (82%), Gaps = 22/326 (6%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGVTESGESEKVLLLMESGVR
Sbjct: 1   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVTESGESEKVLLLMESGVR 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           LHTTAY RDK NTPSGFTLKLRKHIR RRLEDVRQLGYDRI+LFQFGLG NAHYVILELY
Sbjct: 61  LHTTAYVRDKSNTPSGFTLKLRKHIRARRLEDVRQLGYDRIVLFQFGLGANAHYVILELY 120

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSK 185
           +QGNI+L DSEF VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER+TA KL  ALTS K
Sbjct: 121 SQGNIILADSEFMVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERSTAEKLQKALTSLK 180

Query: 186 EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLG 245
           E +  +  K     ++V                       +KN+N+G R KQ TLKTVLG
Sbjct: 181 ELENKKQGKNKGGKSSV----------------------PSKNTNEGNRVKQATLKTVLG 218

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP 305
           E LGYGPALSEHIILD GLVPN K S+ NKL+D  IQVLV AVAKFE+WLQD+ISGD VP
Sbjct: 219 EVLGYGPALSEHIILDAGLVPNTKFSKDNKLDDETIQVLVKAVAKFENWLQDIISGDKVP 278

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQI 331
           EGYILMQNK+LGKD PP++SGSS Q+
Sbjct: 279 EGYILMQNKNLGKDCPPSDSGSSVQV 304


>gi|291230458|ref|XP_002735180.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 834

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 285/718 (39%), Positives = 417/718 (58%), Gaps = 69/718 (9%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +R RLIG+R  NVYDL  KTY+ +L        +    K  LL 
Sbjct: 8   MKARFTTFDILAIIPEIRARLIGLRVLNVYDLDNKTYLIRL--------AKPDVKDALLF 59

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+  T +   K   PSGF++KLRKH+R RRL  V QLG DRI+  QFG    A+++
Sbjct: 60  ESGQRIQCTDFDWPKNAMPSGFSMKLRKHLRGRRLVKVEQLGVDRIVDLQFGEEEAAYHL 119

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT-TASKLHA 179
           I+ELY +GN++LTD ++T+L LLR   D  + V    R  YP E  +  E   +  KLH 
Sbjct: 120 IVELYDRGNVVLTDHQYTILNLLRVRTDQSQDVKFAVREPYPLESAKQPEPVLSIEKLHD 179

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L ++K+ D                                                   
Sbjct: 180 ILVAAKDGD--------------------------------------------------Q 189

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L  GP++ EH +L  G     K+ +   +  +  +++  A+   E+ L+ ++
Sbjct: 190 LKRVLNPHLVCGPSVIEHCLLKQGFDDGCKVGQNVDISTDLPRIMA-ALQDMENVLKKIV 248

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
                 +GY++ Q K         +       Y E+ P+L  Q +   +++ E+F  A+D
Sbjct: 249 ESP--SKGYVI-QKKEKKTSKLSGDVPEELITYAEYHPMLFEQHQKSLYIELESFGKAVD 305

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EF+S++ +Q+ + +   +E +A  KL  +  D E R+  L+   +  +  A+LIE NL  
Sbjct: 306 EFFSQMGTQKLDIKALQQEKSAIKKLENVKKDHEKRIQQLQASQNVDMVKAQLIEINLPL 365

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL--D 477
           VD AI  V+ A+AN++ W ++  +VKE +  G+ VA  I  L L++N ++LLL +     
Sbjct: 366 VDRAIQVVQSAIANQIDWAEIWDIVKEAQTQGDEVAKSIKSLKLDKNHITLLLRDPFVSS 425

Query: 478 EMDDEEKTLPVE--KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
           ++DDE+K   +   K+++DL LSA+ANAR++YE KK    K++KT+ A  KA K+AE KT
Sbjct: 426 DVDDEDKHSGIGPLKIDIDLDLSAYANARKYYEAKKHSAVKEQKTLAASQKALKSAEIKT 485

Query: 536 RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  +    TV +I+  RK +WFEKF WFISSENYL+I GRD QQNE++V++Y++KGD+YV
Sbjct: 486 KQTLKDVATVTSINKARKTYWFEKFIWFISSENYLIIGGRDQQQNEIVVRKYLNKGDIYV 545

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
           HADLHGASS +IKN      +PP TLN+AG   +C+S AW +++VTSAWWVY +QVSKTA
Sbjct: 546 HADLHGASSVIIKNPTGAD-IPPKTLNEAGSMAICYSAAWQARVVTSAWWVYHNQVSKTA 604

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
           PTGEYLT GSFM+RGKKN+LPP  L+MGFG LF++DE SL  H +ER+V+  EE ++D
Sbjct: 605 PTGEYLTTGSFMVRGKKNYLPPSYLVMGFGFLFKVDEDSLWRHKDERKVKSLEEELED 662


>gi|281200297|gb|EFA74518.1| DUF814 family protein [Polysphondylium pallidum PN500]
          Length = 1134

 Score =  503 bits (1294), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 290/722 (40%), Positives = 426/722 (59%), Gaps = 107/722 (14%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R ++ D+   V  L+R +IG+R +NVYDLSP+ ++FKL        S    K  L+
Sbjct: 1   MPKTRFSSVDIRTTVSNLQRTVIGLRLANVYDLSPRVFLFKL--------SKPELKKQLI 52

Query: 60  MESGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +ESG+R+H+T + RDK + TP+ F++ ++          V+QLG DRII F FG G+   
Sbjct: 53  IESGIRVHSTNFTRDKGDHTPAPFSITVK---------SVKQLGVDRIIDFTFGSGVATQ 103

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           ++I+EL++ GNI+LTD ++ V+ +LR+H+  ++  +A+     YP E             
Sbjct: 104 HLIIELFSIGNIILTDGDYKVIAILRTHQFTENDNIAVGDV--YPVE------------- 148

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
                 +K+P    P+ +NE                       L + S K  N       
Sbjct: 149 -----KAKKPTTFTPELINE-----------------------LLEKSEKKDN------- 173

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQ 296
             LK +  +AL +GP L EH +LD GL PN KL   ++  +   IQ  V          Q
Sbjct: 174 --LKQIFNKALDFGPELIEHCLLDAGLSPNQKLESYDRANNEKLIQAFVEG--------Q 223

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
            + +  +   GYI+++        PP     +T IY+EF P L  Q+ S+   ++++FD 
Sbjct: 224 KIFNVTMQSRGYIVLR--------PPKTPTDTTVIYEEFVPFLYKQYHSKPNQEYDSFDQ 275

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+D+F+S+IE+QR EQQ  A+E     KL+K+  DQ+ R+ +L      +V+ A+LIE N
Sbjct: 276 AVDQFFSEIEAQRVEQQRIAQEQTVLKKLDKVREDQQRRIDSLFAAEADNVRKAQLIEAN 335

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLLSN 474
           L++VD  I  ++  +   M W  L++++KEE+K  NP  VA +I KL LE N + L L++
Sbjct: 336 LQEVDQCITIIKSGVNASMDWTALSQLLKEEKKK-NPYSVANIIHKLKLESNQIQLALND 394

Query: 475 NLDEMDDEEKTLPVEK--------------VEVDLALSAHANARRWYELKKKQESKQEKT 520
           N D+  DE++    E+              V+V++AL+A+ANAR +Y+ KK    K  KT
Sbjct: 395 NYDDDYDEDEDDDEEEEKKQQKKDKKKPTLVDVNIALTAYANAREYYDSKKHANEKANKT 454

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           I     A KAAEKKTR Q+ + K  + +  MRKV WFEKF+WF+SS+NYLVISG+DAQQN
Sbjct: 455 IQQAEFAMKAAEKKTRQQLSEVKAKSAMIQMRKVFWFEKFHWFLSSDNYLVISGKDAQQN 514

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           EM+ K+Y+ K D+YVHAD+ G++S VIKNH     +PP TL QAG  T+C+S AW +K+V
Sbjct: 515 EMLFKKYLEKDDIYVHADIFGSTSCVIKNHGG-GAIPPNTLIQAGTMTMCYSNAWSAKVV 573

Query: 641 TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN 700
           TSA+WVY +QVSKTAP+GE+LT GSFMIRGKKN+LP   L+MGFG +F+LDES + +H+ 
Sbjct: 574 TSAYWVYANQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVMGFGFMFKLDESCIANHIG 633

Query: 701 ER 702
           ER
Sbjct: 634 ER 635


>gi|195388566|ref|XP_002052950.1| GJ23608 [Drosophila virilis]
 gi|194151036|gb|EDW66470.1| GJ23608 [Drosophila virilis]
          Length = 966

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 394/1121 (35%), Positives = 568/1121 (50%), Gaps = 245/1121 (21%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R N+ D+   V  L+RLIG+R + VYD+  KTY+F+L         G SEK ++   
Sbjct: 1    MKTRFNSYDITCGVAELQRLIGLRVNQVYDIDNKTYLFRLHGG------GASEKNVV--- 51

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                             PSGF++KLRKH++ +RLE + QL  DRI+ FQFG G  A++V+
Sbjct: 52   -----------------PSGFSMKLRKHLKNKRLERISQLATDRIVDFQFGTGEAAYHVL 94

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LELY +GNI+LTD E  +L +LR H + +  +    R +YP+   +V             
Sbjct: 95   LELYDRGNIILTDYEQIILYILRPHTEGE-CLRFAVREKYPSGRAQV------------- 140

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                             GN     S+E L            +   + SN G   K+  L 
Sbjct: 141  -----------------GN--IELSEEAL------------REIIEQSNVGEGLKR-ILL 168

Query: 242  TVLGEALGYGPALSEHIILDTGL--------------------VPNMKLSEVN----KLE 277
             VLG     GPA+ EH++++ G+                      N + S+++    KL 
Sbjct: 169  PVLG----CGPAVIEHVLIEHGIENCVVSAQQEQTETSKANRCKKNRRSSQISRADTKLF 224

Query: 278  DNA--IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD-- 333
            D A  + +LV A+    D +     G+   +G+I+       K+  P+ + S+   Y   
Sbjct: 225  DFATDLPLLVKAIQSARDIMDLGQKGNC--KGFIIQI-----KEEKPSSTESTDHFYRNV 277

Query: 334  EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
            EF P L +Q +   F ++ TF  A+DEF+S  ESQ+ + +   +E  A  KL+ +  D  
Sbjct: 278  EFHPYLFSQHKKMPFKEYNTFMEAVDEFFSTQESQKIDMKTLQQEREALKKLSNVKNDHT 337

Query: 394  NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             R+  L +  D   K AELI  N   VD AILA++ A+A+++SW D+  +VKE +  G+ 
Sbjct: 338  RRLEELNKVQDLDKKKAELITCNQSLVDKAILAIQSAIASQLSWPDIQELVKEAQANGDI 397

Query: 454  VAGLIDKLYLERNCMSLLLS----------NNLDEMDDEEKTLPVEKVEVDLALSAHANA 503
            VA  I KL LE N +SLLL+          N+ +  D+++  L    ++VDLALSA ANA
Sbjct: 398  VARSIKKLKLEINHISLLLTDPYKCGNEYLNDENGADNDDSLL----IDVDLALSAWANA 453

Query: 504  RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWF 563
             R+Y+LK+    K++KTI A  KA K+AE+KT+  + + +T++NI+  RKV WFEKF WF
Sbjct: 454  CRYYDLKRSAALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFFWF 513

Query: 564  ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
            +SSENYL+I GRDAQQNE+IVKRYM   DVYVHAD+ GASS +I+N      +PP TL +
Sbjct: 514  VSSENYLIIGGRDAQQNELIVKRYMRPKDVYVHADIQGASSVIIRNSTGGD-IPPKTLLE 572

Query: 624  AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
            AG   + +S AWD+K+VT+++WVY  QVSKTAPTGEYL  GSFMIRGKKNFLP   LIMG
Sbjct: 573  AGTMAISYSVAWDAKVVTNSYWVYSDQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMG 632

Query: 684  FGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLS 743
              +LF+L++S +  H+ ER++R  E+ +D                           E++ 
Sbjct: 633  LSILFKLEDSFIQRHVGERKIRSTEDAIDQ--------------------------ENVK 666

Query: 744  VPNSAHPAPSHT---NASNVDS--HEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
             P   +  P+     N +N DS  + FP  +  + +       D  R     +T + E  
Sbjct: 667  QPEITYTDPNQITELNDANSDSAINVFPNTEVKVEH-------DTGR-----ITIKTE-- 712

Query: 799  IDRALGLGSASISSTKHGIETTQFD--LSEEDKHVERTATVRDKPYISKAERRKLKKGQG 856
                LG         K  I  +Q D  ++EED  + + A  R K   +K  +RK  KG  
Sbjct: 713  ---LLG------EDIKTNIIESQHDNPINEEDAVIIKAAPSRKKNQQTK--KRKECKGHM 761

Query: 857  SSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERN 916
                  K + E+ +      QP        I   K+ RGQKGKLKKMK+KY DQD+EER 
Sbjct: 762  E-----KADLERLQNNSPEIQP--------INSSKVKRGQKGKLKKMKQKYKDQDDEERE 808

Query: 917  IRMALLASAGKVQ---KNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCK 973
            IRM +L S+GK +    ND D          E KP IS   AP                 
Sbjct: 809  IRMMILNSSGKDKLKINNDKD----------EDKPNISNKIAP----------------- 841

Query: 974  EHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDI 1033
                     VE      L+     +++ +EE D   I  +     + +D LTG P   D 
Sbjct: 842  ---------VEK-----LETAIPKNQIEIEENDDLPITTDA----DLLDSLTGVPFDDDE 883

Query: 1034 LLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            +L+ IPV  PY A+Q YK++VK+ PGT K+GK  ++  S+ 
Sbjct: 884  VLFAIPVVAPYQALQQYKFKVKLTPGTGKRGKAAKLALSIF 924


>gi|157116544|ref|XP_001658543.1| hypothetical protein AaeL_AAEL007639 [Aedes aegypti]
 gi|108876416|gb|EAT40641.1| AAEL007639-PA [Aedes aegypti]
          Length = 995

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 310/794 (39%), Positives = 438/794 (55%), Gaps = 111/794 (13%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M K R NT DV   V  L++LIGMR + +YD+  KTY+ +L+ +         EKV+LL+
Sbjct: 1   MTKTRFNTYDVVCSVTELQKLIGMRVNQIYDIDNKTYLIRLVRNE--------EKVVLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R HTTA+   K   PSGFT+KLRKH++ +RLE ++QLG DRI+ FQFG G  A++V
Sbjct: 53  ESGNRFHTTAFEWPKNVAPSGFTMKLRKHLKNKRLESMKQLGVDRIVDFQFGTGEAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNILLTD +  +L +LR H + ++ V    R +YPT                 
Sbjct: 113 ILELYDRGNILLTDCDLKILNILRPHVEGEE-VRFAVREKYPT----------------- 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                       D+  ED               KG  + +  K +   ++ G      TL
Sbjct: 155 ------------DRAKED---------------KGPPAMEKVKETIAKAHPGD-----TL 182

Query: 241 KTVLGEALGYGPALSEHIILDTGL----------------VPNM----------KLSEVN 274
           +T L   L YG ++ +H++   GL                VP            + S+V 
Sbjct: 183 RTALNPILEYGASVIDHVLHKYGLYGCRIGGELPAEAMAEVPKKAKKKQKAIAKEFSKVF 242

Query: 275 KLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD- 333
            +E++ +  L+ A+   E  L+  +     P    ++Q K L     P +     + Y  
Sbjct: 243 NIEED-MTALMCAINDAETMLRKAMKE---PSRGFIIQKKELK----PAKDKEQEEFYFT 294

Query: 334 --EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
             E+ P L NQ++     +F++F AA+DEFYS +E Q+ + +  A+E  A  KL+ +  D
Sbjct: 295 NLEYHPFLYNQYKEDPVKEFDSFTAAVDEFYSTLEGQKIDLKAFAQEREALKKLSNVRTD 354

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
              R+  L +      K AELI  N   VD+AILAV+ ALA++MSW D+  +VK  +   
Sbjct: 355 HAKRLEDLTKAQLEDRKKAELITRNQNLVDSAILAVQSALASQMSWSDIQDLVKAAQANN 414

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-------------LPVEKVEVDLALS 498
           +PVA  I +L LE N +SL+L +    +D++ +              L    V+VDLA++
Sbjct: 415 DPVASCIKQLKLEINHISLMLKDPYGALDEDFEDDDDEEEREDGEGKLEPMVVDVDLAMT 474

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
           A ANARR+Y+ ++    K++KTI + SKA K AEKKT   +   +T   IS  RKV+WFE
Sbjct: 475 AFANARRYYDQRRFAARKEQKTIESSSKALKNAEKKTMQTLKDVRTQTTISKARKVYWFE 534

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KF WFISSENYLVI GRD QQNE+IVKRYM   D+YVHA++ GASS +IKN   E  +PP
Sbjct: 535 KFYWFISSENYLVIGGRDQQQNELIVKRYMRPSDIYVHAEIQGASSVIIKNPSGED-IPP 593

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL +AG   + +S AWD+K+VTSA+WV   QVSKTAPTGEYLT GSFMIRGKKNFLPP 
Sbjct: 594 KTLLEAGTMAISYSVAWDAKVVTSAYWVKSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPC 653

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRG--EEEGMDDFEDSGHHKENSDIESEKDDTDEK 736
            L++G   +F+L+ESS+  H  ER+VR   EE  M   + S    +   +  E+D+ +++
Sbjct: 654 HLVLGLSFMFKLEESSIERHKGERKVRTFDEESIMSKEDRSEEQVKLLSLNKEEDEIEKQ 713

Query: 737 PVAESLSVPNSAHP 750
            V       +S  P
Sbjct: 714 GVVSDSDTDDSEGP 727



 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/187 (34%), Positives = 90/187 (48%), Gaps = 53/187 (28%)

Query: 890  GKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPA 949
            G++ RGQK K++K+KEKY DQDEEER + M +L SAG          N N    KE++  
Sbjct: 814  GQLKRGQKAKMRKIKEKYKDQDEEERKLMMEILKSAG----------NRNTQNQKEEEAG 863

Query: 950  ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEM--DKVAMEEEDI 1007
             S                   D K++P     G +  P +   E  E   D  A  + D+
Sbjct: 864  GS-------------------DQKKYP-----GKKPQPRLKPGEFEEFGDDTPAAADVDM 899

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK-- 1065
                         +D LTG P+  D LL+ IPV  PY ++ +YK++VK+ PGT K+GK  
Sbjct: 900  -------------LDSLTGQPMEEDELLFAIPVVAPYQSLHNYKFKVKLTPGTGKRGKAS 946

Query: 1066 --GIQIF 1070
               +QIF
Sbjct: 947  KMALQIF 953


>gi|297736763|emb|CBI25964.3| unnamed protein product [Vitis vinifera]
          Length = 403

 Score =  502 bits (1292), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 248/328 (75%), Positives = 282/328 (85%), Gaps = 4/328 (1%)

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
           E GN VS+A +E  G +KG KS + SKN+N    DGARAKQ TLKTVLGEALGYGPALSE
Sbjct: 63  EGGNKVSDAPREKQGNRKGAKSSEPSKNTN----DGARAKQATLKTVLGEALGYGPALSE 118

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
           HIILD GL+PN K+++ +K + + IQ L  +VAKFE+WL+DVI GD VPEGYILMQNK  
Sbjct: 119 HIILDAGLIPNTKVTKDSKFDIDTIQRLAQSVAKFENWLEDVILGDQVPEGYILMQNKIF 178

Query: 317 GKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
           GKD PP++    +QIYDEFCP+LLNQF+SREFVKFETFDAA DEFYSKIE QR+EQQ KA
Sbjct: 179 GKDCPPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSEQQQKA 238

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE  A  KL+KI MDQENRVHTLK+E DR +KMAELIEYNLEDVDAAILAVRVALAN M+
Sbjct: 239 KEVTAMQKLSKICMDQENRVHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMN 298

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLA 496
           WEDLARMVKE++K+GNPVAGLIDKLYLERNCM+LLLSNNLDEMDD+EKTL V+KVEVDLA
Sbjct: 299 WEDLARMVKEKKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEMDDDEKTLHVDKVEVDLA 358

Query: 497 LSAHANARRWYELKKKQESKQEKTITAH 524
           LSAHANARRWYE KK+QE+K+EKTI AH
Sbjct: 359 LSAHANARRWYEQKKRQENKREKTIIAH 386


>gi|395838618|ref|XP_003792209.1| PREDICTED: nuclear export mediator factor NEMF [Otolemur garnettii]
          Length = 1056

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 288/744 (38%), Positives = 414/744 (55%), Gaps = 117/744 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPVDHART------------ 160

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A  P  +      ++NA K  L                             L
Sbjct: 161 --------AEPPLTLERLTEIIANAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   ++K+ E  KLE   I+ +++ + K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFASSVKVDE--KLESKDIEKVLVCLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFNGKGYII-QKREIKPSLEADKPAEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVATAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 ------NLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYE 508
                   D   ++ +T P +                     V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVVDDVSVEKNETEPSKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSEN
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSEN 538

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG   
Sbjct: 539 YLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTMA 576

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF
Sbjct: 577 LCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLF 636

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
           ++DES +  H  ER+VR ++E ++
Sbjct: 637 KVDESCVWRHRGERKVRVQDEDVE 660


>gi|328864957|gb|EGG13343.1| DUF814 family protein [Dictyostelium fasciculatum]
          Length = 1244

 Score =  501 bits (1289), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 296/773 (38%), Positives = 449/773 (58%), Gaps = 134/773 (17%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN- 77
           + +IG+R +N+YDLSP+ ++FKL        S    K  L++ESG+R+H+T + RDK + 
Sbjct: 69  KNVIGLRLANIYDLSPRVFLFKL--------SRPDFKKTLIIESGIRIHSTNFIRDKGDH 120

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEF 137
           TP+ F++ LRK+++T+RLE VRQLG DRI+ F FG G+   +VI+EL++ GNI+LTD ++
Sbjct: 121 TPAPFSITLRKYLKTKRLESVRQLGVDRIVDFTFGSGVATQHVIVELFSIGNIILTDGDY 180

Query: 138 TVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVN 196
            VL +LR+H+  ++  +A+     YP +  R                        P  V 
Sbjct: 181 KVLAILRTHQYTENDNIAVGDV--YPVDKAR------------------------PPSV- 213

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
                 + A  +N+  Q                   A  K+ TLK V  ++L +GP L E
Sbjct: 214 -----FTEALVDNIIQQ-------------------AADKKDTLKQVFNKSLDFGPELIE 249

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ---- 312
           H IL  GL P++K+   N  E++A ++    +  F++  Q +    +  +G+I+++    
Sbjct: 250 HCILMAGLSPSLKIESYNH-EEHASKL----IEAFKEG-QKIFDVAVQSKGFIVLKPPKV 303

Query: 313 --------------NKHLGKDHPPTESGSSTQ----------IYDEFCPLLLNQFRSREF 348
                          + L KD     +GS  +          +Y+EF P L  Q++ +++
Sbjct: 304 ESKQQQQQKKKAAEQQQLKKD---AIAGSGEEAATEEKKELVVYEEFVPYLYKQYQDKKY 360

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           +++++FD A+D+F+S+IESQ+ EQQ  ++E     KL+K+  DQ+ R+ +L      ++K
Sbjct: 361 LEYDSFDLAVDQFFSEIESQKVEQQRMSQEQTVLKKLDKVREDQQRRIDSLYASEGENIK 420

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKLYLERN 466
            A+LIE NL+DVD  IL +R  +A  M W +L +++KEE+K  NP  VA  I KL L+ N
Sbjct: 421 KAQLIESNLQDVDQCILIIRSGVAASMDWGNLNQLLKEEKKK-NPYSVANKIHKLKLDTN 479

Query: 467 CMSLLLSN------------------------NLDEMDDEEKTLPVEKVEVDLALSAHAN 502
            ++L L++                           +   +    PV  ++VD++LSA+AN
Sbjct: 480 QITLSLTDLHLDDDEDEEDEDENSDDDSEDEEKKKKNQKKNAKKPVF-IDVDISLSAYAN 538

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNW 562
           AR +Y+ KK+   K EKTI     A KAAEKK R Q+ + KT +++  MRKV WFEKF+W
Sbjct: 539 ARNFYDSKKQSHEKAEKTIQQADFALKAAEKKARQQLSEVKTKSSMQQMRKVFWFEKFHW 598

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+NY+VISG+DAQQNE++ K+Y+ K DVYVHAD+ G++S VIKN +  + +PP TL 
Sbjct: 599 FISSDNYIVISGKDAQQNELLFKKYLDKDDVYVHADIFGSTSCVIKNPKGGE-IPPNTLI 657

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG  T+C+S AW +K+VTSA+WVY HQVSKTAP+GE+LT GSFMIRGKKN+LP   L+M
Sbjct: 658 QAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVM 717

Query: 683 GFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS--GHHKENSDIESEKDDT 733
           GFG +F++D+S + +HL ER       G     DS  G H E+  +E   DD+
Sbjct: 718 GFGFMFKIDDSCIANHLGER-----SSGSSLLRDSMDGDHDEDMRMEELPDDS 765



 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 44/64 (68%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLM 1077
             +++D LTGNPL +DIL + IPV GPY+   +YKY+VK+ PG  K+GK  +   + LL +
Sbjct: 1098 FSNIDTLTGNPLENDILHFAIPVVGPYTIFNNYKYKVKLTPGHQKRGKAAKQAAATLLGL 1157

Query: 1078 LSLT 1081
             ++T
Sbjct: 1158 KNIT 1161


>gi|426233098|ref|XP_004010554.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Ovis
           aries]
          Length = 1055

 Score =  500 bits (1288), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 295/747 (39%), Positives = 418/747 (55%), Gaps = 123/747 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDIEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAG 573

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 574 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 633

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 634 FLFKVDESCVWRHRGERKVRVQDEDME 660


>gi|357620683|gb|EHJ72794.1| hypothetical protein KGM_20428 [Danaus plexippus]
          Length = 1001

 Score =  500 bits (1288), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 294/728 (40%), Positives = 426/728 (58%), Gaps = 84/728 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L+RL+GMR + VYD+  KTY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNTYDIVCMVSELQRLVGMRVNQVYDIDNKTYVIRLQRSE--------EKAVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGFT+KLRKH++ +RLE + QLG DRI+  QFG G  A++VI
Sbjct: 53  SGNRFHTTQFEWPKNVAPSGFTMKLRKHLKNKRLEKLSQLGIDRIVELQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E+T+L +LR H + DK V    + +YP              L  A 
Sbjct: 113 LELYDRGNIVLTDCEWTILNVLRPHVEGDK-VRFAVKEKYP--------------LDRAK 157

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           T    P+                A KE LG  K G +                     LK
Sbjct: 158 TDYAAPN--------------EGALKEILGKSKPGDN---------------------LK 182

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSE-VNK--LEDNAIQVLVLAVAKFEDWLQDV 298
            +L   L YG ++ +H++L  GL  N+K+S+  NK    +  +  L  A+ + E  +++ 
Sbjct: 183 KILNPNLEYGASIIDHVLLQNGLSGNLKISQDPNKGFYVERDLGTLANALRQAETMIEN- 241

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD-EFCPLLLNQFRSREFVKFETFDAA 357
              + + +GYI+ +     +D P  + G    + + EF PLL  Q + + +V++ETFD A
Sbjct: 242 -GKNQMAKGYIIQKR----EDRPNQDGGPDFFLTNQEFHPLLYLQNKDQVYVEYETFDRA 296

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYS +E Q+ + +    E  A  KL  I  D E R+  L++      + AE+I  N 
Sbjct: 297 VDEFYSALEGQKIDLKTIQVEREAMKKLQNIRTDHEKRLSNLEKVQLEDRRAAEMIARNE 356

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             V+ A LA++ A+AN+MSW+D+  +VK  +   +PVA  I +L L  N ++LLL +   
Sbjct: 357 PLVEQARLAIQTAIANQMSWDDIKLLVKAAQDNKDPVASAIKQLKLNTNHITLLLKDPYD 416

Query: 475 ----------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                     + D   D+E+  P+  V++DL+L+A ANARR+Y+ K+    KQ+KT+ + 
Sbjct: 417 DDDDDDDDDDDNDGGGDKERLEPM-MVDIDLSLTAFANARRYYDQKRSAAKKQQKTLESA 475

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            KA K+AEKKT+  + + + +++IS  R+ +WFEKF WFISS+NYLVI+GRD QQNE++V
Sbjct: 476 DKALKSAEKKTKQTLKEAQAISSISKARRNYWFEKFYWFISSDNYLVIAGRDQQQNELLV 535

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           KRYM   DVYVHAD+ GASS VIK   P  P PP TL++AG   V +S AW++K++T AW
Sbjct: 536 KRYMRSTDVYVHADVSGASSVVIKC--PSGPPPPRTLSEAGQAAVAYSVAWEAKVLTRAW 593

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           WV+ HQVSK+APTGEYL+ GSFMIRGKKN+L P  L  GF  +FRL++SS+  H ++R+ 
Sbjct: 594 WVHGHQVSKSAPTGEYLSTGSFMIRGKKNYLLPEHLQFGFSFMFRLEDSSIDRHRDDRKA 653

Query: 705 RGEEEGMD 712
              ++  D
Sbjct: 654 VQADDASD 661



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK----GIQIF 1070
            LTG PL  D LL+ +PV  PYS++  YK++VK+ PG+ K+GK     +Q+F
Sbjct: 909  LTGAPLDEDELLFAVPVVAPYSSLLQYKFKVKLTPGSNKRGKAAKTAVQVF 959


>gi|119586150|gb|EAW65746.1| serologically defined colon cancer antigen 1, isoform CRA_f [Homo
           sapiens]
          Length = 1010

 Score =  497 bits (1280), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 299/747 (40%), Positives = 420/747 (56%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I        I L D    VLT        D    I++  R+ T+                
Sbjct: 113 I--------IELYDRGNIVLT--------DYEYVILNILRFRTD---------------- 140

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                            + ++V  A +E         +  L           +  K   L
Sbjct: 141 -----------------EADDVKFAVRERYPLDHARAAEPLLTLERLTEIVASAPKGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P  +       Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFSGKGYIIQKREIKPCLEADKPVED----ILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARR 505
                            N  E    +K     K            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHQGERKVRVQDEDME 681


>gi|347968346|ref|XP_312244.5| AGAP002680-PA [Anopheles gambiae str. PEST]
 gi|333468048|gb|EAA08148.6| AGAP002680-PA [Anopheles gambiae str. PEST]
          Length = 1053

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 298/750 (39%), Positives = 430/750 (57%), Gaps = 114/750 (15%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M K R NT DV   V  L++LIGMR + +YD+  KTY+ +L  +         EKV+LL+
Sbjct: 1   MTKTRFNTYDVVCSVTELQKLIGMRVNQIYDIDNKTYLIRLARNE--------EKVVLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R HTT++   K   PSGFT+K+RKH++ +RLE ++QLG DRI+ FQFG G  A+++
Sbjct: 53  ESGLRFHTTSFEWPKNVAPSGFTMKMRKHLKNKRLESLQQLGVDRIVDFQFGTGEAAYHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNILLTD E  +L +LR H + ++ +    R +YP                  
Sbjct: 113 ILELYDRGNILLTDCELRILNILRPHVEGEE-LRFAVREKYPK----------------- 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                       D+  +D                G  S +  K + + +  G      TL
Sbjct: 155 ------------DRAKQDN---------------GPPSMEQIKEAIQKAQPGD-----TL 182

Query: 241 KTVLGEALGYGPALSEHIILDTGL--------VPN----------------MKLSEVNKL 276
           +T L   L YG ++ +H++   GL        +PN                 + ++V  +
Sbjct: 183 RTALNPILEYGASVIDHVLHRQGLFGCRIGGELPNDPALPKKVKKKQKNIAKEFAKVFDM 242

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--- 333
           E + +  L+ A+ + E  L++  +      GYI+ +     K+  PT+ G   + Y    
Sbjct: 243 ETD-LGPLMSAINEAETMLRE--AQKRPSPGYIIQK-----KEVKPTKQGDEEEYYFTNL 294

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           E+ P + NQ++   F  F++F  A+DEFYS +ESQ+ + +  A+E  A  KL+ +  D  
Sbjct: 295 EYQPYMYNQYQGEPFKAFDSFTTAVDEFYSSLESQKIDLKAFAQEREALKKLSNVKTDHA 354

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
            R+  L +      K AELI  N + VD A+LAV+ ALA +MSW D+  +VK  +   +P
Sbjct: 355 KRIEELTKAQLEDRKRAELITRNQDLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDP 414

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEE-----------------KTLPVEKVEVDLA 496
           VA  I +L LE N +SL L++    +D++                  K +P+  V+VDLA
Sbjct: 415 VASCIRQLKLEINHISLHLTDPYASLDEQASDEEEEEEDSEREDDEAKLVPM-VVDVDLA 473

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE-KTVANISHMRKVH 555
           LSA ANARR+Y+ ++    K++KTI + SKA K AE+KT +Q L++ +T   IS +RKV+
Sbjct: 474 LSAFANARRYYDQRRFAARKEQKTIESSSKALKNAERKT-IQTLKDVRTQTTISKVRKVY 532

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WFEKF WF+SSENYLVI GRD QQNE+IVKRYM   D+YVHA++ GASS +IKN    + 
Sbjct: 533 WFEKFYWFVSSENYLVIGGRDQQQNELIVKRYMRPTDIYVHAEIQGASSVIIKNPAGGE- 591

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL +AG   + +S AWD+K+VTSA+WV+  QVSKTAPTGEYLT GSFMIRG+KNFL
Sbjct: 592 IPPKTLLEAGTMAISYSVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFL 651

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           PP  L++G   LF+L++SS+  H  ERRVR
Sbjct: 652 PPCHLVLGLSFLFKLEDSSVERHRGERRVR 681



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 87/188 (46%), Gaps = 49/188 (26%)

Query: 890  GKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPA 949
            G++ RGQK K++K+KEKY DQD+++R + M +L                           
Sbjct: 865  GQLKRGQKAKMRKIKEKYKDQDDDDRKLIMEIL--------------------------- 897

Query: 950  ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVA-MEEEDIH 1008
                         K AG+  +D  E   D +   +D    G        +   ++  +  
Sbjct: 898  -------------KSAGNRKQD--EGTKDDADQRQDGAGGGKGGGGVGKRTPRLKPGEFE 942

Query: 1009 EIGEEEKGR--LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK- 1065
            E+G++      L+ +D LTG P+  D LL+ IPV  PY ++ +YKY+VK+ PGT K+GK 
Sbjct: 943  ELGDDTPAAADLDMLDTLTGQPVEEDELLFAIPVVAPYQSLHNYKYKVKLTPGTGKRGKA 1002

Query: 1066 ---GIQIF 1070
                +QIF
Sbjct: 1003 SKMALQIF 1010


>gi|348544245|ref|XP_003459592.1| PREDICTED: nuclear export mediator factor Nemf-like [Oreochromis
           niloticus]
          Length = 1074

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 303/748 (40%), Positives = 416/748 (55%), Gaps = 107/748 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +    IGMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFTTVDIRAVIAEINANYIGMRVYNVYDIDNKTYLIRLQKPDS--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+H+T +   K   PSGF +K RKH++TRRL  ++QLG DRI+  QFG    A+++
Sbjct: 53  ESGTRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTQIKQLGIDRIVDIQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY                        D+G  I++ H Y       F    A  +  A
Sbjct: 113 IIELY------------------------DRGNIILADHEYTILNLLRFRTAEAEDVKIA 148

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           +      ++  P +           S E L       +  LSK  N             +
Sbjct: 149 VRERYPVESARPPE--------PLISLERL-------TEILSKAPNGEQ----------V 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKL-SEVNKLED-----NAIQVLVLAVAKFEDW 294
           K VL   L YG  L EH +++ GL  ++K+ S+V+  +       A+Q+    + K E++
Sbjct: 184 KRVLNPHLPYGATLIEHSLIEAGLSGSIKIDSQVDSAQVAPKILEALQIAETYMEKTENF 243

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETF 354
                SG    +GYI+ Q         P +       YDEF P L  Q     +++F+TF
Sbjct: 244 -----SG----KGYII-QKTEKKPSLTPGKPSEELLTYDEFHPFLFAQHAKSPYLEFDTF 293

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAEL 412
           D A+DEF+SK+ESQ+ + +   +E  A  KL  +  D E R+  L   QEVDR +K  EL
Sbjct: 294 DKAVDEFFSKMESQKIDLKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDR-IK-GEL 351

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           +E NL  VD A+  VR ALAN++ W ++  +VKE + AG+PVA  I +L L+ N +++LL
Sbjct: 352 VEMNLPVVDRALQVVRSALANQVDWTEIGVLVKEAQAAGDPVACAIKELKLQTNHITMLL 411

Query: 473 SN---------------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
            N                              +    ++  P+  V+VDL LSA+ANA++
Sbjct: 412 KNPYISEEDQEEEEKKEIVETKGKKNKNKEKGQNKKLQRNKPM-LVDVDLGLSAYANAKK 470

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+  E K++KTI A  KA K+AEKKT+  + + +TV  I   RKV+WFEKF WFIS
Sbjct: 471 YYDSKRSAEKKEQKTIEAADKAMKSAEKKTQQTLKEVQTVTTIQKARKVYWFEKFLWFIS 530

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYLVI+GRD QQNEMIVKRY+  GD+YVHADLHGA+S VIKN     P+PP TL +AG
Sbjct: 531 SENYLVIAGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSG-NPIPPRTLTEAG 589

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              VC+S AWD+K+VTSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  LIMGFG
Sbjct: 590 TMAVCYSAAWDAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFG 649

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMDD 713
            LF++D+ S+  H  ER+VR  EE M++
Sbjct: 650 FLFKVDDQSVFRHQGERKVRTVEEDMEE 677


>gi|170055538|ref|XP_001863626.1| serologically defined colon cancer antigen 1 [Culex
           quinquefasciatus]
 gi|167875449|gb|EDS38832.1| serologically defined colon cancer antigen 1 [Culex
           quinquefasciatus]
          Length = 995

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 301/747 (40%), Positives = 423/747 (56%), Gaps = 111/747 (14%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M K R NT DV   V  L+RL+GMR + +YD+  KTY+ +L+ +         EKV+LL+
Sbjct: 1   MTKTRFNTYDVVCSVTELQRLVGMRVNQIYDIDNKTYLIRLVRNE--------EKVVLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R HTTA+   K   PSGFT+K+RKH++ +RLE +RQLG DRI+ FQFG G  A+++
Sbjct: 53  ESGNRFHTTAFEWPKNVAPSGFTMKMRKHLKNKRLESLRQLGVDRIVDFQFGSGEAAYHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNILLTD E  +L +LR H + ++ +    R +YP                  
Sbjct: 113 ILELYDRGNILLTDCELKILNILRPHVEGEE-LRFAVREKYPE----------------- 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                       D+  +D               +G    +  + +   +N G      TL
Sbjct: 155 ------------DRAKQD---------------RGPPPMEKVRETIAKANPGD-----TL 182

Query: 241 KTVLGEALGYGPALSEHIILDTGL-----------VP--------------NMKLSEVNK 275
           +T L   L YG ++ +H +   GL           VP                + ++V  
Sbjct: 183 RTALNPILEYGASVIDHALTKYGLFGCRIGGKLNPVPPEVSKKVKKKQKAIAKEFAKVFN 242

Query: 276 LEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD-- 333
            E++ +  L+ A+   E  L+    G   P    ++Q K L     P + G   + Y   
Sbjct: 243 PEED-MTALMCAINDAETMLR---QGMREPSKGFIIQKKELR----PAKEGEPEEYYLTN 294

Query: 334 -EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ 392
            E+ P L NQ++   + +F +F AA+DEFYS +E Q+ + +  A+E  A  KL+ +  D 
Sbjct: 295 LEYQPYLYNQYKDEPYQEFASFTAAVDEFYSTLEGQKIDLKSFAQEREALKKLSNVRTDH 354

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
             R+  L +      K AELI  N   VD+A+LAV+ ALA++M+W D+  +VK  +   +
Sbjct: 355 AKRLDDLIKAQLEDRKKAELITRNQNLVDSALLAVQSALASQMAWSDIQDLVKAAQANND 414

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDE-------------EKTLPVEKVEVDLALSA 499
           P+A  I +L LE N +SLLL +    +D+E             +K  P+  V+VDLALSA
Sbjct: 415 PIASCIRQLKLEINHISLLLKDPYAVLDEEEEEEEDSDREDDEQKLEPM-VVDVDLALSA 473

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE-KTVANISHMRKVHWFE 558
            ANAR++Y+ ++    K++KTI + SKA K AEKKT LQ L++ +T   IS  RKV+WFE
Sbjct: 474 FANARKYYDQRRFAARKEQKTIESSSKALKNAEKKT-LQTLKDVRTQTTISKARKVYWFE 532

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KF WFISSENYLVI GRD QQNE++VKRYM   D+YVHA++ GASS VIKN    + +PP
Sbjct: 533 KFYWFISSENYLVIGGRDQQQNELLVKRYMRPADIYVHAEIQGASSVVIKNPSGAE-IPP 591

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL +AG   + +S AWD+K+VTSA+WV   QVSKTAPTGEYLT GSFMIRGKKNFLPP 
Sbjct: 592 KTLLEAGTMAISYSVAWDAKVVTSAYWVRSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPC 651

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVR 705
            L++G   +F+L+ESS+  H  ER+VR
Sbjct: 652 HLVLGLSFMFKLEESSVERHKGERKVR 678



 Score = 93.6 bits (231), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 67/185 (36%), Positives = 88/185 (47%), Gaps = 48/185 (25%)

Query: 890  GKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPA 949
            G + RGQK KL+K+KEKYGDQDEEER + M +L SAG V       QNE AS     K  
Sbjct: 813  GPLKRGQKAKLRKIKEKYGDQDEEERKLMMDILKSAGNVPTKPA--QNEEASGSDPAKKY 870

Query: 950  ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHE 1009
                  P+     +KA     D +E PDD+    +                         
Sbjct: 871  PGKKPPPR-----QKAA----DLEEVPDDTPAAAD------------------------- 896

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK---- 1065
                    ++ +D LTG P   D LL+ IPV  PY ++ SYK++VK+ PGT K+GK    
Sbjct: 897  --------VDMLDSLTGCPHEEDELLFAIPVVAPYQSLHSYKFKVKLTPGTGKRGKASKT 948

Query: 1066 GIQIF 1070
             +QIF
Sbjct: 949  ALQIF 953


>gi|26333303|dbj|BAC30369.1| unnamed protein product [Mus musculus]
          Length = 641

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 287/703 (40%), Positives = 400/703 (56%), Gaps = 100/703 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-- 475
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRNPYL 415

Query: 476 LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWY 507
           L E +D +    +E                             V+VDL+LSA+ANA+++Y
Sbjct: 416 LSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 475

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 476 DHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 535

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 536 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 594

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRG
Sbjct: 595 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 637


>gi|324502310|gb|ADY41017.1| Serologically defined colon cancer antigen 1 [Ascaris suum]
          Length = 958

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 285/707 (40%), Positives = 395/707 (55%), Gaps = 80/707 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T DV A V  LR L G+R +NVYD+  KTY+ ++            EK  ++ME
Sbjct: 1   MKSRFSTLDVFAVVHDLRALEGLRVTNVYDVDSKTYLIRMHIPD--------EKCFIMME 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+RLH T++   K   PS F++KLRKHI+ +RL  V QLG DR++  QFG    A +VI
Sbjct: 53  SGMRLHKTSFEWPKAQFPSSFSMKLRKHIKQKRLTKVEQLGVDRVVDLQFGTDDRASHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GNILLTD ++ +L +LR   D +  V    R  YP E  R            A+
Sbjct: 113 VELYDRGNILLTDHQYVILNVLRPRTDKNTDVRFSVRETYPIENAR----------QEAM 162

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             SK                      E L   K G+S                     ++
Sbjct: 163 VPSKARLI------------------EMLATTKKGES---------------------VR 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLV----LAVAKFEDWLQD 297
             L     YGPAL EH +   G+  N ++       +  IQ L+    +A   F++  Q+
Sbjct: 184 RALAPLTQYGPALIEHSLRLAGICSNAQIGVNISNSEEDIQKLLNAMDIAQIVFDELRQN 243

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
              G I+   Y L               G S + Y EF P    QF S    +FE F   
Sbjct: 244 RSHGFII---YKL----------DTRADGHSFESYQEFHPYRFKQFESENLREFENFSEC 290

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DE++SKIESQRA+Q+    E  A  KL  +  DQ+ R+ +L+       +MAELIE N 
Sbjct: 291 VDEYFSKIESQRADQRALNAEREALKKLENVKRDQQERIESLELAQVEKRQMAELIELNS 350

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           + VD A+L +R A+AN++SWE +  M  +  +AG+P+A  I  L L  N M+L L +   
Sbjct: 351 DLVDKALLIIRSAIANQLSWEMIEEMRIKASEAGDPIASSIVGLNLNSNEMTLSLRDPYH 410

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           + D   K +P+     D+ALSA+ N+R+++  KK    K++KTI++ +KA K+A+ K + 
Sbjct: 411 D-DSSPKKVPI-----DIALSAYQNSRKFHSEKKAAVDKKQKTISSSAKALKSAQLKAKE 464

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            +   +  A++   R+  WFEKF WF+SSENYLVI GRDAQQNE++VKRY+  GD+YVHA
Sbjct: 465 TLATVRAKADVVKSRRQMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRTGDIYVHA 524

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           D+ GASS VI+N      +PP TLN+AG   VC+S +W++K++ +AWWVY HQVS+TAPT
Sbjct: 525 DVRGASSVVIRNKVNGGEIPPKTLNEAGSMAVCYSSSWEAKVIAAAWWVYHHQVSRTAPT 584

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           GEYLT GSFMIRGKKNFLP   L MGFGL+F+LDE S+  H  ERRV
Sbjct: 585 GEYLTPGSFMIRGKKNFLPSCQLQMGFGLMFKLDEDSVERHRGERRV 631



 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 73/160 (45%), Gaps = 17/160 (10%)

Query: 907  YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
            Y DQ+E+ER +R   L S     K   + Q+E       +K      D  +         
Sbjct: 767  YADQEEDERIMRANWLGSREVAAK---EYQDEEGVKETSRKNGTKIADVSR--------- 814

Query: 967  HLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTG 1026
               K+     D+   G ED   V      + ++  ++E D+  +GEEE   L   D LT 
Sbjct: 815  --QKNTTADFDERKQGKEDRDIVRATAKVQEEEEEVDESDLRSMGEEETKML---DSLTW 869

Query: 1027 NPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
             PLP D LL+ + V  PY  + ++KY+VK+ PGT K+GK 
Sbjct: 870  RPLPGDTLLHAVVVVAPYQTMLNFKYKVKLTPGTGKRGKA 909


>gi|339236819|ref|XP_003379964.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
 gi|316977305|gb|EFV60421.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
          Length = 789

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 295/757 (38%), Positives = 423/757 (55%), Gaps = 82/757 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +  D+ A V+ LR+ IGMR + VYD++PKTY+ KL        S   +KV+++ E
Sbjct: 1   MKGRFSLIDLLAVVQELRQYIGMRLNLVYDINPKTYLLKL--------SKPDKKVMIIFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+RLH+T Y   K   PSGFT+KLRKH+R +RLED+  +G DRI+  +FG G  A ++I
Sbjct: 53  SGIRLHSTEYGWSKNIMPSGFTMKLRKHLRDKRLEDISVVGLDRIVDMRFGNGPTACHLI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE--RTTASKLHA 179
           +ELY +GN++LTDSE+ +L +LR+   +   V    R  Y  E+ R FE  R TA +   
Sbjct: 113 IELYDRGNVVLTDSEYVILNILRARTIETDNVRYAVRETYLVEV-REFEEYRRTADE--- 168

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP- 238
                            E  N + +A                               QP 
Sbjct: 169 -----------------EMANRLLHAC------------------------------QPG 181

Query: 239 -TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
            TL   L     YGP L EH +L+  L   MK+  V   +     + +     FE  L +
Sbjct: 182 DTLHKCLVPHFPYGPLLLEHCLLENKLSLRMKVQAVIGDQSLVSALALSLSLAFE--LFE 239

Query: 298 VISGDIVPE-GYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           +I  +  P  GY+ M  +          +G   +I+ EF P   +QF + E  +F+TF+ 
Sbjct: 240 MIRKE--PSCGYLKMTVEE-------NAAGERIEIFHEFHPYFFSQFANSECKQFDTFNG 290

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DE++SK++SQ+ +Q+   +E AA  +L  +  D E R+  L+ +     +MA  +E N
Sbjct: 291 AVDEYFSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERMAVAVELN 350

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
            E V+ A+  +R A+A ++ W  +  M+++ R  G+PVAG I  L LERN   + L  ++
Sbjct: 351 SETVEQALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFVMRLPVDV 410

Query: 477 DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
            + D E        VE+DLALS+H N+RRW+   K+   KQ+KTI A  KA K+AE +T+
Sbjct: 411 FDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAAGGKALKSAELRTK 470

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            Q+   +   NI  +RK+ WFEKF+WF SS+  LVI+GRDA+QNE++VKRY+  GD+YVH
Sbjct: 471 EQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEILVKRYLKPGDLYVH 530

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           ADL GA+S VIK    + P+PP TLN+A    VC S AW+SK+VTSAWWV   QVSK+AP
Sbjct: 531 ADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSAWWVKHDQVSKSAP 590

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER-----RVRGEEEGM 711
           +GEYL  G FMIRGKKN+L    L+MGFGLLFRLD  S   HL +R      + GEE   
Sbjct: 591 SGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAARHLEKRCQAEDELDGEEANC 650

Query: 712 DDFEDSGHHKENSDIESEKDDTDEKPV-AESLSVPNS 747
           D+ +D    K+   + SE  +     V +E  S P++
Sbjct: 651 DNLQDE-QKKQKKLVRSELSEQSFNSVNSEEFSYPDN 686



 Score = 73.6 bits (179), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 65/120 (54%), Gaps = 12/120 (10%)

Query: 965  AGHLSKDCKEHPDDSSHGVEDNPCVGL-DETAEMDKVA---MEEEDIHEIGEEEKG---- 1016
            A HL K C+   +D   G E N C  L DE  +  K+    + E+  + +  EE      
Sbjct: 630  ARHLEKRCQ--AEDELDGEEAN-CDNLQDEQKKQKKLVRSELSEQSFNSVNSEEFSYPDN 686

Query: 1017 -RLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
              L+ V  LTGNP   D +L+ +PVC PY+A+ +YK++VK+ PGT KKGK I+    L +
Sbjct: 687  ETLDAVQCLTGNPTEDDNILFALPVCAPYAALTNYKFKVKLTPGTTKKGKAIKTAIDLFM 746


>gi|432938285|ref|XP_004082515.1| PREDICTED: nuclear export mediator factor Nemf-like [Oryzias
           latipes]
          Length = 1089

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 292/746 (39%), Positives = 412/746 (55%), Gaps = 103/746 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +    +GMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFTTVDIRAAIAEINANYVGMRVYNVYDIDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H+T +   K   PSGF +K RKH++TRRL  ++QLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTHIKQLGIDRIVDMQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY                        D+G  I++ H Y       F    A  +  A
Sbjct: 113 IVELY------------------------DRGNIILADHEYTILNLLRFRNAEAEDVKIA 148

Query: 181 LTSSKEP--DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           +   + P  +A  P+ +          SK   G Q                         
Sbjct: 149 V-RERYPVENARSPEPLISLEQLTEILSKAPKGEQ------------------------- 182

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV---LVLAVAKFEDWL 295
            +K +L   L YG  L EH  ++ GL  ++K+      ++NA +V   +  A+   E ++
Sbjct: 183 -VKRILNPHLSYGATLIEHSFIEAGLPGSIKVDS----QENAAEVAPKIREALQIAESYM 237

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
           +   + +   +G+I+ Q         P +       YDEF P L  Q     F++ ++F+
Sbjct: 238 EK--TENFNGKGFII-QKSEKKPSVAPGKPAEELLTYDEFHPFLFVQHAKSPFLELDSFN 294

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELI 413
            A+DEF+SK+E Q+ + +   +E  A  KL  +  D E R+  L   QEVDR     EL+
Sbjct: 295 KAVDEFFSKMEGQKIDMKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDRL--KGELV 352

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E NL  V+ A+  VR ALAN++ W ++  +VKE + AG+PVA  I +L L  N +++LL 
Sbjct: 353 EINLAVVERALQVVRSALANQVDWAEIGHIVKEAQAAGDPVACAIKELKLHSNHITMLLK 412

Query: 474 N-----------------------NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWY 507
           N                       N +    ++K L   K   V+VDL LSA+ANA+++Y
Sbjct: 413 NPYISEEEQEDEEMKDAVEEKGKKNKNRDKGQKKKLQRNKPMLVDVDLGLSAYANAKKYY 472

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+  E KQ+KT+ A  KA K+AEKKT+  + + +TV  I   RKV+WFEKF WFIS+E
Sbjct: 473 DHKRSAEKKQQKTLEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFISAE 532

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYLVI+GRD QQNE+IVKRY+  GD+YVHADLHGA+S VIKN   + P+PP TL +AG  
Sbjct: 533 NYLVIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNPSGD-PIPPRTLTEAGTM 591

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC+S AWD+K++TSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  LIMGFG L
Sbjct: 592 AVCYSAAWDAKIITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFGFL 651

Query: 688 FRLDESSLGSHLNERRVRGEEEGMDD 713
           F+++E S+  H  ER+V+  EE MDD
Sbjct: 652 FKVEEQSVFRHRGERKVKSVEEEMDD 677



 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/224 (35%), Positives = 110/224 (49%), Gaps = 23/224 (10%)

Query: 847  ERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGK----ISRGQKGKLKK 902
            E +K+K+ Q  S V+ K E         SS    + +  K  GG     + RGQK KLKK
Sbjct: 834  EDKKMKQKQEGSDVEEKTE--------TSSAGPVLDQGPKSGGGPSQPPLKRGQKNKLKK 885

Query: 903  MKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKC 962
            MKEKY DQDEE+R + M LL SAG V+        +     K+ K    PV  P    + 
Sbjct: 886  MKEKYKDQDEEDRELMMQLLGSAGPVKDE-----KDKGKKAKKGKGKEDPVRKPAPQKRQ 940

Query: 963  KKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVD 1022
             K    S + K         +E+ P  G D  A   +   ++ D    G EE   L  + 
Sbjct: 941  PKG---SAEKKPEQTGGVEVLEEKPP-GEDGAAADQEDKEDDIDQDNPGVEEAENL--LT 994

Query: 1023 YLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
             LTG P   D+LL+ +PVC PY+A+ +YK++VK+ PG+ KKGK 
Sbjct: 995  SLTGQPHCEDVLLFAVPVCAPYTALSNYKHKVKLTPGSQKKGKA 1038


>gi|290975413|ref|XP_002670437.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
 gi|284083996|gb|EFC37693.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
          Length = 1146

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 294/816 (36%), Positives = 439/816 (53%), Gaps = 147/816 (18%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K RM+  D+   V  LR +LIGMR +N+YD++ KTY+ K   +         EK+++L+
Sbjct: 1   MKNRMSVVDIRCIVAELREQLIGMRLANLYDINKKTYLLKFAKTD--------EKIVVLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H+TA+ RDK   PS F LK+RKHIRTRRLE + QLG DR++ F FG    A+++
Sbjct: 53  ESGIRIHSTAFERDKSKMPSPFVLKMRKHIRTRRLEKLEQLGVDRVVDFTFGAEEKAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+E +A+GN++LTD ++ ++++LR+H  + +         YP  I    +    SK   A
Sbjct: 113 IVEFFAKGNVVLTDYQYKIISILRTHSKEAEAGLFAVGETYP--ITTRLQSDGISKPTLA 170

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP-- 238
            T            + ++ N     ++EN             +N         + K P  
Sbjct: 171 QTIKT--------AIEKERNAALAPTEEN------------PENPQPTQKKKQQKKAPAV 210

Query: 239 ---TLKTVLGEALGYGPALSEHIILDTG-LVPNMKL------------SEVNKL------ 276
              T+K +L   L YG    EH +L    L  N+ L            SEV+ +      
Sbjct: 211 PTLTVKNLLNNYLDYGTGFVEHCLLTADVLASNLNLLDNAHPDTLKLISEVDNVIASSNV 270

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--------------QNKHLGKDHPP 322
           E   +  LV A  + +D++  + +      GYIL+              +N  L     P
Sbjct: 271 ETPILDKLVSAFKQVDDFIMRIKTEK--QRGYILLKEIVQQQVLDEVTVENPFLPPKKEP 328

Query: 323 TESGSST------------------QI---------------YDEFCPLLLNQFRSR--- 346
           TE+G  +                  QI               YD+F P L  Q R +   
Sbjct: 329 TENGEPSSEEPVVEPEIVLNDLQLKQIELMKQEKRLSIKRDQYDDFTPFLFEQVRRKIPA 388

Query: 347 -----EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
                + ++F++FD + DEF+S IE+++ E Q  + E+    K++K+  +QE ++  L+ 
Sbjct: 389 DKNQIKVIEFDSFDRSADEFFSAIEAKKIESQKSSIENTVEKKMSKVKREQELKLQELQA 448

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL 461
             D+   +A LIE + E VD AI  +  ALA   SWE + +++KE R   +P+A +I KL
Sbjct: 449 SFDKYETIATLIETHYEIVDQAIQVICSALAQSQSWETIKQIIKEHRDV-DPIAAMIHKL 507

Query: 462 YLERNCMSLLL--------------------------------SNNLDEMDDEEKTLPVE 489
            LE + +++ L                                     + D ++K  P+ 
Sbjct: 508 KLESSQITVTLPPPSIDDDDEDEFEYEESDEENDDEDEESDDEEKKEKKSDKKKKEEPM- 566

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANIS 549
           ++++D++L+AHANA ++Y L+KK    +EK   A  KA K  E+KT     + +  + I+
Sbjct: 567 RIDIDISLTAHANAAKYYSLRKKSGENKEKAAFASKKAIKKTEQKTLESAKKSQIKSEIT 626

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
             RK  WFEKF WFI+SENYLV+ GRDAQQNE++VKRYM KGD+Y+HAD+HGASS +IKN
Sbjct: 627 IRRKRFWFEKFYWFITSENYLVLGGRDAQQNELVVKRYMRKGDIYIHADVHGASSCIIKN 686

Query: 610 HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
              E P+PPL+L +AG F VC S AWD+K+++SA+WVY HQVSKTAPTGEYLTVGSFMIR
Sbjct: 687 PTGE-PIPPLSLQEAGMFCVCRSVAWDNKVMSSAYWVYDHQVSKTAPTGEYLTVGSFMIR 745

Query: 670 GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           GKKNFLPP PL+MGF ++F++DES + +H+ ER+ R
Sbjct: 746 GKKNFLPPSPLVMGFAVMFKVDESCIPNHIQERKPR 781



 Score = 67.0 bits (162), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 51/73 (69%), Gaps = 3/73 (4%)

Query: 995  AEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRV 1054
            AE+ K  + +E+I  + +E+K +L ++D LTG P   DI L+ IPVC PY+ +++Y Y+V
Sbjct: 1009 AEIKKF-LADENIPFMDDEDKEKLTEIDSLTGQPRDDDIFLFAIPVCAPYTCLKNYTYKV 1067

Query: 1055 KIIP-GT-AKKGK 1065
            K++P GT  KKGK
Sbjct: 1068 KLVPAGTNTKKGK 1080


>gi|312082754|ref|XP_003143575.1| serologically defined colon cancer antigen 1 [Loa loa]
          Length = 899

 Score =  484 bits (1246), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 287/737 (38%), Positives = 407/737 (55%), Gaps = 81/737 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T DV A V  L+ L G R +NVYD+  KTY+ ++            EK  +++E
Sbjct: 1   MKNRFSTLDVFAVVHDLKELTGQRVANVYDVDSKTYLIRIQKPD--------EKCFIMLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+H T +   K   PS FT+KLRKHIR +RLE V QLG DRII  QFG   +A +VI
Sbjct: 53  SGCRIHRTTFDWPKAQFPSSFTMKLRKHIRHKRLECVTQLGVDRIIDMQFGFDEHACHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            ELY +GN++LTD+ +T+L +LR   D +  +    + RYP E  R              
Sbjct: 113 AELYDRGNVVLTDNNYTILNVLRPRTDKETDMRFSVQERYPLEAAR-------------- 158

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                              NVS  +K+ L         +  K + K           ++K
Sbjct: 159 ------------------QNVSCPTKDEL--------MERLKTAKKGE---------SVK 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             L     YGP L EH +   G+  N ++     +E++    L  A+ +  D + +VI  
Sbjct: 184 RFLAPLTQYGPTLIEHSLRTVGVAQNAQIGVNIGMEESGAMKLFEAL-QLADQIFNVIRC 242

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           +   +G+++ +             G   + Y EF P + +QF   +   F++F   +DEF
Sbjct: 243 N-AAQGFLVYR-------EDARMDGVIVETYQEFHPFMFSQFSDMQTKHFDSFSECVDEF 294

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           +SK+E Q+A+ +    E  A  KLN +  DQ++R+  LK       +MAELIE N + VD
Sbjct: 295 FSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNSDLVD 354

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
            A+L +R A+AN++SWE +  M     +AGNP+A  I  L L  N M+LLL       D 
Sbjct: 355 KALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR------DP 408

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQI 539
               +  +KV +D+ALS++ NAR+ +  KK  + K++KTI A SKA K+ + K +  L +
Sbjct: 409 YRPEIDPKKVTIDIALSSYQNARKLHTEKKAAQQKEQKTICASSKALKSTKVKIKETLNV 468

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +  K  A +   R+V WFEKF WF+SSENYLVI GRDAQQNE++VKRY+  GD+Y+HAD 
Sbjct: 469 VHSK--AEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRPGDIYMHADT 526

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
            GASS +I+N      +PP TLN+A    V +S AW++K+ ++AWWV+ HQVS+TAPTGE
Sbjct: 527 RGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTSAAWWVHQHQVSRTAPTGE 586

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV-----RGEEEGMDDF 714
           YLT GSFMIRGKKN+LP   L MGFG++F+LDE SL  H  ER+V     + +    DD 
Sbjct: 587 YLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLERHAEERKVAPVVTKDDTVNQDDG 646

Query: 715 EDSGHHKENSDIESEKD 731
           ED G     S  E EKD
Sbjct: 647 EDDGISLTGSGSEDEKD 663



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 95/197 (48%), Gaps = 37/197 (18%)

Query: 888  EGGKI---SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHK 944
            E GK+   +R QK K +K+K+KYGDQDEEER +R+ LL+S  K   N         S  K
Sbjct: 729  ESGKVRPMTRRQKHKAEKIKKKYGDQDEEERQLRLMLLSSKPKDTGNFEKKNMNEKSLEK 788

Query: 945  EKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEE 1004
             KK                + G ++ D  EH         +   + ++E AE   +  EE
Sbjct: 789  TKKNV--------------QDGKMT-DQYEH---------EGKALTIEEKAEHSTIPKEE 824

Query: 1005 ED-------IHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKII 1057
            ED       +  +  EE   LN    LT  PL  D+LLY + V  PY  +Q++KY+VK+ 
Sbjct: 825  EDDQLLEADMAVMDAEETKMLNS---LTWRPLDGDVLLYALVVVAPYQTMQNFKYKVKLT 881

Query: 1058 PGTAKKGKGIQIFYSLL 1074
            PGT K+GK  +   +L 
Sbjct: 882  PGTGKRGKAAKSAIALF 898


>gi|393907053|gb|EJD74501.1| serologically defined colon cancer antigen 1 [Loa loa]
          Length = 1568

 Score =  484 bits (1246), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 287/737 (38%), Positives = 407/737 (55%), Gaps = 81/737 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T DV A V  L+ L G R +NVYD+  KTY+ ++            EK  +++E
Sbjct: 1   MKNRFSTLDVFAVVHDLKELTGQRVANVYDVDSKTYLIRIQKPD--------EKCFIMLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+H T +   K   PS FT+KLRKHIR +RLE V QLG DRII  QFG   +A +VI
Sbjct: 53  SGCRIHRTTFDWPKAQFPSSFTMKLRKHIRHKRLECVTQLGVDRIIDMQFGFDEHACHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            ELY +GN++LTD+ +T+L +LR   D +  +    + RYP E  R              
Sbjct: 113 AELYDRGNVVLTDNNYTILNVLRPRTDKETDMRFSVQERYPLEAAR-------------- 158

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                              NVS  +K+ L         +  K + K           ++K
Sbjct: 159 ------------------QNVSCPTKDEL--------MERLKTAKKGE---------SVK 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             L     YGP L EH +   G+  N ++     +E++    L  A+ +  D + +VI  
Sbjct: 184 RFLAPLTQYGPTLIEHSLRTVGVAQNAQIGVNIGMEESGAMKLFEAL-QLADQIFNVIRC 242

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           +   +G+++ +             G   + Y EF P + +QF   +   F++F   +DEF
Sbjct: 243 N-AAQGFLVYRED-------ARMDGVIVETYQEFHPFMFSQFSDMQTKHFDSFSECVDEF 294

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           +SK+E Q+A+ +    E  A  KLN +  DQ++R+  LK       +MAELIE N + VD
Sbjct: 295 FSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNSDLVD 354

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
            A+L +R A+AN++SWE +  M     +AGNP+A  I  L L  N M+LLL       D 
Sbjct: 355 KALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR------DP 408

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQI 539
               +  +KV +D+ALS++ NAR+ +  KK  + K++KTI A SKA K+ + K +  L +
Sbjct: 409 YRPEIDPKKVTIDIALSSYQNARKLHTEKKAAQQKEQKTICASSKALKSTKVKIKETLNV 468

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +  K  A +   R+V WFEKF WF+SSENYLVI GRDAQQNE++VKRY+  GD+Y+HAD 
Sbjct: 469 VHSK--AEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRPGDIYMHADT 526

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
            GASS +I+N      +PP TLN+A    V +S AW++K+ ++AWWV+ HQVS+TAPTGE
Sbjct: 527 RGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTSAAWWVHQHQVSRTAPTGE 586

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV-----RGEEEGMDDF 714
           YLT GSFMIRGKKN+LP   L MGFG++F+LDE SL  H  ER+V     + +    DD 
Sbjct: 587 YLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLERHAEERKVAPVVTKDDTVNQDDG 646

Query: 715 EDSGHHKENSDIESEKD 731
           ED G     S  E EKD
Sbjct: 647 EDDGISLTGSGSEDEKD 663



 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 95/197 (48%), Gaps = 37/197 (18%)

Query: 888  EGGKI---SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHK 944
            E GK+   +R QK K +K+K+KYGDQDEEER +R+ LL+S  K   N         S  K
Sbjct: 729  ESGKVRPMTRRQKHKAEKIKKKYGDQDEEERQLRLMLLSSKPKDTGNFEKKNMNEKSLEK 788

Query: 945  EKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEE 1004
             KK                + G ++ D  EH         +   + ++E AE   +  EE
Sbjct: 789  TKKNV--------------QDGKMT-DQYEH---------EGKALTIEEKAEHSTIPKEE 824

Query: 1005 ED-------IHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKII 1057
            ED       +  +  EE   LN    LT  PL  D+LLY + V  PY  +Q++KY+VK+ 
Sbjct: 825  EDDQLLEADMAVMDAEETKMLNS---LTWRPLDGDVLLYALVVVAPYQTMQNFKYKVKLT 881

Query: 1058 PGTAKKGKGIQIFYSLL 1074
            PGT K+GK  +   +L 
Sbjct: 882  PGTGKRGKAAKSAIALF 898


>gi|440797731|gb|ELR18808.1| isoform 2 of serologically defined colon cancer antigen 1 family
           protein [Acanthamoeba castellanii str. Neff]
          Length = 1138

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 291/773 (37%), Positives = 410/773 (53%), Gaps = 143/773 (18%)

Query: 5   RMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG 63
           R  + D++A  + LR +++G+R +NVYDL  KTY  KL             K  L+ ESG
Sbjct: 3   RFTSLDISAITRELREKVVGLRIANVYDLGKKTYQLKLAKPD--------HKQYLVFESG 54

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
           VRLHTT + R+++  PS F LKLR+++RT+R+EDVRQLG DR+I    G G   H++I+E
Sbjct: 55  VRLHTTKFQRERQTVPSVFCLKLRRYLRTKRIEDVRQLGIDRVIDITIGSGEAQHHLIIE 114

Query: 124 LYAQGNILLTDSEFTVLTLLRSHR-----DDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           LYA GNI+L D  + + TL+RS++     DD+  VA+ +R  YP +  R    TT  +L 
Sbjct: 115 LYASGNIILVDKNYAIETLIRSYKTGEGTDDEVSVAVGTR--YPVDKARQLVPTTVDRLR 172

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
             L S  E                                               RAK+ 
Sbjct: 173 EVLHSVPEEQ---------------------------------------------RAKE- 186

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +K VL   L  GP L EH +L   L P+ K+SE ++ +  A+   +           + 
Sbjct: 187 AVKDVLNRHLDLGPTLFEHCLLCADLKPHAKVSEYDEAKTEALHRAIQHA--------ES 238

Query: 299 ISGDIVPEGYILMQNKHL--------------GKDH------------------------ 320
           +  D   +GYI++++                 GKD                         
Sbjct: 239 LYSDPTLKGYIVLKDAKPDAAPAASAKALQGKGKDKETQPQPPPQQQQQQQEGRAEEEAQ 298

Query: 321 ---------------PPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
                          P  E    ++++  F P +  QF  R  ++F +FD A+D F+SK 
Sbjct: 299 SPVVPATPAPQDAAKPDGEEDYDSRLFMMFVPYVYKQFEGRPRLEFPSFDEAVDIFFSKA 358

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           + Q+ E + + +E         +  D E R+  L +  +  +K A LIE N+ DVDAAI 
Sbjct: 359 QEQQVEVKKEQQE-------KTVKKDHETRIAALTKAEEECIKKAHLIETNVSDVDAAIK 411

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT 485
                LA  M W  L R+VKE +KAG+P+A LI  L    N ++LLL + L+   D    
Sbjct: 412 VTCSELARGMDWAQLTRVVKEAKKAGDPIANLIHSLDFANNRITLLLVDPLEAAADASGA 471

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV 545
           +  +KVEVD+  +A+ANA+ +Y   K++  K  KT+ +   A KAAEKK R +I      
Sbjct: 472 M--QKVEVDIGQTAYANAQEFYAEAKRRAHKHAKTVASSQMAVKAAEKKARREIKDVGVK 529

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
           A I  +RK +WFEKF+WFISSENY+VISGRDAQQNE+IVKRY+ KGD YVHADLHGA++ 
Sbjct: 530 AAIQKVRKAYWFEKFHWFISSENYVVISGRDAQQNELIVKRYLRKGDAYVHADLHGAATC 589

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           V+KN  P++P+P LTL +AG  T+           TSAWWV+P QVSKTAP+GEYL  GS
Sbjct: 590 VVKNPHPDKPIPALTLAEAGSMTI----------PTSAWWVHPEQVSKTAPSGEYLVTGS 639

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSG 718
           FMIRGKKNFLPP  L+MGF  +F++D +S+ +H+NER VR   E + + E +G
Sbjct: 640 FMIRGKKNFLPPSQLVMGFAYMFKVDPTSVANHVNERAVRTLVE-LSELEGAG 691


>gi|403374308|gb|EJY87098.1| DUF3441 multi-domain protein [Oxytricha trifallax]
          Length = 1126

 Score =  473 bits (1218), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 275/744 (36%), Positives = 441/744 (59%), Gaps = 88/744 (11%)

Query: 27  SNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKL 86
           +NVYD+S + Y+ KL        S  + K  LL+ESG+R+HTT + R+KK+ PSGF++KL
Sbjct: 23  ANVYDVSGRLYLLKL--------SKANRKEHLLIESGIRIHTTEFLRNKKDVPSGFSMKL 74

Query: 87  RKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSH 146
           RKH+RT++L ++ QLG DR+I  QFG G NA+++++ELYA GN++LTD E+T+L+LLRSH
Sbjct: 75  RKHLRTKKLCNITQLGVDRVIDLQFGQGENAYHILVELYASGNVILTDFEYTILSLLRSH 134

Query: 147 RDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD------ANEPDKVNEDGN 200
           + D+    I  + +YP      F       + +   S   PD        EP+K  E G 
Sbjct: 135 KFDETS-KIQIKEKYP------FTAAAGMTIDSIFVS---PDDIKRFIEGEPEK--EQGQ 182

Query: 201 NVSNASKENLGGQKGGKSFDL------SKNSNKNS----------------NDGARAKQP 238
              N +K  + GQ+     +       +++ NK                  +   + K+ 
Sbjct: 183 KEDNLNK--IEGQENNNEENAAAQPKPAEDKNKKGLSEKQQQKQDKKQKNQDKKDKKKEV 240

Query: 239 TLKTVLGEALGY-GPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
            +K++L + + Y     +EH++   G  PN K +++ + +     VL+ A  + +  ++D
Sbjct: 241 NMKSILTKMVPYINFPYAEHVLKLLGQDPNAK-AQIEQSD-----VLIQAAMQCQQLVRD 294

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSST---------------------QIYDEFC 336
           + + + + +G+++   K + +   P  + ++                      ++  +F 
Sbjct: 295 LETSEEI-KGFLIYSEKPIEEKKVPVLTTTTAVALPQVEQLEQETEQDIKFKGKLLKDFG 353

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
           P+ L QF S   +++ +FD  +DE++S+++ QR + ++  KED  + K+++I  DQ  R+
Sbjct: 354 PIPLAQFASDPCLEYASFDQCVDEYFSQLDKQREQSKYSNKEDEIWKKMSRIKDDQAKRI 413

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
             L++E D S   A+L++  + +V A I  ++V   + +SW D+ RMVKEE+KAGNP+A 
Sbjct: 414 QGLQKEQDLSEFKAQLLQKYIYEVQALIDILQVMQTSGISWNDIQRMVKEEKKAGNPLAD 473

Query: 457 LIDKLYLERNCMSLLL-SNNLDEMDDE-------EKTLPVEKVEVDLALSAHANARRWYE 508
           LI K+  E+N ++L+L + N ++ ++E       E   PV +V+VDL +SA  N R+++E
Sbjct: 474 LIYKMNFEKNSVTLMLDACNEEDAENEFAVDEKFENFDPVVRVDVDLHISAQMNIRKYFE 533

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           +KKK   K+ KT TA   AFK AE     +I++ +    I  MRKV+WFEKF+WFISSEN
Sbjct: 534 IKKKSYEKEVKTKTAADIAFKDAETNALKEIVKHRQTQKIDRMRKVYWFEKFDWFISSEN 593

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL ISG++AQ NE++VKRYM KGD+++H D+ GA+ T+IKN      VPP+TLN+A  F 
Sbjct: 594 YLCISGKNAQLNEVLVKRYMDKGDLFMHTDMPGAAVTIIKNPSG-LIVPPITLNEAAIFE 652

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +CHS+AW+ K+VTS +WV+  QVSKT PTG Y+  GSFMIRGK+N + P  L +GF ++F
Sbjct: 653 LCHSKAWEGKIVTSVYWVHADQVSKTPPTGLYIPTGSFMIRGKRNIMTPSKLELGFTIMF 712

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
            L+E S+ +H+ ERR R  +E MD
Sbjct: 713 TLNEESIANHMGERRPRLLQEEMD 736


>gi|110735863|dbj|BAE99907.1| hypothetical protein [Arabidopsis thaliana]
          Length = 329

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 237/333 (71%), Positives = 265/333 (79%), Gaps = 26/333 (7%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +
Sbjct: 121 ILELYAQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQES 180

Query: 181 LTSS--KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           LT+   K+ DA + +             KE  GG+KGGK           SND   AKQ 
Sbjct: 181 LTAFVLKDHDAKQIE------------PKEQNGGKKGGK-----------SNDSTGAKQY 217

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TLK +LG+ALGYGP LSEHIILD GLVP  KLSE  KL+DN IQ+LV AV  FEDWL+D+
Sbjct: 218 TLKNILGDALGYGPQLSEHIILDAGLVPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDI 277

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
           I+G  VPEGYILMQ + L  D   +ESG   ++
Sbjct: 278 INGQKVPEGYILMQKQILAND-TTSESGGVKKV 309


>gi|195151655|ref|XP_002016754.1| GL21904 [Drosophila persimilis]
 gi|194111811|gb|EDW33854.1| GL21904 [Drosophila persimilis]
          Length = 966

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 374/1120 (33%), Positives = 546/1120 (48%), Gaps = 243/1120 (21%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R +T D+   V  L++L+G R + +YD+  KTY+F+L  +                 
Sbjct: 1    MKTRFSTYDIICGVAELQKLVGWRVNQIYDIDNKTYLFRLQGNG---------------- 44

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                      A  K   PSGF++KLRKH++ +RLE + QLG DRI+ FQFG G       
Sbjct: 45   ----------AWPKNVAPSGFSMKLRKHLKNKRLEKISQLGVDRIVDFQFGSG------- 87

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
                        D+ + VL  L      D+G  I++           FE TT   L    
Sbjct: 88   ------------DAAYHVLLELY-----DRGNLILTD----------FELTTLYIL---- 116

Query: 182  TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT-- 239
                        + + +G N+  A +E    ++     D  + S  +  D      P   
Sbjct: 117  ------------RPHTEGENIRFAVREKYPIERAKHQDD--EFSLDHLADLLEKAPPGVH 162

Query: 240  LKTVLGEALGYGPALSEHIIL----DTGLVPNMKLSEVNKLED----------------- 278
            L+ +L   L  GPA+ EH++L    +  ++P    S V+  E                  
Sbjct: 163  LRQILMPVLNCGPAVVEHVLLLHDLENRVMPQGTTSNVDGPEQPLKKAQNSKKQRKERNL 222

Query: 279  -------------NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTES 325
                         N +  L +AV +  + ++D  +G+   +GYI+    H+ K+  P E 
Sbjct: 223  QNAKSEVKVFDMVNDLPTLKMAVKRALNLIKDGNNGE--SKGYII----HV-KEEKPIED 275

Query: 326  GSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
            G         EF P L  QF+  EF  FE+F  A+DEFYS  ESQ+ + +   +E  A  
Sbjct: 276  GKIEYFLRNIEFQPFLFAQFKDNEFSMFESFLEAVDEFYSTQESQKIDMKTLQQEREALK 335

Query: 384  KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
            KL+ +  D   R+  L +  D   K AELI  N   VD AI AV+ A+A++++W D+  +
Sbjct: 336  KLSNVKKDHAKRLEELTKVQDDDKKKAELITSNQSLVDNAIRAVQSAIASQLTWPDIHEL 395

Query: 444  VKEERKAGNPVAGLIDKLYLERNCMSLLLSN---NLDEMDDEEKTLPVEKVEVDLALSAH 500
            VKE +  G+ VA  I +L LE N +SL+LS+   + +E D E+ T+    V+VDLALSA 
Sbjct: 396  VKEAQTNGDVVASSIKQLKLEINHISLILSDPYVSQNEKDCEDLTV----VDVDLALSAW 451

Query: 501  ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKF 560
            ANARR+Y+LK+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF
Sbjct: 452  ANARRYYDLKRSAAQKEQKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKF 511

Query: 561  NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
             WFISSEN+LVI GRDAQQNE+IVKRYM   D+YVHA++ GASS VI+N   E  +PP T
Sbjct: 512  YWFISSENFLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVVIRNTTGED-IPPKT 570

Query: 621  LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
            L +AG   + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L
Sbjct: 571  LVEAGSMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHL 630

Query: 681  IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD--FEDSGHHKENSDIESEKDDTDEKPV 738
             MG  LLF+L+ES +  HL ER+VR     +DD  FE+S    + +D+   + + D +  
Sbjct: 631  TMGLSLLFKLEESFVARHLGERKVR----SIDDAPFENSFKQNDLTDMLLNEVNEDLE-T 685

Query: 739  AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
             + +S+P   H           D+ +FP  +  I +       D  R    P +      
Sbjct: 686  QQVVSIPEEDH---------RNDNSDFPNTEVKIEH-------DTGRITVKPNS------ 723

Query: 799  IDRALGLGSASISSTKHGIETTQFDLSEEDKHV--ERTATVRDKPYISKAERRKLKKGQG 856
                                     L+ EDK +  E T+ +   P   K +  K  K   
Sbjct: 724  -------------------------LNVEDKPITDEETSIILAGPSRKKQQNAKKNKENK 758

Query: 857  SSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERN 916
            +    P+++   +   DA     S V++          GQKGK+KKMK KY DQD+EER 
Sbjct: 759  ARSSHPEIKLSDKGSLDAEPSISSQVKR----------GQKGKIKKMKSKYKDQDDEERE 808

Query: 917  IRMALLASA--GKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKE 974
            IRM +L S+  GKV  N      ++ S ++E+KP    V  PK   +            +
Sbjct: 809  IRMMILNSSGKGKVCINTSKDVAKSVSANEEEKPKKIVVPNPKNQMEL-----------D 857

Query: 975  HPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDIL 1034
              DD   GV+                                 ++ ++ LTG P+  D L
Sbjct: 858  ENDDMPAGVD---------------------------------MDILNSLTGQPIEGDEL 884

Query: 1035 LYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            L+ IPV  PY A+Q YK++VK+ PGT K+GK  ++  ++ 
Sbjct: 885  LFAIPVVAPYQALQHYKFKVKLTPGTGKRGKAAKLALNIF 924


>gi|313211850|emb|CBY15998.1| unnamed protein product [Oikopleura dioica]
          Length = 699

 Score =  467 bits (1201), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 277/763 (36%), Positives = 427/763 (55%), Gaps = 101/763 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A +  +R  L+     N+YD+  KTY+ KL   +         K +LL 
Sbjct: 1   MKTRFTVLDIKAALAEIRDNLLHHYVLNIYDIDSKTYLLKLRKCAS--------KHVLLF 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG--MNAH 118
           ESG R+H T     K   PSGF++KLRKH++ +RL +  QLG+DRII  QFG    ++  
Sbjct: 53  ESGNRVHPTEMEWPKNTAPSGFSMKLRKHLKGKRLINATQLGFDRIIDLQFGTSACLDEF 112

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++I+ELY +GNI+L D E+T+L LLR+  D         R  YP           A  L 
Sbjct: 113 HLIIELYDRGNIILCDQEYTILNLLRARTDKTTDERFAVRESYPV--------GQAQPLK 164

Query: 179 AALTSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
               S++E + N +P ++               G +K  K+  ++K              
Sbjct: 165 EPFLSTEELEENIKPPQIQ--------------GNKKKNKNLTIAKQ------------- 197

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
                 L   LGYG  L EH +++ GL   +  + V++  D  ++ L       ++  + 
Sbjct: 198 ------LNSCLGYGTDLIEHFLIEEGL--EVATASVSQDADEILECL-------QNCYEF 242

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
           + SG    +G+I             T++  +   Y ++ P L NQ +    ++ E F  A
Sbjct: 243 LNSGKTKFQGFI------------STKTNDNVLQYVDYQPFLFNQSQLDSTIELEKFSLA 290

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +D+FY +I+SQ+AEQ+    E +A  KL  + +D   R+ +LK     +V+ A+LIE NL
Sbjct: 291 VDKFYGEIQSQKAEQKMMQAEKSAMKKLENVKLDHMKRLESLKLAQADNVRKAQLIEMNL 350

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           + VD+A+  VR A+A+++ WE++   ++E +  G+PV+  I +L L+ N + ++LS  + 
Sbjct: 351 DLVDSALNQVRSAVASQIGWEEIEDFLEEGQDEGDPVSIAIRELKLKTNQIVMMLSEPMY 410

Query: 478 EMDDE--------------------EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ 517
           +  D                     E +  +  + +DL+LSA  NA+ +Y+ K+    K+
Sbjct: 411 DDSDSSSEEEENPSESEYTKSARVTEGSEIIIYIFLDLSLSAFGNAKAFYDSKRAAADKE 470

Query: 518 EKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDA 577
            KTI A  KA K+AEKKT   +   +TV  ++ +RK  WFEKF WFISSENYLVI+G+DA
Sbjct: 471 SKTIDASKKALKSAEKKTNESLKNIQTVRQVTKVRKQMWFEKFFWFISSENYLVIAGKDA 530

Query: 578 QQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
           QQNE IVK+Y+  GDVYVHAD+HGASS ++KN  P +PV P+TL++ G   VCHS AW++
Sbjct: 531 QQNETIVKKYLKNGDVYVHADIHGASSCIVKNIDPSKPVSPVTLHEVGHAAVCHSAAWNA 590

Query: 638 KMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           K++TSAWWV+ +QVSKTAP+GEYL+ GSFMIRGKKN+LPP  L++GFG LF+LD++ +  
Sbjct: 591 KVLTSAWWVHANQVSKTAPSGEYLSTGSFMIRGKKNYLPPSQLVLGFGFLFKLDDACVAR 650

Query: 698 HLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
           H  ER+++G    ++D E+    KE S++   K++ + +P  E
Sbjct: 651 HAGERKIKG---LVNDVEE----KEQSELGEIKEENENEPQLE 686


>gi|28416669|gb|AAO42865.1| At5g49930 [Arabidopsis thaliana]
          Length = 324

 Score =  461 bits (1185), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 232/328 (70%), Positives = 260/328 (79%), Gaps = 26/328 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           MNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLMESGVR
Sbjct: 1   MNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLMESGVR 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           LHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYVILELY
Sbjct: 61  LHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYVILELY 120

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS- 184
           AQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +LT+  
Sbjct: 121 AQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQESLTAFV 180

Query: 185 -KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
            K+ DA + +             KE  GG+KGGK           SND   AKQ TLK +
Sbjct: 181 LKDHDAKQIE------------PKEQNGGKKGGK-----------SNDSTGAKQYTLKNI 217

Query: 244 LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
           LG+ALGYGP LSEHIILD GLVP  KLSE  KL+DN IQ+LV AV  FEDWL+D+I+G  
Sbjct: 218 LGDALGYGPQLSEHIILDAGLVPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDIINGQK 277

Query: 304 VPEGYILMQNKHLGKDHPPTESGSSTQI 331
           VPEGYILMQ + L  D   +ESG   ++
Sbjct: 278 VPEGYILMQKQILAND-TTSESGGVKKV 304


>gi|339260826|ref|XP_003368211.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
 gi|316964832|gb|EFV49764.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
          Length = 749

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 274/704 (38%), Positives = 390/704 (55%), Gaps = 72/704 (10%)

Query: 54  EKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGL 113
           +KV+++ ESG+RLH+T Y   K   PSGFT+KLRKH+R +RLED+  +G DRI+  +FG 
Sbjct: 5   KKVMIIFESGIRLHSTEYGWSKNIMPSGFTMKLRKHLRDKRLEDISVVGLDRIVDMRFGN 64

Query: 114 GMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE--R 171
           G  A ++I+ELY +GN++LTDSE+ +L +LR+   +   V    R  Y  E+ R FE  R
Sbjct: 65  GPTACHLIIELYDRGNVVLTDSEYVILNILRARTIETDNVRYAVRETYLVEV-REFEEYR 123

Query: 172 TTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSND 231
            TA +                    E  N + +A                          
Sbjct: 124 RTADE--------------------EMANRLLHAC------------------------- 138

Query: 232 GARAKQP--TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
                QP  TL   L     YGP L EH +L+  L   MK+  V   +     + +    
Sbjct: 139 -----QPGDTLHKCLVPHFPYGPLLLEHCLLENKLSLRMKVQAVIGDQSLVSALALSLSL 193

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
            FE  L + I  +    GY+ M  +          +G   +I+ EF P   +QF S E  
Sbjct: 194 AFE--LFEKIRKE-PSRGYLKMTVEE-------NAAGERIEIFHEFHPYFFSQFASSECK 243

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F+TF+ A+DE++SK++SQ+ +Q+   +E AA  +L  +  D E R+  L+ +     +M
Sbjct: 244 QFDTFNGAVDEYFSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERM 303

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
           A  +E N E V+ A+  +R A+A ++ W  +  M+++ R  G+PVAG I  L LERN   
Sbjct: 304 AVAVELNSETVEQALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFV 363

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           + L  ++ + D E        VE+DLALS+H N+RRW+   K+   KQ+KTI A  KA K
Sbjct: 364 MRLPVDVFDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAAGGKALK 423

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
           +AE  T+ Q+   +   NI  +RK+ WFEKF+WF SS+  LVI+GRDA+QNE++VKRY+ 
Sbjct: 424 SAELHTKEQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEILVKRYLK 483

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            GD+YVHADL GA+S VIK    + P+PP TLN+A    VC S AW+SK+VTSAWWV   
Sbjct: 484 PGDLYVHADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSAWWVKHD 543

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER-----RV 704
           QVSK+AP+GEYL  G FMIRGKKN+L    L+MGFGLLFRLD  S   HL +R      +
Sbjct: 544 QVSKSAPSGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAARHLEKRCQAEDEL 603

Query: 705 RGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPV-AESLSVPNS 747
            GEE   D+ +D    K+   + SE  +     V +E  S P++
Sbjct: 604 DGEEANCDNLQDE-QKKQKKLVRSELSEQSFNSVNSEEFSYPDN 646



 Score = 73.9 bits (180), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 65/120 (54%), Gaps = 12/120 (10%)

Query: 965  AGHLSKDCKEHPDDSSHGVEDNPCVGL-DETAEMDKVA---MEEEDIHEIGEEEKG---- 1016
            A HL K C+   +D   G E N C  L DE  +  K+    + E+  + +  EE      
Sbjct: 590  ARHLEKRCQ--AEDELDGEEAN-CDNLQDEQKKQKKLVRSELSEQSFNSVNSEEFSYPDN 646

Query: 1017 -RLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
              L+ V  LTGNP   D +L+ +PVC PY+A+ +YK++VK+ PGT KKGK I+    L +
Sbjct: 647  ETLDAVQCLTGNPTEDDNILFALPVCAPYAALTNYKFKVKLTPGTTKKGKAIKTAIDLFM 706


>gi|219109751|ref|XP_002176629.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411164|gb|EEC51092.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1238

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 336/989 (33%), Positives = 499/989 (50%), Gaps = 123/989 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDL-SPKTYIFKLMNSSGVTESGESE----- 54
           VKVR +  DV A V  + RRL+G +  NVYD  + +TY+FKL +S G T S  +      
Sbjct: 12  VKVRFDGLDVTAMVSHVQRRLLGRKIINVYDGDNGETYVFKLDSSGGTTISNNNNNTSNS 71

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           K  LL+ESG+R H   +       P+ F  KLRKH+R  RLE + Q+G DR+IL QFG G
Sbjct: 72  KEFLLLESGIRFHPLEHFESNLPMPTPFCAKLRKHLRGLRLEQISQIGTDRVILLQFGSG 131

Query: 115 MNAHYVILELYAQGNILLTDS-EFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTT 173
            + H +ILELYA+GNI+LT+   +T+L LLRSH  +   VA+     YP       ++  
Sbjct: 132 ASRHALILELYAKGNIILTEGIHYTILALLRSHVYEKDQVAVQVGQVYPVTYATSVQKDN 191

Query: 174 ASKLHAALTSSKEPDANEPDKVN---------EDGNNVSNASKENLGGQKGGKSFDLSKN 224
            +  +A   +  +P+ N+P   +         ++ N + N S E +  Q           
Sbjct: 192 QTVANAVAATDTQPE-NDPSPTSRIMDTACAAKNKNGILNMSIEEI--QASLALLLEPAP 248

Query: 225 SNKNSNDGARAKQPTLKTVLGE----ALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA 280
            +  +  G +     LKT+L +       YGPAL EH IL   L+P+  + E        
Sbjct: 249 VSATTKKGKKGSPLNLKTLLLQPQWGVSQYGPALLEHCILQANLLPHASIKET------- 301

Query: 281 IQVLVLAVAKFEDW-----------LQDVISGDIVPEGYILMQNK---HLGKDHPPTESG 326
               VL  A +E             + ++ S  I   GYIL Q +    +    P +E+ 
Sbjct: 302 ----VLQAADWERLQTSLSEQGPAIMYNLHSAAIDTPGYILYQPRVEEDIVNGKPHSENL 357

Query: 327 SST------------QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQH 374
           SS             ++  EF P LL Q ++   ++++ F AA+ +F++ + +Q+   + 
Sbjct: 358 SSAVAVVAKELAHADKVLLEFQPHLLAQHQNCPRLEYKHFGAAVADFFAHMVAQKRLLKV 417

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANR 434
           +A E A   KL K+  DQ +RV  L+++       A++++ N E+VD A+L +  AL + 
Sbjct: 418 QASEMAVQEKLRKVQQDQADRVMALERDQQTLQAYAQVVKNNAENVDKALLVINSALDSG 477

Query: 435 MSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-LDEMDDEEKTLPVEKVEV 493
           M W+ L  +V  E+   NP+A LI +L LE   M L L  +  DE+ D      V  V V
Sbjct: 478 MDWDQLIELVSVEQANRNPIANLIVRLELENEIMILRLPRDPFDELSD------VLNVNV 531

Query: 494 DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ-----EKTVANI 548
            L  SAHANA   +   +  + K +KT+ + SKA +AAE+  + Q+++     ++TVA +
Sbjct: 532 SLKDSAHANASALFAKYRASKEKTQKTLESSSKALQAAEESAQRQLIEAQRRTKQTVAAV 591

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
              RK  W+EKF+WF++S+NYLV+ G+DA QNE++VKRY+  GD Y+HA++HGA+S +++
Sbjct: 592 K--RKPAWYEKFHWFVTSDNYLVLGGKDAHQNELLVKRYLRAGDAYLHAEVHGAASCILR 649

Query: 609 NHRPEQP-----VPPLT---LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
             R   P       PL+   L +AG FT+C S AW S+MVTSAWWV  HQVSKTAP+GE+
Sbjct: 650 AKRRRLPNGATQSIPLSDQALREAGNFTICRSSAWASRMVTSAWWVESHQVSKTAPSGEF 709

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNERRVRGEEEGMDDFEDSGH 719
           LTVGSFM+RGKKNFLPP PL MG  +LFRL D+ S+  H  ERR         DF     
Sbjct: 710 LTVGSFMVRGKKNFLPPSPLEMGLAVLFRLGDDDSIARHKTERR---------DF----- 755

Query: 720 HKENSDIESEKDDTDEKPVAESLSV-PNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDS 778
               + IE E    D      S  + P +       T   +   HE        S+ +  
Sbjct: 756 ----ALIELENSSVDVLDAVSSFQMEPKTNIEGQEATTHRDTTEHEG-------SDLVSD 804

Query: 779 KIF-DIARNVAAPVTPQLEDLIDRAL----GLGSASISSTKHGIETTQFDLSEEDKHVER 833
           +++  + + + +  T   E+LI+         GS      K G  T + +     K +  
Sbjct: 805 EVWMTLPKVIVSNSTSSAENLINDPTRDDGSCGSDGNEEAKKGSTTNEGNGRRTKKGLSV 864

Query: 834 TATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKIS 893
               + K Y S  E RKL     S+V   K   E   G+    QP        I+  K+ 
Sbjct: 865 KERKQMKKYGSLGEARKLH----STVAVDKSSTEDTHGQ----QPVLPSLDGLIDASKLK 916

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           RG++ K K+   KY DQD+E+R + M  L
Sbjct: 917 RGKRAKAKRAMLKYMDQDDEDRELAMLAL 945



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 45/79 (56%), Gaps = 2/79 (2%)

Query: 999  KVAMEEEDI--HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKI 1056
            K  MEEE +   ++ E+      ++  L+G P   D++LY +PVC PY  +  Y YRVK+
Sbjct: 1103 KQTMEEEGVVGSDLDEDAVDDTIELSKLSGMPQAEDLVLYAVPVCAPYQTLSKYTYRVKL 1162

Query: 1057 IPGTAKKGKGIQIFYSLLL 1075
             PG+ K+GK ++    + L
Sbjct: 1163 TPGSTKRGKAVKQCVDMFL 1181


>gi|332237024|ref|XP_003267700.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Nomascus leucogenys]
          Length = 1058

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 277/746 (37%), Positives = 401/746 (53%), Gaps = 119/746 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNVMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +  ++                            AK   L
Sbjct: 160 --------AAEPLLTLERLTEIVAST----------------------------AKGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 --------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWYE 508
                         N  E    +K     K            V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHW--FEKFNWFISS 566
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+W  F K    +S 
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWXVFSKLLGRLSQ 538

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           EN+L   G D Q+ E+++                     VI    P +P+PP TL +AG 
Sbjct: 539 ENHLNPGGEDLQRTEVLI------------------LCIVI----PGEPIPPRTLTEAGT 576

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  
Sbjct: 577 MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 636

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMD 712
           LF++DES +  H  ER+VR ++E M+
Sbjct: 637 LFKVDESCVWRHRGERKVRVQDEDME 662



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/176 (38%), Positives = 93/176 (52%), Gaps = 19/176 (10%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAG-KVQKNDGDPQNENASTHKEKKPAI 950
            + RGQK K+KKMKEKY DQDEE+R + M LL SAG   ++     +         KK   
Sbjct: 850  MKRGQKSKMKKMKEKYKDQDEEDRELIMKLLGSAGSNKEEKGKKGKKGKTKDELVKKQPQ 909

Query: 951  SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
             P    +V    KK     +         +H ++D     +D+  + DK   EE+D+ + 
Sbjct: 910  KPRGGQRVSDNIKKETLFLEVI-------THELQD---FAVDDPHD-DK---EEQDLDQQ 955

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            G EE    N  D LTG P P D+LL+ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 956  GNEE----NLFDSLTGQPHPEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKA 1007


>gi|407928362|gb|EKG21221.1| protein of unknown function DUF814 [Macrophomina phaseolina MS6]
          Length = 1094

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 347/1129 (30%), Positives = 547/1129 (48%), Gaps = 192/1129 (17%)

Query: 21   LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
            L  +R +N+YDLS + ++FK    +         +  LL++SG R H T++AR    TPS
Sbjct: 21   LCSLRVANIYDLSTRIFLFKFQKPN--------HREQLLIDSGFRCHLTSFARSTPATPS 72

Query: 81   GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
             F ++LRK ++TRR+  + Q+G DRII  QF  G+  + + LE YA GNI+LTD+E  +L
Sbjct: 73   PFVVRLRKFLKTRRVTSITQIGTDRIIELQFSDGL--YRLYLEFYAGGNIILTDNELNIL 130

Query: 141  TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            +LLRS    D+G                +ER      +           N  ++ N  G 
Sbjct: 131  SLLRSV---DEGPE--------------YERVKVGIKY-----------NLTERQNYGG- 161

Query: 201  NVSNASKENL--GGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG-YGPALSEH 257
             V   +KE +  G QK      L +          +  +  L+  L  ++    P L +H
Sbjct: 162  -VPELTKERVREGLQKA-----LDRQQEATDKKAKKRGKDALRKALAVSITELPPMLVDH 215

Query: 258  IILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
                TG   ++K  +V  LED ++   L+ A+A+ ++   ++ S +I  +GYI+   K  
Sbjct: 216  AFASTGFDSSLKPEQV--LEDESLLDNLMKALAEAKNVDAEITSAEIA-KGYIVA--KKT 270

Query: 317  GKDHPP--TESGSSTQ------IYDEFCPLLLNQFRSR---EFVKFETFDAALDEFYSKI 365
            G+  P   +E GS  +      +Y++F P    QF +     F++FE F+  +DEF+S I
Sbjct: 271  GQPAPTEVSEEGSEEKAPAEKLLYEDFHPFKPKQFEADPTLTFLEFEGFNKTVDEFFSSI 330

Query: 366  ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
            E Q+ E + + +E+ A  KL +   +   R+  L++  + +++ AE I+ N++ V  A++
Sbjct: 331  EGQKLESRLQEREENAKRKLEQAKQEHLKRLGGLQRAQELNIRKAEAIQANVDRVQEAVM 390

Query: 426  AVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDE-- 482
            AV   +   M W ++ R+++ E+  GNPVA +I   L L  N ++LLL     E D+E  
Sbjct: 391  AVNGLIDKGMDWIEIDRLIEREQTHGNPVAQMIKVPLKLRENTVTLLLDEPGVEEDEEDF 450

Query: 483  -----------------EKTLPVEK-------VEVDLALSAHANARRWYELKKKQESKQE 518
                             ++  P  K       +++DL LS  ANA+ +++ KK   +K+E
Sbjct: 451  EGSETESEPSDDEEEQQQRKKPAVKPQDNRLTIDIDLGLSPWANAKTYFDQKKTAAAKEE 510

Query: 519  KTITAHSKAFKAAEKKTRLQIL----QEKTVANISHMRKVHWFEKFNWFISSENYLVISG 574
            +T+ A  KA K+ +KK    +     QEK +  +  +RK  WFEKF +FISS+ YLVI G
Sbjct: 511  RTLEASQKALKSTQKKIEADLKKGLKQEKEL--LRPVRKQFWFEKFIYFISSDGYLVIGG 568

Query: 575  RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHS 632
            +DAQQNE++ +R++ KGD+YVHADL  A+  +IKN    P+ P+PP TL+QAG  +V  S
Sbjct: 569  KDAQQNEILYRRHLKKGDIYVHADLSAAAVVIIKNRPSTPDDPIPPSTLSQAGNLSVSTS 628

Query: 633  QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
             AWDSK V SAWWV   QVSKT  +GEYL  G F+I GKKNFLPP  L++GF ++F++ E
Sbjct: 629  TAWDSKAVMSAWWVNADQVSKTTSSGEYLAAGGFVINGKKNFLPPAQLLLGFAVMFQITE 688

Query: 693  SSLGSH-------------------LNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
             S  +H                    +E   + E  G DD  DS     ++ +ES  D++
Sbjct: 689  ESKKNHNKHRLAEANMASKPAAPQPTHEEASKEETVGQDDASDSDEDFPDAKLESASDES 748

Query: 734  DEKPVAESLSV-PNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVT 792
            D +    S  +  N    A    + S  +  E   E    S G++       +    P+ 
Sbjct: 749  DNEQHQRSNPLQSNGVADAADEGSGSGSELEEAAEEQPQTSEGVEG-----VKEEPLPLA 803

Query: 793  PQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLK 852
            P                       +E     + +E K V +      K ++S  ERR L+
Sbjct: 804  P-----------------------VEEAGEQIHQEPKKV-KQEKAGGKRHLSARERRLLR 839

Query: 853  KGQGSSVVDP------KVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEK 906
            KG   S +        + + E +    A ++  + V   K +   + RG++GK KKM  K
Sbjct: 840  KGVNPSELTTAGGSANESDDEDDAVSVAPTEATTQVSSQKSKQTPLPRGKRGKAKKMALK 899

Query: 907  YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVC-----YK 961
            Y +QDEEER + + LL  A    K   +P+    S  +E       + A K        +
Sbjct: 900  YAEQDEEERELALRLLG-AKPTGKESAEPEKPKPSVQEE-------LQAQKQRRREQHQR 951

Query: 962  CKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDV 1021
             ++ G   ++ +    + + G +++P V                      +EE  RL  V
Sbjct: 952  AQEKGKAEEERRRAALEGALGEDNDPAV----------------------DEEIQRLESV 989

Query: 1022 --DYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
              D  TG PL  D L+  IPVC P+SA+ +YKY+VK+ PG  KKGK ++
Sbjct: 990  GLDAFTGRPLAGDELVAAIPVCAPWSALATYKYKVKLQPGAQKKGKAVK 1038


>gi|119586149|gb|EAW65745.1| serologically defined colon cancer antigen 1, isoform CRA_e [Homo
           sapiens]
          Length = 628

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 270/686 (39%), Positives = 386/686 (56%), Gaps = 102/686 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E          
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE--------PL 164

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  +             V++A K  L                             L
Sbjct: 165 LTLERLTEI------------VASAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFSGKGYIIQKREIKPCLEADKPVEDILT----YEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARR 505
                            N  E    +K     K            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQV 651
              +C+S AWD++++TSAWWVY HQV
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQV 620


>gi|330929686|ref|XP_003302734.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
 gi|311321722|gb|EFQ89181.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
          Length = 1133

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 359/1140 (31%), Positives = 553/1140 (48%), Gaps = 161/1140 (14%)

Query: 2    VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
            +K R ++ DV A  +   +L  +R +NVYDLS + ++ K              +  LL++
Sbjct: 1    MKQRFSSLDVKATHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLID 52

Query: 62   SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            SG R H T YAR    TPSGF  KLRK+++TRR+  V Q+G DRI+ FQF  G+  + + 
Sbjct: 53   SGFRCHLTEYARTTAGTPSGFVAKLRKYLKTRRITSVAQIGTDRILEFQFSDGL--YRLY 110

Query: 122  LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKLH 178
            LE YA GNI+LTD+E  VL+LLR+  + ++   +    +Y   I + +      T  ++ 
Sbjct: 111  LEFYAGGNIVLTDAELNVLSLLRNVDEGEEHEKLRVGLKYNLTIRQNYGGAPELTKERVR 170

Query: 179  AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
             AL  + +   N+P+   +     S                                   
Sbjct: 171  QALQKAVDRQQNQPEATGKKAKKASKD--------------------------------- 197

Query: 239  TLKTVLGEALGYGPA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
            +L+  L  ++   P  L +H +        +K  EV  + D+++   +++V +    + D
Sbjct: 198  SLRKALAVSITECPPLLVDHALHVANFDSTLKPEEV--IADDSLMEKLVSVLQDARKITD 255

Query: 298  VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--FVKFETFD 355
             I+     +GYIL +       +    S  S  +YD+F P    QF + +  F++F+ F+
Sbjct: 256  EITTADQIKGYILAKPNPSAPTNVDESSDKSRLLYDDFHPFRPQQFENSDYTFLEFDGFN 315

Query: 356  AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
             A+DEF+S IE Q+ E +   +E  A  KL K   + E+R+  L+Q  + + + AE I  
Sbjct: 316  KAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIGGLQQVQELNFRKAEAILA 375

Query: 416  NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL-- 472
            N+  V  A  AV   +   M W D+AR+++ E+ +GN VA LI   L L  N ++LLL  
Sbjct: 376  NVHRVTEATEAVNGLIRQGMDWVDIARLIEREQNSGNAVAQLIKLPLKLNENTITLLLDE 435

Query: 473  ---------------SNNLDEMDDEE----------KTLPVE-------KVEVDLALSAH 500
                           ++++ E  DEE          K+ PV+        +++DL+L+A 
Sbjct: 436  TNWEEGQEVEDEGNETSSVSEDSDEEAAGEEDGAKKKSAPVKVSARPQLAIDIDLSLTAW 495

Query: 501  ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHW 556
            AN+  +++ KK   +K+++T+ A ++A K+ EKK     +  + QEK V  +  +RK HW
Sbjct: 496  ANSTEYFDQKKTAANKEDRTLQASTRALKSHEKKVAEDLKKGLKQEKEV--LRPVRKQHW 553

Query: 557  FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
            FEKF +FISS+ YLV+ G+DAQQNE+I +R++ KGDVYVHADL GA   +IKN    P+ 
Sbjct: 554  FEKFIYFISSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDA 613

Query: 615  PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            P+PP TL+QAG   +C S AWDSK V SAWWV   QVSKT  TGE+L  G F ++GKK F
Sbjct: 614  PIPPSTLSQAGNLCICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNVKGKKEF 673

Query: 675  LPPHPLIMGFGLLFRLDESSLGSHLNER---RVRGEEEGMDDFEDSGHHKENSDIESEKD 731
            LPP  L++G  ++F + ESS  +H   R         E +D+  D     + +  ++  D
Sbjct: 674  LPPAQLVVGLAVMFEISESSKANHQKHRIQETAVSAAEMVDEATDETKAADATKTDNSDD 733

Query: 732  DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVA--- 788
            D D  P A+  S      P      A   D+    A  +  SN + S+  D AR+ +   
Sbjct: 734  DED-FPDAKIESDSEDDFPDAKMGQAEESDAESEAAAPR--SNPLQSRRTD-ARDESDDE 789

Query: 789  --APVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKA 846
               PV  + ++    A+  G    S+ +   E T    S  D   E+T+    +  +S  
Sbjct: 790  DEPPVAHKGDEF---AMSGGRNGSSANEEPQEDTG---SVAD--TEQTSKSTGRRQLSAR 841

Query: 847  ERRKLKKGQGSSVVDPKVEREKERG------KDASSQPESIVR-KTKIEGGKISRGQKGK 899
            ERR  +KGQ   +  P+V  +          +D SS  E   + + K+ G   S+G K K
Sbjct: 842  ERRLARKGQLPEL--PQVPSDAAPAADDAAHEDGSSAEEGSAKTRGKVPGTATSQGTKQK 899

Query: 900  -----------LKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKP 948
                        KK   KY  QDEE+R + M LL S        G    E A+  K +K 
Sbjct: 900  NTPLPRGKRAKAKKQAAKYAAQDEEDRELAMRLLGS------KSGQQAAEAAAQEKRQKE 953

Query: 949  AISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
              +  D  +     ++  HL          +    E+     L+            ED  
Sbjct: 954  EQAQADKQR-----RREQHLRAQA------AGKAAEEARLRALENA----------EDDD 992

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            E  E  K +L ++D  TG PLP+D +L  IPVC P+SA+ SYKY+ KI PG+ K+GK ++
Sbjct: 993  EGDEVLKTKLQNLDAFTGRPLPNDEILSAIPVCAPWSALSSYKYKAKIQPGSTKRGKAVK 1052


>gi|256080624|ref|XP_002576579.1| hypothetical protein [Schistosoma mansoni]
 gi|353229334|emb|CCD75505.1| hypothetical protein Smp_052790 [Schistosoma mansoni]
          Length = 1009

 Score =  444 bits (1141), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 262/725 (36%), Positives = 402/725 (55%), Gaps = 69/725 (9%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K+   T DV   V  ++ R++G R +N+YD+  KTY+ KL ++         +K +LL+
Sbjct: 1   MKLLYTTFDVMVSVSEIKNRILGYRVNNIYDVDNKTYLLKLASTKS------DDKTILLL 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG RLH T +   K   PSGF++KLRKHIR +++ D+ Q+G DR++    G   +A+++
Sbjct: 55  ESGSRLHITDFDWPKNIMPSGFSMKLRKHIRNKKIVDISQIGADRVVDIHIGYESSAYHL 114

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICR-VFERTTASKLHA 179
           I+ELY +GN+LLTD  FT+L LLR   D ++ +   +  +YPT  CR + E     K   
Sbjct: 115 IVELYDRGNMLLTDESFTILHLLRPRTDKNQNIRFAAHEKYPTTSCRQILECFRDLKDQK 174

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
           +L                   ++ N         KG  +      SN  + D      P+
Sbjct: 175 SL------------------KDIENFLIPLFQSSKGPWT------SNPQTCDS-----PS 205

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEV-NKLEDNAIQ-----VLVLAVAKFED 293
           +   L   L YG  + EH +     V   K+ ++ N  ED  +Q     ++ L V  F  
Sbjct: 206 INKTLSSELPYGNVIIEHCMR----VAQNKIKQMRNHKEDFQLQSEKTDLIELYVEHFAV 261

Query: 294 WLQDVI------SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE 347
            L+D++           P GYI       GK +  ++ G     Y+EF P +  Q+R + 
Sbjct: 262 VLRDILLEPFLCDRQATPHGYIF------GKSYQSSDEGLRN--YEEFHPFMFEQYRDKP 313

Query: 348 FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
            + F++F+ A+D ++SKIESQ+  +Q    E  A  K+  I  DQE R+  LK E +  +
Sbjct: 314 HLAFDSFNKAVDAYFSKIESQKTLEQISRNEQKASRKVENIKKDQERRLMLLKTEQELDM 373

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + A L+E N   VD  I+ +  AL+N++ W++L  +V++ ++  +P+A  I +L L+ + 
Sbjct: 374 RKAYLLEANRRLVDNIIIMINHALSNQIDWKELELIVEDAKQRDDPLACHIVELKLQTSQ 433

Query: 468 MSLLLSNNLDEMDDEEKTL-------PVEKVEVDLALSAHANARRWYELKKKQESKQEKT 520
             + L +  +   D ++TL          +V VD+ ++A  NAR++Y+ K+    K+EKT
Sbjct: 434 AVIRLKDPFESSSDVDETLVRSGNKDEYTEVVVDIDVNALTNARKYYDKKRAASKKEEKT 493

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           I    K  K+A     +++   KTVA I+ +RK  WFEKF WFISSENYLV++G D+QQN
Sbjct: 494 INVSRKVLKSAIHNAEIKMKTAKTVAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQQN 553

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIK-NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           E++VKRY+  GD++VHAD+HGAS+ +IK  H   + V      +AG   V  S AW S +
Sbjct: 554 EVLVKRYLKPGDLFVHADIHGASTVIIKARHLTSEEVDSPNHQEAGNMAVVLSSAWQSHV 613

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           +T AWWV+  QVSKTAP+GEYLT G+FMIRGKKN+LPP P   GFG++F+L E S+  H 
Sbjct: 614 LTRAWWVHHDQVSKTAPSGEYLTSGAFMIRGKKNYLPPCPFDYGFGIMFKLHEDSIAKHK 673

Query: 700 NERRV 704
            ERR+
Sbjct: 674 GERRI 678



 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 77/220 (35%), Positives = 116/220 (52%), Gaps = 23/220 (10%)

Query: 863  KVEREKERGKDASSQPESIVRKTKIE------GGK--ISRGQKGKLKKMKEKYGDQDEEE 914
            KV++ K   K A+   E+I  K K+        G+  + RGQK K+KK+K+KY +QD+EE
Sbjct: 746  KVDKLKPAKKTANLNRETIEAKEKVNEPLLPSAGQPILKRGQKAKIKKIKQKYKEQDDEE 805

Query: 915  RNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDA----PKVCYKCKKAGHLSK 970
            RN+RM +L      Q +D  P   +    ++    +  +       +V Y      +   
Sbjct: 806  RNLRMKIL------QGDDAKPSQYHQILERDNLSNLIKIPQCVLDTQVVYNSDSIQNNQP 859

Query: 971  DC--KEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHE-IGEEEKGRLNDV-DYLTG 1026
            DC   E  +DS+  +E+N  V  DE+ E++ V   + D +E +  E K  L  + + LTG
Sbjct: 860  DCDNNESFNDSNSEIENN-SVKSDESEEVNHVKSNDNDDNEDMPVESKDDLTSLLNSLTG 918

Query: 1027 NPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
             P   D+LLY IPVC PYS +  YK+RVK+ PGT K+GK 
Sbjct: 919  QPNDDDLLLYAIPVCAPYSVLLKYKFRVKLNPGTTKRGKA 958


>gi|341901167|gb|EGT57102.1| hypothetical protein CAEBREN_19463 [Caenorhabditis brenneri]
          Length = 920

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 264/706 (37%), Positives = 383/706 (54%), Gaps = 81/706 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R    DV A    L++L GMR +NVYD+  KTY+ KL        S   EK ++L E
Sbjct: 1   MKNRFTLVDVIAATTELKKLQGMRVNNVYDIDNKTYLIKL--------SRTDEKAVILFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T +   K  TPS F++KLRKHI  +RL  +R +G+DR++   FG     + + 
Sbjct: 53  SGVRLHQTFHDWPKSQTPSSFSMKLRKHINQKRLTSIRVVGFDRLVELVFGTEDRENRLY 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GN++LTD E T+L +LR   D D  V    R +Y                    
Sbjct: 113 VELYDRGNVVLTDHELTILNILRVRTDKDTSVRWAVREKY-------------------- 152

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           T ++E                   S+E    + G   F+    +     DG   K+  L 
Sbjct: 153 TFTEE------------------ISEETANSRHGKFKFEDFAKAVSAIPDG---KEEQLG 191

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ----- 296
            ++ +    G  +++ I+   GL    K+S  NK + + I        KFED L+     
Sbjct: 192 RIVSQFTRCGNPVTKEILCKCGLKAEQKIS--NKSDLSGI------TEKFEDILKATEEI 243

Query: 297 -DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
            +++  +  P+G I            P+ + +  Q+Y EF P+ +    S+   +  +F 
Sbjct: 244 WEMVEEN--PKGVI-------SYTEVPSPTSAPIQLYQEFNPIPM-PLTSKFTKELPSFC 293

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            ++DEFYS+IE+Q+ EQ+    E  A  KL  +  DQ++R+  L+   ++   MA  I  
Sbjct: 294 ESVDEFYSRIETQKQEQKAINMEKQALKKLENVEKDQKDRIEALQMTQEQREHMANRIIL 353

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E V+ A+L +R ALAN+ SW+ +  M K   K G+PVA  ID    E N   + L   
Sbjct: 354 NQELVEKALLLIRSALANQFSWQTIEEMKKTAAKNGDPVAKSIDSFKFESNEFVMTLG-- 411

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
            D  DDE + L   KV +D++++A  NA+R +  KK    K +KT+ +  KA K A++K 
Sbjct: 412 -DPYDDEAEIL---KVPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKA 467

Query: 536 RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  + Q K V  +   RK  WFEKF WFISSE Y+V++GRDAQQNE++VK+Y+   D+Y+
Sbjct: 468 KSTLEQVKIVTEVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYM 527

Query: 596 HADLHGASSTVIKNHRPE--QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK 653
           HAD+ GASS VI+N   E  Q +PP TL +A    VC+S AW++ +  SAWWV P QVS+
Sbjct: 528 HADVRGASSVVIRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVRPEQVSR 587

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           TAPTGEYL  GSFMIRGKKNF+PP  L+MG G+LFR+DE S+  H+
Sbjct: 588 TAPTGEYLPSGSFMIRGKKNFMPPSQLVMGLGVLFRMDEESIERHV 633



 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 27/51 (52%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK----GIQIF 1070
            LT  PL  D LL+ +PV  PYSA+ +YKYRVKI PG  K+GK     I++F
Sbjct: 831  LTAQPLDEDTLLFAVPVVAPYSALSTYKYRVKITPGIGKRGKATKSAIELF 881


>gi|348681953|gb|EGZ21769.1| hypothetical protein PHYSODRAFT_557667 [Phytophthora sojae]
          Length = 1063

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 283/770 (36%), Positives = 425/770 (55%), Gaps = 116/770 (15%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL-------SPKTYIFKLMNSSGVTESGE 52
           M K RM+  D+ A V  +R  +  MR +N+YD+       + KTYI KL           
Sbjct: 1   MKKTRMSIDDIRAMVGSIRANVQNMRVTNIYDVQGQGESGAAKTYILKLHQPP------- 53

Query: 53  SEKVLLLMESGVRLHTTAYARDKKN---TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
             KV LL+ESGVR HT+ YARD K     PS FT+KLRKH+R +RL  +RQL  DR++ F
Sbjct: 54  FPKVFLLLESGVRFHTSKYARDAKAGSALPSQFTMKLRKHLRGKRLSGLRQLEGDRVVDF 113

Query: 110 QFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF 169
            FG      ++ILELYA GNI+LTD ++ +L+LLR+HR D+  V +  +  YP ++    
Sbjct: 114 TFGQDALQCHLILELYASGNIVLTDGDYRILSLLRTHRFDE-NVKMAVKQVYPVQLLGDQ 172

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           E+  A +  A L                       A  +    ++  K+        +  
Sbjct: 173 EKQRAIQTPAQLA----------------------AFVDKWFVEQEAKAAVALPGKTQKK 210

Query: 230 NDGARAKQPTL--KTVLGEALGYGPALSEHIILDTGLVPNMKL---SEVNKLEDNAIQVL 284
                 KQ  L  ++  G   G GP + EH ++  G+ P +KL   +E + L D+ +  L
Sbjct: 211 KKAQTIKQLLLVKESTFG---GLGPVIIEHCLVRAGISPTLKLKNAAEFSALGDDKLAAL 267

Query: 285 VLAVAKFEDW-----LQD----------------VISGDIVPE----------------- 306
           +  +   E W     LQD                V +GD   E                 
Sbjct: 268 LAEIQ--EGWKLLERLQDEQTSVNGPVPVQNDDTVDAGDSDEEEAAPVAKAPSSASSQKC 325

Query: 307 GYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRS--REFVKFETFDAALDEFYSK 364
           G+I++++         +   ++ + ++EF P L  Q +   ++   F+TFD A+DE++S+
Sbjct: 326 GFIILKD---------SADENAPEQFEEFTPFLYAQHQQAHKKVKSFDTFDEAVDEYFSR 376

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
            E+  AE   ++ + AA +KL K+  +Q+ ++  L++  ++S + A+LIE N +DV+  +
Sbjct: 377 FEADTAEVAKQSAQLAAENKLAKLKKNQQQQLAQLREVQEQSFQHAQLIEANQQDVENVL 436

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           L +R ALA+ M W  L  +V+ E+K GNPVA LI KL LE N +++LL +  D+ + E+ 
Sbjct: 437 LVIRSALASGMDWRGLEELVRYEQKNGNPVASLIHKLDLEHNRVAILLCDEEDDDEGEDG 496

Query: 485 TLPVEK-------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
                +       + +DL+LSA ANAR  Y  KKK   K +K   A  KA   AEK T+ 
Sbjct: 497 GDGTGEEDKQAHVIWIDLSLSALANAREIYTKKKKAGEKVKKATEATDKAIALAEKNTKK 556

Query: 538 QILQEKTVANISHMR-KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            + +++T  N+ + R K  WFEKF+WF+++E YLV++G+DA QNE++VKRY+ KGDVYVH
Sbjct: 557 TLEKQQTKRNVIYQRRKTLWFEKFHWFLTNEKYLVVAGKDAHQNELLVKRYLRKGDVYVH 616

Query: 597 ADLHGASSTVIKNH-----RPEQPVPPL---TLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
           ADLHGA++ +++NH     +  Q +PP+   TL QAGC +VC S AW S+++  A+WV+ 
Sbjct: 617 ADLHGAATCIVRNHATVKDKKTQELPPIPVATLEQAGCMSVCRSNAWTSQVIAGAYWVHA 676

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
            QVSKTAP GEYLT GSFMIRGKKN++ P  L MG  +LFR+DESS+ +H
Sbjct: 677 DQVSKTAPAGEYLTTGSFMIRGKKNYIQPSRLEMGLAVLFRIDESSISNH 726



 Score = 67.0 bits (162), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 75/239 (31%), Positives = 112/239 (46%), Gaps = 52/239 (21%)

Query: 840  KPYISKAERRKLKKGQGSSVVDP-----KVEREKERGKDASSQPESIVRKTKIEGGKISR 894
            K  +S  ERR LKK +  S  D        ++++ +GKD    P S  ++ K       R
Sbjct: 767  KKRLSAKERRDLKKSKLPSRDDSIDEQHPAQQKRAKGKDKDKGPASAPQQKKS-----VR 821

Query: 895  GQKGKLKKMKEKYGDQDEEERNIRMALLASA-------GKVQKNDGDPQNENASTHKEKK 947
            G+KGK+KKMK+KY DQDEE+R +RM  L  A        +    DGD   E +    E+ 
Sbjct: 822  GKKGKMKKMKKKYADQDEEDRRLRMEALGHAVEEDQEEEEEPSKDGDDSAEQSGDENEEA 881

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
               +P          KK    S++   H                 +  + +K   E+ED 
Sbjct: 882  ADSTP---------SKKEA--SEEYIRH-----------------QREKKEKYLDEQED- 912

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
                 E +G  +  D  TG PL  DI+L+ +P+C PY+++  +KY+VK+ PG+ KKGK 
Sbjct: 913  -----EAEG-ADFFDAFTGEPLADDIVLFAMPMCAPYASLIKFKYKVKLTPGSQKKGKA 965


>gi|354506443|ref|XP_003515270.1| PREDICTED: nuclear export mediator factor Nemf, partial [Cricetulus
           griseus]
          Length = 699

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 229/500 (45%), Positives = 321/500 (64%), Gaps = 42/500 (8%)

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP 305
           EA  YGPAL EH +++ G   N+K+ E  KLE   I+ +++ V K ED++++  + +   
Sbjct: 18  EAESYGPALIEHCLIENGFSGNVKVDE--KLESKDIEKILVCVQKAEDYMKE--TANFHG 73

Query: 306 EGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A+DEFY
Sbjct: 74  KGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKAVDEFY 129

Query: 363 SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDA 422
           SKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ VD 
Sbjct: 130 SKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDR 189

Query: 423 AILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN--LDEMD 480
           AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   L E +
Sbjct: 190 AIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEE 249

Query: 481 DEEKTLPVEK----------------------------VEVDLALSAHANARRWYELKKK 512
           D++    VE                             V+VDL+LSA+ANA+++Y+ K+ 
Sbjct: 250 DDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRY 309

Query: 513 QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
              K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENYL+I
Sbjct: 310 AAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLII 369

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
            GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S
Sbjct: 370 GGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYS 428

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DE
Sbjct: 429 AAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDE 488

Query: 693 SSLGSHLNERRVRGEEEGMD 712
           S +  H  ER+VR ++E ++
Sbjct: 489 SCIWRHRGERKVRAQDEDIE 508


>gi|451850505|gb|EMD63807.1| hypothetical protein COCSADRAFT_182004 [Cochliobolus sativus ND90Pr]
          Length = 1128

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 358/1140 (31%), Positives = 547/1140 (47%), Gaps = 166/1140 (14%)

Query: 2    VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV      L  +L  +R +NVYDLS + ++ K              +  LL+
Sbjct: 1    MKQRFSSLDVKVIAHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLI 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +SG R H T YAR     PSGF  KLRK+++TRR+  + Q+G DRI+ FQF  G+  + +
Sbjct: 53   DSGFRCHLTEYARTTAAAPSGFVAKLRKYLKTRRVTSISQIGTDRILEFQFSDGL--YRL 110

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKL 177
             LE YA GNI+LTD++  VL+LLR+  + ++   +    +Y   + + +      T  ++
Sbjct: 111  YLEFYAGGNIILTDADLNVLSLLRNVDEGEEHEKLRVGLKYNLTLRQNYGGAPELTKERV 170

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
              AL  + +   ++P      G     A K++L                          +
Sbjct: 171  CQALQKAVDKQQDQPVAA---GRKAKKAGKDSL--------------------------R 201

Query: 238  PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
              L   + E     P L +H +       ++K  EV  L D+ +   ++ V +    + D
Sbjct: 202  KALAVSITEC---PPLLVDHALHVASYDSSLKPEEV--LADDGLVKRLVEVLQDARKITD 256

Query: 298  VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--FVKFETFD 355
             I+     +GYIL +            S  S  +YD+F P    QF + +  F++F+ F+
Sbjct: 257  EITKTDQIKGYILAKPNPSASKPDDESSDKSRLLYDDFHPFRPQQFENTDYTFLEFDGFN 316

Query: 356  AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
             A+DEF+S IE Q+ E +   +E  A  KL K   + E+R+  L+Q  + + + AE I  
Sbjct: 317  KAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIGGLQQVQELNFRKAEAILA 376

Query: 416  NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL-- 472
            N+  V  A  AV   +   M W D+ R+++ E+ +GN VA LI   L L  N ++LLL  
Sbjct: 377  NVHRVTEATEAVNGLIRQGMDWVDIERLIEREQNSGNAVAQLIRLPLKLHENTITLLLNE 436

Query: 473  -------------------SNNLDEMDDE-EKTLPVEKV-------EVDLALSAHANARR 505
                               S + D+ DD   KT P + V       ++DL LSA AN+  
Sbjct: 437  TNWEKGGEEEDEGNETSSVSEDTDDEDDRPRKTSPPKPVARPQLAIDIDLGLSAWANSTE 496

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFN 561
            +++ KK    K+ +T+ A SKA K+ EKK     +  + QEK V  +  +RK HWFEKF 
Sbjct: 497  YFDQKKTAADKEGRTLQASSKALKSHEKKVAEDLKKGLKQEKEV--LRPVRKQHWFEKFI 554

Query: 562  WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPL 619
            +FISS+ YLV+ G+DAQQNE+I +R++ KGDVYVHADL GA   +IKN    P+ P+PP 
Sbjct: 555  YFISSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPS 614

Query: 620  TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
            TL+QAG  ++C S AWDSK V SAWWV   QVSKT  TGE+L  G F I+GKK FLPP  
Sbjct: 615  TLSQAGNLSICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNIKGKKEFLPPAQ 674

Query: 680  LIMGFGLLFRLDESSLGSH----LNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDE 735
            L++G  ++F + +SS  +H    + E  V   E  M D E +   KE + +++++ D DE
Sbjct: 675  LVVGLAVMFEISDSSKANHHKHRVQETAVSAAE--MTD-EPTNESKEAAAMKTDESDDDE 731

Query: 736  KPVAESLSVPNSAHPAP--SHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTP 793
             P A+  S      P     HT  S+ +S    +     SN + S      RN       
Sbjct: 732  FPDAKINSDSEDDFPDAKMEHTEESDAESEAAASR----SNPLQSST----RNAKEDSDE 783

Query: 794  QLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKA------- 846
            + E L+ +            +H        +  +++  E   ++ D   ISK+       
Sbjct: 784  EEEPLVGK---------RGAEHAKPGENNGVVAKEEPPENEGSIADSESISKSMGRGKLS 834

Query: 847  --ERRKLKKGQ----------GSSVVDPKVEREKERGKDASSQPESIVRKT------KIE 888
              ERR  +KGQ             VVD   + E++  +  S++  + V +T      K +
Sbjct: 835  ARERRLARKGQLPELPQVPSDTVPVVDGADQDERDSTEGGSTKAATKVDETVTSQMNKQK 894

Query: 889  GGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKP 948
               + RG++ K KK   KY  QDEE+R + M LL S        G    E A+  K +K 
Sbjct: 895  NPPLPRGKRAKAKKQAAKYAAQDEEDRELAMRLLGS------KSGQQAAEAAAQEKRQKE 948

Query: 949  AISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
              +  D  +     ++  HL          +    E+     L+            ED  
Sbjct: 949  EQAQADKQR-----RREQHLRAQA------AGKAAEEARLRALENA----------EDDD 987

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            E  E  K  L ++D  TG PLP+D L+  IPVC P+SA+ +YKY+ K+ PG+ K+GK ++
Sbjct: 988  EGDEVLKTNLQNLDAFTGRPLPNDELISAIPVCAPWSALSTYKYKAKMQPGSTKRGKAVK 1047


>gi|149051344|gb|EDM03517.1| rCG61611 [Rattus norvegicus]
          Length = 899

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 228/503 (45%), Positives = 321/503 (63%), Gaps = 36/503 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ V + ED+L+   
Sbjct: 17  LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLESKDIEKILVCVQRAEDYLEK-- 72

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 73  TANFNGKGYII-QKREVKPSLDANKPAEDILTYEEFHPFLFSQHLQCPYIEFESFDKAVD 131

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 132 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 191

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN--LD 477
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   L 
Sbjct: 192 VDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLLRNPYLLS 251

Query: 478 EMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWYEL 509
           E +D +    +E                             V+VDL+LSA+ANA+++Y+ 
Sbjct: 252 EEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDH 311

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENY
Sbjct: 312 KRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENY 371

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           L+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   + P+PP TL +AG   +
Sbjct: 372 LIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGD-PIPPRTLTEAGTMAL 430

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C+S AWD++++TSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF+
Sbjct: 431 CYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFK 490

Query: 690 LDESSLGSHLNERRVRGEEEGMD 712
           +DES +  H  ER+VR ++E M+
Sbjct: 491 VDESCVWRHRGERKVRVQDEDME 513



 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 71/202 (35%), Positives = 100/202 (49%), Gaps = 18/202 (8%)

Query: 865  EREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS 924
            E++KE+     S+ +    K    G  + RGQK K+KKMKEKY DQD+E+R + M LLAS
Sbjct: 665  EKDKEKESAVHSEADQNTSKNVAAGQPMKRGQKSKMKKMKEKYKDQDDEDRELIMKLLAS 724

Query: 925  AGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVE 984
            AG    N  +   +      + +P       P       + G    D  +          
Sbjct: 725  AG---SNKEEKGKKGKKGKTKDEPVKKNPQKP-------RGGQRVLDVVKETPSLQASTP 774

Query: 985  DNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPY 1044
            D     +DE  + DK   EE D+ + G EE    N  D LTG P P D+L++ IP+C PY
Sbjct: 775  DLQDFAVDEPHD-DK---EEHDLDQQGNEE----NLFDSLTGQPHPEDVLMFAIPICAPY 826

Query: 1045 SAVQSYKYRVKIIPGTAKKGKG 1066
            + + +YKY+VK+ PG  KKGK 
Sbjct: 827  TIMTNYKYKVKLTPGVQKKGKA 848


>gi|281604208|ref|NP_001164057.1| serologically defined colon cancer antigen 1 [Rattus norvegicus]
          Length = 1065

 Score =  434 bits (1116), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 228/503 (45%), Positives = 321/503 (63%), Gaps = 36/503 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ V + ED+L+   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLESKDIEKILVCVQRAEDYLEK-- 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TANFNGKGYII-QKREVKPSLDANKPAEDILTYEEFHPFLFSQHLQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN--LD 477
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   L 
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLLRNPYLLS 417

Query: 478 EMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWYEL 509
           E +D +    +E                             V+VDL+LSA+ANA+++Y+ 
Sbjct: 418 EEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDH 477

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENY
Sbjct: 478 KRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENY 537

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           L+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   + P+PP TL +AG   +
Sbjct: 538 LIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGD-PIPPRTLTEAGTMAL 596

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C+S AWD++++TSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF+
Sbjct: 597 CYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFK 656

Query: 690 LDESSLGSHLNERRVRGEEEGMD 712
           +DES +  H  ER+VR ++E M+
Sbjct: 657 VDESCVWRHRGERKVRVQDEDME 679



 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 103/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L N           K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNANLLGMRVNNVYDVDNKTYLIRLQNPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHARAAE 162



 Score = 82.8 bits (203), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 71/202 (35%), Positives = 100/202 (49%), Gaps = 18/202 (8%)

Query: 865  EREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS 924
            E++KE+     S+ +    K    G  + RGQK K+KKMKEKY DQD+E+R + M LLAS
Sbjct: 831  EKDKEKESAVHSEADQNTSKNVAAGQPMKRGQKSKMKKMKEKYKDQDDEDRELIMKLLAS 890

Query: 925  AGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVE 984
            AG    N  +   +      + +P       P       + G    D  +          
Sbjct: 891  AG---SNKEEKGKKGKKGKTKDEPVKKNPQKP-------RGGQRVLDVVKETPSLQASTP 940

Query: 985  DNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPY 1044
            D     +DE  + DK   EE D+ + G EE    N  D LTG P P D+L++ IP+C PY
Sbjct: 941  DLQDFAVDEPHD-DK---EEHDLDQQGNEE----NLFDSLTGQPHPEDVLMFAIPICAPY 992

Query: 1045 SAVQSYKYRVKIIPGTAKKGKG 1066
            + + +YKY+VK+ PG  KKGK 
Sbjct: 993  TIMTNYKYKVKLTPGVQKKGKA 1014


>gi|73962860|ref|XP_851229.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Canis
           lupus familiaris]
          Length = 1077

 Score =  434 bits (1116), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/508 (45%), Positives = 321/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREV----KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E  LP  K                   V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  140 bits (352), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   LIGMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAILAELNASLIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHARAAE 162


>gi|344273431|ref|XP_003408525.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor
           NEMF-like [Loxodonta africana]
          Length = 1000

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 232/508 (45%), Positives = 326/508 (64%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++ ++
Sbjct: 183 LKRVLNPLLPYGPALIEHCLMENGFSGNVKVGE--KFESKDIEKVLVCLQKAEDYMKTML 240

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
             +   +GYI+ + +      P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 241 --NFSGKGYIIQKREV----KPSLEIDKPTEDILTYEEFHPFLFSQHLQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGSIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 --NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANAR 504
             + +E DD +  + +EK                            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDISIEKNETEPLKGKKKKQKNKQLQKPQKNKPLPVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTAGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHWGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHARAAE 162



 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 32/64 (50%), Positives = 43/64 (67%), Gaps = 4/64 (6%)

Query: 1003 EEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAK 1062
            +E+D+ + G EE    N  D LTG P P DILL+ IP+C PY+ + +YKY+VK+ PG  K
Sbjct: 890  DEQDLDQQGNEE----NLFDSLTGQPHPEDILLFAIPICAPYTTMANYKYKVKLTPGVQK 945

Query: 1063 KGKG 1066
            KGK 
Sbjct: 946  KGKA 949


>gi|189211034|ref|XP_001941848.1| serologically defined colon cancer antigen 1 [Pyrenophora
            tritici-repentis Pt-1C-BFP]
 gi|187977941|gb|EDU44567.1| serologically defined colon cancer antigen 1 [Pyrenophora
            tritici-repentis Pt-1C-BFP]
          Length = 1151

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 347/1125 (30%), Positives = 544/1125 (48%), Gaps = 168/1125 (14%)

Query: 20   RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
            +L  +R +NVYDLS + ++ K              +  LL++SG R H T YAR    TP
Sbjct: 38   KLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLIDSGFRCHLTEYARTTAGTP 89

Query: 80   SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
            SGF  KLRK+++TRR+  V Q+G DRI+ FQF  G+  + + LE YA GNI+LTD+E  V
Sbjct: 90   SGFVAKLRKYLKTRRITSVAQIGTDRILEFQFSDGL--YRLYLEFYAGGNIVLTDAELNV 147

Query: 140  LTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKLHAALTSSKEPDANEPDKVN 196
            L+LLR+  + ++   +    RY   + + +      T  ++  AL  + +   N+P    
Sbjct: 148  LSLLRNVDEGEEHEKLRVGLRYNLTLRQNYGGAPELTKERVRQALQKAMDRQQNQPAATG 207

Query: 197  EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPA-LS 255
            +        S                                 L+  L  ++   P  L 
Sbjct: 208  KKAKKAGKDS---------------------------------LRKALAVSITECPPLLV 234

Query: 256  EHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKH 315
            +H +        +K  EV  + D+ +   +++V +    + D I+     +GYIL +   
Sbjct: 235  DHALHVADFDSTLKPEEV--IADDGLMEKLVSVLRDARKITDEITTTNQIKGYILAKPNP 292

Query: 316  LGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--FVKFETFDAALDEFYSKIESQRAEQQ 373
                +    S  +  +YD+F P    QF + +  F++F+ F+ A+DEF+S IE Q+ E +
Sbjct: 293  SAPTNEDESSDKARLLYDDFHPFRPQQFENSDYTFIEFDGFNKAVDEFFSSIEGQKLESK 352

Query: 374  HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALAN 433
               +E  A  KL K   + E+R+  L+Q  + + + AE I  N+  V  A  AV   +  
Sbjct: 353  LTEREQQAKRKLEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQ 412

Query: 434  RMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL-----------------SNN 475
             M W D+AR+++ E+ +GN VA LI   L L  N ++LLL                 +++
Sbjct: 413  GMDWVDIARLIEREQNSGNAVAQLIKLPLKLNENTITLLLDETNWEEGEEVEDEGNETSS 472

Query: 476  LDEMDDEE---------KTLPVE-------KVEVDLALSAHANARRWYELKKKQESKQEK 519
            + E  DE+         K+ PV+        +++DL+L+A AN+  +++ KK   +K+++
Sbjct: 473  VSEDSDEDAGEEDGAKKKSAPVKVSARPQLAIDIDLSLTAWANSTEYFDQKKTAANKEDR 532

Query: 520  TITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGR 575
            T+ A ++A K+ EKK     +  + QEK V  +  +RK  WFEKF +FISS+ YLV+ G+
Sbjct: 533  TLQASTRALKSHEKKVAEDLKKGLKQEKEV--LRPVRKQQWFEKFIYFISSDGYLVLGGK 590

Query: 576  DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQ 633
            DAQQNE+I +R++ KGDVYVHADL GA   +IKN    P+ P+PP TL+QAG   +C S 
Sbjct: 591  DAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLCICTSD 650

Query: 634  AWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
            AWDSK V SAWWV   QVSKT  TGE+L  G F ++GKK FLP   L++G  ++F + ES
Sbjct: 651  AWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNVKGKKEFLPLAQLVVGLAVMFEISES 710

Query: 694  SLGSH----LNERRVRGEE---EGMDDFEDSGHHK-ENSD---------IESEKDD---- 732
            S  +H    + E  V   E   E  D+ + + H K +NSD         IES+ +D    
Sbjct: 711  SKANHHKHRIQETAVSAAEMVDEPTDETKAADHTKTDNSDDDEDFPDAKIESDSEDDFPD 770

Query: 733  ----TDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARN-V 787
                  E+  AES +    ++P  S T  +  +S +   ++ +++   D       RN  
Sbjct: 771  AKMGQTEESDAESEAAAPRSNPLQSRTTDARDESDD--GDEPSVAQKDDEFAMSGGRNRS 828

Query: 788  AAPVTPQLEDLIDRALGLGSASISST-KHGIETTQFDLSEEDKHVERTATVRDKPYI--- 843
            +A   PQ +D           S++ T K    T +  LS  ++ + R   + + P +   
Sbjct: 829  SANEEPQEDD----------GSVADTEKTSKSTGRRQLSARERRLARKGQLPELPQVPSN 878

Query: 844  SKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKM 903
            +          +GSS  +   +   +    A+SQ       TK +   + RG++ K KK 
Sbjct: 879  AAPADDDAAHEEGSSAEEGSAKTPGKVPGTATSQ------GTKQKNTPLPRGKRAKAKKQ 932

Query: 904  KEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCK 963
              KY  QDEE+R + M LL S        G    E A+  K +K   +  D  +     +
Sbjct: 933  AAKYAAQDEEDRELAMRLLGS------KSGQQAAEAAAQEKRQKEEQAQADKQR-----R 981

Query: 964  KAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDY 1023
            +  HL          +    E+     L+            ED  E  E  K  L ++D 
Sbjct: 982  REQHLRAQA------AGKAAEEARLRALENA----------EDDDEGDEVLKTNLQNLDA 1025

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
             TG PLP+D +L  IPVC P+SA+ SYKY+ K+ PG+ K+GK ++
Sbjct: 1026 FTGRPLPNDEILSAIPVCAPWSALSSYKYKAKMQPGSTKRGKAVK 1070


>gi|297736760|emb|CBI25961.3| unnamed protein product [Vitis vinifera]
          Length = 321

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/367 (62%), Positives = 261/367 (71%), Gaps = 50/367 (13%)

Query: 484 KTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEK 543
           +   V+KVEVDLALSAHANARRWYE KK+QE+KQEKTI AH KAFKAAEKK+ +Q+ Q  
Sbjct: 4   RHFHVDKVEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQVG 63

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
                     +HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADLHGAS
Sbjct: 64  E-------HYIHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLHGAS 116

Query: 604 STVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTV 663
                                 CFTVCHSQAWDSK+VTSAWWVYPHQVSKTA TGEYLTV
Sbjct: 117 R---------------------CFTVCHSQAWDSKIVTSAWWVYPHQVSKTASTGEYLTV 155

Query: 664 GSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKEN 723
           GSFMIRGK NFLPPHPL+MGFGLLF LDESSLGSHLNERRVRGEEEG  DFE++   K N
Sbjct: 156 GSFMIRGK-NFLPPHPLMMGFGLLFCLDESSLGSHLNERRVRGEEEGAQDFEENESLKGN 214

Query: 724 SDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDI 783
           SD ESEK++TDEK  AES S+ +                   P+  + I  G  S+I DI
Sbjct: 215 SDSESEKEETDEKRTAESKSIMD-------------------PSTHQPILEGF-SEINDI 254

Query: 784 ARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYI 843
           +    + V PQLEDLIDRAL LGS + S  K+ +ET+Q DL EE  H +R A VR+KPY 
Sbjct: 255 SGIHVSSVNPQLEDLIDRALELGSNTASGKKYALETSQVDL-EEHNHEDRKAKVREKPYT 313

Query: 844 SKAERRK 850
           S   +RK
Sbjct: 314 SYQSQRK 320


>gi|55640675|ref|XP_509934.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
           troglodytes]
 gi|410223614|gb|JAA09026.1| nuclear export mediator factor [Pan troglodytes]
 gi|410263654|gb|JAA19793.1| nuclear export mediator factor [Pan troglodytes]
 gi|410263656|gb|JAA19794.1| nuclear export mediator factor [Pan troglodytes]
 gi|410299008|gb|JAA28104.1| nuclear export mediator factor [Pan troglodytes]
 gi|410354861|gb|JAA44034.1| nuclear export mediator factor [Pan troglodytes]
          Length = 1076

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/508 (45%), Positives = 323/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREIKPSLEADKPVEDIFT----YEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|426376840|ref|XP_004055190.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Gorilla
           gorilla gorilla]
          Length = 1077

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRADEADDVKFAVRERYPLDHARAAE 162


>gi|269849764|sp|O60524.4|NEMF_HUMAN RecName: Full=Nuclear export mediator factor NEMF; AltName:
           Full=Antigen NY-CO-1; AltName: Full=Serologically
           defined colon cancer antigen 1
          Length = 1076

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|397523542|ref|XP_003831788.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
           paniscus]
          Length = 1076

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|403277932|ref|XP_003930596.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 1077

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/508 (45%), Positives = 322/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G + N+K+ E  KLE   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFLGNVKVDE--KLETKDIEKILVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E+    +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKRE----TKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHVRAAE 162


>gi|355693257|gb|EHH27860.1| hypothetical protein EGK_18167 [Macaca mulatta]
          Length = 1077

 Score =  431 bits (1107), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 230/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHRGERKVRVQDEDME 681



 Score =  139 bits (350), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|301773240|ref|XP_002922036.1| PREDICTED: serologically defined colon cancer antigen 1-like
           [Ailuropoda melanoleuca]
          Length = 1077

 Score =  431 bits (1107), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 230/508 (45%), Positives = 321/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + + ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLKQAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYII-QKREI---KPSLEVDKPTEDIFTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E   P  K                   V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 101/170 (59%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   LIGMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP    R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVGHARAGE 162


>gi|194375658|dbj|BAG56774.1| unnamed protein product [Homo sapiens]
          Length = 999

 Score =  431 bits (1107), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 233/508 (45%), Positives = 322/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 141 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 196

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +       Y+EF P L +Q     +++FE+FD 
Sbjct: 197 TSNFSGKGYIIQKREIKPCLEADKPVED----ILTYEEFHPFLFSQHSQCPYIEFESFDK 252

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 253 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 312

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 313 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 372

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 373 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 432

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 433 KYYDHKRYAAKKTQKTVEAAGKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 492

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 493 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 551

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 552 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 611

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 612 SFLFKVDESCVWRHQGERKVRVQDEDME 639



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 71/170 (41%), Gaps = 51/170 (30%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG+R+HTT +   K   PS F +K                                   
Sbjct: 53  KSGIRIHTTEFEWPKNMMPSSFAMK----------------------------------- 77

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
                  GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 78  -------GNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 120


>gi|32130516|ref|NP_004704.2| nuclear export mediator factor NEMF [Homo sapiens]
 gi|119586148|gb|EAW65744.1| serologically defined colon cancer antigen 1, isoform CRA_d [Homo
           sapiens]
 gi|148922399|gb|AAI46282.1| Serologically defined colon cancer antigen 1 [synthetic construct]
 gi|151556560|gb|AAI48733.1| Serologically defined colon cancer antigen 1 [synthetic construct]
          Length = 1076

 Score =  431 bits (1107), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 233/508 (45%), Positives = 322/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +       Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREIKPCLEADKPVED----ILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|281343421|gb|EFB19005.1| hypothetical protein PANDA_010972 [Ailuropoda melanoleuca]
          Length = 1058

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 230/508 (45%), Positives = 321/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + + ED+++   
Sbjct: 164 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLKQAEDYMK--T 219

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 220 TSNFSGKGYII-QKREI---KPSLEVDKPTEDIFTYEEFHPFLFSQHSQCPYIEFESFDK 275

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 276 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 335

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 336 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 395

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E   P  K                   V+VDL+LSA+ANA+
Sbjct: 396 LLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 455

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 456 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 515

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 516 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 574

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 575 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 634

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 635 SFLFKVDESCVWRHRGERKVRVQDEDME 662



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 67/150 (44%), Positives = 91/150 (60%), Gaps = 8/150 (5%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           LIGMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS
Sbjct: 2   LIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLLESGIRIHTTEFEWPKNMMPS 53

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F +K RKH+++RRL   +QLG DRI+ FQFG    A+++I+ELY +GNI+LTD E+ +L
Sbjct: 54  SFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEYLIL 113

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
            +LR   D+   V    R RYP    R  E
Sbjct: 114 NILRFRTDESDDVKFAVRERYPVGHARAGE 143


>gi|296214948|ref|XP_002753922.1| PREDICTED: nuclear export mediator factor NEMF isoform 1
           [Callithrix jacchus]
          Length = 1077

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 233/513 (45%), Positives = 323/513 (62%), Gaps = 39/513 (7%)

Query: 233 ARA-KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
           ARA K   LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ + K 
Sbjct: 175 ARAPKGELLKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKILVCLQKA 232

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
           ED+++   + +   +GYI+ Q + +       +       Y+EF P L +Q     +++F
Sbjct: 233 EDYMK--TTSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEF 289

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      E
Sbjct: 290 ESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGE 349

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++L
Sbjct: 350 LIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTML 409

Query: 472 LSN--------------------NLDEMDDEEKTLPVEK------------VEVDLALSA 499
           L N                    N  E    +K     K            V+VDL+LSA
Sbjct: 410 LRNPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSA 469

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEK 559
           +ANA+++Y+ K+    K +KT+ A  KAF++AEKKT+  + + +TV +I   RKV+WFEK
Sbjct: 470 YANAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEK 529

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP 
Sbjct: 530 FLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPR 588

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  
Sbjct: 589 TLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSY 648

Query: 680 LIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 649 LMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|297736751|emb|CBI25952.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 232/388 (59%), Positives = 277/388 (71%), Gaps = 28/388 (7%)

Query: 488 VEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN 547
           V+KVEVDLALSAHANARRWYE KK+QE+KQEKTI AH KAFKAAEKK+ +Q+ Q      
Sbjct: 8   VDKVEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQVGE--- 64

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH--ADLHGASST 605
                 +HWFEKFNWFISSENYLVISGRDAQQN+MIVKRYMSKGD+++H  +  + +SST
Sbjct: 65  ----HYIHWFEKFNWFISSENYLVISGRDAQQNKMIVKRYMSKGDLFIHFKSTNNNSSST 120

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
            +   R       + LN +  F VCHSQAWDSK+VTSAWWVYPHQVSKTA TGEYLTVGS
Sbjct: 121 FLFFQRHLNTCCRIPLNYSSLFIVCHSQAWDSKIVTSAWWVYPHQVSKTASTGEYLTVGS 180

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSD 725
           FMIRGK NFLPPHPL+MGFGLLF LDESSLGSHLN+RRVRGEEEG  DFE++   K NSD
Sbjct: 181 FMIRGK-NFLPPHPLMMGFGLLFCLDESSLGSHLNDRRVRGEEEGAQDFEENESLKGNSD 239

Query: 726 IESEKDDTDEK---------------PVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDK 770
            ESEK++TDEK               P+ E  S  +SAH   + +N  +++  E P E++
Sbjct: 240 SESEKEETDEKRTAESKSIMDPSTHQPILEGFSEISSAHNELTTSNVGSINLPEVPLEER 299

Query: 771 TISNGIDSK-IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDK 829
            + NG DS+ I DI+    + V PQLED IDRAL LGS + S  K+ +ET+Q DL EE  
Sbjct: 300 NMLNGNDSEHIDDISGIHVSSVNPQLEDFIDRALELGSNTASGKKYALETSQVDL-EEHN 358

Query: 830 HVERTATVRDKPYIS-KAERRKLKKGQG 856
           H +R A VR+KPY S + E   +  GQG
Sbjct: 359 HEDRKAKVREKPYTSYQREVIYISHGQG 386


>gi|255083452|ref|XP_002504712.1| predicted protein [Micromonas sp. RCC299]
 gi|226519980|gb|ACO65970.1| predicted protein [Micromonas sp. RCC299]
          Length = 1219

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 281/782 (35%), Positives = 402/782 (51%), Gaps = 103/782 (13%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPK-TYIFKLMNSSGVTESGESEKVLL 58
           M K + N+ D+AA    LR +++G   +N++DL  K T + K   S G TESGE EK  +
Sbjct: 1   MPKQKFNSHDIAASCATLRAKVLGAWLANIFDLDDKRTLLLKFTRSGGATESGEGEKTTV 60

Query: 59  LMESGVRLHTTAYARDKK-NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L+ESG R HTT+YAR++K + PS F  KLR H+R +RL  V Q+G DR + F FG G   
Sbjct: 61  LLESGARFHTTSYARERKADQPSKFNAKLRMHLRGKRLNGVNQMGADRAVAFTFGAGDTE 120

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           H+++LELYAQGNI+L D E+ +LTLLR HRDD + + ++  H YP E  R   R  A+ L
Sbjct: 121 HHLVLELYAQGNIVLCDREWRILTLLRPHRDDARSLVLLGNHPYPRERFRSHVRVDAAAL 180

Query: 178 HAALTS--SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA 235
            AAL      +P   +P +           ++E                         R 
Sbjct: 181 VAALEGRHDDDPLGPKPIEGEGVEGEGIEGAREK------------------------RR 216

Query: 236 KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWL 295
              T++  L +A G+GP + +      G+V     +    L+D  +  L  ++   +DW 
Sbjct: 217 APGTVREALCKAFGFGPPVVDRAARMAGIVDGS--AAKTPLDDAQVTALGASLGAIDDWF 274

Query: 296 QDVISGDIVPEGYILMQNKH--LGKDHPPTESGSSTQIYDEFCPLLLN------QFRSRE 347
           + V  G + P G +  + K    G+D     S S    +++F P   +      QF  + 
Sbjct: 275 EGVTDGRVEPRGVVTWRIKEGESGEDGATASSPSLDADFEDFSPFPADDVPPPAQFDPKV 334

Query: 348 FVKFET---FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           F   E    FDAALD F++  E++R   + +   +AA  KL K+  DQE RV  L++E +
Sbjct: 335 FRTTEISGGFDAALDLFFASFEARRDRSRREKSANAAAKKLEKVRRDQEARVRALEKERE 394

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
                A LIEYNL  VDA + AV  ALA  M+W+DL  M+KEE +AGNPVA L+  L L 
Sbjct: 395 SQELAATLIEYNLTQVDAVLAAVNGALAGGMAWDDLTLMIKEEARAGNPVARLVKTLDLP 454

Query: 465 RNCMSLLLSNNLDEMDDEEKTL------------------PVEK---------VEVDLAL 497
           +N +++ L N+LD  DDE                      P  +         VE+DLAL
Sbjct: 455 KNKVTVTLKNHLDVDDDEGDDDGDDGDGGDADDVGEGDAKPRSRRLKRDGGVSVELDLAL 514

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL---QILQEKTVANISHMRKV 554
            AHANAR  ++ KKK ++K  KT+  + +A  AAEKK +    ++  + T   I+  R  
Sbjct: 515 GAHANAREHFDRKKKHDAKHGKTLAQNKRAVAAAEKKAKEAGARMASKGTGMGIARARVP 574

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN---HR 611
            WFEKF+WFI++EN LV+S RDA Q + +V +Y+   D +VHAD  GA  T++K      
Sbjct: 575 EWFEKFHWFITTENCLVLSARDAAQADALVVKYLGPDDAFVHADSPGAPVTIVKAPPVRS 634

Query: 612 PEQP---------------------------VPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           P  P                           VPP++L QAG   +C S AWDS+ V SA+
Sbjct: 635 PALPEAEASMSRLSLSATRVVGSSADGWCGGVPPVSLIQAGAACLCRSAAWDSRHVVSAF 694

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNERR 703
           W+ P  V K  P G+ L  G     G K +LPP PL+MGFG +F L DE  + +H+ +R 
Sbjct: 695 WIPPENVRKVTPDGDPLAPGVVWHVGAKTYLPPAPLVMGFGCVFLLRDEDGVRAHVGDRT 754

Query: 704 VR 705
           V+
Sbjct: 755 VK 756



 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 39/65 (60%)

Query: 1001 AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGT 1060
             + EED       +  R+  VD+ TG P   D + + + V  PY+A+QS++Y+VK+ PGT
Sbjct: 1062 GVAEEDPAAFARRDAERVARVDWFTGCPTFPDAIDFAVCVVAPYAALQSFRYKVKLTPGT 1121

Query: 1061 AKKGK 1065
             KKGK
Sbjct: 1122 QKKGK 1126


>gi|308480173|ref|XP_003102294.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
 gi|308262220|gb|EFP06173.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
          Length = 917

 Score =  428 bits (1100), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 259/714 (36%), Positives = 380/714 (53%), Gaps = 73/714 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R    DV A    L++L GMR +NVYD+  KTY+ KL        S   EK ++L E
Sbjct: 1   MKNRFTLVDVIAATTELKKLQGMRVNNVYDIDNKTYLIKL--------SRTDEKAVILFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T +   K  TPS F++KLRKHI  +RL  +R +G+DR++   FG     + + 
Sbjct: 53  SGVRLHQTFHEWPKSQTPSSFSMKLRKHINQKRLTSIRVVGFDRLVELVFGTDDRENRLY 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GN++LTD+E  +L +LR   D D  V    R +Y                    
Sbjct: 113 VELYDRGNVVLTDNELIILNILRVRTDKDTSVRWAVREKY-------------------- 152

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           T ++E +        E G    +     +GG   GK   L +                  
Sbjct: 153 TFNEEAE-------RERGGVTMDDVTRAIGGIPEGKEEQLGR------------------ 187

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            V+ +    G  +++ I+   G+   MK+S    +E      L     + E   + V   
Sbjct: 188 -VMSQLTKCGNPITKEILAACGMKAEMKVSRKTDVETEFRGKLEEIRKETEHVWEQV--- 243

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           +  P G+I        +   PT      Q+Y+EF P+ +  F S+   +  +F  ++DEF
Sbjct: 244 EEQPRGFI-----SYTEILSPT--SQPIQLYNEFNPIPM-PFTSKLQKELPSFCESVDEF 295

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           YS+IE+Q+ EQ+    E  A  KL  +  DQ+ R+  L+   ++   MA  I  N + V+
Sbjct: 296 YSRIETQKQEQKAVNMEKQALKKLENVEKDQKERIEALQLTQEQREHMANRIILNQDLVE 355

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
            A+L +R ALAN+ SW+ +  M K     G+ VA  ID    E N   +    NL +  D
Sbjct: 356 KALLLIRSALANQFSWQTIEEMRKNAAMNGDLVAKSIDSFRFENNEFFM----NLGDPYD 411

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
           EE  L   KV +D++++A  NA+R +  KK    K +KT+ +  KA K A++K +  + Q
Sbjct: 412 EEAELL--KVPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQ 469

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
            K V  +   RK  WFEKF WFISSE Y+V++GRDAQQNE++VK+Y+   D+Y+HAD+ G
Sbjct: 470 VKIVTEVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRG 529

Query: 602 ASSTVIKNHRPE--QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
           ASS +I+N   E  Q +PP TL +A    VC+S AW++ +  SAWWV+P+QVS+TAPTGE
Sbjct: 530 ASSVIIRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPNQVSRTAPTGE 589

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
           YL  GSFMIRGKKNF+PP  L+MG G+LFR+D+ S+  H    + +  EE  D+
Sbjct: 590 YLPSGSFMIRGKKNFMPPSQLVMGLGVLFRMDDESIERHAALEKAKKSEENPDE 643


>gi|410962212|ref|XP_003987668.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Felis catus]
          Length = 1080

 Score =  427 bits (1099), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 233/511 (45%), Positives = 324/511 (63%), Gaps = 47/511 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDLEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
           A+DEFYSKIE Q+ +    Q       A  KL+ +  D ENR+  L+Q  +      ELI
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQVWYXKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 354

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL 
Sbjct: 355 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 414

Query: 474 N----NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHA 501
           N    + +E DD +  + VEK                            V+VDL+LSA+A
Sbjct: 415 NPYLLSEEEDDDVDGDITVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 474

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF 
Sbjct: 475 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 534

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           WF+SSENYL+I GRD QQNEMIVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL
Sbjct: 535 WFVSSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTL 593

Query: 622 NQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
            +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+
Sbjct: 594 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 653

Query: 682 MGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 654 MGFSFLFKVDESCIWRHRGERKVRVQDEDME 684



 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHARAAE 162


>gi|119586146|gb|EAW65742.1| serologically defined colon cancer antigen 1, isoform CRA_b [Homo
           sapiens]
          Length = 1067

 Score =  427 bits (1098), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 231/507 (45%), Positives = 315/507 (62%), Gaps = 51/507 (10%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV- 298
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMKTTS 240

Query: 299 -ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
             SG + P    +      G              Y+EF P L +Q     +++FE+FD A
Sbjct: 241 NFSGKVAPCILTIYCCDLFG--------------YEEFHPFLFSQHSQCPYIEFESFDKA 286

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 287 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 346

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 347 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYL 406

Query: 475 -----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARR 505
                            N  E    +K     K            V+VDL+LSA+ANA++
Sbjct: 407 LSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 466

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 467 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 526

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 527 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 585

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 586 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 645

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 646 FLFKVDESCVWRHQGERKVRVQDEDME 672



 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|336276025|ref|XP_003352766.1| hypothetical protein SMAC_01600 [Sordaria macrospora k-hell]
 gi|380094654|emb|CCC08036.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1086

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 351/1121 (31%), Positives = 538/1121 (47%), Gaps = 174/1121 (15%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV      L   L+ +R +N+YDL+ K  + K        +        LL+
Sbjct: 1    MKQRFSSLDVRVVAHELSEALVSLRLANIYDLNSKILLLKFAKPDNRQQ--------LLI 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ESG R H T + R     PS F  +LRK+++TRR   V Q+G DRII FQF  G  A  +
Sbjct: 53   ESGFRCHLTDFVRTASPAPSQFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRL 110

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
             LE +A GNI+LTDS+  +L LL   R+  +G A     + P  I   +           
Sbjct: 111  YLEFFASGNIILTDSDLKILALL---RNVPEGEA-----QEPQRIGLTYTLENRQNFGGV 162

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             T +KE               + +A +  +  QK        K   K   D  R    T 
Sbjct: 163  PTLTKE--------------RLRDALQSTV--QKVAADQAAGKKIKKKGADELRRGLATT 206

Query: 241  KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
             T L       P L +H+   T   P+ K +E+  LED+++   +    +    + D ++
Sbjct: 207  ITELP------PILVDHVFRLTSFDPSTKPAEI--LEDDSLLDRLFDTLQKAREILDEVT 258

Query: 301  GDIVPEGYIL------MQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKF 351
               V  GYI+       ++  +  D PP E   +  +Y++F P L  QF + +    + F
Sbjct: 259  DSSVANGYIIAKPRPGFEDAEVVVDAPPAEKAKNL-LYEDFQPFLPKQFENNKDYRILPF 317

Query: 352  ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
              ++  +DEF+S +E QR E +   +E AA  KL    MDQ  R+  L++    + + A 
Sbjct: 318  VGYNKTVDEFFSSLEGQRLESKLSEREAAAKRKLEAARMDQAKRIEGLQEMEMLNYRKAA 377

Query: 412  LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
             I+ N+E V  A+ AV   L   M W D+ +++++E+K GNPVA +I   + L+ N ++L
Sbjct: 378  TIQANIERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPVAEIIKLPMKLKENTITL 437

Query: 471  LL---------------------SNNLDEMD--DEEKTLPVEKVEVD--LALSAHANARR 505
            LL                     S++ DE D  + +  +PV ++E+D  L LS   NAR 
Sbjct: 438  LLGEGVEEEEEGDEDKEDDEFDYSDDEDEGDVGEPKDKVPVNRLEIDINLTLSVWNNARE 497

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFN 561
            +Y+ K+    K +KT+     A K+AE+K     R  + QEK V  +  +RK  WFEKF 
Sbjct: 498  YYDQKRTAAHKAQKTVQQSVIALKSAEQKISEDLRKGLKQEKPV--LQPIRKAMWFEKFT 555

Query: 562  WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPL 619
            WFISS+ YLV+ GRDAQQNEM+ KRY+ KGDVYVHAD+HGA+S +IKN+   P+ P+PP 
Sbjct: 556  WFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAASVIIKNNPKTPDAPIPPS 615

Query: 620  TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
            TL QAG  +VC S AWDSK    AWWV   QVSK+APTGEYL VGSFM+RGK+N LPP  
Sbjct: 616  TLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPTGEYLPVGSFMVRGKRNLLPPAL 675

Query: 680  LIMGFGLLFRLDESSLGSH-------LNERRVRGEEEGMDDFEDSG----HHKENSDIES 728
            L +GFGLLFR+ + S   H         E + R   + +D   + G      K  +  +S
Sbjct: 676  LTLGFGLLFRISDDSKSKHTRNRVYDFGEAKTRDRADSLDVLSEHGESLHEQKPEAGQKS 735

Query: 729  EKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVA 788
            E DD DE       +        P H+  S         +DK++ +         A   A
Sbjct: 736  ESDDEDED------AANQKGRSNPLHSQRS--------VQDKSVESD--------AGQGA 773

Query: 789  APVTPQLEDL-IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAE 847
             P T +L DL I++       S+S           +L E++K     +    +P +++ E
Sbjct: 774  EPPTEELADLEINK-----DESVS-----------NLDEDNK-----SPAEPEPAVAQDE 812

Query: 848  RRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKY 907
            + +    +      P     K+ G  +SS      ++  ++     RGQ+GK KK+  KY
Sbjct: 813  KEEGDDDEDEDSHQPS---SKQAGTPSSSTAPQ--KQQPLKKAPAKRGQRGKQKKIAAKY 867

Query: 908  GDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGH 967
             DQDEE+R +   L+      +K + +   +  +  +                + ++   
Sbjct: 868  KDQDEEDRALMEELMGVKAAREKAEAEAVAKAKAEAEAAA---------ARERRRQQQER 918

Query: 968  LSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGN 1027
            + K+  EH +     +E+   + +DE+    ++AME              +  ++ L G 
Sbjct: 919  VKKEIAEHEEVRRLMMEEGEDMPVDES----EMAME--------------MAPLETLVGT 960

Query: 1028 PLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            PL  D +L V+P+C P++A+   KY+ K+ PG  KKGK ++
Sbjct: 961  PLGGDEILEVVPICAPWNALNKVKYKTKLQPGNTKKGKAVK 1001


>gi|268571229|ref|XP_002640975.1| Hypothetical protein CBG11722 [Caenorhabditis briggsae]
          Length = 894

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 254/685 (37%), Positives = 370/685 (54%), Gaps = 86/685 (12%)

Query: 24  MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFT 83
           MR +NVYD+  KTY+ KL            EK ++L ESGVRLH T +   K  TPS F+
Sbjct: 1   MRVNNVYDIDNKTYLIKLTRPD--------EKAVILFESGVRLHQTFHDWPKSQTPSSFS 52

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
           +KLRKHI  +RL  +R +G+DRI+   FG     + + +ELY +GN++LTD E T+L +L
Sbjct: 53  MKLRKHINQKRLTSIRVVGFDRIVELIFGTEDRENRLYVELYDRGNVILTDHEMTILNIL 112

Query: 144 RSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVS 203
           R   D D  V    R +Y                    T S + +  +P           
Sbjct: 113 RVRTDKDTSVRWAVREKY--------------------TCSGDAEQQDP----------- 141

Query: 204 NASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTG 263
                     +G KS D+ +   ++  DG   K   L  +L      G  +++ I+   G
Sbjct: 142 ----------RGFKSDDVIRRI-QSIPDG---KDEQLGRILSGFTKCGNPITKEILSKIG 187

Query: 264 LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPT 323
           L    KL+  + + + + +   +  A  E W  D +  D  P+G+I     +L  + P  
Sbjct: 188 LKWEQKLNAKSDVAEISAKFEEIKKATEEIW--DTVEHD--PKGFI----SYL--EIPSA 237

Query: 324 ESGSSTQIYDEFCPL-------LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
            S +  +IY EF P+       L  + RS        F  ++DEFYS+IE+Q+ EQ+   
Sbjct: 238 TSSTPIEIYSEFNPISMPLTLKLQKELRS--------FCESVDEFYSRIETQKQEQKAVN 289

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
            E  A  KL  +  DQ+ R+  L+   ++   MA  I  N E V+ A+L +R ALAN+ S
Sbjct: 290 MEKQALKKLENVEKDQKERIEALQLTQEQREHMANRIILNQELVEKALLLIRSALANQFS 349

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLA 496
           W+ +  M K     G+PVA  ID    E N   + L    D  D+E + L   KV +D++
Sbjct: 350 WQTIEEMRKSAAANGDPVAKSIDSFKFENNEFFMKLG---DPYDEEAELL---KVPIDIS 403

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHW 556
           ++A  NA+R +  KK    K +KT+ +  KA K A++K +  + Q K V  +   RK  W
Sbjct: 404 MNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKCTLEQVKIVTEVKKSRKTMW 463

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP- 615
           FEKF WFISSE Y+V++GRDAQQNE++VK+Y+   D+Y+HAD+ GASS +I+N   E+  
Sbjct: 464 FEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVIIRNKSFEESM 523

Query: 616 -VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            +PP TL +A    VC+S AW++ +  SAWWV+P QV++TAPTGEYL  GSFMIRGKKNF
Sbjct: 524 EIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPSQVTRTAPTGEYLPSGSFMIRGKKNF 583

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHL 699
           +PP  L+MG G+LFR+DE S+  H+
Sbjct: 584 MPPSQLVMGLGILFRMDEESIERHV 608



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK----GIQIF 1070
            LT  PL  D LL+ +PV  PYSA+ +YKYRVK+ PG  K+GK     I++F
Sbjct: 805  LTAQPLDEDTLLFAVPVVAPYSALSTYKYRVKVTPGIGKRGKATKQAIELF 855


>gi|328723949|ref|XP_001945685.2| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Acyrthosiphon pisum]
          Length = 987

 Score =  421 bits (1082), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 270/742 (36%), Positives = 398/742 (53%), Gaps = 71/742 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T D+   V  +++  GMR   VYD+  KTY+FK            +EK +LL+E
Sbjct: 1   MKTRFSTLDIMCVVNEIQKYKGMRLQRVYDIDHKTYLFKF--------QLNNEKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T Y   K   PS F++KLRKH+  +RLE + Q+G+DRII  QFG+G  A++VI
Sbjct: 53  SGVRLHVTNYEWTKNEAPSSFSMKLRKHLSNKRLEKLTQMGFDRIIDLQFGVGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY                        DKG  I++   Y   +  +    T  +     
Sbjct: 113 LELY------------------------DKGNIILADKDYI--MINILRPHTEDEKQKFF 146

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                P++   +++N    +      +        K F  S   N               
Sbjct: 147 VKEVYPNSRPKNRLNPPTEDSLIQILKTAKHSTNLKKFIFSNFPN--------------- 191

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
                 L YG  L EH+++  G   N ++  E N   D  IQ L+      E +L ++ +
Sbjct: 192 -----CLDYGNCLLEHMLISGGFPTNTRIGIEFNI--DTDIQKLMNCFCIAEKFLDNITT 244

Query: 301 GDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
              + EG+I+ + ++ L  D    E  ++     E+ P L  Q +      +E+F+ A+D
Sbjct: 245 ---LKEGFIIQKIDQQLLPDGIMKELCTN----QEYHPFLFAQHQKLPSKTYESFNEAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYS +ESQ+ + +   +E  A  KL  I  D E R+  L+   D     AELI  NL+ 
Sbjct: 298 EFYSNLESQKYDVKCMQQEKGAVKKLQNIVKDHEERLKKLQDTQDEHKFKAELITNNLDL 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNCMSLLLSNNLDE 478
           VD  I  VR A+A ++ W+++  M+++     +     ++  L L  N ++L L +  +E
Sbjct: 358 VDNTIQFVRQAVAKQLHWDEIWDMIRQLNFEDDGCTYAIVKNLKLSVNHITLQLFDPYNE 417

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
            +  E+   +  +++DL  SA  NA R+Y  KK+   K++KTI + S   K AEKKT+  
Sbjct: 418 ENKNEENSQL--IDIDLGQSAFGNAERYYGSKKQSAIKEKKTIDSSSTVLKMAEKKTKQT 475

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           +   + VA+I+ +RK +WFEKF WFISSENYLVI+GRDA QNE+IVKRYM   DVYVHA 
Sbjct: 476 LKDMQVVASINKVRKTYWFEKFYWFISSENYLVIAGRDAHQNEVIVKRYMKSSDVYVHAG 535

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPT 657
             GA++ +IKN    QPVPP TLN+A    + +S +W  K+ + +A+WV P QVSKTAPT
Sbjct: 536 FSGATTVIIKN-PINQPVPPATLNEAAVMAISYSVSWTMKINLQNAFWVKPEQVSKTAPT 594

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE-EGMDDFED 716
           GEYLT GSFMIRGKKN+LP   LI+G   LF+L++SS+  H NER+++G E EG+D+ E 
Sbjct: 595 GEYLTTGSFMIRGKKNYLPATHLILGLSFLFKLEDSSIPRHANERKIKGIECEGLDNIEQ 654

Query: 717 SGHHKENSDIESEKDDTDEKPV 738
           +    EN   E++ D+  EK +
Sbjct: 655 NNDEFENIPSENDSDEDLEKNI 676



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 36/55 (65%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            +D LTG P   D LL+ +PV  PY+A+ +YKY++K+ PG  K+GK  +   +L L
Sbjct: 891  LDSLTGVPYAEDELLFAVPVVAPYTALTNYKYKLKLTPGNTKRGKASKTCLNLFL 945


>gi|322693747|gb|EFY85597.1| serologically defined colon cancer antigen 1 [Metarhizium acridum
            CQMa 102]
          Length = 1063

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 352/1124 (31%), Positives = 540/1124 (48%), Gaps = 201/1124 (17%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV      L + L+ +R +NVYDLS K  +FK              K  L++
Sbjct: 1    MKQRFSSLDVKVIAHELNQSLVTLRLANVYDLSSKILLFKFAKPDN--------KKQLVV 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            ++G R H T ++R     PS F  +LRK ++TRRL  V Q+G DRI+  QF  G   + +
Sbjct: 53   DTGFRCHLTKFSRTTAAAPSAFVARLRKLLKTRRLTSVSQVGTDRILQLQFSDGQ--YRL 110

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
             LE +A GNI+LTD++  +L+L R+  + D         +Y  E  + F      T  ++
Sbjct: 111  FLEFFASGNIILTDADLKILSLARNVSEGDGQEPQRVGLQYSLENRQNFHGIPPLTRERV 170

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
              AL S+ +  A  P            ASK+   G+ GG   DL K              
Sbjct: 171  QVALQSAVDKAAATP------------ASKKP-KGKPGG---DLRK-------------- 200

Query: 238  PTLKTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
              L   + E     P L +HI+     DT + P  ++ E   L D  +++L  A +  E 
Sbjct: 201  -CLAVSITE---LPPVLVDHILQSNNFDTAVNP-AEILENEVLLDELVKLLSEAKSSVEG 255

Query: 294  WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ--------IYDEFCPLLLNQFR- 344
                 I+   +  GYI  + +    D  P +    ++        +Y++F P + ++ + 
Sbjct: 256  -----ITSSEICTGYIFAKRR----DGNPIKEAQGSEAATNRGELLYEDFHPFIPHKLQR 306

Query: 345  --SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
              S + ++F+ ++  +DEF+S +E Q+ E +   +E AA  KL+    DQ  R+  L+  
Sbjct: 307  DPSIKALEFKGYNQTVDEFFSSLEGQKLETRLNEREAAAKRKLDAAKADQAKRIEGLQDA 366

Query: 403  VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KL 461
               +++ A  IE N+E V  A+ AV   LA  M W D+ ++V+ E+K  NPVA +I   L
Sbjct: 367  QTLNMRKAAAIEANVEWVQEAMDAVNGLLAQGMDWVDIGKLVEREKKRKNPVADIIVLPL 426

Query: 462  YLERNCMSLLL---------------SNNLDEMDDEEKTLPVEK---------VEVDLAL 497
             L  N ++L L               +++ D  D+ E +   +K         VE++L L
Sbjct: 427  NLAENLITLSLAEEEEEEAEEADPFETDDSDSEDENEASTISKKSEKPAKGLNVEINLKL 486

Query: 498  SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRK 553
            S  +NAR +YE ++    K+EKT    S+A K AE+K     +  + QEK +  +  +RK
Sbjct: 487  SPWSNAREYYEQRRTAVVKEEKTQQQASRALKNAEQKIVEDLKKGLKQEKAL--LQPIRK 544

Query: 554  VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--R 611
              WFEKF WFISS+ YLV+ G+DAQQNE++ KRY+ KGDVY HADL GA S +IKN+   
Sbjct: 545  QLWFEKFLWFISSDGYLVLGGKDAQQNEILYKRYLRKGDVYCHADLRGAPSVIIKNNPST 604

Query: 612  PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
            P+ P+PP TL QAG  +VC S+AWD K    AWWV   QVSK+ P G++L  G+FM+RG+
Sbjct: 605  PDAPIPPATLAQAGNLSVCASEAWDQKAGMGAWWVKADQVSKSGPAGDFLPTGNFMVRGQ 664

Query: 672  KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKD 731
            KNFL P  L++G G++F++ E S   H+  R        + D + +      SD+ + ++
Sbjct: 665  KNFLAPAQLLLGLGIMFKISEESKARHVKHR--------IHDVDSA----LGSDVATSRN 712

Query: 732  DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEF-PAEDKTISN--GIDSKIFDIARN-- 786
            D                      + AS  DS E  P +D T S+    D +  + AR   
Sbjct: 713  DM--------------------QSLASVADSQEKEPEDDVTQSDNESDDGREQEDARANP 752

Query: 787  VAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKA 846
            + AP   + +D +D A G  S S++ T+    T + D   ED+  E T T RD+  ++  
Sbjct: 753  LQAPDAAE-DDEVDEATGAVS-SLNLTEQ--PTGEGD--GEDEAAE-TGTSRDESELATE 805

Query: 847  ERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEK 906
                  K   S+                ++ P S  +K     G   RGQ+GK KK+  K
Sbjct: 806  ASEAPTKTSDSTT-------------QTAATPSSHSKK-----GPPKRGQRGKAKKIALK 847

Query: 907  YGDQDEEERNIRMALL-ASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKA 965
            Y DQDEE+R    AL+ A+ G         Q    +  K K    + +DA +   + +  
Sbjct: 848  YKDQDEEDRAAAEALIGATVG---------QKRQEAEAKAKADRQAELDAARERRRAQHQ 898

Query: 966  GHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEE-DIHEIGEEEKGRLNDVDYL 1024
                K+  EH                    E+ +V M+E  D+ +  E EK     +D L
Sbjct: 899  -RQQKEIAEH-------------------EEVRRVMMDEGIDVLDADEAEKA--TPLDAL 936

Query: 1025 TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
             G PLP D +L  IPVC P++A+  +KY+ K+ PG  KKGK  +
Sbjct: 937  VGTPLPGDEILEAIPVCAPWNALGKFKYKAKLQPGAVKKGKATK 980


>gi|452981583|gb|EME81343.1| hypothetical protein MYCFIDRAFT_114319, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 1087

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 321/963 (33%), Positives = 469/963 (48%), Gaps = 128/963 (13%)

Query: 14  EVKCL-----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VKC+       L  +R +NVYDLS + ++ K              +  LL++SG R H 
Sbjct: 9   DVKCIAHELSNSLTTLRLANVYDLSTRIFLLKFQKPE--------HREQLLVDSGFRCHL 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T +AR     PS F  +LRK ++TRR   V+Q+G DR+I  QF  G  A+ + LE YA G
Sbjct: 61  TKFARATAAAPSPFVARLRKFLKTRRCTAVKQIGTDRVIELQFSDG--AYRLFLEFYAGG 118

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
           NI+LT                D  + I++  R  +E     +     K + +L  + E  
Sbjct: 119 NIVLT----------------DNELTILALLRSVSEGAEHEQYRQGLKYNLSLRQNHE-- 160

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG--------ARAKQPTL 240
                        V + +KE L          L K   K   +          +A  P  
Sbjct: 161 ------------GVPSLTKEWL-------KESLQKTVEKQQAEAQKPGKKIKKKAGDPLR 201

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +      + P L +H +  +G+   ++   V  LE + +   VL   K  + +   I+
Sbjct: 202 KALAVTTTQFPPVLLDHALHVSGVDRELQPERV--LEHDELLEKVLQALKQAESVVAEIT 259

Query: 301 GDIVPEGYILMQNKHLGK--DHPPTESGSSTQIYDEFCPLLLNQF---RSREFVKFETFD 355
              V +GYIL + K   K  D   T       +Y+ F P    Q    +S  F++++ F+
Sbjct: 260 SQPVAKGYILGKRKQSSKQEDTDGTADEGKDVMYEHFHPFKPAQLAEDQSFVFLEYDGFN 319

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+DEF+S IE Q+ E + + +ED A  ++     +QE R+  L+Q  +  V+ A+ IE 
Sbjct: 320 VAVDEFFSSIEGQKLESRLQEREDNAKKRIEHARKEQEQRIEGLQQVQELHVRKAQAIEA 379

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN 474
           N+E V+ A  AV   +A  M W D+  +++ E+   NPVA LI   L L  N ++LLLS 
Sbjct: 380 NVERVEEATAAVNGLIAQGMDWADIGSLIENEQARHNPVAELIKLPLKLHENTITLLLSE 439

Query: 475 ---------------NLDEMDDEEKTLPVEK------VEVDLALSAHANARRWYELKKKQ 513
                            D  D + +T P         V++DLA SA +NAR++Y+ K+  
Sbjct: 440 IGRDADEEMDVTDSEPSDSEDGDAETAPARAEDKRLTVDIDLAASAWSNARQYYDQKRTA 499

Query: 514 ESKQEKTITAHSKAFKAAEK----KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
            SKQE+T  A  KA K+ E+    K +  + QEK V  +  +RK  WFEKF +FISS+ Y
Sbjct: 500 ASKQERTEAASKKALKSTEQNVMAKLKKDLKQEKDV--LRPVRKQFWFEKFIYFISSDGY 557

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCF 627
           LV++GRD  QNEM+ +R++ KGDVYVHADL+GASS VIKN  H P  P+PP TL QAG  
Sbjct: 558 LVLAGRDDLQNEMLYRRHLRKGDVYVHADLNGASSVVIKNSPHTPCAPIPPSTLAQAGDL 617

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AWDSK V SAWWV   QVSKTA TGEYL VGSF+IRGKKNFLPP  L++GFG++
Sbjct: 618 VVCRSSAWDSKAVMSAWWVNAEQVSKTADTGEYLAVGSFIIRGKKNFLPPARLLLGFGVM 677

Query: 688 FRLDESSLGSHLNERRVRGEE-EGMDDFEDSGHHKENS-----DIESEKDDTDEKPVAES 741
           F++ E S   H+  R +R +  +   D  D+    E+S       +   DD  + P A  
Sbjct: 678 FQISEESKARHVKHRLLRQDSYQATPDLTDAETIAESSAAGEPSDDGSDDDFPDAPPAPR 737

Query: 742 LSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDR 801
           +   +   P  ++      D  E     ++  N + S  FD A +         +D  D 
Sbjct: 738 IEDEDDGFPDRTYGTPDYNDDEE--EHSRSQRNPLQSSAFD-AHDNDDHEDEDGDDEKDE 794

Query: 802 ALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
             G    S +  + G E T+  ++  D+  +   +      +S  ER+ L K +      
Sbjct: 795 ETGSVEGSTNGAELGREDTESTVTPADQEEQPETSA----PLSNKERKALAKFE------ 844

Query: 862 PKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMAL 921
                     KD   QP    +  +I+   + RG++GK KK+ EKY DQDEE+R I M L
Sbjct: 845 ----------KDKKPQPSQKAKAKQIKA--LVRGKRGKAKKLAEKYADQDEEDREIAMRL 892

Query: 922 LAS 924
           L S
Sbjct: 893 LGS 895



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 33/51 (64%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L  +D L G PL  D +L  IP+C P++A+  +KY+ K+ PG  KKGK ++
Sbjct: 958  LTQLDTLIGQPLAGDEILEAIPICAPWAALGRFKYKAKMQPGQQKKGKAVR 1008


>gi|301106825|ref|XP_002902495.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262098369|gb|EEY56421.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1051

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 272/775 (35%), Positives = 421/775 (54%), Gaps = 95/775 (12%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL-------SPKTYIFKLMNSSGVTESGE 52
           M K RM+  D+ A V  +R  ++ MR +N+YD+       + KTYI KL           
Sbjct: 1   MKKTRMSIDDIHAMVGSIRANVVNMRVTNIYDVQGQGDSGAAKTYILKLHQPP------- 53

Query: 53  SEKVLLLMESGVRLHTTAYARDKKN---TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
             KV LL+ESGVR HT+ YARD K     PS FT+KLRKH+R +RL  + QL  DR++ F
Sbjct: 54  FPKVFLLLESGVRFHTSKYARDAKAGNALPSQFTMKLRKHLRGKRLSALTQLEGDRVVDF 113

Query: 110 QFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF 169
            FG      ++ILELYA GNI+LTD ++ +L             +++  HR+   +    
Sbjct: 114 TFGQDALKCHLILELYASGNIILTDGDYRIL-------------SLLRTHRFDENVKMAV 160

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           ++    +L       K+      +++ E  N            Q+  K+        +  
Sbjct: 161 KQEYPVQLLG--DQEKQRGIQTTEQLTEFVNRWFE--------QQEAKAAIALPGKTQKK 210

Query: 230 NDGARAKQPTL--KTVLGEALGYGPALSEHIILDTGLVPNMKL---SEVNKLEDNAIQVL 284
                 KQ  L  ++  G   G GP + EH ++   + P +K+   +E   L ++ +  L
Sbjct: 211 KKAQTIKQLLLVKESTFG---GLGPVIIEHCLVRAAISPTLKIKNAAEFTTLGEDKLAAL 267

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLG-------KDHPPTESGSST-------- 329
           +  + +    L+ +        G + +Q+           +D  P     ST        
Sbjct: 268 LAEIQEGWKLLERLQDEQTSVNGPVPLQSDDTADTGDSDEEDAAPVAKDPSTTSQKCGFI 327

Query: 330 -----------QIYDEFCPLLLNQ-FRSREFVK-FETFDAALDEFYSKIESQRAEQQHKA 376
                      + ++EF P L  Q  ++ + VK F+TFD A+DE++S+ E++ AE   ++
Sbjct: 328 ILKDVAGENAPEQFEEFTPYLYAQHLQAYKKVKSFDTFDEAVDEYFSRFEAETAEVAKQS 387

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
            + AA +KL K+  +Q+ ++  L++  ++S + A+LIE N +DV+  +L +R ALA+ M 
Sbjct: 388 AQLAAENKLAKLKKNQQQQLAQLREVQEQSFQDAQLIEANQQDVENVLLVIRSALASGMD 447

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK------ 490
           W  L  +V+ E+K GNPVA LI +L LE N +++LL ++ ++  ++      E+      
Sbjct: 448 WRGLEELVRYEQKNGNPVASLIHQLDLEHNRVAILLCDSDEDDYEDGGDGTGEEDKKAHV 507

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           + +DL+LSA ANAR  Y  KKK  +K +K   A  KA   AEK T+  + +++T  N+ +
Sbjct: 508 IWIDLSLSALANAREIYTKKKKAGAKVKKATEATDKAIALAEKNTKKTLEKQQTKRNVIY 567

Query: 551 MR-KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
            R K  WFEKF+WF+++E YLV++G+DA QNE++VKRY+ KGDVYVHADLHGA++ +++N
Sbjct: 568 QRRKTLWFEKFHWFLTNEKYLVVAGKDAHQNELLVKRYLRKGDVYVHADLHGAATCIVRN 627

Query: 610 HRP-------EQP-VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           H         E P +P  TL QAGC +VC S AW S+++  A+WV+  QVSKTAP GEYL
Sbjct: 628 HATVKDKKTQELPSIPVATLEQAGCMSVCRSNAWTSQVIAGAYWVHADQVSKTAPAGEYL 687

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNE---RRVRGEEEGMDD 713
           T GSFMIRGKKN++ P  L MG  +LFR+D+S +G+H  +   R +R  E   DD
Sbjct: 688 TTGSFMIRGKKNYIQPSRLEMGLAILFRIDDSCIGNHARQGEGRDLRVAEGPEDD 742



 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 72/228 (31%), Positives = 116/228 (50%), Gaps = 39/228 (17%)

Query: 840  KPYISKAERRKLKKGQGSSVVDPKVEREKER-GKDASSQPESIVRKTKIEGGKISRGQKG 898
            K  +S  ERR LKKG+   ++  +   +++R GKD +S    ++   K    K  RG+KG
Sbjct: 767  KKRLSVKERRDLKKGKTPELIGEQPPAQQQRKGKDKAS----VLTAQK----KSVRGKKG 818

Query: 899  KLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKV 958
            K+KKMK+KY DQD+E+R +RM  L    + ++ D +P  EN        PA    D    
Sbjct: 819  KMKKMKKKYADQDDEDRRLRMEALGHVVEEEQEDEEPTKEN-------DPAEQSGD---- 867

Query: 959  CYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRL 1018
                              +D  +    N    + E     +   +E+ + E  +E +G +
Sbjct: 868  ------------------EDGEYVAGGNAQTEVSEEYIRQQREKKEKYLDEQEDEAEG-V 908

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            +  D  TG PLP+DI+L+ +P+C PY+++  +KY+VK+ PG+ KKGK 
Sbjct: 909  DFFDAFTGEPLPNDIVLFAMPMCAPYASLTKFKYKVKLTPGSQKKGKA 956


>gi|145509741|ref|XP_001440809.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408037|emb|CAK73412.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1071

 Score =  414 bits (1064), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 249/719 (34%), Positives = 393/719 (54%), Gaps = 89/719 (12%)

Query: 3   KVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           K+R+   D+ A V  L+ +LIG R SN+Y++  KTY+FK         S +  K  L++E
Sbjct: 5   KIRLTALDIMALVTELKQKLIGTRLSNIYNIDAKTYVFKF--------SLQESKSYLVIE 56

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G+R + +    +K   PSGFT+K RK +R+RRLE + Q+G +R+++F FG   + +Y+I
Sbjct: 57  NGLRFNLSD-TIEKNKVPSGFTMKFRKFLRSRRLESIEQIGVERVVVFTFGREDHTYYLI 115

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY+QGNI+L D ++ ++ L R H +  + V +     YP      FE T  + L    
Sbjct: 116 LELYSQGNIILADKDYRIIQLTRQH-EFSENVKVAPNEIYP------FEYTATNYLEKFD 168

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           TS +       +K                                     G + K+   K
Sbjct: 169 TSMERIQKVISEK------------------------------------QGQKLKEVVFK 192

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVI 299
            V        P L + +  D     NM  +E  VN+ +         +V K  D+  D I
Sbjct: 193 LV--------PCLHQSLTDDIIQQLNMNQNEKIVNQFD---------SVKKVVDFAMDYI 235

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           +       Y      +L     P ++    + +D F       ++ +  V+  TF+ A+ 
Sbjct: 236 NKYRAQTQY----KGYLCAKEAPKDAEQKPKFFD-FAADQPAYYQGKYVVETPTFNQAVH 290

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           +++  ++  R E+  ++ ED A+ K   I  DQ +R+  L++E D  +  A LI+ N+ D
Sbjct: 291 QYFLVVD--RQEENKQSIEDIAWKKFENIKQDQMSRIQKLQEEQDEYIMKAGLIQENIND 348

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           V A I  ++  + N + W+ + RM+ + +K GNP++ +I  + L++N +++LL N  DE 
Sbjct: 349 VQAIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMNLKQNKVTILLGNKDDEY 408

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
            D      + ++E+D+  SA+ NAR++YE KKK   K+ KT  A  +A K AEK    +I
Sbjct: 409 SD------LIQIEIDITQSAYQNARKYYESKKKNRDKEIKTKEAVEQALKQAEKTALKEI 462

Query: 540 LQEKT-VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            +EK  +  + + RK +WFEKF WFISS+ YLVISG+D QQNEMIVKRYM+K D+Y+HAD
Sbjct: 463 EREKNKIQKVQNQRKKYWFEKFFWFISSDGYLVISGKDVQQNEMIVKRYMNKDDIYMHAD 522

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           ++G++ST++KN   E P+P  T+ QA   T+C S++WD+K+V SAWWV+  QVSK+APTG
Sbjct: 523 IYGSASTIVKNP-SEGPIPEATIMQAATATICRSKSWDAKIVVSAWWVHASQVSKSAPTG 581

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER--RVRGEEEGMDDFE 715
             +  GSFMI GKKNF+ P  L MG  +L++LD+ S+  H  ER  ++R E+  +D+ E
Sbjct: 582 MNIPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQDSIKRHEEERKKKLREEQSQVDESE 640



 Score = 44.7 bits (104), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 32/55 (58%)

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            E+E    +++  L       D  L +IP+  PYS + +YK+++KI PG+ KKGK 
Sbjct: 953  EDENIEYSEMQKLVSYLYADDKYLSLIPMVAPYSVLGNYKFKIKIAPGSLKKGKA 1007


>gi|195451571|ref|XP_002072981.1| GK13887 [Drosophila willistoni]
 gi|194169066|gb|EDW83967.1| GK13887 [Drosophila willistoni]
          Length = 1004

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 293/782 (37%), Positives = 423/782 (54%), Gaps = 118/782 (15%)

Query: 306  EGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALDEFYS 363
            +GYI+       K+  PT+ G     +   EF P L +Q +  E  + ETF  A+DEF+S
Sbjct: 284  KGYIMQV-----KEEKPTDGGDVDYFFRNVEFHPFLFSQLKHLEVEEHETFMTAVDEFFS 338

Query: 364  KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVD 421
            K ESQR + +   +E  A  KL+ I  D   R+  L   Q VD+  + AELI  N   VD
Sbjct: 339  KQESQRIDMKTLGQERDALKKLSNIKNDHAQRLEDLNKVQSVDK--RKAELITCNQSLVD 396

Query: 422  AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS-----NNL 476
             AILAV+ A+A+++ W D+ ++VKE +  G+ VA  I +L LE N +SL+LS     N+ 
Sbjct: 397  KAILAVQSAIASQLPWPDIRQLVKEAQANGDIVANSIKQLKLETNHISLILSDPYSANDS 456

Query: 477  DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
            DE DDEE   P+  V+VDLALSA ANARR+Y+LK+    K++KT+ A  KA K+AE+KT+
Sbjct: 457  DEDDDEESEEPM-IVDVDLALSAWANARRYYDLKRSAAKKEQKTVDASEKALKSAERKTQ 515

Query: 537  LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
              + + +T++NI+  RKV WFEKF WFISSENYLVI GRDAQQNE+IVKRYM   D+YVH
Sbjct: 516  QTLKEVRTISNIAKARKVFWFEKFYWFISSENYLVIGGRDAQQNELIVKRYMRPKDIYVH 575

Query: 597  ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
            A++ GASS +I+N   ++ +PP TL +AG   + +S AWD+K++T+A+WV   QVSKTAP
Sbjct: 576  AEIQGASSVIIRNPNADE-IPPKTLLEAGTMAISYSVAWDAKVITNAYWVTSDQVSKTAP 634

Query: 657  TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED 716
            TGEYL  GSFMIRGKKNFLP   LIMG   LF+L++S +  H  ER++R  +E  +D + 
Sbjct: 635  TGEYLGTGSFMIRGKKNFLPSCHLIMGLSFLFKLEDSFVQRHAGERKIRSTDEDPNDIDL 694

Query: 717  SGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGI 776
                  N  +    +D       ESL       P+ +  N  N D+              
Sbjct: 695  KQCDIANDGLPEISED------GESL-------PSQNVNNIENADN-------------- 727

Query: 777  DSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT 836
                            P  E  I+   G  +    S   G E      +E +  + + AT
Sbjct: 728  --------------AFPDTEVKIEHDTGRVTIRTDSYPQGSEPA----TEPENDLTKNAT 769

Query: 837  VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIE----GGKI 892
              ++  I  A   + K+ + ++      +R+ ++G+   ++P++ V + ++E     G +
Sbjct: 770  EDEETTIIAAAPARQKQQKSNN------KRKDDKGR--KNKPQNQVTEVEVEPKPNTGVL 821

Query: 893  SRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISP 952
             RGQK KLKKMK KY DQDEEER +RM +L S+G                   K+ A +P
Sbjct: 822  KRGQKSKLKKMKLKYKDQDEEERKLRMMILNSSG-------------------KETAKAP 862

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
              +     +  KA  + +D    P          P V +D+  +M   A    D+  +  
Sbjct: 863  NSSVDEKTEVTKAAEVKRDRNPMP---------KPQVEIDDNEDMPTGA----DVEML-- 907

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYS 1072
                     + LTG PL  D LL+ IPV  PY A+Q YK++ K+ PGT K+GK  ++  +
Sbjct: 908  ---------NTLTGQPLEDDELLFAIPVVAPYQALQQYKFKAKLTPGTGKRGKAAKLALN 958

Query: 1073 LL 1074
            + 
Sbjct: 959  MF 960



 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 109/163 (66%), Gaps = 6/163 (3%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T D+   V  L+RL+G+R + +YD+  KTY+ +L  + G     E+EKV LL+E
Sbjct: 1   MKTRFSTYDIICGVAELQRLVGLRVNQIYDIDNKTYLIRLQGTGG-----ETEKVTLLIE 55

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTTA+   K   PSGF++KLRKH++ +RLE + QLG DRI+  QFG G  A++VI
Sbjct: 56  SGTRFHTTAFEWPKNVAPSGFSMKLRKHLKNKRLEHIHQLGADRIVDLQFGTGDAAYHVI 115

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           LELY +GN++LTD E T+L +LR H + +  +    R +YP +
Sbjct: 116 LELYDRGNVILTDYEQTILYILRPHTEGE-ALRFAVREKYPID 157


>gi|303312187|ref|XP_003066105.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735 delta
            SOWgp]
 gi|240105767|gb|EER23960.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735 delta
            SOWgp]
          Length = 1125

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 334/1123 (29%), Positives = 536/1123 (47%), Gaps = 172/1123 (15%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV    + L   ++G+R SN+YDLS +TY+FK+       +         ++
Sbjct: 1    MKQRFSSLDVKVICRELSAAVVGLRVSNIYDLSSRTYLFKIAKPDVRKQ--------FIV 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +SG R H T Y+R     PS F  +LR  +++RR+  V Q+G DRI+  +F  G   +++
Sbjct: 53   DSGFRCHITEYSRVTAPAPSHFVSRLRGFLKSRRITAVSQIGTDRIVHIEFSDGY--YHL 110

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
             LE +A GNI+LTD+E+ ++ LLR   + +    +    +Y  +  + +E     +  +L
Sbjct: 111  FLEFFASGNIILTDNEYKIVALLRIVPEGEDQDEVRLGLKYRLDNKQNYEGVPPPSVDRL 170

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
              AL   KE DA+                              +S+ +NK +    + ++
Sbjct: 171  KTALQKGKERDAS------------------------------ISEPANKRAK---KKQE 197

Query: 238  PTLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
              L+  L  +LG   Y P L EH +     D+ L P+  L   +++ D      ++ V +
Sbjct: 198  EALRRAL--SLGFPEYPPVLLEHALHVTGFDSSLRPDQILETGDRVND------LMRVLR 249

Query: 291  FEDWLQDVISGDIVPEGYILMQNKHLGKDHPP--TESGSSTQIYDEFCPLLLNQFRSR-- 346
              + + + +S      GYI+ +N++   ++P    E+      Y ++ P    QF     
Sbjct: 250  EVESVSNELSTTEQTRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPFAPRQFADGND 309

Query: 347  -EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
               + F++F+ A+DE+YS +E+Q+ E +   +E+    KL     D E RV  L+Q  + 
Sbjct: 310  ISILTFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEI 369

Query: 406  SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
              + AE I  NL  V+  + AV   +A  M W ++AR+++ E+   NPVA LI   L L 
Sbjct: 370  HTRKAEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLY 429

Query: 465  RNCMSLLLSNN---------------LDEMDDEEKTLP----VEKVEVDLALSAHANARR 505
             N +++LL                   +E D E KT P    V  V++DL L+  ANA +
Sbjct: 430  ENTVTVLLPEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQ 489

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKK--TRLQ--ILQEKTVANISHMRKVHWFEKFN 561
            +Y+ KK    K+EKTI A  +A K+AEKK  T L+  + QEK V  +   R   WFEKF 
Sbjct: 490  YYDQKKTAAVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPV--LRPARIPFWFEKFY 547

Query: 562  WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPP 618
            +FISS+ YLV+ G D +QNE++  R++ KGDVYVHAD+ GA   ++KN +P   + P+PP
Sbjct: 548  FFISSDGYLVLGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKN-KPGASDAPIPP 606

Query: 619  LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
             TL QAG FTV  S+AW+SK +  AWWV   QVSKT P+GEYL  G  +IRG KN L P 
Sbjct: 607  GTLAQAGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPG 666

Query: 679  PLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPV 738
             LI+GF ++F++   S+ +H    R R EE    +      H+  +   SE +  +E P 
Sbjct: 667  QLILGFAVMFQISPESVRNHT---RHRLEEPVSSEMTVKNDHRNGTHEPSEMEKLEESP- 722

Query: 739  AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
                           +T   N    +   E K   N  D     + ++    + PQ+++ 
Sbjct: 723  ---------------NTAVDNCSIGKVGMEQKPRENTWD---LPVEQSAQTGIAPQVKE- 763

Query: 799  IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQG-- 856
                   G A +S            LS+ D   +  A      ++S  ERR +K+G G  
Sbjct: 764  -----PQGEAGLSREDKDT------LSDPDLQQQLAAFGATTKHVSAQERRLMKRGAGLH 812

Query: 857  -SSVVDPKVEREKERGKDASSQPESI---------VRKTKIEGGKIS-RGQKGKLKKMKE 905
             S++ +  ++ E E  ++  S P +          ++ T     ++  RG++GK KK+  
Sbjct: 813  ASALPELGLDEEDEDEEENQSTPSTFKPSGTPTLSIQSTSTSKSQLPVRGKRGKAKKLAS 872

Query: 906  KYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKA 965
            KY DQDEE+R + + LL S  K        ++  A    +K              + ++A
Sbjct: 873  KYKDQDEEDRELALRLLGSTPKTTTPKKTKEDREAEIQAQK--------------ERRRA 918

Query: 966  GHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLT 1025
             H          D +   E        +  E    A++  D  ++ E+    L+ +  L 
Sbjct: 919  QH----------DKAAQAERRRQESFQKRPEGQNQALDMADAEQVVED----LSSLPALV 964

Query: 1026 GNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            G P+  D ++  IPVC P+SA+  YKYR K+ PG   KGK ++
Sbjct: 965  GTPVLGDEIISAIPVCAPWSALGQYKYRAKLQPGPTGKGKIVK 1007


>gi|397618049|gb|EJK64734.1| hypothetical protein THAOC_14501 [Thalassiosira oceanica]
          Length = 1217

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 288/788 (36%), Positives = 425/788 (53%), Gaps = 102/788 (12%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPK----------TYIFKLMNSSG----- 46
           K R +  DVA+    L+R ++G + +N+YD S             Y+FKL + SG     
Sbjct: 5   KTRFDGLDVASMCSHLKRTMMGFKLANIYDGSSLGVSGGSDSKGVYMFKLADPSGGSAAT 64

Query: 47  ------VTESG---ESEKVLLLMESGVRLHTTAY---ARDKKNTPSGFTLKLRKHIRTRR 94
                  TE G   ES++ +LL+ESGVR H T +   +      PS F +KLRKH+R  R
Sbjct: 65  GKSNTSSTEDGGEAESKRAMLLIESGVRFHPTTHFSQSSSSSAMPSPFAMKLRKHLRNLR 124

Query: 95  LEDVRQLG-YDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSH------- 146
           LE+V QLG  DR++ F+FG G   H++ILELY+QGN++LTD E+ +L LLR+H       
Sbjct: 125 LENVTQLGNLDRVVDFRFGSGSYTHHLILELYSQGNLVLTDGEYRILALLRTHEYEVKDG 184

Query: 147 -RDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA-LTSSKEPDANEPDKVNEDGNNVSN 204
            +D+ +GV    +      +  V+  T A+ L     T +   D N+   ++    N   
Sbjct: 185 KKDEREGV---EKEEVKVRVGNVYPVTLATTLSMDDRTENSGEDGNKSGLLSMSAENAFE 241

Query: 205 ASKENL-GGQKGGKSFDLSKNSNKNSNDGARAK---QPTLKTVL---GEAL-GYGPALSE 256
            +K  L   Q+  ++ +  ++  K    G + +      LK +L   G  +  YGP+L E
Sbjct: 242 WAKSELVATQQRARTVNSQQHGGKGKKKGKKKQLDENLVLKALLLRPGSGVYHYGPSLVE 301

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ-----------DVISGDIVP 305
           H IL  GL P +KL+  N +E        L    + D L+           ++ S D   
Sbjct: 302 HCILFAGLEPTLKLNADN-IE------YTLPSGSWGDLLESLRDEGSVVLGNLQSPDSAG 354

Query: 306 EGYILMQNKHLGKDHPPTESGSST--------QIYDEFCPLLLNQFRSREFVKFETFDAA 357
            GYIL + K   +     ++ + T        +   EF P LL Q +++  + + TF  A
Sbjct: 355 SGYILYKPKETKESLQEQKNDAQTAPQNPHSDKTLLEFQPHLLIQHKNQPHLTYSTFATA 414

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
            DEF+S + SQ+A  +  A E AA  +L KIH DQ  RV  L +E D+    A L+E + 
Sbjct: 415 TDEFFSNLSSQKAAARADAAESAARERLAKIHADQARRVDGLVREQDKFRDAARLVELHA 474

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           +DVD A+  +  AL + M W+ L ++V  E+   NP+A LI KL L+++ + L L + +D
Sbjct: 475 DDVDRALGVINGALQSGMDWDQLEQLVTVEQGNENPIALLIHKLVLDKDEIMLALPD-ID 533

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH------------S 525
             +DE +  P+  V V++  SAH NAR  Y + +  + K+ KTI A              
Sbjct: 534 NWEDESEAPPIVIVTVNIKESAHGNARAKYAVYRASKEKERKTIEASETALKAAEAKAKQ 593

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHW-FEKFNWFISSENYLVISGRDAQQNEMIV 584
           +  +A ++K R Q+    +V +  +   + +   KF WFI+S+NYLV++G+DAQQNE +V
Sbjct: 594 QLAEAQKRKARKQL----SVNSQVYQGNLQFCLNKFAWFITSDNYLVVAGKDAQQNEQLV 649

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQ--------PVPPLTLNQAGCFTVCHSQAWD 636
           KRY+  GD Y+HA++HGA++ V++  R  +        P+    L +AG FT+C S AW 
Sbjct: 650 KRYLRPGDAYLHAEIHGAATCVLRAKRRRRKDGKTQVMPLSDQALREAGTFTICRSSAWS 709

Query: 637 SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSL 695
           SKMVTSA+WV  HQVSKTAPTGEYLTVGSFMIRG+KNFLP   L MG G+LFRL D+ S+
Sbjct: 710 SKMVTSAYWVESHQVSKTAPTGEYLTVGSFMIRGRKNFLPASTLEMGVGVLFRLGDDVSV 769

Query: 696 GSHLNERR 703
             H NERR
Sbjct: 770 ARHANERR 777



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 31/43 (72%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            LTG P+  D+LL+ +PV  PY+ +  YKYRVK+ PG+ K+GK 
Sbjct: 1106 LTGKPVGQDLLLHALPVVAPYNVLSQYKYRVKLTPGSVKRGKA 1148


>gi|145494650|ref|XP_001433319.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400436|emb|CAK65922.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1070

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 248/717 (34%), Positives = 391/717 (54%), Gaps = 85/717 (11%)

Query: 3   KVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           K+R+   D+ A V  L+ +LIG R SN+Y++  KTY+FK         S +  K  L++E
Sbjct: 5   KIRLTALDIMALVTELKQKLIGTRLSNIYNIDAKTYVFKF--------SLQESKSYLVIE 56

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G+R + +    +K   PSGFT+K RK +R+RRLE + Q+G +R+++F FG   + +Y+I
Sbjct: 57  NGLRFNLSD-TIEKNKVPSGFTMKFRKFLRSRRLESIEQIGVERVVVFTFGREDHTYYLI 115

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY+QGNI+L D ++ ++ L R H +  +   +     YP      FE T  + L    
Sbjct: 116 LELYSQGNIILADKDYRIIQLTRQH-EFSENAKVAPNEIYP------FEYTATNYLEKFD 168

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           TS +                +     E  G +     F L                P L 
Sbjct: 169 TSMER---------------IQKVVSEKAGQKLKEVVFKLV---------------PCLH 198

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
                      +L++ II    +  N K+  VN+ E+         V K  D+  + I+ 
Sbjct: 199 Q----------SLTDDIIQQLQMNQNEKI--VNQFEN---------VKKVVDYAMEYINK 237

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
                 Y      +L     P ++    + +D F       ++ +  ++  TF+ A+ ++
Sbjct: 238 YRAQTQY----KGYLCAKEAPKDAEQKPKFFD-FAADQPAYYQGKYVIETPTFNEAVHQY 292

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           +  ++  R E   ++ ED A+ K   I  DQ +R+  L+ E D  +  A LI+ N+ DV 
Sbjct: 293 FLVVD--RQEDNKQSIEDIAWKKFENIKQDQMSRIQKLQSEQDEYIMKAGLIQENINDVQ 350

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
           A I  ++  + N + W+ + RM+ + +K GNP++ +I  + L++N +++LL N  DE  D
Sbjct: 351 AIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMNLKQNKVTILLGNKEDEYSD 410

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
                 + ++E+D+  SAH NAR++YE KKK   K+ KT  A  +A K AEK    +I +
Sbjct: 411 ------LIQIEIDITQSAHQNARKYYESKKKNRDKEIKTKEAVEQALKQAEKTALKEIER 464

Query: 542 EKT-VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           EK  +  + + RK +WFEKF WFISS+ YLVISG+D QQNEMIVKRYM+K D+Y+HAD++
Sbjct: 465 EKNKIQKVQNQRKKYWFEKFFWFISSDGYLVISGKDVQQNEMIVKRYMNKDDIYMHADIY 524

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           G++ST++KN   E P+P  T+ QA   T+C S++WD+K+V SAWWV+  QVSK+APTG  
Sbjct: 525 GSASTIVKNPN-EGPIPEATIMQAATATICRSKSWDAKIVVSAWWVHASQVSKSAPTGMN 583

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER--RVRGEEEGMDDFE 715
           +  GSFMI GKKNF+ P  L MG  +L++LD+ S+  H  ER  ++R E+  +D+ E
Sbjct: 584 IPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQDSIKRHEEERKKKLREEQSQVDESE 640



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 7/82 (8%)

Query: 985  DNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPY 1044
            DN    +D   E  +  +E+ED   I   E  +L  V YL     P D  L +IP+  PY
Sbjct: 932  DNKQEEIDSDDEKQEKQVEQED-ENIEYTEMQKL--VSYL----YPDDKYLSLIPMVAPY 984

Query: 1045 SAVQSYKYRVKIIPGTAKKGKG 1066
            + + +YK+++KI PG+ KKGK 
Sbjct: 985  TVIGNYKFKIKIAPGSLKKGKA 1006


>gi|453084374|gb|EMF12418.1| hypothetical protein SEPMUDRAFT_149103 [Mycosphaerella populorum
           SO2202]
          Length = 1130

 Score =  411 bits (1056), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 265/751 (35%), Positives = 390/751 (51%), Gaps = 93/751 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L  +R SNVYDLS + ++ K      +          L++
Sbjct: 1   MKQRFSSLDVKVIAHELNASLTSLRLSNVYDLSSRIFLLKFQKPDQIRHQ-------LIV 53

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T + R     PS F  +LRK +RTRR   VRQ+G DR+I   F      + +
Sbjct: 54  DSGFRCHLTQFVRATAAQPSPFVARLRKFLRTRRCVSVRQIGTDRVIELCFSHAEGVYRL 113

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE YA GN++LTD E+ +L LLRS  + ++        +Y  E                
Sbjct: 114 FLEFYAGGNVILTDHEYHILGLLRSVNEGEEHEQYRVGLKYDLE---------------- 157

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP-- 238
                        + N  G  V + +K  L       +  L + +N+ ++     K+   
Sbjct: 158 ------------KRQNYAGEGVPDLTKVWLKEALQRTATKLVEQANREASKKKVVKKKKG 205

Query: 239 -TLKTVLG-EALGYGPALSEHIILDTGLVPNMKLSEV---NKLEDNAIQVLVLAVAKFED 293
            +L+  L      + P L +H I    +   ++  +V    +L D  +  L +A    ED
Sbjct: 206 DSLRKALAVTTTQFPPVLLDHAIFVAKVDRELEAQQVVDSEELLDQVLSALRIAEGVMED 265

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTES-----------GSSTQIYDEFCPLLLNQ 342
                I+   + +GYIL Q K  G   P                +S  +YD+F P    Q
Sbjct: 266 -----ITSQPIAKGYILAQRKK-GMATPEKAEEEGEEEGRDADSTSGLMYDDFHPFKPAQ 319

Query: 343 FR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL 399
                +  F++ E F+ A+DEF+S IE Q+ E +   K+++A  ++     +QE R++ L
Sbjct: 320 LAEDPANVFLEHEGFNIAVDEFFSSIEGQKLESKLAEKQESARKRIEHAKKEQEQRINGL 379

Query: 400 KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           +Q  +  V+ A+ IE N+E V+ A  AV   +A  M WED+ R++++E+K  NPVA LI 
Sbjct: 380 QQVQELHVRKAQAIEANVERVEEATAAVNGLIAQGMDWEDIGRLIEQEQKRHNPVAELIK 439

Query: 460 -KLYLERNCMSLLLS----NNLDEMDDEEKTLPVEK------------------VEVDLA 496
             L L  N M+LLLS    ++ DE  +E  + P +                   +++DLA
Sbjct: 440 LPLKLHENTMTLLLSELGADDEDEEANETDSEPSDSEDEGTNAAQVKHDAKRLTIDIDLA 499

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMR 552
            SA  NAR++Y+ K+    KQEKT+ A  KA K+ E+K     +  + QEK V  +  +R
Sbjct: 500 GSAWVNARQYYDQKRTAAVKQEKTVLASKKAIKSTEQKVMATLKKDLKQEKDV--LRPVR 557

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH-R 611
           K  WFEKF +F+SS+ YLV++G+DAQQNE++ +RY+ KGDVY+HADL GA+S +IKN   
Sbjct: 558 KQFWFEKFIYFVSSDGYLVLAGKDAQQNEILYRRYLKKGDVYIHADLDGAASVIIKNKLN 617

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           PE P+PP TL Q G   VC S AWDSK V SAWWV   QVSKTAPTGEYL  G F++RGK
Sbjct: 618 PEDPIPPSTLAQGGDLAVCTSSAWDSKAVMSAWWVNADQVSKTAPTGEYLAAGGFIVRGK 677

Query: 672 KNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           KNFLPP  L++GFG++F++ E S   H+  R
Sbjct: 678 KNFLPPAKLLLGFGVMFQISEESKAQHVKHR 708


>gi|361131825|gb|EHL03460.1| putative Nuclear export mediator factor Nemf [Glarea lozoyensis
            74030]
          Length = 1063

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 348/1130 (30%), Positives = 525/1130 (46%), Gaps = 178/1130 (15%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV      L   L+ +R SN+YDLS K ++ K              K  +++
Sbjct: 1    MKQRFSSLDVKVIAHELSNALLTLRVSNIYDLSSKIFLIKFAKPE--------HKQQIII 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +SG R H T YAR   +  S F  KLRK ++TRR+  V Q+G DRII FQF  G+   Y 
Sbjct: 53   DSGFRCHLTDYARATASDQSDFVKKLRKVLKTRRVTSVCQIGTDRIIEFQFSDGLYKLY- 111

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
             LE YA GNI        +LT        DK + I++  R P       E     +L   
Sbjct: 112  -LEFYAAGNI--------ILT--------DKELNILALLR-PVPAGEGQE-----ELRVG 148

Query: 181  LTSSKEPDANEPDKVNEDGNNVSNASKENLGG------QKGGKSFDLSKNSNKNSNDGAR 234
            L  S E   N         + V   +KE L         KG +     K + K   D  R
Sbjct: 149  LQYSLENRQNY--------HGVPGLTKERLQNALQRAVDKGDEGLVAGKKAKKKGADALR 200

Query: 235  AKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
                  K +      + P + +H +  T     +K + V + +D+ +  L+ A+ + +  
Sbjct: 201  ------KALAVSITEFPPMVVDHAMRVTSFDSTLKPAGVLQ-KDSLVDDLMKALQEAQKV 253

Query: 295  LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKF 351
            +++V S ++   G+I+ + K   +++   E  S   +YD+F P    QF S     F+++
Sbjct: 254  MEEVTSCEVAT-GFIIAKKKEGYEENSDPEHSSKNVLYDDFHPFRPAQFESDPATVFLQY 312

Query: 352  ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            E F+  +DEF+S IE QR E + + +E  A  K+     DQE R+  L+     + + A 
Sbjct: 313  EGFNKTVDEFFSSIEGQRLESKLEERELNAQRKIQAARQDQERRLDGLQAVQSLNERKAS 372

Query: 412  LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
             I+ N+E V  A+ AV   +A  M W ++ ++++ E+K  NPVA +I   L LE N ++L
Sbjct: 373  AIQANVERVQEAMDAVNGLVAQGMDWVEIGKLIEVEKKRSNPVASMIKLPLKLEENTITL 432

Query: 471  LL-------------------SNNLDEM--DDEEKTLPVEK---VEVDLALSAHANARRW 506
            LL                   S++ DE+    E K   VEK   V++DL L+   NAR +
Sbjct: 433  LLDEEVFDEDEDSAYETDDAPSDSEDEVTKQKEPKEKGVEKRLTVDIDLGLTPWKNAREY 492

Query: 507  YELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNW 562
            ++ K++  +K++KT+ + +KA K+ E K     +  + QEK V  +  +R++ WFEKF W
Sbjct: 493  FDEKRQAATKEQKTLESSTKALKSQEAKIAHDLKKGLQQEKAV--LRPVRRLMWFEKFIW 550

Query: 563  FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLT 620
            FISS+ YLV+ G+DAQQNEM+ K+YM KGD ++HAD+ GA++ V++N    P+ P+PP T
Sbjct: 551  FISSDGYLVLGGKDAQQNEMLYKKYMKKGDAFLHADIQGAATVVVRNDPRTPDAPIPPST 610

Query: 621  LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
            L+QAG   V  S AWDSK   SAWW    Q+SK AP+G++L  GSF + GKKNFLPP  L
Sbjct: 611  LSQAGSLVVSCSVAWDSKAGMSAWWASATQISKAAPSGDFLPPGSFSVNGKKNFLPPSQL 670

Query: 681  IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDE---KP 737
            ++GFG++FR+ ESS   HL  R        + D  D   H     +E    DT E     
Sbjct: 671  LLGFGVIFRISESSKSKHLKHR--------VSDDRDQNRHS----VEEPNQDTPEIAESE 718

Query: 738  VAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLED 797
            VA   +VP       S    SN    E   E  T SN +  +       +A      L D
Sbjct: 719  VASESAVPEIDDGQDSDDGTSNASDAE-EEEQNTPSNPLQRQSTATEPKIAEVSNDDLTD 777

Query: 798  LIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGS 857
                              GIE  + D + +  H   TAT  D    S+++       Q +
Sbjct: 778  ------------------GIEALEIDDTPKIPH---TATPNDIDSNSESDD-DTDFNQTT 815

Query: 858  SVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNI 917
                P    +  +G  A+ +     +  KI                  KY DQDEE+R  
Sbjct: 816  GTRTPNTVADNRKGGPATKKRGKRGKAKKIAN----------------KYKDQDEEDRLA 859

Query: 918  RMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPD 977
               L+ ++   +K   + +   A   +E + A           + +KA H  +  +E   
Sbjct: 860  AQQLIGASAGAEKARVEAE---AKAQREAELAFQ--------KERRKAQH--RRTRE--- 903

Query: 978  DSSHGVEDNPCVGLDETAEMDKVAME--EEDIHEIGEEEKGRLNDVDYLTGNPLPSDILL 1035
                           ETA  +K+  E  E    E+ E E+ ++  +D L G PL  D +L
Sbjct: 904  ---------------ETAAHEKLRREKMERGTDEVDEAEEEQMAAIDALVGTPLRGDEIL 948

Query: 1036 YVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLTPVFD 1085
              IP C P+SA+   KY+VK+ PGT KKGK I+      L+      V D
Sbjct: 949  EAIPFCAPWSAMAKTKYKVKLQPGTQKKGKAIKEIIGRWLIASQAKGVLD 998


>gi|378733722|gb|EHY60181.1| translation factor [Exophiala dermatitidis NIH/UT8656]
          Length = 1147

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 277/798 (34%), Positives = 425/798 (53%), Gaps = 110/798 (13%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R ++ DV   AAE+     L  +R SN+YDLS + ++FK        + G  E+  L
Sbjct: 1   MKQRFSSLDVKVIAAELAA--SLTSLRVSNIYDLSSRIFLFKF------AKPGRREQ--L 50

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           L++SG R H T+++R     PS F  +LRK++++RR+ +V Q+G DR+I   F  G   +
Sbjct: 51  LVDSGFRCHLTSFSRTAATAPSAFVSRLRKYLKSRRVTNVAQIGTDRVIEITFSEGQ--Y 108

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            + LE +A GNI++TD++  VL L R   + D+ V +    +Y  +  + F         
Sbjct: 109 RMFLEFFAAGNIIVTDADLNVLALQRQVSEGDEDVDVKLGGKYILDAKQNFHGI------ 162

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKE--NLGGQKGGKSFDLSKNSNKNSNDGARAK 236
           A +T         P++V E        +K+   +GG+K  ++        K  +D     
Sbjct: 163 APVT---------PERVKETLEKAVQRAKDAKEVGGKKAKRA--------KGGDD----- 200

Query: 237 QPTLKTVLGEALGYG-PALS----EHIILDTGLVPNMKLSEVNKLEDNAIQVLVL-AVAK 290
                  L +AL +G P  S    +H+  + G+    K  +V  L D  +   V+ A+ +
Sbjct: 201 -------LRKALSFGFPEFSAHLLDHVFNEIGIDAAAKAEDV--LNDGQLMEAVMKALNR 251

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHP--------PTESGSSTQIYDEFCPLLLNQ 342
            ++  + + +G    +GYI+ + K    + P        P+ SG    +Y++F P   +Q
Sbjct: 252 AKEIFESLGTGQ--SKGYIIAKIKSPSSEAPQEAEAQTQPS-SGRDNLLYEDFHPFRPSQ 308

Query: 343 FRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL 399
           F  +     ++F+ F+  +DEFYS IESQ+ E +   +E+AA  KL     +QE R+  L
Sbjct: 309 FEGKPDLRILEFDGFNRTVDEFYSSIESQKLESRLTEREEAARKKLQAAKEEQEKRLGAL 368

Query: 400 KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           +   +  V+ A+ IE N   V+ A  AV   +   M W D+ ++++ E+K GN VA +I 
Sbjct: 369 QHVQELHVRKAQAIEANTHRVEEACAAVNGLIGQGMDWVDIGKLIENEQKRGNVVAQMIK 428

Query: 460 -KLYLERNCMSLLLSN-------------------NLDEMDDEEKTLPVE----KVEVDL 495
             L LE N ++LLL                     N DE   ++ T P       +++DL
Sbjct: 429 LPLKLEENTVTLLLDEPGFNEESEEDEPDETDEEENSDEDTRKKPTKPATDKRLAIDIDL 488

Query: 496 ALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHM 551
            LS  ANAR++YE KK    K+++T+ A + A K+AE+K     +  + QEK  A +   
Sbjct: 489 GLSPWANARQYYEQKKNAAVKEKRTLEAATMALKSAERKIEADLKRGLKQEK--AALRPA 546

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH- 610
           RK  WFEKF +FISS+ YLVI G+DAQQNE++ +RY+ +GDVYVHADL GASS ++KN+ 
Sbjct: 547 RKQFWFEKFLYFISSDGYLVIGGKDAQQNELLYRRYLKRGDVYVHADLQGASSVIVKNNP 606

Query: 611 -RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
             P+ P+PP TL+QAG  TVC S AWDSK V  AWWV   QVSKTAP+GEYLT G F+IR
Sbjct: 607 RTPDAPIPPSTLSQAGALTVCTSSAWDSKAVMGAWWVNAEQVSKTAPSGEYLTTGGFIIR 666

Query: 670 GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESE 729
           G KN LPP  L++GFG+L+ + E S  +H   R  R E     + E   +      +E +
Sbjct: 667 GHKNLLPPSQLLLGFGVLWLISEESKVNHGKHRLERTESMLPGEAEALANDARGLSLEEQ 726

Query: 730 KDDTDEKPVAE-SLSVPN 746
           + D    P++E S +VP+
Sbjct: 727 EQDL---PISEQSRAVPD 741



 Score = 86.7 bits (213), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 97/192 (50%), Gaps = 9/192 (4%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAI 950
            +++RG++ KLK+ ++KY DQDEE+R + M LL SA       G  + + A   +  + A 
Sbjct: 877  QLTRGKRTKLKRAQKKYADQDEEDRALAMQLLGSA------KGQERKQMAEAERAAREAK 930

Query: 951  SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
            +  D  +   +  KA    +   E  + ++   + +   G    A++D    +++   E 
Sbjct: 931  AQADRERRKAQHAKAAEKERQRLERLEKAATAADGHEGTGAHAGADVD---ADDKLSREQ 987

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIF 1070
             E+E+  L D+D L   P P D LL  IPVC P+SA+   KY+VK+ PG  KKGK I+  
Sbjct: 988  LEQERRELLDIDRLVPMPEPGDELLAAIPVCAPWSALSRQKYKVKLQPGNVKKGKAIREI 1047

Query: 1071 YSLLLLMLSLTP 1082
                  + +  P
Sbjct: 1048 LGFWTSLATKGP 1059


>gi|225681027|gb|EEH19311.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis Pb03]
          Length = 1161

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 346/1153 (30%), Positives = 528/1153 (45%), Gaps = 198/1153 (17%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV    + L R L+G+R SN+YDLS +  +FKL       +        L++
Sbjct: 1    MKQRFSSLDVKVISQELSRALVGLRISNIYDLSSRICLFKLAKPDTRKQ--------LIV 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            + G R H T Y+R     PS F  +LRK ++TRR+  V QLG DRII     L     ++
Sbjct: 53   DIGFRCHLTEYSRTTAAAPSPFISRLRKFLKTRRVTAVSQLGTDRII--DIALSDGNFHL 110

Query: 121  ILELYAQGNILLTDSEFTVLTLLR-SHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
            +LE Y  GNI+LTD ++ ++ L R  H   ++            E  RV        L  
Sbjct: 111  LLEFYVGGNIILTDKDYKIVALHRIVHGGGER------------EEVRV-------GLQY 151

Query: 180  ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
             +T+ +  +   P  +      +  A  E   G+ G         SNK    G + +   
Sbjct: 152  DITNKQNYNGVPPLSIERLRETLQRA--EEAEGESGAVE---GPGSNKR---GKKRQTEA 203

Query: 240  LKTVLGEALGYGPALS-EHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQD 297
            LK  +       PAL  +H     G   N++  +   LED+ + + L+L + + E+ +  
Sbjct: 204  LKRAISMGFPEYPALLLDHSFHAAGFDANLEPKQA--LEDSELMKRLMLVLTEAENVIAR 261

Query: 298  VISGDIVPEGYILMQNK-HLGKDHPPTESGS---STQIYDEFCPLLLNQFRS---REFVK 350
            + + +  P GYI+++ +   G+     ++ S      +Y +F P    QF +      + 
Sbjct: 262  LSTLEDTP-GYIILKGESKTGEAITEADTDSPKPKNMLYHDFHPFKPKQFENVPGMTILT 320

Query: 351  FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
            F TF+ A+DE++S +ESQ+ + +   +E+ A  KL     DQENRV  LK+  +  V+ A
Sbjct: 321  FNTFNKAVDEYFSSVESQKLKYRLTEREEVARRKLEAAQKDQENRVGALKEVQELHVRKA 380

Query: 411  ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
            + IE NL  V+ AI AV   +A  M W ++AR+++ E+ + NPVA +I   L L  N ++
Sbjct: 381  QAIEANLLRVEEAINAVNGLIAQGMDWVEIARLIEMEKSSQNPVAKVIKLPLKLYENTVT 440

Query: 470  LLLS---------------------------NNLDEMDDEEKTLPVEKVEVDLALSAHAN 502
            LLL                            N +     ++    +  +++DL +S  AN
Sbjct: 441  LLLGEPTEDEEPADESDEEEEDSESGDEDGGNKVKLEGSKKAQQQLLSIDIDLGISPWAN 500

Query: 503  ARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFE 558
            AR++YE +K    K+EKT+ +  KA K+ EKK     +  + QEK +  +   R   WFE
Sbjct: 501  ARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPI--LRPTRTPFWFE 558

Query: 559  KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPV 616
            KF +F+SS+ YLV+ GRD QQ E++ +RY+ KGDVYVHAD+ GA+   +KN    P+ P+
Sbjct: 559  KFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPGTPDAPI 618

Query: 617  PPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
            PP TL+QAG   V  S AWDSK V  AWWV   QVSKTAP+GE++  G F+IRG+K+ LP
Sbjct: 619  PPGTLSQAGNLCVATSSAWDSKAVMGAWWVNAGQVSKTAPSGEFVGTGGFVIRGEKHQLP 678

Query: 677  PHPLIMGFGLLFRLDESSLGSHLNER----------------------RVRGEEEGMDDF 714
            P  L++GF ++F++ E S+ +H   R                      +   E  G D  
Sbjct: 679  PAQLLLGFAVMFQISEDSIKNHTKFRVQDEPSIVGIAKEVQANEVLHSKQDSEAPGADGN 738

Query: 715  EDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTN---ASNVDSHEFPAEDKT 771
            ++     E  D   E+D+  + P    L +   + P  S  N    S+    + P++D  
Sbjct: 739  KEISLASEEHDSSDEQDEETDNP----LLIGMESEPDDSGGNENKGSDNGEEKLPSDDTD 794

Query: 772  ISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHV 831
                 D K ++   +V    T  LE   D  +    A +S  + GI   Q       KH 
Sbjct: 795  -----DEKEYN---SVVTKETVVLESGGDEPITQPEADVSEQQPGITKRQ-----ALKH- 840

Query: 832  ERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQ---------PESIV 882
                       +S  ERR+LKKG    V+   +E+   R  DA SQ         P    
Sbjct: 841  -----------LSARERRQLKKG----VL---IEQTSVRVADAESQSSSPTPSVAPSVTT 882

Query: 883  RKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGK------VQKNDGDPQ 936
                       RG++GK KK+  KY  QDEE+R + + LL SA K        KN  + Q
Sbjct: 883  TTNTNTLNSNIRGKRGKSKKLATKYQHQDEEDRELALRLLGSAPKPDKLREAAKNKAERQ 942

Query: 937  NE-NASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETA 995
             E  A   + ++       A +  YK  +      D  E   D +    D  C       
Sbjct: 943  AELEAQKQRRREQHDRAAQAERERYKALQ--QQGGDGGETQFDDTDTAADLSC------- 993

Query: 996  EMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVK 1055
                                     +  L G P+  D +L  IPVC P++A+  YKYR K
Sbjct: 994  -------------------------LPSLVGTPVVGDEVLAAIPVCAPWAALGHYKYRAK 1028

Query: 1056 IIPGTAKKGKGIQ 1068
            + PG  KKGK ++
Sbjct: 1029 LQPGIVKKGKAVK 1041


>gi|119193306|ref|XP_001247259.1| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
 gi|392863500|gb|EAS35746.2| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
          Length = 1125

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 334/1123 (29%), Positives = 536/1123 (47%), Gaps = 172/1123 (15%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV    + L   ++G+R SN+YDLS +TY+FK+       +        L++
Sbjct: 1    MKQRFSSLDVKVICRELSAAVVGLRVSNIYDLSSRTYLFKIAKPDVRKQ--------LIV 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +SG R H T Y+R     PS F  +LR  +++RR+  V Q+G DRI+  +F  G   +++
Sbjct: 53   DSGFRCHITEYSRVTAPAPSHFVSRLRGFLKSRRITAVSQVGTDRIVHIEFSDGY--YHL 110

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
             LE +A GNI+LTD+E+ ++ LLR   + +    +    +Y  +  + +E     +  +L
Sbjct: 111  FLEFFASGNIILTDNEYKIVALLRIVPEGEDQDEVRLGLKYRLDNKQNYEGVPPPSVDRL 170

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
              AL   KE DA+                              +S+ +NK +    + ++
Sbjct: 171  KTALQKGKERDAS------------------------------ISEPANKRAK---KKQE 197

Query: 238  PTLKTVLGEALG---YGPALSEH----IILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
              L+  L  +LG   Y P L EH    I  D+ L P+  L   +++ D      ++ V +
Sbjct: 198  EALRRAL--SLGFPEYPPVLLEHALHVIGFDSSLRPDQILETGDRVND------LMRVLR 249

Query: 291  FEDWLQDVISGDIVPEGYILMQNKHLGKDHP--PTESGSSTQIYDEFCPLLLNQF---RS 345
              + + + +S      GYI+ +N++   ++P    E+      Y ++ P    QF     
Sbjct: 250  EVESISNELSTTEQTRGYIVARNENKPPENPSFSGEAKPDKSNYIDYHPFAPRQFVDGND 309

Query: 346  REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
               + F++F+ A+DE+YS +E+Q+ E +   +E+    KL     D E RV  L+Q  + 
Sbjct: 310  TSILTFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEI 369

Query: 406  SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
              + AE I  NL  V+  + AV   +A  M W ++AR+++ E+   NPVA LI   L L 
Sbjct: 370  HTRRAEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLY 429

Query: 465  RNCMSLLLSNN---------------LDEMDDEEKTLP----VEKVEVDLALSAHANARR 505
             N +++LL                   +E D E K  P    V  V++DL L+  ANA +
Sbjct: 430  ENTVTVLLPEGQPDGEDDDSEESGEEDEENDGEAKKKPQRPEVLSVDIDLGLTPWANASQ 489

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKK--TRLQ--ILQEKTVANISHMRKVHWFEKFN 561
            +Y+ KK    K++KTI A  +A K+AEKK  T L+  + QEK V  +   R   WFEKF 
Sbjct: 490  YYDQKKTAAIKEDKTIKASKQALKSAEKKLTTDLKRGLKQEKPV--LRPARIPFWFEKFY 547

Query: 562  WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPP 618
            +FISS+ YLV+ G D +QNE++  R++ KGDVYVHAD+ GA   ++KN +P   + P+PP
Sbjct: 548  FFISSDGYLVLGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKN-KPGASDAPIPP 606

Query: 619  LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
             TL QAG FTV  S+AW+SK +  AWWV   QVSKT P+GEYL  G  +IRG KN L P 
Sbjct: 607  GTLAQAGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPG 666

Query: 679  PLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPV 738
             LI+GF ++F++   S+ +H    R R EE    +      H+  +   SE +  +E P 
Sbjct: 667  QLILGFAVMFQISPESVRNHT---RHRLEEPVSSEMTVKNDHRNGTHEPSEMEKLEESP- 722

Query: 739  AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
                           +T   N    +   E K   N  D     + ++    + PQ+++ 
Sbjct: 723  ---------------NTAVDNCSIGKVGMEQKPRENTTD---LPVEQSAQTGIAPQVKE- 763

Query: 799  IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQG-- 856
                   G A +S            L++ D   +  A      ++S  ERR +K+G G  
Sbjct: 764  -----PQGEAGLSREDKDA------LADPDLQQQLAAFGATTKHVSAQERRLMKRGAGLH 812

Query: 857  -SSVVDPKVEREKERGKDASSQPESI---------VRKTKIEGGKIS-RGQKGKLKKMKE 905
             S++ +  ++ E E  ++  S P +          ++ T     ++  RG++GK KK+  
Sbjct: 813  ASALSELGLDEEDEDEEENQSTPSTFKPSGTQTLSIQSTSTSKSQLPVRGKRGKAKKLAS 872

Query: 906  KYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKA 965
            KY DQDEE+R + + LL SA K        ++  A    +K              + ++A
Sbjct: 873  KYKDQDEEDRELALRLLGSAPKTTTPKKTKEDREAEIQAQK--------------ERRRA 918

Query: 966  GHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLT 1025
             H          D +   E        +  E    A++  D  ++ E+    L+ +  L 
Sbjct: 919  QH----------DKAAQAERRRQENFQKRPEGQNQALDMADAEQVVED----LSSLPALV 964

Query: 1026 GNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            G P   D ++  IPVC P+SA+  YKYR K+ PG   KGK ++
Sbjct: 965  GTPALGDEIISAIPVCAPWSALGQYKYRAKLQPGPTGKGKIVK 1007


>gi|145351275|ref|XP_001420008.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580241|gb|ABO98301.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1069

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 272/722 (37%), Positives = 377/722 (52%), Gaps = 88/722 (12%)

Query: 23  GMRCSNVYDLSP----KTYIFKLMNSSGVTE--------SGESEKVLLLMESGVRLHTTA 70
           G   +N YD+      K ++ KL   SG           + ESEK+L+ +ESG R+HTT 
Sbjct: 24  GCWLANAYDVDATSGNKKFLLKLNKPSGAVARDARADATTAESEKILVFIESGTRVHTTR 83

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM-NAHYVILELYAQGN 129
           Y R K   P+ FT KLR   + +RL D RQLG DR I F FG G  N  ++I+ELY+QGN
Sbjct: 84  YERGKTTAPTAFTAKLRARAKGKRLTDARQLGRDRAIDFTFGGGGENECHLIVELYSQGN 143

Query: 130 ILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDA 189
           ++L D  +TV+ LLRS+RD    V I+  H+YP E  + F+    ++             
Sbjct: 144 VILCDGNYTVVALLRSYRDGGD-VNILPNHQYPLERLKGFQLGGYTR------------- 189

Query: 190 NEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG 249
              D V+     V    +E +GG                    AR    TL+  L  A G
Sbjct: 190 --EDVVSALARGVLATEEETMGGD-------------------ARRAPATLREALCRAFG 228

Query: 250 YGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV- 304
           Y PA+++H+ L      G   ++ LSE        +  L  AV   E W + V +GD+V 
Sbjct: 229 YSPAIADHVALTASIEHGSNASLPLSEA------CVDRLTAAVRDLESWFEGVTTGDVVA 282

Query: 305 -PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE---------TF 354
            P     M     G D          +I+D+F P  L Q   R   KFE          F
Sbjct: 283 VPNVCTKMDANADGTDE--------IEIFDDFSPFSLKQNEGRPTRKFELPKGLDPVCAF 334

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           D A+DE++  +E+Q      +  E  A  KL K   DQ++RV  L++E ++  + A LIE
Sbjct: 335 DHAVDEYFIALEAQSQILARRKAEAQALAKLEKSLKDQKSRVEQLEREREKEEQRAVLIE 394

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           YN E VD AI AV  ALA+ MSW +L  M+ EER+ GNPVAG+I  L L  N +++ L+N
Sbjct: 395 YNHEAVDTAIDAVNSALASGMSWPELEAMINEERRLGNPVAGMIKSLDLANNQITITLAN 454

Query: 475 NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           +LDE+D+ +      K   V VDL LSAHANA   +  KKK   K  KT+ A SKA  AA
Sbjct: 455 HLDEVDEVDAASGKRKRVAVGVDLGLSAHANASMRFAAKKKHAEKFSKTVDAQSKAVAAA 514

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           E K +  + +    ++I+  R+  WFEKFNWFI+SEN LV+  +DA Q EM++ RYM  G
Sbjct: 515 EAKAKAAMEKAANGSSIARARQPLWFEKFNWFITSENCLVLQAKDATQAEMLITRYMLPG 574

Query: 592 DVYVHADLHGASSTVIKNHRPE----QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           D +VHA++  A  T++K   P     + VP  +L QAG   +C S AW+S+ V SAWW  
Sbjct: 575 DAFVHAEVPQAPVTLVKP--PPGVDVRAVPAYSLVQAGAAVMCRSSAWNSRAVKSAWWTS 632

Query: 648 PHQVSKTAPT-GEYLTVG-SFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
             +VSK +P  G+ L  G + +    K FLP   L+MGFGL+F + E +  +H NER VR
Sbjct: 633 SERVSKISPVAGDALPPGVTHVAHADKQFLPHAQLVMGFGLMFVVSEKNAEAHKNERLVR 692

Query: 706 GE 707
            +
Sbjct: 693 SD 694



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 42/80 (52%)

Query: 1003 EEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAK 1062
            +E  I E  + +  RL  V+ +   P   D + Y +PVC P +A  + KYR+K+ PG+ K
Sbjct: 947  DEASIEERLKLDAERLEIVNRIVSAPFKDDDIEYCLPVCAPITATNALKYRMKVTPGSQK 1006

Query: 1063 KGKGIQIFYSLLLLMLSLTP 1082
            KGK  ++   +L      TP
Sbjct: 1007 KGKAAKLAMEILSRAPFATP 1026


>gi|449017191|dbj|BAM80593.1| unknown RNA-binding protein, conserved [Cyanidioschyzon merolae
           strain 10D]
          Length = 1371

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 257/731 (35%), Positives = 391/731 (53%), Gaps = 119/731 (16%)

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA------------------HYV 120
           PSGFTLKLRKH+RTRRL +V QLG DR++ F+F  G  +                  +++
Sbjct: 167 PSGFTLKLRKHLRTRRLAEVTQLGIDRVVDFRFVGGSQSASAYKASANGQPSRAALENHL 226

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEIC----RVFERTTASK 176
           I+EL++ GNI+LTD ++ +L +LR  R + + +A  +  R P        R+ ++     
Sbjct: 227 IVELHSGGNIILTDGDYQILAVLRVFRAEPRPLADSADQRDPPATGPGSRRMQQQDAVVG 286

Query: 177 LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
               ++ +++      D+++E         + + G Q      DL +N            
Sbjct: 287 ARYDISLARQFAPLTYDRLHEIFQECYQKRQRSGGDQL----RDLQRN------------ 330

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
                  LG ALG+GP L EH++L+ G      L E    E    +VL  A A   +  +
Sbjct: 331 -------LGRALGWGPELIEHVLLEVGAPSPDPLPE---YEQRLYRVLCEAAAFLSESPR 380

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPP-TESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
                    EGYIL++    G       +S   +  Y EF P LL Q +  E   F +FD
Sbjct: 381 ---------EGYILLRPVAEGASQASGADSEDVSDRYCEFTPRLLRQHQHLEPRMFPSFD 431

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+DE+++++E  R  Q+ + ++  A   L ++  + E RV TLKQ+ +R ++ A LIE 
Sbjct: 432 EAVDEYFARMEELRYRQEIENRQRQAQGTLERMRRELETRVLTLKQQEERCLRKAALIET 491

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           NL DVD A+  +R ALA+ + W++L +M+  ER+ GNPVA LI  L L+ N M+L+L+++
Sbjct: 492 NLVDVDNALQVIRAALASGIDWKELDQMLVLERRRGNPVAQLIHSLQLQENQMTLMLADD 551

Query: 476 LDEMDD---------------EEKTLP--------------------------VEKVEVD 494
              +D+               E + L                           VE V+VD
Sbjct: 552 SGSVDNTDAETGSSSRQRRPAETRDLSNEDSASSVESASEDESGDSTSVCSSRVELVQVD 611

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN------- 547
           L+LSA ANARR+YE +KK   K  KT+ A ++A +AAEKK  L++L      N       
Sbjct: 612 LSLSAFANARRYYEQRKKAAEKGTKTMEASAQALRAAEKKA-LEVLAGTASKNKRKKATP 670

Query: 548 ---ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK--GDVYVHADLHGA 602
              +  +RK  WFEKF +FI+SENYLVI+G+D+QQNE +V+RY+ +  GD+Y+HAD+HGA
Sbjct: 671 LNTLKAIRKPLWFEKFRYFITSENYLVIAGKDSQQNEQLVRRYLEENTGDLYMHADVHGA 730

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
           +S +IK  +  +P PPL++ +A  F    S AWD+K+  +A+WVYP QVS+TAP+G YL 
Sbjct: 731 ASVIIKGKK-NRPAPPLSIQEAAIFAAACSSAWDAKVAVNAYWVYPEQVSRTAPSGMYLQ 789

Query: 663 VGSFMIRGKKNFLP-----PHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
            GSF+IRG +N++P       PL+MGFG LFRL   S+  H+ ER VR   + + + + +
Sbjct: 790 QGSFVIRGSRNYVPVTTSGSGPLVMGFGFLFRLAPESVWRHIGERPVRSGPDSLQEAQAA 849

Query: 718 GH-HKENSDIE 727
           G   K+   +E
Sbjct: 850 GAPQKQQQQVE 860



 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/227 (26%), Positives = 89/227 (39%), Gaps = 52/227 (22%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLAS--------------------AGKVQKNDG 933
            RG+K KL+K++ KY DQ EEER   + LL +                    +  V++  G
Sbjct: 1064 RGKKSKLRKLRLKYSDQTEEEREAALRLLGTTRMRIMEARDREGAATEAPASSSVKQAAG 1123

Query: 934  DPQNENASTHK----EKKPAISPV------DAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
              Q  N +  +    E+ PA S V      D   +     +           P   + G 
Sbjct: 1124 VDQGTNTTVQRNVNAEEAPASSSVKQAAGVDQGTIASAQGQTSAQRNGTSAAPATHAQGH 1183

Query: 984  EDNPCVGLDETAEMDKVAMEEEDIH----------------------EIGEEEKGRLNDV 1021
                    +   E  + A EE                           + E E   L+++
Sbjct: 1184 ASTGSAASELPLETHRAAREETHFQWQKIDQVEVSKILQSASAALAEHLSEAELANLSEL 1243

Query: 1022 DYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            D  TG P P D+L Y +PVC PY  +  YKY+VK++PGT KKGK ++
Sbjct: 1244 DLFTGCPHPDDVLEYALPVCAPYQTLAKYKYKVKLVPGTLKKGKALK 1290



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 48/88 (54%), Gaps = 17/88 (19%)

Query: 3  KVRMNTADVAAEVKCLRRLIGM--RCSNVYDLSPKTYIFKL----MNSSGVTESGESEKV 56
          K + +  D+ AEV  L+  +G   R  NVY+L  KTY+ KL    +N+SG   + E+E+ 
Sbjct: 8  KTKFSLLDLRAEVSVLQERLGSGSRVLNVYNLGRKTYLLKLSVPPLNASGRIPATETEEA 67

Query: 57 -----------LLLMESGVRLHTTAYAR 73
                      LL+ESGVRLHTT + R
Sbjct: 68 WATGDSSWRREYLLIESGVRLHTTRFTR 95


>gi|395745874|ref|XP_002824790.2| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Pongo abelii]
          Length = 1061

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/515 (44%), Positives = 317/515 (61%), Gaps = 43/515 (8%)

Query: 231 DGARAKQPTLKT-VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
           D ARA +P L    L E +   P      +L   L P + L E  KLE   I+ +++++ 
Sbjct: 156 DHARAAEPLLTLERLTEIVASAPKGE---LLKRVLNP-LLLDE--KLETKDIEKVLVSLQ 209

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
           K ED+++   + +   +GYI+ Q + +       +       Y+EF P L +Q     ++
Sbjct: 210 KAEDYMK--ATSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYI 266

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +FE+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +     
Sbjct: 267 EFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLK 326

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
            ELIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N ++
Sbjct: 327 GELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVT 386

Query: 470 LLLSN--------------------NLDEMDDEEKTLPVEK------------VEVDLAL 497
           +LL N                    N  E    +K     K            V+VDL+L
Sbjct: 387 MLLRNPYLLSEEEDDDVDDDVNVEKNETEPPKGKKKKQKSKQLQKPQKNKPLLVDVDLSL 446

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWF 557
           SA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WF
Sbjct: 447 SAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWF 506

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVP 617
           EKF WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+P
Sbjct: 507 EKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIP 565

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP
Sbjct: 566 PRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPP 625

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
             L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 626 SYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  139 bits (350), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|320040092|gb|EFW22026.1| conserved hypothetical protein [Coccidioides posadasii str. Silveira]
          Length = 1136

 Score =  404 bits (1039), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 335/1134 (29%), Positives = 536/1134 (47%), Gaps = 183/1134 (16%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            +K R ++ DV    + L   ++G+R SN+YDLS +TY+FK+       +         ++
Sbjct: 1    MKQRFSSLDVKVICRELSAAVVGLRVSNIYDLSSRTYLFKIAKPDVRKQ--------FIV 52

Query: 61   ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +SG R H T Y+R     PS F  +LR  +++RR+  V Q+G DRI+  +F  G   +++
Sbjct: 53   DSGFRCHITEYSRVTAPAPSHFVSRLRGFLKSRRITAVSQIGTDRIVHIEFSDGY--YHL 110

Query: 121  ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
             LE +A GNI+LTD+E+ ++ LLR   + +    +    +Y  +  + +E     +  +L
Sbjct: 111  FLEFFASGNIILTDNEYKIVALLRIVPEGEDQDEVRLGLKYRLDNKQNYEGVPPPSVDRL 170

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
              AL   KE DA+                              +S+ +NK +    + ++
Sbjct: 171  KTALQKGKERDAS------------------------------ISEPANKRAK---KKQE 197

Query: 238  PTLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
              L+  L  +LG   Y P L EH +     D+ L P+  L   +++ D      ++ V +
Sbjct: 198  EALRRAL--SLGFPEYPPVLLEHALHVTGFDSSLRPDQILETGDRVND------LMRVLR 249

Query: 291  FEDWLQDVISGDIVPEGYILMQNKHLGKDHPP--TESGSSTQIYDEFCPLLLNQFRSR-- 346
              + + + +S      GYI+ +N++   ++P    E+      Y ++ P    QF     
Sbjct: 250  EVESVSNELSTTEQTRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPFAPRQFADGND 309

Query: 347  -EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
               + F++F+ A+DE+YS +E+Q+ E +   +E+    KL     D E RV  L+Q  + 
Sbjct: 310  ISILTFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEI 369

Query: 406  SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
              + AE I  NL  V+  + AV   +A  M W ++AR+++ E+   NPVA LI   L L 
Sbjct: 370  HTRKAEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLY 429

Query: 465  RNCMSLLLSNN---------------LDEMDDEEKTLP----VEKVEVDLALSAHANARR 505
             N +++LL                   +E D E KT P    V  V++DL L+  ANA +
Sbjct: 430  ENTVTVLLPEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQ 489

Query: 506  WYELKKKQESKQEKTITAHSKAFKAAEKK--TRLQ--ILQEKTVANISHMRKVHWFEKFN 561
            +Y+ KK    K+EKTI A  +A K+AEKK  T L+  + QEK V  +   R   WFEKF 
Sbjct: 490  YYDQKKTAAVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPV--LRPARIPFWFEKFY 547

Query: 562  WFISSENYLVI-----------SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            +FISS+ YLV+           SG D +QNE++  R++ KGDVYVHAD+ GA   ++KN 
Sbjct: 548  FFISSDGYLVLGIDSVMLITRSSGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKN- 606

Query: 611  RP---EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
            +P   + P+PP TL QAG FTV  S+AW+SK +  AWWV   QVSKT P+GEYL  G  +
Sbjct: 607  KPGASDAPIPPGTLAQAGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVV 666

Query: 668  IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIE 727
            IRG KN L P  LI+GF ++F++   S+ +H    R R EE    +      H+  +   
Sbjct: 667  IRGGKNHLAPGQLILGFAVMFQISPESVRNHT---RHRLEEPVSSEMTVKNDHRNGTHEP 723

Query: 728  SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV 787
            SE +  +E P                +T   N    +   E K   N  D     + ++ 
Sbjct: 724  SEMEKLEESP----------------NTAVDNCSIGKVGMEQKPRENTWD---LPVEQSA 764

Query: 788  AAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAE 847
               + PQ+++        G A +S            LS+ D   +  A      ++S  E
Sbjct: 765  QTGIAPQVKE------PQGEAGLSREDKDT------LSDPDLQQQLAAFGATTKHVSAQE 812

Query: 848  RRKLKKGQG---SSVVDPKVEREKERGKDASSQPESI---------VRKTKIEGGKIS-R 894
            RR +K+G G   S++ +  ++ E E  ++  S P +          ++ T     ++  R
Sbjct: 813  RRLMKRGAGLHASALPELGLDEEDEDEEENQSTPSTFKPSGTPTLSIQSTSTSKSQLPVR 872

Query: 895  GQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVD 954
            G++GK KK+  KY DQDEE+R + + LL S  K        ++  A    +K        
Sbjct: 873  GKRGKAKKLASKYKDQDEEDRELALRLLGSTPKTTTPKKTKEDREAEIQAQK-------- 924

Query: 955  APKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEE 1014
                  + ++A H          D +   E        +  E    A++  D  ++ E+ 
Sbjct: 925  ------ERRRAQH----------DKAAQAERRRQESFQKRPEGQNQALDMADAEQVVED- 967

Query: 1015 KGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
               L+ +  L G P   D ++  IPVC P+SA+  YKYR K+ PG   KGK ++
Sbjct: 968  ---LSSLPALVGTPALGDEIISAIPVCAPWSALGQYKYRAKLQPGPTGKGKIVK 1018


>gi|327287378|ref|XP_003228406.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Anolis carolinensis]
          Length = 635

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 256/683 (37%), Positives = 374/683 (54%), Gaps = 100/683 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ + +  LR+ L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFNTVDIRSVIAELRQSLLGMRVNNVYDVDNKTYLIRLQKPDV--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH++TRRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKTRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +I               
Sbjct: 113 IIELYDRGNIVLTDHEYLILNILRFRTDEADDVRFAVREHYPVDIA-------------- 158

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                +P A  P             S E L         ++   S K            +
Sbjct: 159 -----KPAAPLP-------------SLERLT--------EIITTSPKTEQ---------I 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YG  L EH +++TG   N ++ +++  +   I+ L+ A+ K E++++  ++
Sbjct: 184 KRVLNPHLPYGATLIEHCLIETGFSGNTRIEQIDSKD---IERLLAALQKAEEYME--VT 238

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q +       P +       Y+EF P L +Q+    FV+F++F+ A+DE
Sbjct: 239 DNFDGKGYII-QKREKKPSLEPEKPAEEILTYEEFHPFLFSQYTKCPFVEFDSFNKAVDE 297

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNLE 418
           FYSK+E Q+ + +   +E  A  KL  +  D E+R+  L   QE+D+ VK  EL+E NLE
Sbjct: 298 FYSKLEGQKIDLKALQQEKQALKKLENVRKDHEHRLEALHQAQEIDK-VK-GELVEMNLE 355

Query: 419 DVDAAILAVRVALANRMSWED--------------LARMVKEERKAGNPVAGLIDKLYL- 463
            VD AI  VR ALAN++ W +              LA  +KE +   N +  L+   Y+ 
Sbjct: 356 MVDRAITVVRSALANQIDWTEIGALVKEAQAQGDPLASAIKELKLQTNHITMLLKNPYVF 415

Query: 464 ----------------ERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
                                      N  +   + +      V++DL+LSA+ANA+++Y
Sbjct: 416 SEEEEEEEDGEVEEEVGEETKGKRKKKNKAKQPKKPQKNKPLLVDLDLSLSAYANAKKYY 475

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSE
Sbjct: 476 DHKRFAARKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSE 535

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYLVI+GRD QQNEMIVKRY+  GD+YVHADLHGA+S VIKN   + P+PP TL +AG  
Sbjct: 536 NYLVIAGRDQQQNEMIVKRYLRPGDIYVHADLHGATSCVIKNPTGD-PIPPRTLTEAGAM 594

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQ 650
            +C+S AWD++++TSAWWV+ HQ
Sbjct: 595 ALCYSAAWDARVITSAWWVHHHQ 617


>gi|380483775|emb|CCF40411.1| hypothetical protein CH063_10996 [Colletotrichum higginsianum]
          Length = 1087

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 278/794 (35%), Positives = 404/794 (50%), Gaps = 106/794 (13%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L  +R +NVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSIDVKVIAHELQESLTTLRLANVYDLSSKILLLKFAKPDN--------KKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T + R     PS F  +LRK ++TRRL  VRQ+G DRI+ FQF  G   + +
Sbjct: 53  DSGFRCHLTDFTRTTAAAPSAFVTRLRKFLKTRRLTSVRQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
            LE +A GN++LTD++  +LTLLR+  + +          Y  E  + +      T  ++
Sbjct: 111 FLEFFASGNVILTDADLKILTLLRNVSEGEGQEPQRVGMNYSLENRQNYNGVPDLTKERV 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
            AAL SS                 VS  S     G+K                D  R   
Sbjct: 171 RAALESS-----------------VSKTSVAATAGKK----------IKVKPGDELRR-- 201

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQ 296
            +L T + E     P L +H    TG    MK +++  LED + +  L+ A+ +    ++
Sbjct: 202 -SLATTITE---LPPILVDHSFQLTGFDGKMKPADI--LEDESLLDALLKALTQARSIVE 255

Query: 297 DVISGDIVPEGYILMQNKH--------LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
           D  S     +GYI  + +                 E+  S  +YD+F P L ++F +   
Sbjct: 256 DATSS-ATAKGYIFAKYRSKPDHAPEAAPPAAEDEETKRSNLLYDDFHPFLPSKFANDPT 314

Query: 349 VK---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
           VK   F+ ++  +DEF+S +E Q+ E +   +E AA  KL+    DQE R+  L+     
Sbjct: 315 VKVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRKLDAARSDQEKRIEGLRGAQSI 374

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
           +V+ A  IE N+E V  A+ AV   L   M W D++++++ E+K  NPVA +I   L L 
Sbjct: 375 NVQKATAIEANVERVQEAMDAVNGLLQQGMDWVDISKLIEREQKRRNPVAEIIKLPLNLA 434

Query: 465 RNCMSLLL----------SNNLDEMD------------DEEKTLPVEKVEVDLALSAHAN 502
            N ++LLL          SN   + D            +++K     ++EVD+ LS  AN
Sbjct: 435 ENKITLLLGEEEDIEDDESNYETDSDASDSENEESSNNNKQKNDKRLEIEVDITLSPWAN 494

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ----EKTVANISHMRKVHWFE 558
           +R ++E K+    K EKT+     A K AE+K + ++ +    EK V  +  +RK  WFE
Sbjct: 495 SRGYHEQKRSAAKKAEKTVQQSQMALKNAEQKIQAELKKGLKTEKAV--LQPIRKQSWFE 552

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPV 616
           KF WF+SS+ YLV+ G+DAQQNEM+ KRY+ KGDVYVHAD+HGA++ +IKN    P+ P+
Sbjct: 553 KFIWFVSSDGYLVLGGKDAQQNEMLYKRYLRKGDVYVHADMHGAATVIIKNSPSTPDAPI 612

Query: 617 PPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
           PP TL QAG   VC S AWDSK    AWWV  +QVSK+APTGEYL  GSFM+RG+KNFLP
Sbjct: 613 PPSTLAQAGTLAVCSSSAWDSKAGMGAWWVNANQVSKSAPTGEYLPTGSFMVRGQKNFLP 672

Query: 677 PHPLIMGFGLLFRLDESSLGSHLNERRVRG------------EEEGMDDFEDSGHHKEN- 723
           P  L++G G++F++ E S   H+  R   G            EE   D  +      ++ 
Sbjct: 673 PAQLLLGIGIMFKISEESKARHVKHRLYDGAGLQAPSADKGPEESAADAAQARDEDPDDV 732

Query: 724 SDIESEKDDTDEKP 737
           SDI SE +D DE P
Sbjct: 733 SDIGSENNDEDEDP 746



 Score = 73.6 bits (179), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 89/180 (49%), Gaps = 33/180 (18%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RGQK K KK+  KY DQDEE+R    AL  SA   Q+ + + Q++     +E + A    
Sbjct: 858  RGQKSKAKKLAAKYKDQDEEDRAAAEALYGSARGKQRAEAEAQSK---AEREAQLAFQK- 913

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKV--AMEEEDIHEIG 1011
                   + ++A H  +                      ETAE ++V   M EE +  + 
Sbjct: 914  -------ERRRAQHERQQ--------------------KETAEHEEVRRLMNEEGVEVLD 946

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFY 1071
             EE G++  +D L G PLP D +L  +PVC P++A+  +KY+ K+ PG  KKGK ++  +
Sbjct: 947  AEELGKMTLLDALVGTPLPGDEILEAVPVCAPWNAMGKFKYKAKLQPGAVKKGKAVKEVF 1006


>gi|449299546|gb|EMC95559.1| hypothetical protein BAUCODRAFT_71160 [Baudoinia compniacensis UAMH
           10762]
          Length = 1052

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 266/726 (36%), Positives = 383/726 (52%), Gaps = 94/726 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +N+YDLS + ++ K              +  LL++SG R H T +AR     PS
Sbjct: 21  LVTLRLANIYDLSTRIFLLKFAKPD--------HREQLLVDSGFRCHLTDFARATAAAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK +RTRR+  V Q+G DR+I  QF  G+  + + LE YA GN++LTDS+ T+L
Sbjct: 73  PFVARLRKFLRTRRVTKVEQIGTDRVIEIQFSEGL--YRLFLEFYAGGNVVLTDSDLTIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR+       VA  + H                KL      S          + ++  
Sbjct: 131 ALLRT-------VAEGAEHEQ-------------YKLGLKYDLS----------LRQNYG 160

Query: 201 NVSNASKENL--GGQKG--GKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
            V   +KE +  G QK    +  +  K   K    G  A +  L     E   + P L +
Sbjct: 161 GVPPLTKERVRDGLQKAIQKQEAEAQKPGKKIKRKGGDALRKALAVTTTE---FPPILLD 217

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ--NK 314
           H +  TG     +  +V       +  LV ++ + ++ +Q++ S      GYIL +    
Sbjct: 218 HALHVTGYDREAQPEQVVA-SGELLNKLVESLQEAQNVVQEITSA-ATARGYILAKPGKS 275

Query: 315 HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFDAALDEFYSKIESQRAE 371
              +D     +  +  +YD+F P    Q  S     F++ E F+   DEF+S +E Q+ E
Sbjct: 276 SAHQDANGLVNSDAGLLYDDFHPFKPAQLASDPSITFLEHEGFNKTCDEFFSSLEGQKLE 335

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            + + +ED A  K+ +   +Q  R+  L+   + +V+ A+ IE N+E V+ A+ AV   +
Sbjct: 336 SRLQEREDNAKRKIEQARQEQAKRIDGLQHVQELNVRKAQAIEANVERVEEAVAAVNGLI 395

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN------NLDEMDD--- 481
           A  M W D+ R+++ E+   N VA +I   L L  N ++LLLS       + D+M D   
Sbjct: 396 AQGMDWMDIGRLIENEQSRHNAVAEMIKLPLKLYENTVTLLLSEYAGLEEDYDDMADETE 455

Query: 482 -------------------EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                              EEK L V+   VDLALS  +NAR++Y+ K+    KQE+T  
Sbjct: 456 SEESEDEADTQAPRHTSKPEEKRLAVD---VDLALSPWSNARQYYDQKRTAAEKQERTAQ 512

Query: 523 AHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           A  KA K+ E+K     +  + QEK V  +  +RK  +FEKFN+FISS+ YLV++GRDAQ
Sbjct: 513 ASQKALKSTEQKVMADLKKGLKQEKDV--LRPVRKQMYFEKFNYFISSDGYLVLAGRDAQ 570

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           QNEM+ +RY+ KGDVY+HADLHGA+S ++KN    PE P+PP TL QAG   VC S AWD
Sbjct: 571 QNEMLYRRYLKKGDVYIHADLHGAASVIVKNDPQTPEAPIPPSTLGQAGNLAVCTSTAWD 630

Query: 637 SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
           SK V SAWWV   QVSKTAPTGEYLT G F+IRGKKN+LPP  L++GF +LFR+ E S  
Sbjct: 631 SKAVMSAWWVGSEQVSKTAPTGEYLTTGGFVIRGKKNYLPPAQLLLGFAVLFRISEESKA 690

Query: 697 SHLNER 702
            HL  R
Sbjct: 691 RHLKHR 696



 Score = 73.6 bits (179), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 63/214 (29%), Positives = 99/214 (46%), Gaps = 29/214 (13%)

Query: 855  QGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEE 914
            +G +   P         KD +   ES     K +     RG++GK KK  +KY +QDEE+
Sbjct: 782  EGQAAAQPSANGGVGSNKDPARDQESRTASAKPKATPQIRGKRGKAKKAAQKYAEQDEED 841

Query: 915  RNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKE 974
            R + M LL S    +K + +   + + T   ++             + ++     K  KE
Sbjct: 842  RELAMKLLGSRAAAEKREAEAALKASKTESTEEA------------RARRRAQHEKAQKE 889

Query: 975  HPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDIL 1034
                           GL E  E+ ++ + EE +  + +EE   L  +D L G PLP D +
Sbjct: 890  ---------------GL-EAEEIRRLNL-EEGVEAVDDEEAAHLTQLDSLVGTPLPGDEI 932

Query: 1035 LYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L  IPVC P++A+  YKY+VK+ PG  KKGK ++
Sbjct: 933  LEAIPVCAPWAALGKYKYKVKMQPGQQKKGKAVR 966


>gi|402876104|ref|XP_003901819.1| PREDICTED: nuclear export mediator factor NEMF [Papio anubis]
          Length = 1048

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 210/464 (45%), Positives = 296/464 (63%), Gaps = 36/464 (7%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLL 340
           I+ +++++ K ED+++   + +   +GYI+ Q + +       +       Y+EF P L 
Sbjct: 193 IEKVLVSLQKAEDYMK--TTSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLF 249

Query: 341 NQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
           +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+
Sbjct: 250 SQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQ 309

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDK 460
           Q  +      ELIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +
Sbjct: 310 QAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKE 369

Query: 461 LYLERNCMSLLLSN--------------------NLDEMDDEEKTLPVEK---------- 490
           L L+ N ++++L N                    N  E    +K     K          
Sbjct: 370 LKLQTNHVTMMLRNPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKP 429

Query: 491 --VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
             V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I
Sbjct: 430 LLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSI 489

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
              RKV+WFEKF WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIK
Sbjct: 490 QKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIK 549

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
           N   E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMI
Sbjct: 550 NPTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMI 608

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           RGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 609 RGKKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 652



 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|310791286|gb|EFQ26815.1| hypothetical protein GLRG_02635 [Glomerella graminicola M1.001]
          Length = 1073

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 263/749 (35%), Positives = 395/749 (52%), Gaps = 92/749 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L  +R +NVYDLS K  +FK              K  L++
Sbjct: 1   MKQRFSSIDVKVIAHELQESLTTLRLANVYDLSSKILLFKFAKPDN--------KKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T + R     PSGF  +LRK+++TRRL  V+Q+G DRI+ FQF  G   + +
Sbjct: 53  DSGFRCHLTDFTRTTAAAPSGFVARLRKYLKTRRLTSVKQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
            LE +A GN++LTD++  +LTLLR+  + +         +Y  +  + +      T  ++
Sbjct: 111 FLEFFASGNVILTDTDLRILTLLRNVPEGEGQEPQRVGLKYSLDNRQNYNGVPDLTKERV 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
            AAL SS                      K++      GK   +     K  ++  R   
Sbjct: 171 RAALESS---------------------VKKSAATATAGKKIKV-----KPGDELRR--- 201

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQ 296
            +L T + E     P L +H    TG     K +E+  LED+++   L+ A+ +    ++
Sbjct: 202 -SLATTITE---LPPILVDHSFQITGFDGKTKPAEI--LEDDSLLDALLKALTRARSIVE 255

Query: 297 DVISGDIVPEGYILMQNKH-------LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
           D  S     +GYI  + +                E+  S  +YD+F P L  +F     V
Sbjct: 256 DATSS-ATSKGYIFAKYRSKADAASDAAPTAEGEETKRSDLLYDDFHPFLPKKFADDPTV 314

Query: 350 K---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           K   F+ ++  +DEF+S +E Q+ E +   +E AA  KL+    DQE R+  L+     +
Sbjct: 315 KVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRKLDAARSDQEKRIEGLRGAQSIN 374

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
           V+ A  IE N+E V  A+ A+   L   M W D++++++ E+K  NPVA +I   L L  
Sbjct: 375 VQKATAIEANVERVQEAMDAMNGLLQQGMDWVDISKLIEREQKRHNPVAEIIKLPLNLAE 434

Query: 466 NCMSLLL-------------------SNNLDEMD---DEEKTLPVEKVEVDLALSAHANA 503
           N ++LLL                   S++ DE +   +++K+    +V+V++ALS  AN+
Sbjct: 435 NTITLLLGEEEDIEDDESNYETDSDASDSEDEDNGNSNKQKSDKRLEVDVNIALSPWANS 494

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ----EKTVANISHMRKVHWFEK 559
           R ++E K+    K EKT+     A K AE+K + ++ +    EK V  +  +RK  WFEK
Sbjct: 495 REYHEQKRSAAKKAEKTVQQSVIALKNAEQKIQAELKKGLKTEKAV--LQPIRKQIWFEK 552

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F WF+SS+ YLV+ G+DAQQNEM+ KRY+ KGDVYVHAD+HGA++ +IKN    P+ P+P
Sbjct: 553 FIWFVSSDGYLVLGGKDAQQNEMLYKRYLRKGDVYVHADMHGAATVIIKNSPSTPDAPIP 612

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL QAG   VC S AWDSK    AWWV   QVSK+APTGEYL  GSFM+RG+KNFLPP
Sbjct: 613 PSTLAQAGTLAVCSSSAWDSKAGMGAWWVNADQVSKSAPTGEYLPTGSFMVRGQKNFLPP 672

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRG 706
             L++G G++F++ E S   H+  R   G
Sbjct: 673 AQLLLGIGIMFKISEESKARHVKHRLYDG 701



 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 51/81 (62%), Gaps = 2/81 (2%)

Query: 993  ETAEMDKV--AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSY 1050
            ETAE +++   M EE +  +  +E G++  +D L G PLP D +L  IPVC P++A+  +
Sbjct: 913  ETAEHEEIRRLMNEEGVEVLDSDEMGKMTLLDSLVGTPLPGDEILEAIPVCAPWNAMGKF 972

Query: 1051 KYRVKIIPGTAKKGKGIQIFY 1071
            KY+ K+ PG  KKGK ++  +
Sbjct: 973  KYKAKLQPGAVKKGKAVKEVF 993


>gi|171684415|ref|XP_001907149.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942168|emb|CAP67820.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1070

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 259/731 (35%), Positives = 373/731 (51%), Gaps = 96/731 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +N+YDL+ K  +FK        +        LL+ESG R H T +AR     PS
Sbjct: 23  LVSLRLANIYDLNSKILLFKFAKPDNRQQ--------LLIESGFRCHLTDFARSTAPAPS 74

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII F+F  G  A+ + LE +A GN++LTD++ T++
Sbjct: 75  AFVARLRKFLKTRRVTSVSQIGTDRIIEFRFSDG--AYRLYLEFFASGNVILTDADLTII 132

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKLHAALTSSKEPDANEPDKVNE 197
            LLR+  + +         +Y  E  + F      T  +L AAL ++ E           
Sbjct: 133 ALLRNVPEGEGQEPQRVGLKYTLENRQNFGGVPELTKERLRAALKTAAE----------- 181

Query: 198 DGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEH 257
                                  ++K + K   D  R    T  T L       P L +H
Sbjct: 182 ---------------------HAVTKKAKKKGADELRRGLATTITEL------PPVLVDH 214

Query: 258 IILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLG 317
           +   T      K  E+ + E   +  L  A+ K    L +V S      GYI+ +     
Sbjct: 215 VFRLTEFNSAAKPLEILESE-TLLDSLFRALEKARAVLDEVTSSPRA-TGYIIAKPNPRA 272

Query: 318 KDHPPTESGSSTQ-------IYDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKIES 367
            + PP E+   TQ       +Y++F P L  QF   +    + F+ ++  +DEF+S IE 
Sbjct: 273 VEQPPAETEGETQKEKPRGLLYEDFQPFLPKQFEDDQGLTTLSFDGYNKTVDEFFSSIEG 332

Query: 368 QRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAV 427
           Q+ E + + +E  A  KL+    DQ  R+  L      +++ A  IE N+E V  A+ AV
Sbjct: 333 QKLESKLQEREATAKRKLDAARQDQAKRIEGLVGFQTLNLRKAAAIEANIERVQEAMDAV 392

Query: 428 RVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNN----------- 475
              L   M W ++ ++V+ E+  GNPVA +I   + L  + ++LLL              
Sbjct: 393 NGLLEQGMDWVNINKLVEREQAQGNPVAEIIKLPVNLAESTITLLLGEEEEEEAGEDEDM 452

Query: 476 ----------LDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                     +D   + EK    +K   ++++L LS   NAR +YE K+    K++KT+ 
Sbjct: 453 EFNYDTDEEVVDAAPEPEKAKGPDKRLAIDINLKLSVWNNAREYYEQKRTAADKEKKTVA 512

Query: 523 AHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
               A K+AE+K     R  + QEK V  +  +RK  WFEKF WFISS+ YLV+ GRDAQ
Sbjct: 513 QSVIALKSAEQKITEDLRKGLKQEKPVLQL--IRKQMWFEKFVWFISSDGYLVLGGRDAQ 570

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           QNE++ KRY+ KGDVYVHAD+HGAS+ +IKN    P+ P+PP TL QAG  +VC S AWD
Sbjct: 571 QNEILYKRYLKKGDVYVHADMHGASTVIIKNSPKTPDAPIPPSTLAQAGSLSVCCSSAWD 630

Query: 637 SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
           SK    AWWV   QVSK+APTGEYL  GSFM+RGKKN LPP  L++GFGL+FR+ E S  
Sbjct: 631 SKAAMGAWWVNADQVSKSAPTGEYLPAGSFMVRGKKNPLPPALLMLGFGLMFRISEESKA 690

Query: 697 SHLNERRVRGE 707
            H+  R   G+
Sbjct: 691 KHVKHRLYDGD 701



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 43/70 (61%)

Query: 1002 MEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTA 1061
            M EE +  + E EK     ++ L G P+P D +L V+PVCGP+ A+   KY+VK+ PG  
Sbjct: 911  MLEEGVDILDENEKADAGPLESLVGTPMPGDEILEVVPVCGPWGALGKLKYKVKLQPGQV 970

Query: 1062 KKGKGIQIFY 1071
            KKGK ++  +
Sbjct: 971  KKGKAVKEIF 980


>gi|320169195|gb|EFW46094.1| serologically defined colon cancer antigen 1 [Capsaspora owczarzaki
           ATCC 30864]
          Length = 1151

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 214/519 (41%), Positives = 305/519 (58%), Gaps = 52/519 (10%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSE---VNKLEDNAIQVLVLAVAKFEDWLQ 296
           LK  L   L +GPA+ EH IL  GL P+  +S            I  L   +   +  L 
Sbjct: 183 LKKFLNSQLAFGPAVVEHCILKAGLKPDGSVSSQLPCTAEHSEPIDKLYAEILNTQQLLI 242

Query: 297 DVISGDIVPEGYILM----------QNKHLGKDHPPTESGSSTQ--------IYDEFCPL 338
           DV +   VP GYI+           +NK +G +     +  ++         ++DE+ P 
Sbjct: 243 DVGASSEVP-GYIIQRKESRATAANKNKGVGDEQAAVAAALASASGDASDIFVFDEYHPF 301

Query: 339 LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHT 398
           L  Q ++R  V F TFD A+DEFYS+IE QR + +H   E     KL K  ++QE ++  
Sbjct: 302 LFEQHKARPVVHFPTFDRAVDEFYSRIEGQRLDMKHIGDERNVLKKLEKFKLEQERKLVG 361

Query: 399 LKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI 458
           L+   +      +LI   L++   A+L +R ALA+ + W +++ MV+  ++  +PVA +I
Sbjct: 362 LRTTQEEEALRGQLI---LDNQTKALLVIRSALAHAVDWSEISDMVEAAKEQKDPVASII 418

Query: 459 DKLYLERNCMSLLLSN---------------NLDEMDDEEKTLPVE------------KV 491
            KL L+ N ++L+L++                 D+    +     +            KV
Sbjct: 419 HKLKLDSNIITLMLTSPDAVEEEEDDNSEDEGADQAVSSKGKGSAKGGKKGHHQQTRMKV 478

Query: 492 EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM 551
           ++D+  S HANA  ++  KK+  +K+++TI A SKA K+AE++T+ Q+ Q    A ++ +
Sbjct: 479 DIDITASVHANAESYFSRKKQAAAKEQRTIDASSKALKSAERQTKQQLKQVAVKATVNKV 538

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           RKV WFEKF WFI+SENYLVI GRD QQNE++VKR++  GD YVHADLHGASS ++KN  
Sbjct: 539 RKVLWFEKFLWFITSENYLVIGGRDMQQNELLVKRHLRNGDAYVHADLHGASSVIVKNPT 598

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+QPVP  +L +AG F VC+S AWD+K++TSAWWV  +QVSKTAPTGEYLT GSFMIRG+
Sbjct: 599 PDQPVPIRSLCEAGTFAVCYSSAWDAKVITSAWWVAANQVSKTAPTGEYLTTGSFMIRGR 658

Query: 672 KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           KNFLPP PLI+GFG L+RLDES +  HL ER+V  E E 
Sbjct: 659 KNFLPPSPLILGFGFLYRLDESCIAKHLQERKVVSEGEA 697



 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 76/168 (45%), Positives = 110/168 (65%), Gaps = 10/168 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+ A +  LR RLIG+R +NVYD++ KTY+FKL             K +LL+
Sbjct: 1   MKQRFSSLDIIASIALLRSRLIGLRVTNVYDINFKTYLFKLAKPGF--------KAILLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R HTT +   K N+PS F +KLRKH+RTRRL  +RQ+G DR+I  +FG G+ A++V
Sbjct: 53  ESGIRFHTTEFDWPKNNSPSNFAMKLRKHLRTRRLNSIRQVGADRVIDLEFGSGVAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICR 167
           I+ELY +GNI+LTD E+ +L+LLR    + D+ V      ++P    R
Sbjct: 113 IVELYDRGNIILTDFEYNILSLLRVRTVEGDEDVRFAVGEKFPEAAVR 160


>gi|156059014|ref|XP_001595430.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980]
 gi|154701306|gb|EDO01045.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 1063

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 261/732 (35%), Positives = 398/732 (54%), Gaps = 94/732 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDLS K ++ K              K  +L++SG R H T ++R     PS
Sbjct: 19  LVTLRVSNIYDLSSKIFLVKFAKPDN--------KQQILIDSGFRCHLTDFSRATAAAPS 70

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK+++TRR+  V Q+G DRII FQF  G    Y  LE YA GNI+LTD E  +L
Sbjct: 71  VFVQRLRKYLKTRRVTQVSQVGTDRIIEFQFSDGQYRLY--LEFYAGGNIILTDKELNIL 128

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
           TLLR     D G A                     +L   L  S E      ++ N  G 
Sbjct: 129 TLLRVV---DPGEA-------------------QEELRVGLKYSLE------NRQNYGG- 159

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSND-GARAKQP--TLKTVLGEALG-YGPALSE 256
            + + ++E L          L K ++K  +D G + K+P   L+  L  ++  + P L +
Sbjct: 160 -IPDLTRERLKEA-------LQKGADKGEDDSGKKKKKPGDALRKALAVSITEFAPMLVD 211

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK-- 314
           H +  T    ++K SEV + ED  +  L+ ++ + +  +Q++ S +   +GYI+ + K  
Sbjct: 212 HAMRITNFNHSLKPSEVLQSED-LLDHLMRSLQEAQRVVQEITSSE-TSKGYIIAKKKDS 269

Query: 315 HLGKDHPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQRAE 371
            +  D    E      +YD+F P    QF    +  F++FE F+  +DEF+S IE Q+ E
Sbjct: 270 QVTSDDNQAEDRKGL-LYDDFHPFKPRQFEDDPTLVFLEFEGFNKTVDEFFSSIEGQKLE 328

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            + + +E  A  K+     +Q  R+  L++    + + A  ++ N+E V  A  AV   +
Sbjct: 329 SRLEERELNAKKKIQAARNEQAKRLGGLQEIQALNERKASALQANVERVQEARDAVNGLI 388

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL----------------SN 474
           A  M W ++ R+++ E+K  NPVA +I   L L++N ++LLL                S+
Sbjct: 389 AQGMDWFEIGRLIELEQKRKNPVASMIKLPLKLDQNTVTLLLDEEVFNDDEDSSYETDSD 448

Query: 475 NLDEMDDEEKTLPVEK----------VEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
             D  D+E+   PVEK          ++++L+LS  ANAR +++ K+   SK++KT+ + 
Sbjct: 449 VSDSEDEEKAAKPVEKEEKATETRLAIDINLSLSPWANARNYFDQKRSAASKEDKTLQSS 508

Query: 525 SKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           SKA K+ E K     +  + QEKT+  +  +RK  WFEKF WFISS+ YLV++G+DAQQ+
Sbjct: 509 SKALKSTEAKIAQDLKKGLKQEKTI--LRPVRKQIWFEKFVWFISSDGYLVLAGKDAQQS 566

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           E++ KRY+ KGD+Y+HAD+ GA+S +++N+   P+ P+PP TL+QAG   V  S AWDSK
Sbjct: 567 EILYKRYLRKGDMYLHADISGAASVIVRNNPKTPDAPIPPQTLSQAGTLVVATSSAWDSK 626

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
              SAWWV   QVSK APTGE+L  G F I+GKKNFLPP  L++GFG+LF++ + S   H
Sbjct: 627 AGMSAWWVNADQVSKAAPTGEFLPAGKFTIQGKKNFLPPAQLLLGFGILFQISDESKARH 686

Query: 699 LNERRVRGEEEG 710
           +  R   GE  G
Sbjct: 687 VKHRFQDGEPVG 698



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 2/95 (2%)

Query: 993  ETAEMDKV--AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSY 1050
            ETAE +++   M E+ I  + + E  ++  +D   G PLP D +L  IPVC P++A+  Y
Sbjct: 898  ETAEHEQMRKLMLEDGIDTLEDNEAEKMTSLDTFVGLPLPGDEILEAIPVCAPWAAMGKY 957

Query: 1051 KYRVKIIPGTAKKGKGIQIFYSLLLLMLSLTPVFD 1085
            KY+ KI PG  KKGK ++      +   S   V D
Sbjct: 958  KYKAKIQPGAQKKGKAVREILGKWIAAASAKNVLD 992


>gi|47230001|emb|CAG10415.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 582

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 259/717 (36%), Positives = 363/717 (50%), Gaps = 169/717 (23%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +    +GMR +NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFTTVDIKAVIAEINANYMGMRVNNVYDIDNKTYLIRLQKPDS--------KAILLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+H+T +   K   PSGF +K RKH++TRRL  V+QLG DRI+  QFG    A+++
Sbjct: 53  ESGTRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTRVQQLGNDRIVDIQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHA 179
           I+ELY +GNI+L D E+T+L LLR    +   V I  R RYP E  R  E   +  +L  
Sbjct: 113 IVELYDRGNIILADHEYTILNLLRFRTAEVDDVKIAVRERYPVESARPPEPLISLERLTE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L+++++ D                                                   
Sbjct: 173 ILSTAQQGD--------------------------------------------------Q 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVL-VLAVAKFEDWLQDV 298
           +K VL   L YG  L EH +++ GL  + K+     +   A ++L  L VA  E +++  
Sbjct: 183 VKRVLNPHLSYGATLIEHSLIEVGLPGSAKVDSQTDVAQVAPKILEALKVA--ETYMEK- 239

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFD 355
            S     +GYI+ +++      P    G   +    YDEF P L  Q     +++F++FD
Sbjct: 240 -SEHFTGKGYIIQKSE----KKPSVTPGKPCEELLTYDEFHPFLFAQHSKSPYLEFDSFD 294

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELI 413
            A+DEF+SK+ESQ+ + +    E  A  KL  +  D E R+  L   QE+DR +K  ELI
Sbjct: 295 KAVDEFFSKMESQKIDMKALQLEKHALKKLENVKKDHEQRLEALHQAQEIDR-IK-GELI 352

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E NL  V+ A+  V  ALAN++ W ++  +VKE + AG+PVA  I +L L+ N ++LLL 
Sbjct: 353 EMNLAIVERALQVVCGALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHITLLLK 412

Query: 474 NNLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYELKKKQ 513
           N     DDE++   +E+                    V+VDL+LSA+ANA++        
Sbjct: 413 NPYISEDDEQEDDVLEETGRKNKNKKNKKFHKNKPVLVDVDLSLSAYANAKK-------- 464

Query: 514 ESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVIS 573
                                                      FEKF WFIS+ENYLVI+
Sbjct: 465 -------------------------------------------FEKFLWFISAENYLVIA 481

Query: 574 GRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQ 633
           GRD QQNEMIVKRY+  G                      +P+PP TL +AG   VC+S 
Sbjct: 482 GRDQQQNEMIVKRYLRAG----------------------EPIPPRTLTEAGTMAVCYSA 519

Query: 634 AWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           AW++K+VTSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKN+LPP  LIMGFG LF++
Sbjct: 520 AWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFGFLFKV 576


>gi|346976277|gb|EGY19729.1| DUF814 domain-containing protein [Verticillium dahliae VdLs.17]
          Length = 1086

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 256/741 (34%), Positives = 374/741 (50%), Gaps = 117/741 (15%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +NVYDLS K  + K              K  +L++SG R H T +AR     PS
Sbjct: 71  LVTLRLANVYDLSSKILLLKFAKPD--------NKKQILIDSGFRCHLTDFARTTAAAPS 122

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRRL  V Q+G DRII F F  G   + + LE +A GN++LTD+E  +L
Sbjct: 123 AFVARLRKFLKTRRLTAVSQVGTDRIIEFTFSDGQ--YRLFLEFFASGNVILTDAELRIL 180

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
           TLLR+                                        E +  EP +V   G 
Sbjct: 181 TLLRN--------------------------------------VPEGEGQEPQRV---GL 199

Query: 201 NVSNASKENLGGQK--------------GGKSFDLSKNSNKNSNDGARAKQPTLKTVLGE 246
             S  +++N GG                  K+ +      K    G + ++  L T + E
Sbjct: 200 GYSLDNRQNFGGVPPLTRERLQDALRVMAAKAANAPTTGKKKVKPGDQLRK-GLATTITE 258

Query: 247 ALGYGPALSEHIILDTGLVPNMKLSEV---NKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
                P L +H    TG  P    +E+   + L D+ +  L +A    ED      +   
Sbjct: 259 ---LPPMLVDHAFQVTGFDPTKTPAELLDSDALLDSLLHALTVARKVVED-----ATSSA 310

Query: 304 VPEGYILMQNKHLGKD-HPPTESGSSTQ----IYDEFCPLLLNQFRSREFVKFETFDA-- 356
              GY++ + +   ++     + G+ T+    +YD+F P L  +F     VK  TFD   
Sbjct: 311 TTTGYVIAKYRQKSEETEEKPDDGAETKREDLLYDDFHPFLPQKFADDPSVKVLTFDGFN 370

Query: 357 -ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
             +DEF+S +E Q+ E +   +E AA  KL     D   R+  L++    + + A  IE 
Sbjct: 371 KTVDEFFSSLEGQKLESKLTEREAAAKKKLEATRQDHAQRIEGLQEAQSLNEQKAAAIEA 430

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL--YLERNCMSLLLS 473
           N+E V  A+ AV   +   M W ++ ++++ E+K  NPVA  I KL   L  N M+LLL 
Sbjct: 431 NVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRRNPVAETI-KLPRKLGENLMTLLLG 489

Query: 474 NNLDEMDDEEKTLPVE--------------------KVEVDLALSAHANARRWYELKKKQ 513
               E +DE      +                    ++E++L LS  ANAR +Y+ ++  
Sbjct: 490 TEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEINLGLSPWANAREYYDQRRTA 549

Query: 514 ESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
             K++KT+   + A + AEKK     +  + QEK V  +  +RK  WFEKF WFISS+ Y
Sbjct: 550 AVKEQKTVQHSTMALRNAEKKITEDLKKGLKQEKAV--LQPIRKQMWFEKFIWFISSDGY 607

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVPPLTLNQAGCF 627
           LV+ G+DAQQNE + KRY+ KGDVY HAD+HGA++ ++KN +  P+ P+PP TL QAG  
Sbjct: 608 LVLGGKDAQQNETLYKRYLRKGDVYCHADMHGAATVIVKNRQDTPDAPIPPATLAQAGML 667

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +VC S AWDSK    AWWV   QVSK+APTGEYL  GSFM+RG+KNFLPP PL++G G++
Sbjct: 668 SVCSSSAWDSKAGMGAWWVRADQVSKSAPTGEYLPAGSFMVRGQKNFLPPAPLVLGLGIM 727

Query: 688 FRLDESSLGSHLNERRVRGEE 708
           FR+ E S   H+ + R+RG+E
Sbjct: 728 FRISEESKAKHV-KHRLRGDE 747


>gi|345804334|ref|XP_863447.2| PREDICTED: nuclear export mediator factor NEMF isoform 6 [Canis
           lupus familiaris]
          Length = 1056

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/508 (42%), Positives = 304/508 (59%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREV----KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E  LP  K                   V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   LIGMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAILAELNASLIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHARAAE 162


>gi|325185450|emb|CCA19934.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1061

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 261/741 (35%), Positives = 395/741 (53%), Gaps = 101/741 (13%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL-------------SPKTYIFKLMNSSG 46
           M K RM   D+ A +  +R+ ++ MR +N+Y+L             + +TYIFKL     
Sbjct: 1   MPKTRMLIDDIHAMMGSVRKNILNMRVTNIYNLQNEAEVEGIDNKSNQRTYIFKLHQPP- 59

Query: 47  VTESGESEKVLLLMESGVRLHTTAYARDKKNT---PSGFTLKLRKHIRTRRLEDVRQLGY 103
                   KV LL+ESGVR H++ YAR+  ++   P+ FT+KLRKHIR +RL  + QL  
Sbjct: 60  ------FPKVYLLIESGVRFHSSNYARNISSSSTLPNQFTMKLRKHIRGKRLMQLEQLKG 113

Query: 104 DRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           DR+I F FG   +  ++ILELYA GNI+LTD+++ +L+LLR+HR D+  V +  R  YP 
Sbjct: 114 DRVIDFTFGSDQSQCHLILELYASGNIILTDNQYNILSLLRTHRIDE-NVKVAVRQVYPI 172

Query: 164 EIC--RVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDL 221
           +I   R  E   + ++     S    D ++ D                            
Sbjct: 173 QILSNRALESQVSGQILRQRLSDWFSDQSDDD---------------------------- 204

Query: 222 SKNSNKNSNDGARAKQPTL-KTVLGEALGYGP---ALSEHIILDTGLVPNMKL---SEVN 274
              + KN+  G + K  TL + +L +++G+G    A+ EH I+ TG +PN K+    +V 
Sbjct: 205 ---TTKNTARGGKKKFQTLEQLLLTKSVGFGGLGRAIVEHCIVSTG-IPNSKIKSYQDVR 260

Query: 275 KLEDN----------AIQVLVLAVAKFEDWLQDVISGDIVPE-------GYILMQNKHLG 317
            LED+           I++L       E +++D  S +I+ E       GYI++ N    
Sbjct: 261 TLEDHLGKLAEELNKGIKLLQWLENNQEQYMKDEQSTEILSESEKKPKGGYIILGN---- 316

Query: 318 KDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAK 377
                 ++G+ T  Y+ F P+L  Q R + +V F+TFD  +DE++S  E+++ +   +A 
Sbjct: 317 -----AQTGTKTDTYESFTPVLYAQHREKAYVSFDTFDQTVDEYFSYHEARKTQTGSQAA 371

Query: 378 EDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSW 437
           + AA  KL K+  +Q  ++  L    + ++K A+LIE +  D++  +  +R ALA+ M W
Sbjct: 372 QQAASSKLEKMRKNQIQQLDELHHSEEINLKHAQLIELHQLDIEKVLSVIRSALASGMDW 431

Query: 438 EDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLAL 497
           + L  +VK E+   NPVA +I +  L +N +S+LLS   D+   E+    V  + +DL+L
Sbjct: 432 KALKDLVKYEQTNANPVASMIHEFDLSKNRVSVLLS---DDPYFEDAEPAVHAIWLDLSL 488

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT-RLQILQEKTVANISHMRKVHW 556
           SA  NA   Y  KK    K +K   A  KA K A  KT +    Q      I   RK  W
Sbjct: 489 SALGNAAELYAKKKTSAEKAKKAEVATEKAIKLAASKTEKFMKTQLIKPTPIHQRRKTFW 548

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ-- 614
           FEKF+WF+SSEN LVISG+DAQQNE++V RY+ K DV+VH+DL GAS  +++        
Sbjct: 549 FEKFHWFLSSENILVISGKDAQQNELLVNRYVRKNDVFVHSDLQGASPCIVRVRAARTFD 608

Query: 615 ---PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
               +P  TL QA C  VC S AW ++++T A+WV    VSK+  +GE L  G+F+I GK
Sbjct: 609 QALSIPITTLEQAACMCVCRSNAWKNQVITGAYWVKAECVSKSTSSGELLPPGTFLILGK 668

Query: 672 KNFLPPHPLIMGFGLLFRLDE 692
           KNFL    L MG  +L+  +E
Sbjct: 669 KNFLQALRLEMGLAILYHTEE 689



 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 96/207 (46%), Gaps = 52/207 (25%)

Query: 876  SQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDP 935
            S P+ +V        K +RG+KGKLKK+K+KY DQDEE+R +RM  L             
Sbjct: 812  STPQHLVDDATQVRSKSARGKKGKLKKIKQKYADQDEEDRLLRMEALG------------ 859

Query: 936  QNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETA 995
                      KK  IS     K+                 P D +  V  +       + 
Sbjct: 860  ---------HKKSVISEPTPLKLV----------------PSDGTAAVNTH-------SV 887

Query: 996  EMDKVAMEEEDIHEIGEEEKGRLNDVDY---LTGNPLPSDILLYVIPVCGPYSAVQSYKY 1052
            +MDK  + +     + EEE+  ++ +D+    TG+P P+  L+  IP+C PYSA+Q Y Y
Sbjct: 888  KMDKQKVYQGREQYLKEEEEF-VDALDFSVVFTGSPKPNSRLIAAIPMCAPYSALQKYTY 946

Query: 1053 RVKIIPGTAKKGKG----IQIFYSLLL 1075
            RVK++PG  K GK     I  F++L L
Sbjct: 947  RVKLVPGAQKLGKAARQIIAHFFTLNL 973


>gi|398396540|ref|XP_003851728.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
 gi|339471608|gb|EGP86704.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
          Length = 1060

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 265/739 (35%), Positives = 393/739 (53%), Gaps = 79/739 (10%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R SNVYDLS + ++ K              +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSNTLVSLRLSNVYDLSSRIFLLKFAKPD--------HREQLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T++AR     P+ F  +LRK ++TRR+  VRQ+G DR+I  +F  G  A+ +
Sbjct: 53  DSGFRCHLTSFARATAAAPTPFVARLRKFLKTRRVTAVRQVGTDRVIELEFSDG--AYRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE YA GNI+LTD E T+L LLRS                      V E     +  A 
Sbjct: 111 YLEFYAGGNIVLTDKESTILALLRS----------------------VGEGAEHEQYRAG 148

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENL-GGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            T       N   + N DG  V + S E L  G +      L ++         +A    
Sbjct: 149 ATY------NLSLRQNFDG--VPDLSTERLRDGLQAAIQKQLIESQKPGKKIKKKAGDAL 200

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
            + +      + P L +H +  +G+  N++  +V  LE + +   VLA  +  + + D I
Sbjct: 201 RRALAITTTEFPPILLDHALHVSGIDRNVQPEQV--LESDELLDKVLAALQQANIVIDDI 258

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKFETFDA 356
           +   V  GYIL +     K+     +     +Y++F P    Q  + E   F +F  F+ 
Sbjct: 259 TQAEVATGYILAKRNGAVKESDGEATDERGLMYEDFHPFKPAQLTAEETIVFREFSGFNK 318

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
            +DEF+S IE Q+ E + + +ED A  ++ +   +Q  R+  L++  + +++ A+ IE N
Sbjct: 319 TVDEFFSSIEGQKLESKLQEREDHAKRRIEQAREEQAKRIDGLQEVQELNIRKAQAIEAN 378

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLS-- 473
           +E V+ A  AV   +A  M W D+ ++++ E+K  N VA LI   L L  N ++LLLS  
Sbjct: 379 VERVEEATAAVNGLIAQGMDWVDIGKLIENEQKRHNAVAELIKLPLKLHENTVTLLLSEL 438

Query: 474 NNLDEMDDEEKTLPVEK---------------------VEVDLALSAHANARRWYELKKK 512
           +  D  DDE      E                      +++DLA S  ANAR++Y+ K+ 
Sbjct: 439 DAADGGDDEANETDSEPDDSDDEDAAPAAKGGEDKRLTIDIDLAASGWANARQYYDQKRS 498

Query: 513 QESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
             +KQEKT  A  KA K+ E++     +  + QEK V  +  +RK  WFEKF +F+SS+ 
Sbjct: 499 AATKQEKTAQASQKALKSTEQRVMADLKKGLKQEKDV--LRPVRKQFWFEKFIYFLSSDG 556

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGC 626
           YLV++G+DAQQNE++ +RY+ KGDVYV+ADL GA+S +IKN+   PE P+PP TL+QAG 
Sbjct: 557 YLVLAGKDAQQNEILYRRYLKKGDVYVNADLQGAASVIIKNNPATPEAPIPPSTLSQAGN 616

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             VC S AW+SK V SAWWV   QVSKTAPTGEYLT G F+IRGKKN LPP  L++GFG+
Sbjct: 617 LAVCTSSAWESKAVMSAWWVNADQVSKTAPTGEYLTNGGFVIRGKKNHLPPAQLLLGFGV 676

Query: 687 LFRLDESSLGSHLNERRVR 705
           +F++ E S  +H+  R  R
Sbjct: 677 MFQISEESKANHVKHRLQR 695



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 32/51 (62%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
               +D L G PLP D +L  IPVC P++A+   KY+ K+ PG  KKGK ++
Sbjct: 929  FTQLDALVGTPLPGDEILEAIPVCAPWAALARSKYKAKLQPGQQKKGKAVR 979


>gi|367042422|ref|XP_003651591.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
 gi|346998853|gb|AEO65255.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
          Length = 1094

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 260/724 (35%), Positives = 372/724 (51%), Gaps = 96/724 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDL+ K  + K        +        LL+ESG R H T +AR     PS
Sbjct: 21  LVSLRLSNIYDLNSKLLLLKFAKPDNRQQ--------LLIESGFRCHLTDFARAAAPAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII FQF  G  A+ + LE +A GN++LTD++  +L
Sbjct: 73  QFVSRLRKFLKTRRVTGVSQIGTDRIIEFQFSNG--AYRLYLEFFASGNVILTDADLKIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR+                      V +          LT + E      ++ N  G 
Sbjct: 131 ALLRN----------------------VPQGEGQEPQRVGLTYTLE------NRQNFGG- 161

Query: 201 NVSNASKENL-GGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII 259
            V   +KE L G  K       +K + +  +D  R    T  T L       P L +H+ 
Sbjct: 162 -VPALTKERLRGALKTASEQAATKKAKRKGSDELRRGLATTITELP------PVLVDHVF 214

Query: 260 LDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGK 318
             T   P  K +++  LE+ A+   L  ++ K    L +V S      GYI+ +      
Sbjct: 215 RLTSFDPTTKPADI--LENEALLDALFQSLEKARSILDEVTSSPSA-RGYIIAKRNPRAA 271

Query: 319 DH-----PPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQRA 370
           D        T+  +   +Y++F P L  QF    + + + F+ F   +DEF+S +E Q+ 
Sbjct: 272 DQVADGEETTKEKAQNLLYEDFQPFLPKQFEDDPTCQVLSFDGFSKTVDEFFSSLEGQKL 331

Query: 371 EQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVA 430
           E + + +E  A  KL     DQ  R+  L++    +++ A  IE N+E V  A+ AV   
Sbjct: 332 ESRLQEREATAKRKLEAARRDQAQRIEGLQEAQLLNLRKAAAIEANVERVQEAMDAVNGL 391

Query: 431 LANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN---------NLD--- 477
           L   M W D+ ++V+ E++  NPVA +I   + LE + ++LLL           N+D   
Sbjct: 392 LQQGMDWVDINKLVEREQRLHNPVAEIIKLPMRLEESIITLLLGEEEEEAEAEANMDFDY 451

Query: 478 -------------EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                        +    +K L    ++++L LS   NAR +YE K+    KQ+KTI   
Sbjct: 452 DTDEEAAEETAAGKAKGPDKRL---AIDINLKLSPWNNAREYYEQKRTAADKQQKTIQQS 508

Query: 525 SKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
             A + AEKK     +  + QEK V  +  +RK  WFEKF WFISS+ YLV+ GRDAQQN
Sbjct: 509 EIALRNAEKKISEDLKKGLKQEKPVLQL--IRKQMWFEKFLWFISSDGYLVLGGRDAQQN 566

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           E++ KRY+ KGDVYVHAD+HGA S +IKN+   P+ P+PP TL QAG  +VC S AWDSK
Sbjct: 567 EILYKRYLRKGDVYVHADMHGAPSVIIKNNPKTPDAPIPPSTLAQAGSLSVCCSSAWDSK 626

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
            V  AWWV   QVSK+APTGEYL  GSFM+RGK+N LPP  L +GFGL+F++ E S   H
Sbjct: 627 AVMGAWWVNADQVSKSAPTGEYLPAGSFMVRGKRNALPPALLTLGFGLMFKISEDSKSKH 686

Query: 699 LNER 702
           +  R
Sbjct: 687 VKHR 690



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 32/47 (68%)

Query: 1022 DYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            D L G PLP D +L V+PVC P++A+   KY+ K+ PG  KKGK ++
Sbjct: 953  DALVGAPLPGDEILEVVPVCAPWNALGRVKYKAKLQPGHVKKGKAVK 999


>gi|50555916|ref|XP_505366.1| YALI0F13277p [Yarrowia lipolytica]
 gi|49651236|emb|CAG78173.1| YALI0F13277p [Yarrowia lipolytica CLIB122]
          Length = 1134

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 261/758 (34%), Positives = 399/758 (52%), Gaps = 98/758 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R +  D+      LR+ ++  R  N+YDL  S + ++ K      V ES    K L+
Sbjct: 1   MKQRFSQLDLKVIASELRKSILNYRLQNIYDLLSSSRHFLLKF----AVPES----KQLV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G R+HT+ + R    TPS F  KLRKH+RTRRL  + Q   DR+++  F  G   +
Sbjct: 53  VIDPGFRIHTSNFQRPTSQTPSNFVAKLRKHLRTRRLSAITQPVGDRVLVLTFSDGQ--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR--SHRDDDKGVAIMSRHRYPTEIC------RVFE 170
           ++ILE +A GN++L D +F +L L R  S   +++ VA+   + +  E+       +V  
Sbjct: 111 HLILEFFAGGNLILVDQDFKILALQRVVSEGANNQRVAVGVIYEFDKELLNNTDPLQVSR 170

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               + L     ++  PD  E D+VN     V                     N  K   
Sbjct: 171 TEITADLLQQWVATVSPD--EDDEVNAISGGV---------------------NKKKTRR 207

Query: 231 DGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
              +AK P+LK +L   +    PAL E  +   G+  N+ + +V+   ++ +  +  AV 
Sbjct: 208 ---KAKLPSLKKLLYSNMSELSPALLEQYLEKEGVDGNLSIKDVD-FSESTVTSIAAAVK 263

Query: 290 KFEDWLQDVISGDIVPEGYILMQ-NKHLGK-DHPPTESGSSTQ------IYDEFCPLLLN 341
             ED +Q+++  D+V  GYI  + N +  K D   T    S        +Y+ F P  + 
Sbjct: 264 GCEDRVQELLDADLV-TGYIACEKNPNWKKPDEEKTYIPGSIDPSDIEYLYESFEPFEIT 322

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
                +   FE ++  +D ++S +ES R   +  A+E  A  +LN    + + RV  L+Q
Sbjct: 323 -VADGKVDTFEGYNLTVDRYFSTVESTRYSLRVNAQEQIAEKRLNAARNETKKRVDGLQQ 381

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL 461
             DRS+ M   ++     V+ AI AV+      M W+D+  ++  E+K GNPVA ++  +
Sbjct: 382 VQDRSILMGTALQTYAGRVEEAIAAVKQLQDQGMDWKDMEHLIDLEKKKGNPVAQMVSSM 441

Query: 462 YLERNCMSLLLSNNLDEM-----------------------------DDEEKTLPVEKVE 492
            LE+N ++L+L N   E                               +E KTL   KVE
Sbjct: 442 NLEKNRVTLILPNPDVEDESDSDSDSDMDETDSEGESEESGSESDSNKNESKTL---KVE 498

Query: 493 VDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH-- 550
           V+L L+A+ANA  ++++KK    KQEKT    + A K+AE+K +L +  ++++A   H  
Sbjct: 499 VNLDLTAYANANNYFDIKKVAAQKQEKTEKNSATALKSAEQKVKLDL--KRSLAQEQHAL 556

Query: 551 --MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             MR  +WFEKF WF SS+ YLVI G+DAQQNEM+ KRY  KGD YVHA++ GAS+ ++K
Sbjct: 557 RPMRPSYWFEKFWWFFSSDGYLVIGGKDAQQNEMLYKRYFRKGDAYVHAEIQGASTVIVK 616

Query: 609 NHR-PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           NH  P  P+PP TL+QAG  ++C S+AWDSK++ SAWWV   QVSK+AP+GE+L  GSFM
Sbjct: 617 NHLGPTAPLPPSTLSQAGSLSICTSKAWDSKVLISAWWVEHGQVSKSAPSGEFLPTGSFM 676

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           IRGKKNFLPP  L +G  +L+  DE S   ++ +R  R
Sbjct: 677 IRGKKNFLPPTSLDVGLAILWIADEDSTAKYVKQRLER 714



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 14/38 (36%), Positives = 29/38 (76%)

Query: 1031 SDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +D+++  IP+  P++A+  +K++ K++PGT KKGK ++
Sbjct: 1023 NDVVVGAIPMFAPWAALSKFKFKAKMVPGTVKKGKAVK 1060


>gi|194379038|dbj|BAG58070.1| unnamed protein product [Homo sapiens]
          Length = 782

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 203/414 (49%), Positives = 272/414 (65%), Gaps = 34/414 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y+EF P L +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D
Sbjct: 41  YEEFHPFLFSQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKD 100

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
            ENR+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G
Sbjct: 101 HENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQG 160

Query: 452 NPVAGLIDKLYLERNCMSLLLSN--------------------NLDEMDDEEKTLPVEK- 490
           +PVA  I +L L+ N +++LL N                    N  E    +K     K 
Sbjct: 161 DPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQ 220

Query: 491 -----------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
                      V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  +
Sbjct: 221 LQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTL 280

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK-GDVYVHAD 598
            + +TV +I   RKV+WFEKF WFISSENYL+I GRD QQNE+IVKRY++  GD+YVHAD
Sbjct: 281 KEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPVGDIYVHAD 340

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           LHGA+S VIKN   E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTG
Sbjct: 341 LHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTG 399

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           EYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 400 EYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDEDME 453


>gi|452840445|gb|EME42383.1| hypothetical protein DOTSEDRAFT_73267 [Dothistroma septosporum
           NZE10]
          Length = 1122

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 255/726 (35%), Positives = 386/726 (53%), Gaps = 96/726 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +NVYDLS + ++ K              +  L+++SG R H T +AR     PS
Sbjct: 21  LVSLRLANVYDLSSRIFLLKFAKPE--------HREQLIVDSGFRCHLTDFARATAAAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK +RTRR   VRQ+G DRI+  QF  G  A+ + LE YA GNI        VL
Sbjct: 73  PFVARLRKFLRTRRCTAVRQIGTDRIVELQFSDG--AYRLFLEFYAGGNI--------VL 122

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
           T        D  +  +   R  +E     +    SK   +L  + E              
Sbjct: 123 T--------DADLTTLGLLRSVSEGAEHEQYRLGSKYDLSLRQNYE-------------- 160

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG---------YG 251
            + + +K+ L         D  + + +     AR     +K   G+AL          + 
Sbjct: 161 GIPSLTKDRLR--------DGLRKAEERQQAEARKPGKKIKKKSGDALRKALAITTTEFP 212

Query: 252 PALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM 311
           P L +H +  TG+   ++L  V   ++   +VL  A+ +    + D+ S  +   GYIL 
Sbjct: 213 PVLIDHALHVTGVDRQIELEAVIGRDEELDKVLK-ALQEANRVIDDITSLPVA-RGYILA 270

Query: 312 QNKHLGKDHPPTESGSSTQI-YDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIES 367
           + K    D   T +  +  + Y++F P    Q     +  F++ E F+ A+D+F+S IE 
Sbjct: 271 KRKVPKADANTTATEDNQNVMYEDFHPFKPAQLEGDPANVFIEHEGFNKAVDDFFSSIEG 330

Query: 368 QRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAV 427
           Q+ E + + +E+ A  ++ +   +QE R+  L+Q  + +++ A+ IE N+E V+ A+ AV
Sbjct: 331 QKLESRLQEREENAKRRIEQARQEQEKRITGLQQVQELNIRKAQAIEANVERVEEAVAAV 390

Query: 428 RVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLS------------- 473
              +A  M W D+ R+++ E+K  NPVA +I   L L  N  +LLLS             
Sbjct: 391 NGLIAQGMDWVDIGRLIENEQKRHNPVAEMIKLPLKLHENTATLLLSELADADDEDMDET 450

Query: 474 -NNLDEMDDEEKTLPVEK----------VEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   + +DE+    ++K          V++DLA S  +NAR++Y+ ++   +KQEKT  
Sbjct: 451 DSEPSDSEDEDHQANIKKSFVPEDERLTVDIDLAASGWSNARQYYDQRRTAATKQEKTAQ 510

Query: 523 AHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           A  KA K+ E+K     +  + QEK V  +  +RK  WFEKF +FISS+ YLV++G+DAQ
Sbjct: 511 AAQKALKSTEQKVMADLKKGLKQEKEV--LRPVRKQFWFEKFIYFISSDGYLVLAGKDAQ 568

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWD 636
           QNEM+ +R++ KGDVYVHAD+HGA+S +IKN+   P+ P+PP +L+QAG  +VC S AWD
Sbjct: 569 QNEMLYRRHLRKGDVYVHADMHGAASVIIKNNPATPQAPIPPSSLSQAGNLSVCTSSAWD 628

Query: 637 SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
           SK V SAWWV   QVSKTAPTGEYLT G FM+RGKKNFLPP  L++GF L+F++ E S  
Sbjct: 629 SKAVMSAWWVNADQVSKTAPTGEYLTTGGFMVRGKKNFLPPAQLLLGFALVFQISEDSKA 688

Query: 697 SHLNER 702
            H   R
Sbjct: 689 KHAKHR 694


>gi|426376842|ref|XP_004055191.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Gorilla
           gorilla gorilla]
          Length = 1056

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/505 (42%), Positives = 303/505 (60%), Gaps = 59/505 (11%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTM 575

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 576 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 635

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 636 FKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRADEADDVKFAVRERYPLDHARAAE 162


>gi|332842178|ref|XP_003314363.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
           troglodytes]
          Length = 1055

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 217/508 (42%), Positives = 306/508 (60%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREIKPSLEADKPVEDIFT----YEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|403277934|ref|XP_003930597.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 1056

 Score =  391 bits (1005), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 215/508 (42%), Positives = 305/508 (60%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G + N+K+ E  KLE   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFLGNVKVDE--KLETKDIEKILVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E+    +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKRE----TKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHVRAAE 162


>gi|397523544|ref|XP_003831789.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
           paniscus]
          Length = 1055

 Score =  391 bits (1005), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 215/505 (42%), Positives = 303/505 (60%), Gaps = 59/505 (11%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTM 575

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 576 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 635

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 636 FKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|194388162|dbj|BAG65465.1| unnamed protein product [Homo sapiens]
          Length = 1055

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 217/508 (42%), Positives = 305/508 (60%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +       Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREIKPCLEADKPVED----ILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|390469065|ref|XP_003734045.1| PREDICTED: nuclear export mediator factor NEMF isoform 2
           [Callithrix jacchus]
          Length = 1056

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 217/513 (42%), Positives = 306/513 (59%), Gaps = 60/513 (11%)

Query: 233 ARA-KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
           ARA K   LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ + K 
Sbjct: 175 ARAPKGELLKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKILVCLQKA 232

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
           ED+++   + +   +GYI+ Q + +       +       Y+EF P L +Q     +++F
Sbjct: 233 EDYMK--TTSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEF 289

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      E
Sbjct: 290 ESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGE 349

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++L
Sbjct: 350 LIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTML 409

Query: 472 LSN--------------------NLDEMDDEEKTLPVEK------------VEVDLALSA 499
           L N                    N  E    +K     K            V+VDL+LSA
Sbjct: 410 LRNPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSA 469

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEK 559
           +ANA+++Y+ K+    K +KT+ A  KAF++AEKKT+  + + +TV +I   RKV+WFEK
Sbjct: 470 YANAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEK 529

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F WFISSENYL+I GRD QQNE+IVKRY++ G                      +P+PP 
Sbjct: 530 FLWFISSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPR 567

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  
Sbjct: 568 TLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSY 627

Query: 680 LIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 628 LMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|367021400|ref|XP_003659985.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
           42464]
 gi|347007252|gb|AEO54740.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
           42464]
          Length = 1085

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 270/768 (35%), Positives = 387/768 (50%), Gaps = 122/768 (15%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDL+ K  + K    +   +        LL+ESG R H T +AR     PS
Sbjct: 21  LVSLRLSNIYDLNSKILLLKFAKPNSRQQ--------LLIESGFRCHLTDFARAAAPAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII  QF  G  A+ + LE +A GNI+LTD+E  +L
Sbjct: 73  QFVSRLRKFLKTRRVTAVSQIGTDRIIEIQFSDG--AYRLYLEFFASGNIILTDAELKIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR+                                        E +  EP +V   G 
Sbjct: 131 ALLRN--------------------------------------VPEGEGQEPQRV---GL 149

Query: 201 NVSNASKENLGG------------QKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
             +  +++N GG             +   +   SK + K ++D  R    T  T L    
Sbjct: 150 TYTLENRQNFGGVPPLTKERLRDALRTALAQAESKKAKKKTSDELRRGLVTTITELP--- 206

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEG 307
              P L +H        P +K +E+  LED ++   L  ++ +    L DVIS     +G
Sbjct: 207 ---PVLIDHAFRLANFDPAIKPAEI--LEDESLLDALFQSLERGRSILDDVISSSTT-KG 260

Query: 308 YILMQNKHLGKDHPPTESGSSTQI-------YDEFCPLLLNQFR---SREFVKFETFDAA 357
           YI+ +     ++  P   G   QI       Y++F P L  QF    S + + F+ ++  
Sbjct: 261 YIIAKPNPRAQE--PVAEGEDAQISRPRNLLYEDFQPFLPKQFEDDPSCQVLSFDGYNKT 318

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEF+S +E Q+ E + + +E  A  KL     DQE R+  L++    +++ A  IE N+
Sbjct: 319 VDEFFSSLEGQKLESRLQEREAIAKRKLEAARRDQEQRIEGLQEAQMLNLRKAAAIEANI 378

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL 476
           E V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I   + L  N ++LLL    
Sbjct: 379 ERVQEAMDAVNGLLQQGMDWVDVNKLVEREQKLHNPVAEIIQLPMRLHENVITLLLGEEE 438

Query: 477 ------DEMD-----DEEKT---------LPVEKVEVD--LALSAHANARRWYELKKKQE 514
                 D++D     DEE            P +++ +D  L LS   NAR +YE K+   
Sbjct: 439 EEGEAEDKLDFDYDTDEEAADDGVPDKAKGPAKRLAIDINLKLSPWNNAREYYEQKRTAA 498

Query: 515 SKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
            KQ+KT+     A K AE+K     +  + QEK V  +  +RK  WFEKF WFISS+ YL
Sbjct: 499 EKQQKTVQQSEIALKNAEQKIAEDLKKGLKQEKPV--LQPIRKQLWFEKFIWFISSDGYL 556

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFT 628
           V+ GRDAQQNE++ KRY+ KGDVYVHAD+HGA + ++KN+   P+ P+PP TL QAG  +
Sbjct: 557 VLGGRDAQQNEILYKRYLRKGDVYVHADMHGAPTVIVKNNPKTPDAPIPPSTLAQAGSLS 616

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           VC S AWDSK    A+WV   QVSK+AP GEYL VGSFM+RGK+N LPP  L++GFGL+F
Sbjct: 617 VCCSNAWDSKAAMGAYWVNADQVSKSAPAGEYLPVGSFMVRGKRNPLPPALLMLGFGLMF 676

Query: 689 RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEK 736
           ++ E S   H+  R          D   +G    +   E E D T EK
Sbjct: 677 KVSEESKARHVKHRLYDA------DVGTAGAAPVSVATEVEADATSEK 718


>gi|295673284|ref|XP_002797188.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226282560|gb|EEH38126.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1258

 Score =  388 bits (996), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 344/1120 (30%), Positives = 520/1120 (46%), Gaps = 206/1120 (18%)

Query: 58   LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L+++ G R H T Y+R     PS F  +LRK ++TRR+  V QLG DRII   F  G N 
Sbjct: 243  LIVDIGFRCHLTEYSRTTAAAPSPFISRLRKFLKTRRVTAVSQLGTDRIIDIAFSDG-NF 301

Query: 118  HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY-----PTEICRVFERT 172
            H ++LE YA GNI+LT                DK   I++ HR        E  RV    
Sbjct: 302  H-LLLEFYAGGNIILT----------------DKDYKIVALHRIVHGGGEKEEVRV---- 340

Query: 173  TASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
                L   +T+ +  +   P  +      +  A  E   G+ G         +NK    G
Sbjct: 341  ---GLQYDITNKQNYNGVPPLSIERLRETLQRA--EEAEGECGAVE---GPGTNKR---G 389

Query: 233  ARAKQPTLKTVLGEALGYGPALS-EHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAK 290
             + +   LK  +       PAL  +H     G   N++  +   LED+ + + L+L + +
Sbjct: 390  KKKQAEALKRAISMGFPEYPALLLDHSFHAAGFDANLEPKQA--LEDSELMKRLMLVLTE 447

Query: 291  FEDWLQDVISGDIVPEGYILMQ-NKHLGKDHPPTESGS---STQIYDEFCPLLLNQFRS- 345
             E     + + +  P GYI+ +     G+     ++ S      +Y +F P    QF + 
Sbjct: 448  AESVNARLSTLEDTP-GYIISKAESKTGEAITEADTDSPKPKNMLYHDFHPFEPKQFENV 506

Query: 346  --REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
                 +KF+TF+ A+DE++S +ESQ+ E +   +E+ A  KL     DQENR+  LK+  
Sbjct: 507  PGMTILKFKTFNKAVDEYFSSVESQKLEYRLTEREEIARRKLEAAQKDQENRIGALKEVQ 566

Query: 404  DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLY 462
            +  V+ A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L 
Sbjct: 567  ELHVRKAQAIEANLLRVEEAIKAVNGLIAQGMDWVEIARLIEMEKSRQNPVANVIKLPLK 626

Query: 463  LERNCMSLLLS--------------------------NNLDEMDDEEKTLPVEKVEVDLA 496
            L  N ++LLL                           N +     ++    +  +++DL 
Sbjct: 627  LYENTVTLLLGEPTEDEEPADESEEEEDSESDDEDGGNKVKLEGSKKAQQQLLSIDIDLG 686

Query: 497  LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMR 552
            +S  ANAR++YE K+    K+EKT+ +  KA K+ EKK     +  + QEK +  +   R
Sbjct: 687  ISPWANARQYYEQKRVAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPI--LRPTR 744

Query: 553  KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH-- 610
               WFEKF +F+SS+ YLV+ GRD QQ E++ +RY+ KGDVYVHAD+ GA+   +KN   
Sbjct: 745  TPFWFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPG 804

Query: 611  RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             P+ P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKTAP+GE++  G F+IRG
Sbjct: 805  TPDAPIPPGTLSQAGNLCVASSSAWDSKAVMGAWWVNADQVSKTAPSGEFVGTGGFVIRG 864

Query: 671  KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED-------------- 716
            +K+ LPP  L++GF ++F++ E S+ +H  + RV+ E   +D  +D              
Sbjct: 865  EKHQLPPAQLLLGFAVMFQISEDSIKNH-TKYRVQDEPSIVDIAKDIQWANEVLNSKQDS 923

Query: 717  ----SGHHKENSDIESEKDDTDEK--PVAESLSVPNSAHPAPSHTN---ASNVDSHEFPA 767
                +  +KE S    E D +DE+   +   L     + P  S  N     +    + P 
Sbjct: 924  EAPRADGNKEISPASEEHDSSDEQDEEIENPLLTGMESEPDDSGGNEDKGGDNGEEKLPN 983

Query: 768  EDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEE 827
            +D       D K ++   +V    T  LE  +D  +    A +S    GI   Q      
Sbjct: 984  DDTD-----DEKEYN---SVVTKETVVLESGVDEPITQSEADVSKQPTGITKRQ------ 1029

Query: 828  DKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPES------- 880
                       D  +++  ERR+LKKG         +E+   R  DA SQ  S       
Sbjct: 1030 -----------DIKHLTARERRQLKKGV-------LIEQTSGRVGDAESQSSSPTPSVAP 1071

Query: 881  IVRKTKIEGGKIS--RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNE 938
             V  T      IS  RG++GK KK+  KY  QDEE+R + + LL SA K      D   E
Sbjct: 1072 SVTTTTNTNTVISNIRGKRGKSKKLATKYQHQDEEDRELALRLLGSAPK-----PDKLRE 1126

Query: 939  NASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMD 998
             A    E++   + ++A K   + ++A H                              D
Sbjct: 1127 AAKNKAERQ---AELEAQK---QRRRAQH------------------------------D 1150

Query: 999  KVAMEEEDIHEIGEEEKG-----RLNDVDY---------LTGNPLPSDILLYVIPVCGPY 1044
            + A  E + H+  +++ G     +L+D D          L G P+  D +L  IPVC P+
Sbjct: 1151 RAAQAERERHKALQQQGGDGGETQLDDADTVADLSCLPSLIGTPVVGDEVLAAIPVCAPW 1210

Query: 1045 SAVQSYKYRVKIIPGTAKKGKGI-QIFYSLLLLMLSLTPV 1083
            +A+  YKYR K+ PG  KKGK + +I    +L   + TPV
Sbjct: 1211 AALGHYKYRAKLQPGIVKKGKAVKEILGKWVLDATASTPV 1250


>gi|299471369|emb|CBN79324.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1380

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 196/384 (51%), Positives = 255/384 (66%), Gaps = 10/384 (2%)

Query: 323 TESGSSTQIYDEFCPLLLNQFRSREFV-KFETFDAALDEFYSKIESQRAEQQHKAKEDAA 381
           TE G    +Y+EF P LL Q      +  F +FD A+D F+ +I  Q+ +Q   A E A 
Sbjct: 440 TEEGGDHVVYEEFLPQLLAQHEGGAVIHSFASFDQAVDAFFGRIVEQKLKQTAMAAEAAV 499

Query: 382 FHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLA 441
             K+  I  DQE RV  L++  ++ ++ A+L E   ++V+ A++ VR ALAN M W+DL 
Sbjct: 500 ERKVAWIRNDQERRVLALEERQEKMLRHAQLAEAWADEVEKALMVVRSALANGMDWQDLE 559

Query: 442 RMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
            +VK E   GNP+A LI +L L+RN + L L    D  DD+        VEVD+ LSAHA
Sbjct: 560 DLVKAETANGNPIASLIHELRLDRNQVVLSLPTAEDGEDDQ-------LVEVDIMLSAHA 612

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NAR  YE KK   +K+ KT+TA  K  K AE++    + ++    ++   RKV+WFEKFN
Sbjct: 613 NARVMYENKKLARAKELKTLTASEKVLKIAEQQAERTLQRQAHKRSLQVARKVYWFEKFN 672

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP--EQPVPPL 619
           WFISSENYLVISGR+AQQNE++VK+Y+  GD+YVHADLHGASS V++N  P  ++ V PL
Sbjct: 673 WFISSENYLVISGRNAQQNEVVVKKYLRPGDIYVHADLHGASSCVVRNKDPSGKRAVSPL 732

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
            L +AGC TVC S AW +KMVTSAWWVY  QVSKTAPTGEYL  GSFM+RG+K+FLPP  
Sbjct: 733 ALEEAGCMTVCRSGAWGAKMVTSAWWVYADQVSKTAPTGEYLVTGSFMVRGRKHFLPPRA 792

Query: 680 LIMGFGLLFRLDESSLGSHLNERR 703
           L MGF LLF+LD+S L +H  ERR
Sbjct: 793 LEMGFALLFKLDDSCLAAHAGERR 816



 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/134 (48%), Positives = 86/134 (64%), Gaps = 16/134 (11%)

Query: 57  LLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
           +LL+ESGVR HTT +   K + PSGF++KLRKHIRT+RLEDVRQ+G DR++ F+FG G  
Sbjct: 1   MLLLESGVRFHTTKFTHTKSDMPSGFSMKLRKHIRTQRLEDVRQVGMDRVVDFKFGSGKA 60

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSH----------------RDDDKGVAIMSRHR 160
           +++VILELYA GNI+LTDS++ +L LLR+H                      V +  R  
Sbjct: 61  SNHVILELYASGNIILTDSKYEILDLLRTHIYEGQGGGAAGGSGATGGAGDNVRVAVRQI 120

Query: 161 YPTEICRVFERTTA 174
           YP E+    E TTA
Sbjct: 121 YPMELATTQEGTTA 134



 Score = 70.1 bits (170), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 63/117 (53%), Gaps = 22/117 (18%)

Query: 970  KDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKG------------R 1017
            +D     +D   G E+ P   L   A+  +   EEE++ ++ EEE               
Sbjct: 1195 RDAASRTEDQEAGGEEEP---LSRRAQKKR---EEEEVRKLLEEEGAAGEDFDGGGDGGG 1248

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK----GIQIF 1070
            ++++D LTG P   D+LL+ +PVCGPY +++ YKY+VK+ PG  K+GK     I++F
Sbjct: 1249 VSELDRLTGKPRDEDVLLFAVPVCGPYMSLRDYKYKVKLTPGKQKRGKASKQAIEVF 1305


>gi|327348881|gb|EGE77738.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 1166

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 317/1008 (31%), Positives = 480/1008 (47%), Gaps = 162/1008 (16%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L + L+G+R SN+YDLS + Y+FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVISRELSQALVGLRISNIYDLSSRIYLFKLAKPDTRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T Y+R     PS F ++LRK ++TRR+  V Q+G DRII  +   G N H V
Sbjct: 53  DTGFRCHLTEYSRTTAAAPSPFIVRLRKFLKTRRVTAVTQVGTDRIIDIELSDG-NFH-V 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE YA GNI+LTD E+ ++ L   HR   +G           E  RV        L   
Sbjct: 111 LLEFYAGGNIILTDKEYKIVAL---HRIVPEG--------NDQEEVRV-------GLQYV 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT+ +  +   P  +      +  A      G+  G       N+ +     A A +  +
Sbjct: 153 LTNKQNYNGVPPLSIERLRETLEQAKDVAGSGEGAG-------NTKRAKKKQAEALRRAV 205

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVI 299
                E   Y P L EH+   TG+ P++K  +V  L DN  ++ L+LA+ + E     + 
Sbjct: 206 SLGFPE---YPPLLLEHVFHITGVDPSLKPEQV--LGDNELVEKLMLALVEAESVNSSLS 260

Query: 300 SGDIVPEGYILMQNKHLG-KDHPPTESG---SSTQIYDEFCPLLLNQFRSRE---FVKFE 352
           + D  P GYI+ + +    +D   T +    S    Y +F P    QF ++     +KF+
Sbjct: 261 TADDTP-GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPFEPKQFENQADMAILKFD 319

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           TF+ A+DE++S +E Q+ E +   +E+ A  KL     DQE RV  LK+  +  V+ A+ 
Sbjct: 320 TFNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQA 379

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL 471
           IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L  N ++LL
Sbjct: 380 IEANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLL 439

Query: 472 LSNNL------------------------------DEMDDEEKTLPVEKVEVDLALSAHA 501
           L                                   +  +++    +  +++DL +S  A
Sbjct: 440 LGEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWA 499

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANISHMRKVHWFEK 559
           NAR++YE KK    K+EKT+ +  KA K+ EKK    + Q  ++    +  +R   WFEK
Sbjct: 500 NARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEK 559

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F +FISS+ YL + GRDAQQ E++ +R++ KGDVYVHAD+ GA    +KN    P+ P+P
Sbjct: 560 FIYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIP 619

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL+QAG   V  S AW SK V  AWWV   QVSKT P+GEYL  G F+IRG+KN LPP
Sbjct: 620 PGTLSQAGNLCVATSSAWHSKAVMGAWWVNADQVSKTTPSGEYLETGGFVIRGEKNQLPP 679

Query: 678 HPLIMGFGLLFRLDESSLGSHLNER------RVRG--EEEGMDDF--------------- 714
             L++GF ++F++   S+ +H   R         G  E +GM++                
Sbjct: 680 AQLLLGFAVMFQISSESIKNHTKHRVQDDSSTTTGVKETQGMEELPSRLDQQTPRESENK 739

Query: 715 -------EDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPA 767
                  ++    +EN +IE   DD    P           H     +++ + D      
Sbjct: 740 ETYHQPEQNDSSDEENGEIEENTDDKRTNPF---------LHEKAESSDSDSEDGESKIG 790

Query: 768 EDKTIS-NGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSE 826
           ED+    +  D + +D A + A         + + ALG    S    + G E        
Sbjct: 791 EDRPQDVDAKDEREYDHAESKA---------VEEAALGGKETSSQEEQAGSEP------- 834

Query: 827 EDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTK 886
              H + +A  R    +S  E  +LKK  G S+         E+     + PES  R T 
Sbjct: 835 ---HTD-SAAARPAKRLSATENGQLKK--GVSI---------EQASTPPTDPES--RLTP 877

Query: 887 IEGGKIS----RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
            E  + S    RG++GK KK+  KY  QDEE+R + + LL SA K  K
Sbjct: 878 NEPSRSSTPNIRGKRGKNKKIATKYQHQDEEDRELALRLLGSAPKPDK 925



 Score = 49.7 bits (117), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L G  +  D ++  IPVC P+ A+  YKYR K+ PG  KKGK ++
Sbjct: 1000 LIGTAVVGDEIVAAIPVCAPWMALGQYKYRAKLQPGPLKKGKAVK 1044


>gi|169612956|ref|XP_001799895.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
 gi|111061751|gb|EAT82871.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
          Length = 1132

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 335/1027 (32%), Positives = 478/1027 (46%), Gaps = 195/1027 (18%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L + L  +R +NVYDLS  T   ++     +       +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSKSLTSLRVTNVYDLSSLTLSQRIFL---IKFHKPDHREQLLI 57

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T YAR     PS F  KLRK+++TRR+  + Q+G DRI+ FQF  G+  + +
Sbjct: 58  DSGFRCHLTEYARTTAAAPSTFVAKLRKYLKTRRVTSIAQIGTDRILEFQFSDGL--YRL 115

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE YA GNI+LTD +  VL LLR+                      V E     +L   
Sbjct: 116 YLEFYAGGNIVLTDGDLKVLALLRN----------------------VDEGEEHERLRVG 153

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L      + N   + N  G       +   G QK      + +   + +  G +AK+   
Sbjct: 154 L------EYNLSMRQNYGGAPELTKDRIRKGLQKA-----VDRQQAQPAATGKKAKK-VG 201

Query: 241 KTVLGEALGYG-----PALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
           K  L +AL        P L +H +     D+ L P   L+    LE       +L+V K 
Sbjct: 202 KDALRKALAVSITECPPLLVDHALHVAKYDSALKPEEILANDELLEK------LLSVLKD 255

Query: 292 EDWLQDVISGDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--F 348
              + D I+     +GYIL + N +   D    E   S  +YD+F P    QF   +  F
Sbjct: 256 ARKITDEINSQEQTKGYILAKPNPNATTDEEGAEK--SKHMYDDFHPFRPQQFEESDYTF 313

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F+ F+ A+DEF+S IE Q+ E +   +E  A  KL K   + E R+  L+Q  + + +
Sbjct: 314 LEFDGFNKAVDEFFSSIEGQKLESRLTEREQQAKKKLEKARREHEERLGGLQQVQEVNFR 373

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNC 467
            AE I  N+  V  A  AV   +   M W D+A +++ E+  GN VA  I   L L  N 
Sbjct: 374 KAEAILANVHRVAEATEAVNGLIRQGMDWGDIASLIEREQSHGNAVAETIKLPLKLHENT 433

Query: 468 MSLLLS----NNLDEMDDE-EKTLPVEK-------------------------VEVDLAL 497
           ++LLL     ++ +E DDE  +T  V +                         +++DLAL
Sbjct: 434 ITLLLDETDFDHAEEDDDEGNETSSVSEDSEDEDEGPKKKAAPAKPAARPKLAIDIDLAL 493

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRK 553
           S  AN+  +Y+ KK   SK+++T+ A +KA K+ EKK     +  + QEK +  +  +RK
Sbjct: 494 SPWANSTEYYDQKKTAASKEDRTLQASTKALKSHEKKVAEDLKKGLKQEKDI--LRPVRK 551

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--R 611
             WFEKF +FISS+ YLV+ G+DAQQNE+I +RY  KGDVYVHADL GA   +IKN    
Sbjct: 552 QQWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRYFRKGDVYVHADLKGAVPMIIKNKPTT 611

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+ P+PP TL+QAG  +VC S AW+SK V SAWWV   QVSKT  TGE+L  G F I+GK
Sbjct: 612 PDAPIPPSTLSQAGHLSVCSSDAWESKAVMSAWWVLADQVSKTGQTGEFLPPGLFNIKGK 671

Query: 672 KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV----------------------RGEEE 709
           K +LPP  LI+G  ++F + E+S   H N+ RV                      +G +E
Sbjct: 672 KEYLPPAQLIVGLAVMFEISEASKARH-NKHRVLDGVNISAVEMAPDSEEQPKATQGSKE 730

Query: 710 GM----------------DDFEDSG-HHKENSDIESEKDDTDEKPVAESLSVPNSAHPAP 752
                             DDF D+   H E SD ESE         A   + P  +  A 
Sbjct: 731 DDSDDDEFPDAKLASDSDDDFPDAKMEHTEESDAESE---------AAGHANPLQSSKAD 781

Query: 753 SHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISS 812
           +H N+S+ D      ED    NG    +    R+ A+       D  D    LG      
Sbjct: 782 AHENSSDEDED----EDVKSVNGKSGHVMSGGRDGAS----HQGDAQDDTGSLGD----- 828

Query: 813 TKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQ-GSSVVDPKVEREKE-- 869
                       SE+ K   R        ++S  ERR LKKGQ  +SV  P  +   +  
Sbjct: 829 ------------SEQTKGASRR-------HLSAKERRLLKKGQLPASVQVPSQKTPADGS 869

Query: 870 -RGKDASSQPESIVRKTKIEGGKIS-----------RGQKGKLKKMKEKYGDQDEEERNI 917
             G +++S  E   + TK  G   S           RG++ K KK+  KY  QDEE+R +
Sbjct: 870 VDGDESASAGEEAQQPTKPAGTVTSQASKATSSPLPRGKRSKQKKLAAKYAAQDEEDREL 929

Query: 918 RMALLAS 924
            M LL S
Sbjct: 930 AMRLLGS 936


>gi|389646873|ref|XP_003721068.1| nuclear export mediator factor [Magnaporthe oryzae 70-15]
 gi|351638460|gb|EHA46325.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           70-15]
          Length = 1074

 Score =  385 bits (988), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 268/768 (34%), Positives = 384/768 (50%), Gaps = 127/768 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D  A  + L  +L G+R SN+YDLS K  + K             +K  L++
Sbjct: 1   MKQRFSSVDCKAISQELHAQLPGLRLSNIYDLSSKILLLKFAKPD--------QKAQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T +AR     PS F  +LRK ++TRRL  V Q+G DRII FQF  G   + +
Sbjct: 53  DSGFRCHLTDFARTTAPAPSPFVARLRKFLKTRRLTSVSQIGTDRIIEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GN++LTD+E  +L                                      A 
Sbjct: 111 FLEFFAGGNVILTDNELKIL--------------------------------------AI 132

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGG----QKGGKSFDLSKNSNKNSNDG---- 232
           L + KE +  EP ++   G + S  +++N GG     K      L+K + K +N      
Sbjct: 133 LRNVKEGEGQEPQRI---GLSYSLDNRQNYGGVPEFTKQRLRDALTKTAEKAANTSGATR 189

Query: 233 -ARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLED------------ 278
            AR     L+  L   +    P + +H    +      + +++ + +D            
Sbjct: 190 KARKSGADLRRGLASTITELPPIVVDHAFRSSNFDAQAQAADILQNDDTFDALFEALEEA 249

Query: 279 --------NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
                   +A Q+    VAK  D    V + D V EG ++          P     S   
Sbjct: 250 RKTLAGITSAAQITGYIVAKTRDGAASVQNEDRVSEGALV---------KPFVPGSSKDL 300

Query: 331 IYDEFCPLLLNQFRS---REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
           +Y++F P L  QF S      ++FE F+  +DEFYS +E Q+ E +   +E+AA  KL+ 
Sbjct: 301 LYEDFQPFLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDA 360

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              +Q  R+  L++    + + A  IE N+E V  A+ AV   L N M W D+ ++V+ E
Sbjct: 361 AREEQAKRIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVERE 420

Query: 448 RKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------------------EMDDEEK 484
           +K  NPVA +I+  + L  N ++L +    +                      E D + +
Sbjct: 421 QKRNNPVAAIIELPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQ 480

Query: 485 TLPVEKVEVD--LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK-TR-LQ-- 538
                ++EVD  L LS  +NA  +Y+ K+    K+EKTI   S A K+A +K TR LQ  
Sbjct: 481 QPSKRELEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKG 540

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           + QEK V  I  +R   WFEK+ WF+SS+ YLV+ GRDAQQNEMI +R++ +GDVYVHAD
Sbjct: 541 LKQEKPV--IQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHAD 598

Query: 599 LHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           L GA S +IKN+   PE P+PP TL+QAG  TVC S AWD K    A+WV   QVSK AP
Sbjct: 599 LKGAPSVIIKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAP 658

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           TGE+L  GSFMI+GKKN LPP  L++GFGLLFR+ E S   H  + RV
Sbjct: 659 TGEFLPAGSFMIKGKKNELPPATLVIGFGLLFRISEESKAKHAKQHRV 706



 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            ++ L G PLP D +L  IP+C PY+A+   KY+VK+ PG  KKGK I+
Sbjct: 950  LETLVGTPLPGDEILEAIPICAPYAAMGKIKYKVKLQPGAQKKGKAIK 997


>gi|226292279|gb|EEH47699.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 1261

 Score =  384 bits (987), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 322/1094 (29%), Positives = 501/1094 (45%), Gaps = 185/1094 (16%)

Query: 58   LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L+++ G R H T Y+R     PS F  +LRK ++TRR+  V QLG DRII     L    
Sbjct: 149  LIVDIGFRCHLTEYSRTTAAAPSPFISRLRKFLKTRRVTAVSQLGTDRII--DIALSDGN 206

Query: 118  HYVILELYAQGNILLTDSEFTVLTLLR-SHRDDDKGVAIMSRHRYPTEICRVFERTTASK 176
             +++LE Y  GNI+LTD ++ ++ L R  H   ++            E  RV        
Sbjct: 207  FHLLLEFYVGGNIILTDKDYKIVALHRIVHGGGER------------EEVRV-------G 247

Query: 177  LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
            L   +T+ +  +   P  +      +  A  E   G+ G         SNK    G + +
Sbjct: 248  LQYGITNKQNYNGVPPLSIERLRETLQRA--EEAEGESGAVE---GPGSNKR---GKKRQ 299

Query: 237  QPTLKTVLGEALGYGPALS-EHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDW 294
               LK  +       PAL  +H     G   N++  +   LED+ + + L+L + + E+ 
Sbjct: 300  TEALKRAISRGFPEYPALLLDHSFHAAGFDANLEPKQA--LEDSELMKRLMLVLTEAENV 357

Query: 295  LQDVISGDIVPEGYILMQNK-HLGKDHPPTESGS---STQIYDEFCPLLLNQFRS---RE 347
            +  + + +  P GYI+++ +   G+     ++ S      +Y +F P    QF +     
Sbjct: 358  IARLSTLEDTP-GYIILKGESKTGEAITEADTDSPKPKNMLYHDFHPFKPKQFENVPGMT 416

Query: 348  FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             + F TF+ A+DE++S +ESQ+ E +   +E+ A  KL     DQENRV  LK+  +  V
Sbjct: 417  ILTFNTFNKAVDEYFSSVESQKLEYRLTEREEIARRKLEAAQKDQENRVGALKEVQELHV 476

Query: 408  KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERN 466
            + A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L L  N
Sbjct: 477  RKAQAIEANLLRVEEAINAVNGLIAQGMDWVEIARLIEMEKSRQNPVAKVIKLPLKLYEN 536

Query: 467  CMSLLLS---------------------------NNLDEMDDEEKTLPVEKVEVDLALSA 499
             ++LLL                            N +     ++    +  +++DL +S 
Sbjct: 537  TVTLLLGEPTEDEEPADESDEEEEDSESGDEDGGNKVKLERSKKAQQQLLSIDIDLGISP 596

Query: 500  HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVH 555
             ANAR++YE +K    K+EKT+ +  KA K+ EKK     +  + QEK +  +   R   
Sbjct: 597  WANARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPI--LRPTRTPF 654

Query: 556  WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPE 613
            WFEKF +F+SS+ YLV+ GRD QQ E++ +RY+ KGDVYVHAD+ GA+   +KN    P+
Sbjct: 655  WFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPGTPD 714

Query: 614  QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
             P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKTAP+GE++  G F+IRG+K+
Sbjct: 715  APIPPGTLSQAGNLCVATSSAWDSKAVMGAWWVNADQVSKTAPSGEFVGTGGFVIRGEKH 774

Query: 674  FLPPHPLIMGFGLLFRLDESSLGSHLNER----------------------RVRGEEEGM 711
             LPP  L++G+ ++F++ E S+ +H   R                      +   E  G 
Sbjct: 775  QLPPAQLLLGYAVMFQISEDSIKNHTKFRVQDEPSIVEIAKEVQANEVLHSKQDSEAPGA 834

Query: 712  DDFEDSGHHKENSDIESEKDDTDEKPVAESL-SVPNSAHPAPSHTNASNVDSHEFPAEDK 770
            D  ++     E  D   E+D+  + P+   + S P+ +    +     +    + P++D 
Sbjct: 835  DGNKEISLASEEHDSSDEQDEETDNPLLTGMESEPDDS--GGNENKGGDNGEEKLPSDDT 892

Query: 771  TISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKH 830
                  D K ++   +V    T  LE              S     I   + D+SE+   
Sbjct: 893  D-----DEKEYN---SVVTKETVVLE--------------SGGDEPITQPEADVSEQQPG 930

Query: 831  VERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQ---------PESI 881
            + +   ++   ++S  ERR+LKKG    V+   +E+   R  DA SQ         P   
Sbjct: 931  ITKRQAIK---HLSARERRQLKKG----VL---IEQTSVRVADAESQSSSPTPSVAPSVT 980

Query: 882  VRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGK------VQKNDGDP 935
                        RG++GK KK+  KY  QDEE+R + + LL SA K        KN  + 
Sbjct: 981  TTTNTNTLNSNIRGKRGKSKKLATKYQHQDEEDRELALRLLGSAPKPDKLREAAKNKAER 1040

Query: 936  QNE-NASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDET 994
            Q E  A   + ++       A +  YK  +      D  E   D +    D  C      
Sbjct: 1041 QAELEAQKQRRREQHDRAAQAERERYKALQ--QQGGDGGETQFDDTDTAADLSC------ 1092

Query: 995  AEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRV 1054
                                      +  L G P+  D +L  IPVC P++A+  YKYR 
Sbjct: 1093 --------------------------LPSLVGTPVVGDEVLAAIPVCAPWAALGHYKYRA 1126

Query: 1055 KIIPGTAKKGKGIQ 1068
            K+ PG  KKGK ++
Sbjct: 1127 KLQPGIVKKGKAVK 1140


>gi|440634980|gb|ELR04899.1| hypothetical protein GMDG_00158 [Geomyces destructans 20631-21]
          Length = 1072

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 255/749 (34%), Positives = 384/749 (51%), Gaps = 104/749 (13%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R +NVYDL+ K ++ +             +K  +++
Sbjct: 1   MKQRFSSLDVKVIAYELSNSLVTLRLANVYDLASKIFLLRFTKPD--------DKKQMII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T+++R    +PS F  KLRK ++TRR+  V Q+G DRII FQF  G    Y 
Sbjct: 53  DSGFRCHLTSFSRATTASPSVFVTKLRKFLKTRRVTAVSQIGTDRIIEFQFSEGQYRLY- 111

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS------HRDDDKGV--AIMSRHRYPTEICRVFERT 172
            LE YA GNI+LTD E  +LTLLR+        +   G+  ++ +R  Y           
Sbjct: 112 -LEFYAGGNIILTDKELNILTLLRTVPPGEGQEEQRIGLKYSLENRQNYLG-----IPPL 165

Query: 173 TASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
           T  +L AAL  + E   N P                              K   KN  D 
Sbjct: 166 TKDRLQAALRKAAEQSENAP----------------------------AEKKQGKNGIDS 197

Query: 233 ARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFE 292
            R     L   + E   + P L +H +  T   P +K +++ K  D  +  L+ ++ + +
Sbjct: 198 LRR---ALAVSITE---FPPLLVDHAMKVTDFDPTLKPADIAK-NDTLLDHLLRSLEEAD 250

Query: 293 DWLQDVISGDIVPEGYILMQ-----NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR- 346
             +++ I+G  V  GYI+ +     +K   +D    E+     +Y++F P    QF +  
Sbjct: 251 RVVKE-ITGSDVATGYIIAKKQERTDKVASRDE---ETERQALLYEDFHPFKPRQFENDP 306

Query: 347 --EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
              FV FE F+  +DEF+S IE QR E +   +E  A  KL     DQ+ R+  L++   
Sbjct: 307 ACTFVPFEGFNNTVDEFFSSIEGQRLESRLYEREVTAKKKLQAAKDDQQKRLGGLQEIQT 366

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYL 463
            + + A  IE N++ V  A  AV   +A  M W ++ +++  E+K GNPVA +I   L L
Sbjct: 367 LNERKAGAIETNVQRVQEATDAVNGLIAQGMDWIEIGKLIDIEQKRGNPVASIIKLPLKL 426

Query: 464 ERNCMSLLLSNNL-------------DEMDDEEKTLPVEK-----------VEVDLALSA 499
             N ++LLL   +              ++ D E   P+++           ++++L  S 
Sbjct: 427 HENTVTLLLDEEIFVEDLNDEAYETGSDVSDSEDEAPIKEAVKKVVDKRLAIDINLGASP 486

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAA----EKKTRLQILQEKTVANISHMRKVH 555
            +NAR +Y  ++    K++KT+ + +KA K+     E+  +  + QEK +  +  +RK  
Sbjct: 487 WSNAREYYGQRRSAAEKEKKTLESSTKALKSTSHKIEQDLKKGLKQEKAI--LRPVRKHM 544

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPE 613
           WFEKF WFISS+ YLV+ GRDAQQNE++ KRY+ KGDVYVHADL GA+S  I+NH  R +
Sbjct: 545 WFEKFMWFISSDGYLVLGGRDAQQNEILYKRYLRKGDVYVHADLDGATSVFIRNHESRVD 604

Query: 614 QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
            P+PP TL+QAG   V  S AW+SK    AWW    QVSK+APTG+Y   GSF +RGKKN
Sbjct: 605 APIPPSTLSQAGILAVSSSSAWESKAGMPAWWANADQVSKSAPTGDYFKPGSFDVRGKKN 664

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           FLPP PL++GFG++F +   S  +H   R
Sbjct: 665 FLPPAPLLLGFGVMFHVSNESKANHTKYR 693



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 996  EMDKVAMEE-EDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRV 1054
            E  K  ME  EDI +   +E  +   +D L G PLP D++L VIPVC P++AV  YKY+V
Sbjct: 917  EARKAMMEAGEDIVDEEADEAEKAVSLDTLVGTPLPGDVILDVIPVCAPWTAVGKYKYKV 976

Query: 1055 KIIPGTAKKGKGIQIFYS 1072
            K+ PG  KKGK ++   S
Sbjct: 977  KLQPGPMKKGKAVKEILS 994


>gi|86196391|gb|EAQ71029.1| hypothetical protein MGCH7_ch7g436 [Magnaporthe oryzae 70-15]
          Length = 1095

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 263/749 (35%), Positives = 375/749 (50%), Gaps = 126/749 (16%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L G+R SN+YDLS K  + K             +K  L+++SG R H T +AR     P
Sbjct: 41  QLPGLRLSNIYDLSSKILLLKFAKPD--------QKAQLIIDSGFRCHLTDFARTTAPAP 92

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           S F  +LRK ++TRRL  V Q+G DRII FQF  G   + + LE +A GN++LTD+E  +
Sbjct: 93  SPFVARLRKFLKTRRLTSVSQIGTDRIIEFQFSDGQ--YRLFLEFFAGGNVILTDNELKI 150

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L                                      A L + KE +  EP ++   G
Sbjct: 151 L--------------------------------------AILRNVKEGEGQEPQRI---G 169

Query: 200 NNVSNASKENLGG----QKGGKSFDLSKNSNKNSNDG-----ARAKQPTLKTVLGEALG- 249
            + S  +++N GG     K      L+K + K +N       AR     L+  L   +  
Sbjct: 170 LSYSLDNRQNYGGVPEFTKQRLRDALTKTAEKAANTSGATRKARKSGADLRRGLASTITE 229

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLED--------------------NAIQVLVLAVA 289
             P + +H    +      + +++ + +D                    +A Q+    VA
Sbjct: 230 LPPIVVDHAFRSSNFDAQAQAADILQNDDTFDALFEALEEARKTLAGITSAAQITGYIVA 289

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRS---R 346
           K  D    V + D V EG ++          P     S   +Y++F P L  QF S    
Sbjct: 290 KTRDGAASVQNEDRVSEGALV---------KPFVPGSSKDLLYEDFQPFLPKQFSSDPTN 340

Query: 347 EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
             ++FE F+  +DEFYS +E Q+ E +   +E+AA  KL+    +Q  R+  L++    +
Sbjct: 341 VILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAKRIEGLEESQLLN 400

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
            + A  IE N+E V  A+ AV   L N M W D+ ++V+ E+K  NPVA +I+  + L  
Sbjct: 401 FRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPVAAIIELPMDLAN 460

Query: 466 NCMSLLLSNNLD----------------------EMDDEEKTLPVEKVEVD--LALSAHA 501
           N ++L +    +                      E D + +     ++EVD  L LS  +
Sbjct: 461 NTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRELEVDIKLNLSPWS 520

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKK-TR-LQ--ILQEKTVANISHMRKVHWF 557
           NA  +Y+ K+    K+EKTI   S A K+A +K TR LQ  + QEK V  I  +R   WF
Sbjct: 521 NAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKPV--IQPIRHQVWF 578

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQP 615
           EK+ WF+SS+ YLV+ GRDAQQNEMI +R++ +GDVYVHADL GA S +IKN+   PE P
Sbjct: 579 EKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVIIKNNPRTPEAP 638

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL+QAG  TVC S AWD K    A+WV   QVSK APTGE+L  GSFMI+GKKN L
Sbjct: 639 IPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAGSFMIKGKKNEL 698

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           PP  L++GFGLLFR+ E S   H  + RV
Sbjct: 699 PPATLVIGFGLLFRISEESKAKHAKQHRV 727



 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            ++ L G PLP D +L  IP+C PY+A+   KY+VK+ PG  KKGK I+
Sbjct: 971  LETLVGTPLPGDEILEAIPICAPYAAMGKIKYKVKLQPGAQKKGKAIK 1018


>gi|350296215|gb|EGZ77192.1| hypothetical protein NEUTE2DRAFT_99766 [Neurospora tetrasperma FGSC
           2509]
          Length = 1095

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 273/778 (35%), Positives = 384/778 (49%), Gaps = 129/778 (16%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R +N+YDL+ K  + K        +        LL+
Sbjct: 1   MKQRFSSLDVRVVAHELSEALVSLRLANIYDLNSKILLLKFAKPDNRQQ--------LLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H T + R     PS F  +LRK+++TRR   V Q+G DRII FQF  G  A  +
Sbjct: 53  ESGFRCHLTDFVRTASPAPSQFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTDS+  +L L                                      
Sbjct: 111 YLEFFASGNIILTDSDLKILAL-------------------------------------- 132

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGG-----------------QKGGKSFDLSK 223
           L +  E +  EP ++   G   +  +++N GG                 QK        K
Sbjct: 133 LRNVPEGEGQEPQRI---GLTYTLENRQNFGGVPALTKERLRDALQSTVQKAAADQAAGK 189

Query: 224 NSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV 283
              K   D  R    T  T L       P L EH+   T   P  K +E+   +    ++
Sbjct: 190 KIKKKGADELRRGLATTITELP------PILVEHVFRLTSFDPATKPAEILDDDSLLDKL 243

Query: 284 LVLAVAKFEDWLQDVISGDIVPEGYIL------MQNKHLGKDHPPTESGSSTQIYDEFCP 337
                   E  + D ++   V  GYI+       ++  L  D PP E  + T +Y++F P
Sbjct: 244 FDTLQQARE--ILDEVTDSSVSNGYIIAKPRSGFEDTELDVDAPPAEK-AKTLLYEDFQP 300

Query: 338 LLLNQF---RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
            L  QF   ++   + F  ++  +DEF+S +E QR E +   +E AA  KL    MDQ  
Sbjct: 301 FLPKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRKLEAARMDQAK 360

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           R+  L++    + + A  I+ N E V  A+ AV   L   M W D+ +++++E+K GNPV
Sbjct: 361 RIEGLQEMEMLNYRKAATIQANTERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPV 420

Query: 455 AGLID-KLYLERNCMSLLL---------------------SNNLDEMDDEEKT---LPVE 489
           A +I   + L+ N ++LLL                     S++ D+ D  E T    PV+
Sbjct: 421 AEIIKLPMKLKENTITLLLGEGVEEEDEGDQDKEDDEFDYSDSEDDADGAETTKHKAPVK 480

Query: 490 KVEVD--LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEK 543
           ++EVD  L LS   NAR +Y+ K+    K +KT+     A K AE+K     R  + QEK
Sbjct: 481 RLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIAEDLRKGLKQEK 540

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
            V  +  +RK  WFEKF WFISS+ YLV+ GRDAQQNEM+ KRY+ KGDVYVHAD+HGA+
Sbjct: 541 PV--LQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAA 598

Query: 604 STVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           S +IKN+   P+ P+PP TL QAG  +VC S AWDSK    AWWV   QVSK+AP GEYL
Sbjct: 599 SVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPAGEYL 658

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN-------ERRVRGEEEGMD 712
            VGSFM+RGK+N LPP  L +GFGLLFR+ + S   H         ER+ +G  + +D
Sbjct: 659 PVGSFMVRGKRNLLPPALLTLGFGLLFRVSDDSKSKHTRHRVYDFVERKTKGRADSLD 716


>gi|440466993|gb|ELQ36234.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           Y34]
 gi|440486785|gb|ELQ66618.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           P131]
          Length = 1095

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 263/749 (35%), Positives = 374/749 (49%), Gaps = 126/749 (16%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L G+R SN+YDLS K  + K             +K  L+++SG R H T +AR     P
Sbjct: 41  QLPGLRLSNIYDLSSKILLLKFAKPD--------QKAQLIIDSGFRCHLTDFARTTAPAP 92

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           S F  +LRK ++TRRL  V Q+G DRII FQF  G   + + LE +A GN++LTD+E  +
Sbjct: 93  SPFVARLRKFLKTRRLTSVSQIGTDRIIEFQFSDGQ--YRLFLEFFAGGNVILTDNELKI 150

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L                                      A L + KE +  EP ++   G
Sbjct: 151 L--------------------------------------AILRNVKEGEGQEPQRI---G 169

Query: 200 NNVSNASKENLGG----QKGGKSFDLSKNSNKNSNDG-----ARAKQPTLKTVLGEALG- 249
            + S  +++N GG     K      L+K + K +N       AR     L+  L   +  
Sbjct: 170 LSYSLDNRQNYGGVPEFTKQRLRDALTKTAEKAANTSGATRKARKSGADLRRGLASTITE 229

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLED--------------------NAIQVLVLAVA 289
             P + +H    +      + +++ + +D                    +A Q+    VA
Sbjct: 230 LPPIVVDHAFRSSNFDAQAQAADILQNDDTFDALFEALEEARKTLAGITSAAQITGYIVA 289

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRS---R 346
           K  D    V + D V EG ++          P     S   +Y++F P L  QF S    
Sbjct: 290 KTRDGAASVQNEDRVSEGALV---------KPFVPGSSKDLLYEDFQPFLPKQFSSDPTN 340

Query: 347 EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
             ++FE F+  +DEFYS +E Q+ E +   +E+AA  KL+    +Q  R+  L++    +
Sbjct: 341 VILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAKRIEGLEESQLLN 400

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
            + A  IE N+E V  A+ AV   L N M W D+ ++V+ E+K  NPVA +I   + L  
Sbjct: 401 FRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPVAAIIKLPMDLAN 460

Query: 466 NCMSLLLSNNLD----------------------EMDDEEKTLPVEKVEVD--LALSAHA 501
           N ++L +    +                      E D + +     ++EVD  L LS  +
Sbjct: 461 NTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRELEVDIKLNLSPWS 520

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKK-TR-LQ--ILQEKTVANISHMRKVHWF 557
           NA  +Y+ K+    K+EKTI   S A K+A +K TR LQ  + QEK V  I  +R   WF
Sbjct: 521 NAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKPV--IQPIRHQVWF 578

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQP 615
           EK+ WF+SS+ YLV+ GRDAQQNEMI +R++ +GDVYVHADL GA S +IKN+   PE P
Sbjct: 579 EKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVIIKNNPRTPEAP 638

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL+QAG  TVC S AWD K    A+WV   QVSK APTGE+L  GSFMI+GKKN L
Sbjct: 639 IPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAGSFMIKGKKNEL 698

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           PP  L++GFGLLFR+ E S   H  + RV
Sbjct: 699 PPATLVIGFGLLFRISEESKAKHAKQHRV 727



 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            ++ L G PLP D +L  IP+C PY+A+   KY+VK+ PG  KKGK I+
Sbjct: 971  LETLVGTPLPGDEILEAIPICAPYAAMGKIKYKVKLQPGAQKKGKAIK 1018


>gi|336464133|gb|EGO52373.1| hypothetical protein NEUTE1DRAFT_71883 [Neurospora tetrasperma FGSC
           2508]
          Length = 1095

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 269/761 (35%), Positives = 378/761 (49%), Gaps = 122/761 (16%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R +N+YDL+ K  + K        +        LL+
Sbjct: 1   MKQRFSSLDVRVVAHELSEALVSLRLANIYDLNSKILLLKFAKPDNRQQ--------LLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H T + R     PS F  +LRK+++TRR   V Q+G DRII FQF  G  A  +
Sbjct: 53  ESGFRCHLTDFVRTASPAPSQFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTDS+  +L L                                      
Sbjct: 111 YLEFFASGNIILTDSDLKILAL-------------------------------------- 132

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGG-----------------QKGGKSFDLSK 223
           L +  E +  EP ++   G   +  +++N GG                 QK        K
Sbjct: 133 LRNVPEGEGQEPQRI---GLTYTLENRQNFGGVPALTKERLRDALQSTVQKAAADQAAGK 189

Query: 224 NSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV 283
              K   D  R    T  T L       P L EH+   T   P  K +E+   +    ++
Sbjct: 190 KIKKKGADELRRGLATTITELP------PILVEHVFRLTSFDPATKPAEILDDDSLLDKL 243

Query: 284 LVLAVAKFEDWLQDVISGDIVPEGYIL------MQNKHLGKDHPPTESGSSTQIYDEFCP 337
                   E  + D ++   V  GYI+       ++  L  D PP E  + T +Y++F P
Sbjct: 244 FDTLQQARE--ILDEVTDSSVSNGYIIAKPRSGFEDTELDVDAPPAEK-AKTLLYEDFQP 300

Query: 338 LLLNQF---RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
            L  QF   ++   + F  ++  +DEF+S +E QR + +   +E AA  KL    MDQ  
Sbjct: 301 FLPKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLKSKLSEREAAAKRKLEAARMDQAK 360

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           R+  L++    + + A  I+ N+E V  A+ AV   L   M W D+ +++++E+K GNPV
Sbjct: 361 RIEGLQEMEMLNYRKAATIQANIERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPV 420

Query: 455 AGLID-KLYLERNCMSLLL---------------------SNNLDEMDDEEKT---LPVE 489
           A +I   + L+ N ++LLL                     S++ D+ D  E T    PV+
Sbjct: 421 AEIIKLPMKLKENTITLLLGEGVEEEEEGDQDKEDDEFDYSDSEDDADGAETTKDKAPVK 480

Query: 490 KVEVD--LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEK 543
           ++EVD  L LS   NAR +Y+ K+    K +KT+     A K AE+K     R  + QEK
Sbjct: 481 RLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIAEDLRKGLKQEK 540

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
            V  +  +RK  WFEKF WFISS+ YLV+ GRDAQQNEM+ KRY+ KGDVYVHAD+HGA+
Sbjct: 541 PV--LQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAA 598

Query: 604 STVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           S +IKN+   P+ P+PP TL QAG  +VC S AWDSK    AWWV   QVSK+AP GEYL
Sbjct: 599 SVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPAGEYL 658

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
            VGSFM+RGK+N LPP  L +GFGLLFR+ + S   H   R
Sbjct: 659 PVGSFMVRGKRNLLPPALLTLGFGLLFRVSDDSKSKHTRHR 699



 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/175 (28%), Positives = 86/175 (49%), Gaps = 27/175 (15%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RGQ+GK KK+  KY DQDEE+R +   L+      QK + +   +  +  +         
Sbjct: 856  RGQRGKQKKIAAKYKDQDEEDRALMEELMGVKAARQKAEAEAAAKAKAEAEAAA------ 909

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
                   + ++   + K+ +EH +     +E+   + LDE+    ++AME          
Sbjct: 910  ---ARERRRQQQERVKKEIREHEEVRRLMMEEGEDMPLDES----EMAME---------- 952

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
                +  ++ L GNPL  D +L V+P+C P+SA+  +KY+ K+ PG  KKGK ++
Sbjct: 953  ----MAPLETLVGNPLAGDEILEVVPICAPWSALNKFKYKTKLQPGNTKKGKAVK 1003


>gi|342879256|gb|EGU80511.1| hypothetical protein FOXB_08971 [Fusarium oxysporum Fo5176]
          Length = 1060

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 258/743 (34%), Positives = 377/743 (50%), Gaps = 107/743 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKIIAHELQERLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  VRQ+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTKFARTTAAAPSAFVARLRKFLKTRRLTSVRQVGTDRVLEFEFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD++  +L L R                                    
Sbjct: 111 FLEFFASGNIILTDADLKILALAR------------------------------------ 134

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             +  E +  EP +V   G   S  +++N GG        L++   +++   A  K  T 
Sbjct: 135 --TVSEGEGQEPQRV---GLQYSLENRQNFGGIP-----PLTRERVQDALRTAVEKAATA 184

Query: 241 KTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
                +     P L +H +     DT + P+  L+    L D     LV ++ +    ++
Sbjct: 185 TASSKKQKELPPVLVDHWLHTNNFDTTIKPDEILANETLLAD-----LVKSLQEARQSVE 239

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLL---LNQFRSREFV 349
           ++ S +    GYI  + +   +    T+  S TQ    +Y++F P +   L +  + E +
Sbjct: 240 ELTSSEACT-GYIFAKRRERTEGAEATDE-SKTQRDNLLYEDFHPFVPYKLKKDPTIEVL 297

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F  ++  +DEF+S +E QR E +   +E AA  KL     +Q  R+  L++    + + 
Sbjct: 298 EFTGYNETVDEFFSSLEGQRLESRLSEREAAAKRKLEAARNEQSKRIEGLQEAQALNFRK 357

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A  IE N E V  A+ AV   L+  M W D+ ++V+ E+K  NPVA +I   L L  N +
Sbjct: 358 AAAIEANAERVQEAMDAVNGLLSQGMDWVDVGKLVEREKKRHNPVAEIIKLPLNLAENLI 417

Query: 469 S--------------------LLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANARR 505
           +                       +   DE     K     K   VE++L LS  +NAR 
Sbjct: 418 TLELAEEEFEPEEDDPYETDDDDSALGDDEGTSAAKGKQANKALSVEINLGLSPWSNARE 477

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFN 561
           +++ +K    K+EKT    S+A K AE+K     +  + QEK +  +  +RK  WFEKF 
Sbjct: 478 YFDQRKTAAVKEEKTQQQASRALKNAEQKITEDLKKGLKQEKAL--LQPIRKPMWFEKFV 535

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPL 619
           WFISS+ YLVI G+DAQQNEMI K+Y+ KGDVY HADLHGASS +IKN+   P+ P+PP 
Sbjct: 536 WFISSDGYLVIGGKDAQQNEMIYKKYLRKGDVYCHADLHGASSVIIKNNPKTPDAPIPPA 595

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL+QAG   VC S AWDSK   SAWWV   QVSK+APTGE+L  GSFMIRGKKNFLPP  
Sbjct: 596 TLSQAGSLAVCSSNAWDSKAGMSAWWVNADQVSKSAPTGEFLPAGSFMIRGKKNFLPPAQ 655

Query: 680 LIMGFGLLFRLDESSLGSHLNER 702
           L++G G+ F++ E S   H+  R
Sbjct: 656 LLLGLGVAFKISEESKAKHVKHR 678



 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 89/180 (49%), Gaps = 31/180 (17%)

Query: 890  GKISRGQKGKLKKMKEKYGDQDEEERNIRMALL-ASAGKVQKNDGDPQNENASTHKEKKP 948
            G   RGQKGK KK+  KY  QDEE+R    AL+ A+ G+ +          A   +E + 
Sbjct: 821  GPPKRGQKGKAKKIASKYKHQDEEDRAAVEALIGATVGQKKAE----AEAKAKVDRELEL 876

Query: 949  AISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
            A           + ++A H     +E  + + H              E+ +V M EE I 
Sbjct: 877  A--------AAKERRRAQH----QREQKETAEH-------------EEIRRVMM-EEGID 910

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
             + E+E  ++  +D + G PLP D +L +IPVC P++A+  YKY+ K+ PG  KKGK ++
Sbjct: 911  ILDEDEASQMTVLDSIVGTPLPGDEILEIIPVCAPWNALGRYKYKAKLQPGATKKGKAVK 970


>gi|358398026|gb|EHK47384.1| hypothetical protein TRIATDRAFT_238226 [Trichoderma atroviride IMI
           206040]
          Length = 1068

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 254/748 (33%), Positives = 392/748 (52%), Gaps = 93/748 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L+ +R +NVYDLS K  + K              K  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELQASLVTLRLANVYDLSSKILLLKFAKPDN--------KQQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R H T +AR     PS F  +LRK+++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  ENGFRCHLTDFARTTAAAPSAFVARLRKYLKTRRLTAVTQVGTDRILEFQFSDGQ--YRM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKL 177
            LE +A GNI+LTD++  +L + R+  + +   A     +Y  E  + +      T  ++
Sbjct: 111 FLEFFASGNIILTDADLKILAISRNVGEGEGQEAQQVGLQYSLENRQNYGGIPALTKERI 170

Query: 178 HAAL-TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
             AL T++++ +ANE                       G  +F   K   K+  D  +A 
Sbjct: 171 RDALKTAAEKAEANE-----------------------GANTFSGKKAKGKSGGDLRKAL 207

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
             ++  +        P L E+I+         KL++V   E + +  LV  +++  D ++
Sbjct: 208 AVSITEL-------PPTLVENILQANSFDVTAKLADVIDNE-SLLDALVRYLSEARDIVE 259

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-----IYDEFCPLLLNQFR---SREF 348
           + I+      GYI  + K           G+++Q     +YD+F P + ++F+   S E 
Sbjct: 260 N-ITASATCTGYIFAKKK--ATSSSGLVEGNASQKREGLLYDDFHPFIPHKFKKDSSFEI 316

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++FE ++  +DEF+S +E Q+ E +   +E+AA  KL     +Q  R+  L+     +++
Sbjct: 317 LEFEGYNRTVDEFFSSLEGQKLESRLTGREEAAKKKLEDARHEQGKRIQGLQDAQAMNLR 376

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNC 467
            A  IE N+E V  A+ AV   +A  M W D+ ++++ E+K  NPVA  I+  L L  N 
Sbjct: 377 KAAAIEANVERVQEAMDAVNGLIAQGMDWIDIGKLIEREKKRQNPVAETINLPLKLSENT 436

Query: 468 MSLLLSN----------------NLDEMDDEE---------KTLPVEKVEVDLAL--SAH 500
           ++LLL+                   DE D EE          T P + + VD+ L  S  
Sbjct: 437 ITLLLAEEEFDEDEDEAQEANPYETDESDSEEGLSEANATKDTKPAKLLTVDIVLNVSPW 496

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHW 556
           +NAR +YE ++    K+EKT    +KA K+ E K     +  + QEK +  +  +RK  W
Sbjct: 497 SNAREYYEQRRSAAIKEEKTQQQATKALKSTEHKIAEDLKKGLKQEKAL--LQPIRKQLW 554

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
           FEKF WFISS+ YLV+ G+D QQ+E++ +RY+ KGD+Y HAD+ GA++ VIKN+   P+ 
Sbjct: 555 FEKFLWFISSDGYLVLGGKDPQQSEILYRRYLRKGDIYCHADIRGAANIVIKNNPNTPDA 614

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG  +VC S+AWDSK    AWWV   QVSK+A TGE +  G+F+I GKKN+
Sbjct: 615 PIPPATLSQAGSLSVCSSEAWDSKAGMGAWWVNTDQVSKSASTGEIMPAGNFIIEGKKNY 674

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHLNER 702
           LPP  L++G G  FR+ E S GSHL  R
Sbjct: 675 LPPTQLLLGLGFAFRISEQSKGSHLKHR 702



 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 50/78 (64%), Gaps = 2/78 (2%)

Query: 993  ETAEMDKV--AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSY 1050
            E AE +++  AM  E +  +  +E  +  ++D L G PL  D ++ VIPVC P++A+  +
Sbjct: 901  EVAEQEEIRRAMMNEGLDLLEPDEAEKATNLDTLVGTPLAGDEIIEVIPVCAPWNALVRF 960

Query: 1051 KYRVKIIPGTAKKGKGIQ 1068
            KY+VK+ PG+ KKGK ++
Sbjct: 961  KYKVKMQPGSVKKGKAVK 978


>gi|406864313|gb|EKD17358.1| serologically defined colon cancer antigen 1 [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 1052

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/726 (34%), Positives = 390/726 (53%), Gaps = 94/726 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDLS K ++ K              K  +L++SG R H T ++R     P+
Sbjct: 21  LVTLRVSNIYDLSSKIFLIKFAKPD--------HKQQILIDSGFRCHLTEFSRATAAAPT 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK+++TRR+  +  +G DRII FQF  G   + + LE YA GNI+LTD E  +L
Sbjct: 73  AFVTRLRKYLKTRRVTSIAPVGTDRIIEFQFSDGQ--YRLFLEFYAGGNIILTDKELNIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR                       V E     +L   L  S E      ++ N  G 
Sbjct: 131 ALLRI----------------------VGEGEGQEELRVGLKYSLE------NRQNYAG- 161

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA-KQPTLKT--VLGEALG-----YGP 252
            V   +KE L         D  + S    +DG  A KQP  K    L  AL      Y P
Sbjct: 162 -VPPLTKERLQ--------DALQKSVDRGDDGLVAGKQPKKKASDALRRALAVSITEYPP 212

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +H +  T    ++K ++V + +D  +  L+ ++ + +  +Q++ S ++  +GYI+ +
Sbjct: 213 MLVDHAMRVTDFDASLKPADVLQSQD-LLDHLMRSLQEAQSVVQEITSSEVA-KGYIIAK 270

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQR 369
            K   ++  P +      IY++F P    QF    +  F++F+ F+   D+F+S IE Q+
Sbjct: 271 KKEGYEEASPEDQARKFVIYEDFHPFRPRQFENDPATVFLEFQGFNKTADQFFSSIEGQK 330

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
            E + + +E  A  K+     DQ  R+  L++  + +++ A  ++ N E V  A+ AV  
Sbjct: 331 LESRLQEREQMAKRKIEAARQDQAKRLGGLQEVQELNIRKAGALQANAERVQEAMDAVNG 390

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL----------------- 471
            +A  M W ++ ++V+ E+K  NPVA +I   L L+ N +SLL                 
Sbjct: 391 LVAQGMDWVEIGKLVEIEQKRNNPVASIIKLPLKLQENTISLLLDEEEDADDDESNYETD 450

Query: 472 --LSNNLDEMDDEE-KTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
             +S++ DE   +E K   VEK   ++V+LALS  ANAR +Y+ K+    K++KT+ + +
Sbjct: 451 SDVSDSEDEAPKKEPKQKTVEKRLTIDVNLALSPWANAREYYDQKRTAAEKEQKTLQSST 510

Query: 526 KAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
           KA K+ E K     R  + QEK V  +  +R+  WFEKF WFISS+ YLV++G+D QQ E
Sbjct: 511 KALKSQEAKIAHDLRKGLKQEKAV--LRPVRRQMWFEKFTWFISSDGYLVLAGKDPQQKE 568

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
            + +RY+ KGDVYVHA++ GA+S VI+N+   P+ P+PP TL+QAG  ++  S AW++K 
Sbjct: 569 TLYRRYLKKGDVYVHAEVQGAASVVIRNNPKTPDAPIPPSTLSQAGTLSISCSSAWEAKA 628

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
             SAWWV   QVSK A TGE+L  GSF I+GKKNFLPP  L++GFG++F + E S  +H 
Sbjct: 629 GMSAWWVNADQVSKAASTGEFLPAGSFNIKGKKNFLPPAVLLLGFGVIFLISEESKVNH- 687

Query: 700 NERRVR 705
           N+ R++
Sbjct: 688 NKHRLQ 693



 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            ++ L G P   D ++  IPVC P++A+ +YKY+ K+ PGT KKGK ++
Sbjct: 922  LEQLVGRPSKGDEIIEAIPVCAPWAAMGNYKYKAKLQPGTQKKGKAVK 969


>gi|317033383|ref|XP_001395552.2| hypothetical protein ANI_1_620104 [Aspergillus niger CBS 513.88]
          Length = 1108

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 290/973 (29%), Positives = 466/973 (47%), Gaps = 123/973 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + P+ F  ++RK +++RR+  + Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATASAPTPFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI++TD E+ +L L R               +  T +   +  T     H  
Sbjct: 111 FLEFFAGGNIIITDREYNILALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                      PD   E        +K  L  Q+G    +  K S K + D        L
Sbjct: 163 -----------PDITRERVKETVEKAK-ALFAQEG----NAPKKSKKKNAD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H      L P   L EV  L+D A+ + V+ V +      D +
Sbjct: 200 RKALSQGFPEYPPLLLDHAFAVKELDPATPLDEV--LQDEALLLKVVDVLEEAKVETDKL 257

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-----IYDEFCPLLLNQFRSREFV---KF 351
           + +    GYI+ ++        P +           +Y++F P    QF  +  V   ++
Sbjct: 258 ATEKSHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPFKPKQFEGKPGVTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F+A +DE++S IE+Q+ E +   +E+AA  KL+ +  +   R+  LK+  +  ++ A 
Sbjct: 318 PSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRKAG 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N ++L
Sbjct: 378 AIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITL 437

Query: 471 LLSNNLDEMDDEEKTLPVEK----------------------VEVDLALSAHANARRWYE 508
           +L  + +E D+ E     +                       +++DL LS  ANA ++YE
Sbjct: 438 MLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQYYE 497

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 498 QKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFI 555

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLN 622
           SSE YLV+ GRD  Q+E++ +RY+ KGDV+VHADL GA+  ++KN  + P  P+PP TL+
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AWDSK + SA+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675

Query: 683 GFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESL 742
           GFG++F++ + SL +H   R         D+   +    E  + + E DD   KPV +  
Sbjct: 676 GFGVMFQVSKESLRNHKLHR--------FDEPVATEAPVEGQEADKEADD---KPVEQEA 724

Query: 743 SVPNSAHP--------APSHTNASNVDSHEFPAEDKTISNGID-SKIFDIARNVAAPVTP 793
            +  S  P          S +     D    PA +       + ++   IA N +    P
Sbjct: 725 QITKSERPAEAEQEQEQSSESEGEQEDDAVIPARNPLQRGSSEPTQTESIAANESQNAQP 784

Query: 794 QLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT---VRDKPYISKAERRK 850
             +D  +      +   +      ++ Q + +E++K  + + T     D   +S  ERR 
Sbjct: 785 --DDAAEEEKEEEAEEPNGNNEDEQSAQEEPAEDEKDEDESGTSPQTYDDRQLSARERRM 842

Query: 851 LKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQ 910
            +KG+ S +  P       +  ++   P              +RG++GK KK  +KY DQ
Sbjct: 843 ARKGRASELDGPAANGTSAKSTNSKQAP--------------TRGKRGKAKKAAQKYADQ 888

Query: 911 DEEERNIRMALLA 923
           DEE+R + + LL 
Sbjct: 889 DEEDRELALRLLG 901



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +  L G P P D +L  IPVC P++A+  YKYR+K+ PGT KKGK ++
Sbjct: 971  IPALVGTPHPDDEILAAIPVCAPWAALGRYKYRIKLQPGTVKKGKAVK 1018


>gi|134080270|emb|CAK97173.1| unnamed protein product [Aspergillus niger]
          Length = 1180

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 285/953 (29%), Positives = 457/953 (47%), Gaps = 122/953 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           ++ +R SN+YDLS + ++FK+             +  L+++SG R H T Y+R   + P+
Sbjct: 93  IVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVVDSGFRCHVTQYSRATASAPT 144

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  ++RK +++RR+  + Q+G DRII F F  GM  +++ LE +A GNI++TD E+ +L
Sbjct: 145 PFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHMFLEFFAGGNIIITDREYNIL 202

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            L R               +  T +   +  T     H             PD   E   
Sbjct: 203 ALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI-----------PDITRERVK 243

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG-YGPALSEHII 259
                +K  L  Q+G    +  K S K + D        L+  L +    Y P L +H  
Sbjct: 244 ETVEKAKA-LFAQEG----NAPKKSKKKNAD-------VLRKALSQGFPEYPPLLLDHAF 291

Query: 260 LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKD 319
               L P   L EV  L+D A+ + V+ V +      D ++ +    GYI+ ++      
Sbjct: 292 AVKELDPATPLDEV--LQDEALLLKVVDVLEEAKVETDKLATEKSHPGYIVAKDDTRPSA 349

Query: 320 HPPTESGSSTQ-----IYDEFCPLLLNQFRSREFV---KFETFDAALDEFYSKIESQRAE 371
             P +           +Y++F P    QF  +  V   ++ +F+A +DE++S IE+Q+ E
Sbjct: 350 DSPAQGEEEAARKPGYLYEDFHPFKPKQFEGKPGVTILEYPSFNATVDEYFSSIETQKLE 409

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            +   +E+AA  KL+ +  +   R+  LK+  +  ++ A  IE N+  V  A+ AV   +
Sbjct: 410 SRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRKAGAIEDNVYRVQEAMDAVNGLI 469

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDEEKTLPVEK 490
           A  M W ++AR+++ E+  GNPVA +I   L L  N ++L+L  + +E D+ E     + 
Sbjct: 470 AQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITLMLGESGEEQDEGEDLFSDDD 529

Query: 491 ----------------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
                                 +++DL LS  ANA ++YE KK    K++KT  + +KA 
Sbjct: 530 SESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQYYEQKKMAAVKEQKTTQSSTKAL 589

Query: 529 KAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           K+ EKK     +  + QEK V  +   RK  WFEKF +FISSE YLV+ GRD  Q+E++ 
Sbjct: 590 KSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFISSEGYLVLGGRDVMQSEILY 647

Query: 585 KRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           +RY+ KGDV+VHADL GA+  ++KN  + P  P+PP TL+QAG   V  S AWDSK + S
Sbjct: 648 RRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLSQAGNLCVATSSAWDSKAIMS 707

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           A+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++GFG++F++ + SL +H   R
Sbjct: 708 AYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVLGFGVMFQVSKESLRNHKLHR 767

Query: 703 RVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHP--------APSH 754
                    D+   +    E  + + E DD   KPV +   +  S  P          S 
Sbjct: 768 --------FDEPVATEAPVEGQEADKEADD---KPVEQEAQITKSERPAEAEQEQEQSSE 816

Query: 755 TNASNVDSHEFPAEDKTISNGID-SKIFDIARNVAAPVTPQLEDLIDRALGLGSASISST 813
           +     D    PA +       + ++   IA N +    P  +D  +      +   +  
Sbjct: 817 SEGEQEDDAVIPARNPLQRGSSEPTQTESIAANESQNAQP--DDAAEEEKEEEAEEPNGN 874

Query: 814 KHGIETTQFDLSEEDKHVERTAT---VRDKPYISKAERRKLKKGQGSSVVDPKVEREKER 870
               ++ Q + +E++K  + + T     D   +S  ERR  +KG+ S +  P       +
Sbjct: 875 NEDEQSAQEEPAEDEKDEDESGTSPQTYDDRQLSARERRMARKGRASELDGPAANGTSAK 934

Query: 871 GKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
             ++   P              +RG++GK KK  +KY DQDEE+R + + LL 
Sbjct: 935 STNSKQAP--------------TRGKRGKAKKAAQKYADQDEEDRELALRLLG 973



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +  L G P P D +L  IPVC P++A+  YKYR+K+ PGT KKGK ++
Sbjct: 1043 IPALVGTPHPDDEILAAIPVCAPWAALGRYKYRIKLQPGTVKKGKAVK 1090


>gi|307109165|gb|EFN57403.1| hypothetical protein CHLNCDRAFT_57209 [Chlorella variabilis]
          Length = 1158

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 233/513 (45%), Positives = 295/513 (57%), Gaps = 88/513 (17%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TLK  L + + YGP  +EH +L  GL P  +      L       L+  V ++E WL   
Sbjct: 203 TLKGCLADLVPYGPLTAEHCVLLAGLEPQRQ-PAAAPLSALEAAALLGGVRQWEAWLDAC 261

Query: 299 ISGDIVPEGYILMQNKHL------------------GKDHPPTESGSSTQ-IYDEFCPLL 339
                 PEG+IL +                      G++        +   +YDEF PLL
Sbjct: 262 EDSATPPEGFILTKPAAAAAAAAVAAVAAAPPAPAAGQEDGGDGGAPAAAGVYDEFQPLL 321

Query: 340 LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL 399
           L                        I+ QR+  Q  AKE AA  KL  I  D E R+ +L
Sbjct: 322 L------------------------IQGQRSAHQQAAKEKAAVGKLEAIRRDHEKRLGSL 357

Query: 400 KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
            QE + +   A LIEYNLE VDAA+ AVR ALA+ M W DLARMVKEER+AGNPVAGLID
Sbjct: 358 GQEAEAAELKAALIEYNLEAVDAALNAVREALASGMDWRDLARMVKEERRAGNPVAGLID 417

Query: 460 KLYLERNCMSLLL-------------------SNNLDEMDDEEK--TLPVEKVEVDLALS 498
            L LER+ ++LLL                    N LDE D +E+  T P  KVEVDL LS
Sbjct: 418 SLQLERSRVTLLLRRARVCAWGGGGVAGGVRGGNWLDEEDGDEEAATRPATKVEVDLGLS 477

Query: 499 AHANARRWYELKKKQES-----KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM-R 552
           AHANAR +Y+ ++K ++     KQ+KT+ A+ KA KAAEKK + Q+ Q ++ A    + R
Sbjct: 478 AHANARTYYDSRRKHQARGAGVKQQKTLDANQKALKAAEKKAQQQLKQVRSAAAAPAITR 537

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP 612
           K  WFEKF WFISSENYLV+SGRDAQQNE++VKRY+ +GD YVHADLHGASST+++N  P
Sbjct: 538 KPFWFEKFFWFISSENYLVLSGRDAQQNELLVKRYLRRGDAYVHADLHGASSTIVRNSDP 597

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
             P+PPLTL+QAG   VC SQAWD+K+VTSAWWV+P QVSKTAP+GEYL           
Sbjct: 598 GAPIPPLTLSQAGQACVCRSQAWDAKIVTSAWWVHPEQVSKTAPSGEYL----------- 646

Query: 673 NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
                 PL+MG+G +F L E S+ +H+ ER  R
Sbjct: 647 ------PLVMGYGYMFGLAEESIPAHMGERAPR 673



 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 104/187 (55%), Positives = 127/187 (67%), Gaps = 24/187 (12%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK R ++ADVAAEV CL+R +GMR +NVYD++PKTY+ KL  S    E GE  KVLLL+
Sbjct: 1   MVKQRFSSADVAAEVSCLQRCLGMRVANVYDINPKTYVLKLARSG---EDGE--KVLLLI 55

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR HT     +K +TPS FTLKLRKHIRTRRLE V+QLG DRI+   FG G  + ++
Sbjct: 56  ESGVRFHTVQAMPEKADTPSNFTLKLRKHIRTRRLEAVKQLGVDRIVQLSFGSGPASCHL 115

Query: 121 ILELYAQ-------------------GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           +LE YAQ                   GN++L D +F VLTLLRSHRDD KGVAIM+RH Y
Sbjct: 116 LLEFYAQASGRRQGELCFGTCMHPCAGNVILADDKFEVLTLLRSHRDDAKGVAIMARHPY 175

Query: 162 PTEICRV 168
           P +  R+
Sbjct: 176 PIQTIRL 182



 Score = 84.3 bits (207), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/74 (50%), Positives = 51/74 (68%)

Query: 1002 MEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTA 1061
            ++EE +  +GEEE+ +L  +D LTG P   DILL+ +PVC PY  + S+K++VKIIPGT 
Sbjct: 1036 LKEEKLEALGEEERDKLTQLDQLTGVPRGEDILLFAVPVCAPYQVLASFKFKVKIIPGTL 1095

Query: 1062 KKGKGIQIFYSLLL 1075
            KKGK  +    LLL
Sbjct: 1096 KKGKAARQAAELLL 1109


>gi|358379255|gb|EHK16935.1| hypothetical protein TRIVIDRAFT_10609, partial [Trichoderma virens
           Gv29-8]
          Length = 1079

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 259/798 (32%), Positives = 395/798 (49%), Gaps = 138/798 (17%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L+ +R +NVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKVIAHELQGSLVTLRLANVYDLSSKILLLKFAKPDN--------KQQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK+++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DNGFRCHLTDFARTTAAAPSAFVARLRKYLKTRRLTSVAQVGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS-------------------HRDDDKGVAIMSRHRY 161
            L+ +A GNI+LTD++  +L + R+                   +R +  G+  +++ R 
Sbjct: 111 FLKFFASGNIILTDADLKILAISRNVSEGEGQEPQGVGLQYSLENRQNFGGIPALTKERI 170

Query: 162 PTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDL 221
              +    E+  ASK+ A  + SK                          G+ GG   DL
Sbjct: 171 RDALKTAAEKAEASKVAATFSGSKAK------------------------GKSGG---DL 203

Query: 222 SKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI 281
            K                L   + E     PAL E+I+       + K ++V    DN +
Sbjct: 204 RK---------------ALAVSITE---LPPALVENILQANSFDVSAKPADVV---DNEL 242

Query: 282 QV--LVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL---GKDHPPTESGSSTQIYDEFC 336
            +  LV  +++  D ++++I+     +GYI  + K     G D           +Y++F 
Sbjct: 243 LLDELVKHLSEARDIVENIIASATC-KGYIFAKKKTAPSSGPDETDQAQKHEGLLYEDFH 301

Query: 337 PLLLNQFR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           P +  +F+   S + ++FE ++  +DEF+S +E Q+ E +   +E+AA  KL     +Q 
Sbjct: 302 PFVPQKFKNDPSIQVLEFEGYNRTVDEFFSSLEGQKLESRLSGREEAAKKKLEAARHEQA 361

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
            R+  L+     +++ A  IE N+E V  A+ AV   LA  M W D+ ++++ E+K  NP
Sbjct: 362 KRIEGLQDAQAMNLRKAAAIEANVERVQEAMDAVNGLLAQGMDWVDIGKLIEREKKRQNP 421

Query: 454 VAGLID-KLYLERNCMSLLLSN-----------NLDEMDDEEKTLPVEKVE--------- 492
           VA +I   L L  N ++LLL+            N  E DD +      +V          
Sbjct: 422 VAEIISLPLKLAENTITLLLAEEEFDEDEAAEDNPFETDDSDSEAEASEVTPTKDKKADK 481

Query: 493 ---VDLAL--SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEK 543
              VD+ L  S  +NAR +YE ++    K+EKT    +KA K+ E+K     +  + QEK
Sbjct: 482 LLTVDIVLNTSPWSNAREYYEERRSAAMKEEKTQLQANKALKSTEQKIAEDLKKGLKQEK 541

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
            +  +  +RK  WFEKF WFISS+ YLV+ G+D QQ+EM+ +RY+ KGDVY HAD+ GA+
Sbjct: 542 AL--LQPIRKQMWFEKFIWFISSDGYLVLGGKDPQQSEMLYRRYLRKGDVYCHADIRGAA 599

Query: 604 STVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
             VIKN+   P+ P+PP TL+QAG  +VC S AWDSK     WWV   QVSK+ PTG+ L
Sbjct: 600 HIVIKNNPNTPDAPIPPATLSQAGSLSVCTSDAWDSKAGMGGWWVNADQVSKSTPTGDIL 659

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER------------RVRGEE- 708
             G+F I+GKKN+LPP  L++G G  F++ E S G HL  R               GE+ 
Sbjct: 660 PAGNFTIQGKKNYLPPTQLLLGLGFTFKISEQSKGKHLKHRVHDERSSLATETATTGEDE 719

Query: 709 ----EGMDDFEDSGHHKE 722
               E +D+ EDSG   E
Sbjct: 720 LQNAEEVDNSEDSGDESE 737


>gi|297297786|ref|XP_002805097.1| PREDICTED: serologically defined colon cancer antigen 1-like
           [Macaca mulatta]
          Length = 856

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 240/686 (34%), Positives = 347/686 (50%), Gaps = 147/686 (21%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A++    
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR------AAEPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L    E  A+ P                                           K   L
Sbjct: 167 LERLTEIVASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 --------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWYE 508
                         N  E    +K     K            V+VDL+LSA+ANA+++  
Sbjct: 419 EEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKF-- 476

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
                                                       K  WF      ISSEN
Sbjct: 477 -------------------------------------------EKFLWF------ISSEN 487

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG   
Sbjct: 488 YLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMA 546

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKT 654
           +C+S AWD++++TSAWWVY HQ+ ++
Sbjct: 547 LCYSAAWDARVITSAWWVYHHQIIRS 572



 Score = 79.7 bits (195), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 72/178 (40%), Positives = 100/178 (56%), Gaps = 23/178 (12%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQK K+KKMKEKY DQDEE+R + M LL SAG  ++       E     K+ K    
Sbjct: 648  MKRGQKSKMKKMKEKYKDQDEEDRELIMKLLGSAGSNRE-------EKGKKGKKGKTKDE 700

Query: 952  PVDAPKVCYKCKKAGHLSKDCK-EHP--DDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
            PV   K   K +    +S + K E P  +  +H ++D     +D+  + DK   EE+D+ 
Sbjct: 701  PVK--KQPQKPRGGQRISDNIKKETPFLEVITHELQD---FAVDDPHD-DK---EEQDLD 751

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            + G EE    N  D LTG P P D+LL+ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 752  QQGNEE----NLFDSLTGQPHPEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKA 805


>gi|259479735|tpe|CBF70228.1| TPA: DUF814 domain protein, putative (AFU_orthologue; AFUA_2G09170)
           [Aspergillus nidulans FGSC A4]
          Length = 1100

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 253/748 (33%), Positives = 393/748 (52%), Gaps = 90/748 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    K L   L+G+R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRYSSLDVQVISKELASELVGLRVSNIYDLSTRIFLFKVAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R    TPSGF  +LRK++++RR+  V Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATAATPSGFVSRLRKYLKSRRITSVTQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GNI++TD ++T++ LLR               + P       E    +K+   
Sbjct: 111 LLEFFASGNIIITDRDYTIIALLR---------------QVPGG-----EGMEEAKVGLK 150

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            T + + + +    +  D    +    + L  Q+     D  K S K S D        L
Sbjct: 151 YTVTNKQNYSGIPPITRDRIRETLEKAKALFAQEN----DAPKKSKKKSTD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H        P M L +V  L D  +  +VL V +    +   +
Sbjct: 200 RRALSQGFPEYPPLLLDHAFATRAADPAMPLDQV--LGDAGLIDVVLGVLEEAQNVTKDL 257

Query: 300 SGDIVPEGYILMQNKHLGK-DHPPTESGSSTQ----IYDEFCPLLLNQFRSRE---FVKF 351
           S D    G+I+ +     K   P +E   S      +Y++F P    QF  ++    +++
Sbjct: 258 SADKAHPGFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPFKPRQFEGKDGFTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            + +A +DE++S IESQ+ E +   +E AA  KL+ +  + E R+  L+Q  +  ++ A 
Sbjct: 318 PSMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKAS 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            I+ N++ V  A+ AV   +A  M W ++AR+V+ E+K GNPVA LI   L L  N ++L
Sbjct: 378 AIQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITL 437

Query: 471 LLSNNLDEMDDEEKTL------------------PVEK-----VEVDLALSAHANARRWY 507
           LL    DE  + E+                    P +K     +++DL LS  ANA ++Y
Sbjct: 438 LLREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYY 497

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWF 563
           E KK    K EKT  + +KA K+ E+K     +  + QEK V  +   RK  WFEKF +F
Sbjct: 498 EQKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQV--LRPARKPFWFEKFLFF 555

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPPLT 620
           +SSE YLV+ GRD+ Q+EM+ +RY+ KGDV+VHADL GA+  ++KN +P      + P T
Sbjct: 556 VSSEGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKN-KPGALSSSISPTT 614

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L+QAG   V  S AWDSK + SA+WV   QVSKT+  G+ L VG F+++G+KNFL P  L
Sbjct: 615 LSQAGNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQL 674

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEE 708
           ++GF +++++   S GS +N +  R EE
Sbjct: 675 VLGFAVMWQI---SKGSLVNHKSFRSEE 699



 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 31/45 (68%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L G P   D +L  IP+C P+S++  YKYRVK+ PGT KKGK ++
Sbjct: 969  LVGTPHVDDEILAAIPICAPWSSLGRYKYRVKLQPGTVKKGKAVK 1013


>gi|67539818|ref|XP_663683.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
 gi|40738864|gb|EAA58054.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
          Length = 1588

 Score =  371 bits (953), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 253/748 (33%), Positives = 393/748 (52%), Gaps = 90/748 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    K L   L+G+R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRYSSLDVQVISKELASELVGLRVSNIYDLSTRIFLFKVAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R    TPSGF  +LRK++++RR+  V Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATAATPSGFVSRLRKYLKSRRITSVTQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GNI++TD ++T++ LLR               + P       E    +K+   
Sbjct: 111 LLEFFASGNIIITDRDYTIIALLR---------------QVPGG-----EGMEEAKVGLK 150

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            T + + + +    +  D    +    + L  Q+     D  K S K S D        L
Sbjct: 151 YTVTNKQNYSGIPPITRDRIRETLEKAKALFAQEN----DAPKKSKKKSTD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H        P M L +V  L D  +  +VL V +    +   +
Sbjct: 200 RRALSQGFPEYPPLLLDHAFATRAADPAMPLDQV--LGDAGLIDVVLGVLEEAQNVTKDL 257

Query: 300 SGDIVPEGYILMQNKHLGK-DHPPTESGSSTQ----IYDEFCPLLLNQFRSRE---FVKF 351
           S D    G+I+ +     K   P +E   S      +Y++F P    QF  ++    +++
Sbjct: 258 SADKAHPGFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPFKPRQFEGKDGFTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            + +A +DE++S IESQ+ E +   +E AA  KL+ +  + E R+  L+Q  +  ++ A 
Sbjct: 318 PSMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKAS 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            I+ N++ V  A+ AV   +A  M W ++AR+V+ E+K GNPVA LI   L L  N ++L
Sbjct: 378 AIQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITL 437

Query: 471 LLSNNLDEMDDEEKTL------------------PVEK-----VEVDLALSAHANARRWY 507
           LL    DE  + E+                    P +K     +++DL LS  ANA ++Y
Sbjct: 438 LLREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYY 497

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWF 563
           E KK    K EKT  + +KA K+ E+K     +  + QEK V  +   RK  WFEKF +F
Sbjct: 498 EQKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQV--LRPARKPFWFEKFLFF 555

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPPLT 620
           +SSE YLV+ GRD+ Q+EM+ +RY+ KGDV+VHADL GA+  ++KN +P      + P T
Sbjct: 556 VSSEGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKN-KPGALSSSISPTT 614

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L+QAG   V  S AWDSK + SA+WV   QVSKT+  G+ L VG F+++G+KNFL P  L
Sbjct: 615 LSQAGNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQL 674

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEE 708
           ++GF +++++   S GS +N +  R EE
Sbjct: 675 VLGFAVMWQI---SKGSLVNHKSFRSEE 699



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 31/45 (68%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L G P   D +L  IP+C P+S++  YKYRVK+ PGT KKGK ++
Sbjct: 969  LVGTPHVDDEILAAIPICAPWSSLGRYKYRVKLQPGTVKKGKAVK 1013


>gi|315050252|ref|XP_003174500.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
 gi|311339815|gb|EFQ99017.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
          Length = 1093

 Score =  369 bits (947), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 317/1118 (28%), Positives = 519/1118 (46%), Gaps = 209/1118 (18%)

Query: 14   EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
            +VK + R     ++G+R +N+YD+S +T++FKL        +    K  L++ +G   H 
Sbjct: 9    DVKVISRELSTNILGLRIANIYDISGRTFLFKL--------ALPDIKKQLIINAGFHCHI 60

Query: 69   TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
            T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   G+   Y  LE +A G
Sbjct: 61   TESSRTTADAPSHFVSRLRKLLKTRRITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAG 118

Query: 129  NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
            N++LTD+++ ++ LL              RH  P       +     KL +         
Sbjct: 119  NLILTDAKYGIVALL--------------RHVAPGSDIEEVKVGMTYKLES--------- 155

Query: 189  ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
                 K+N +G  +   + E L              S  + ++G++  + +L     E  
Sbjct: 156  -----KMNYNG--IPPLTVERL-------------KSALSKDNGSKVLKRSLYFGFPE-- 193

Query: 249  GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
             Y P L +H     G   + KL     L DN +   ++ V +  D + + +S D    GY
Sbjct: 194  -YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQGLMGVLQEADRINNTLSSDCQHPGY 250

Query: 309  ILMQNKHLGKDHPPTESGSSTQI-----YDEFCPLLLNQFR---SREFVKFETFDAALDE 360
            I+ +N         ++ G STQ      + +F P   +Q +   +   ++FE+F++A+D+
Sbjct: 251  IIAKNIAPSA----SDGGDSTQQAPVTEFRDFHPFEPSQTKDLPNTTTLRFESFNSAVDK 306

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE NL  V
Sbjct: 307  YFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIETNLLQV 366

Query: 421  DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL--- 476
            + A+ AV   +A  M W ++AR+++ E+   NPVA  I   L L  N +++LL+  +   
Sbjct: 367  EEAMTAVNGLVAQGMDWVEIARLIEMEQGKRNPVALSIKLPLKLYENTITVLLNEEVAEE 426

Query: 477  -------------------------------------DEMDDEEKTLPVEKVEVDLALSA 499
                                                  + + +EK      +++DL +S 
Sbjct: 427  EEEEESDESDEEEDEDDDDGYGDDEYERPKQKKRLVNPQREKKEKKDTRLSIDIDLGISP 486

Query: 500  HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHMRKVH 555
             ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QEK V  +   R   
Sbjct: 487  WANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV--LRRTRNPT 544

Query: 556  WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
            WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G    +IKN      
Sbjct: 545  WFEKFFFFISSDGYLVIGGRDQQQDEILFQRYMKKGDIYVHTDLEGGVPLIIKNKPDTPD 604

Query: 616  VPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
             P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G FMI+G+KN
Sbjct: 605  DPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKN 664

Query: 674  FLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
             +PP  +++GF +LF++   S+ +H   + +    EG    + + +   +S  ++ + D 
Sbjct: 665  HIPPGQIVLGFAVLFQISSQSIQNHA--KSLPATSEG----DVNNYQPISSAADTAQSDR 718

Query: 734  DEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTP 793
            DE       +VP+    A  H   S+ +  E   +DK +S  ++ K+  I          
Sbjct: 719  DE-------NVPSEQEDA--HEPGSDGEKEEL-NDDKAVS--LEEKVEFI---------- 756

Query: 794  QLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKK 853
              ED +D      SA +  T+      Q  L  E++    ++T+ ++P  S     +   
Sbjct: 757  YFEDDLDP----DSAQVHETEK-----QEALQPEEQSAHGSSTIAEEPEDSNESEDE--- 804

Query: 854  GQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGK---LKKMKEKYGDQ 910
               S +  P   +E        S+P + +  +     K     +GK    KK+  KY DQ
Sbjct: 805  ---SQLTTPSAVQE--------SRPSTPLVISSAGTQKFRPPVRGKRGKAKKLAMKYKDQ 853

Query: 911  DEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSK 970
            DEE+R + + LL SA         P+ + A    E+       +A K   + +    L  
Sbjct: 854  DEEDRKLALRLLGSAAGTSTPANKPKTK-ADIEAER-------EAQKERRRAQHERALQA 905

Query: 971  DCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLP 1030
              ++    + + VED+                        GEE K   + +  L G P+ 
Sbjct: 906  VKRQQEAFTRNSVEDS-----------------------TGEEHKLDFSILPALVGTPVE 942

Query: 1031 SDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
             D +   IPVC P++A+  YKYR K+ PG  KKGK ++
Sbjct: 943  GDEIEAAIPVCAPWTALGQYKYRAKLQPGKIKKGKAVK 980


>gi|63054438|ref|NP_588145.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe
           972h-]
 gi|48475020|sp|Q9USN8.2|YJY1_SCHPO RecName: Full=Uncharacterized protein C132.01c
 gi|157310510|emb|CAA22870.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe]
          Length = 1021

 Score =  369 bits (947), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 253/748 (33%), Positives = 389/748 (52%), Gaps = 89/748 (11%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +  D+AA    LR +++G R +N YDL+ +T++ K           +  K  +++
Sbjct: 1   MKQRFSALDIAAIAAELREQVVGCRLNNFYDLNARTFLLKF--------GKQDAKYSIVI 52

Query: 61  ESGVRLHTTAYARDKKNTP-SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN--- 116
           ESG R H T +  D++N P SGF  KLRKHI++RRL  V QLG DR+++F FG G N   
Sbjct: 53  ESGFRAHLTKF--DRENAPLSGFVTKLRKHIKSRRLTGVSQLGTDRVLVFTFGGGANDQD 110

Query: 117 ---AHYVILELYAQGNILLTDSEFTVLTLLRS-HRDDDKGVAIMSRHRYP-------TEI 165
               +Y++ E +A GN+LL D  + +L+LLR    D D+  A+  ++           + 
Sbjct: 111 PDWTYYLVCEFFAAGNVLLLDGHYKILSLLRVVTFDKDQVYAVGQKYNLDKNNLVNDNKS 170

Query: 166 CRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNS 225
                  TA +L+  L       A+ P  +NE                       L    
Sbjct: 171 QSTIPHMTAERLNILLDEISTAYAS-PTSINEP----------------------LPDQQ 207

Query: 226 NKNSNDGARAKQP-TLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV 283
             +S    +  +P +L+  L   LG YG AL EH +  + L P     ++    D   + 
Sbjct: 208 LSSSTKPIKVPKPVSLRKALTIRLGEYGNALIEHCLRRSKLDPLFPACQL--CADETKKN 265

Query: 284 LVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQF 343
            +LA  +  D +   ++   V +GYI    + L     P      T +Y++F P    Q 
Sbjct: 266 DLLAAFQEADSILAAVNKPPV-KGYIFSLEQALTNAADPQHPEECTTLYEDFHPFQPLQL 324

Query: 344 --RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
              +R+ ++F T++  +DEF+S IE+Q+ +++   +   A  +L     DQ  ++ +L+ 
Sbjct: 325 VQANRKCMEFPTYNECVDEFFSSIEAQKLKKRAHDRLATAERRLESAKEDQARKLQSLQD 384

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
                   A+ IE N E V+A I  +   L   M W D+ ++++ +++  +PVA  I   
Sbjct: 385 AQATCALRAQAIEMNPELVEAIISYINSLLNQGMDWLDIEKLIQSQKRR-SPVAAAIQIP 443

Query: 461 LYLERNCMSLLLSN--NLDEMDDEEKTLPVEK--------------------VEVDLALS 498
           L L +N +++ L N  ++D  D+  +T   +                     VE+DL+L 
Sbjct: 444 LKLIKNAVTVFLPNPESVDNSDESSETSDDDLDDSDDDNKVKEGKVSSKFIAVELDLSLG 503

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM---RKVH 555
           A ANAR+ YEL+++   K+ KT  A SKA K+ ++K   Q L+  T A+   +   RK  
Sbjct: 504 AFANARKQYELRREALIKETKTAEAASKALKSTQRKIE-QDLKRSTTADTQRILLGRKTF 562

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           +FEKF+WFISSE YLV+ GRDAQQNE++ ++Y + GD++V ADL  +S  ++KN  P  P
Sbjct: 563 FFEKFHWFISSEGYLVLGGRDAQQNELLFQKYCNTGDIFVCADLPKSSIIIVKNKNPHDP 622

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL QAG   +  S+AWDSK V SAWWV   +VSK APTGE L  GSF IR KKN+L
Sbjct: 623 IPPNTLQQAGSLALASSKAWDSKTVISAWWVRIDEVSKLAPTGEILPTGSFAIRAKKNYL 682

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERR 703
           PP  LIMG+G+L++LDE S     +ERR
Sbjct: 683 PPTVLIMGYGILWQLDEKS-----SERR 705



 Score = 47.0 bits (110), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 28/46 (60%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            +D LT NP   D ++  +P   PY+A+  +  +VK++PGT K GK 
Sbjct: 922  IDSLTPNPQQQDTVINAVPTFAPYNAMTKFNQKVKVMPGTGKVGKA 967


>gi|350636898|gb|EHA25256.1| hypothetical protein ASPNIDRAFT_49657 [Aspergillus niger ATCC 1015]
          Length = 1515

 Score =  367 bits (943), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 240/736 (32%), Positives = 378/736 (51%), Gaps = 84/736 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + P+ F  ++RK +++RR+  + Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATASAPTPFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI++TD E+ +L L R               +  T +   +  T     H  
Sbjct: 111 FLEFFAGGNIIITDREYNILALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                      PD   E        +K  L  Q+G    +  K S K + D        L
Sbjct: 163 -----------PDITRERVKETVEKAKA-LFAQEG----NAPKKSKKKNAD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H      L P   L EV  L+D A+ + V+ V +      D +
Sbjct: 200 RKALSQGFPEYPPLLLDHAFAVKELDPATPLDEV--LQDEALLLKVVDVLEEAKVETDKL 257

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-----IYDEFCPLLLNQFRSREFV---KF 351
           + +    GYI+ ++        P +           +Y++F P    QF  +  V   ++
Sbjct: 258 ATEKSHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPFKPKQFEGKPGVTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F+A +DE++S IE+Q+ E +   +E+AA  KL+ +  +   R+  LK+  +  ++ A 
Sbjct: 318 PSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRKAG 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N ++L
Sbjct: 378 AIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITL 437

Query: 471 LLSNNLDEMDDEEKTLPVEK----------------------VEVDLALSAHANARRWYE 508
           +L  + +E D+ E     +                       +++DL LS  ANA ++YE
Sbjct: 438 MLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQYYE 497

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 498 QKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFI 555

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLN 622
           SSE YLV+ GRD  Q+E++ +RY+ KGDV+VHADL GA+  ++KN  + P  P+PP TL+
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AWDSK + SA+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675

Query: 683 GFGLLFRLDESSLGSH 698
           GFG++F++ + SL +H
Sbjct: 676 GFGVMFQVSKESLRNH 691



 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +  L G P P D +L  IPVC P++A+  YKYR+K+ PGT KKGK ++
Sbjct: 910  IPALVGTPHPDDEILAAIPVCAPWAALGRYKYRIKLQPGTVKKGKAVK 957


>gi|396473834|ref|XP_003839430.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
            JN3]
 gi|312215999|emb|CBX95951.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
            JN3]
          Length = 1115

 Score =  367 bits (941), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 289/895 (32%), Positives = 427/895 (47%), Gaps = 142/895 (15%)

Query: 252  PALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEG 307
            P L +H +     D+ L P   L++ + LE      LV+ +       +++   + + +G
Sbjct: 213  PLLVDHALHNADFDSCLKPEQVLADESLLEK-----LVVVLKDARKIAEEITQPEQI-KG 266

Query: 308  YILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRS--REFVKFETFDAALDEFYSKI 365
            YIL +            SG +  +Y++F P    QF +   +F++F+ F+ A+DEF+S I
Sbjct: 267  YILAKPNPAVASTEDASSGKAKFLYEDFHPFKSQQFENLDYQFLEFDGFNKAVDEFFSSI 326

Query: 366  ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
            E Q+ E +   +E  A  KL K   + E R+  L+Q  + + + AE I  N+  V  A  
Sbjct: 327  EGQKLESKLTEREQQAKKKLEKARKEHEERIGGLQQVQEMNFRKAEAILANVHRVTEATE 386

Query: 426  AVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDD--- 481
            AV   +   M W D++R+++ E+  GN VA  I   L L +N ++LLL+    + ++   
Sbjct: 387  AVNGLIRQGMDWVDISRLIEREQAQGNAVAQSIRLPLKLHQNTITLLLNETDWDHEEEEE 446

Query: 482  --------------------EEKTLPVE-------KVEVDLALSAHANARRWYELKKKQE 514
                                ++K  P +        +++DL LSA AN+  +Y+ KK   
Sbjct: 447  DEGNETSSVSEDSEEEEEGSKKKAAPTKVTQQPQLAIDIDLGLSAWANSTEYYDQKKTAA 506

Query: 515  SKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
            SK+++T  A SKA K+ EKK     +  + QEK V  +  +RK  WFEK+ +FISS+ YL
Sbjct: 507  SKEDRTAAASSKALKSHEKKVTEDLKKGLKQEKEV--LRPVRKQQWFEKYIYFISSDGYL 564

Query: 571  VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFT 628
            V+ G+DAQQNE+I KR++ KGDVYVHADL GA   +IKN    P+ P+PP TL+QAG  +
Sbjct: 565  VLGGKDAQQNEIIYKRFLRKGDVYVHADLKGAVPMIIKNKPDTPDAPIPPSTLSQAGHLS 624

Query: 629  VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            VC S+AW+SK V SAWWV   QVSKT  TGE+L  G F I GKK FLPP  L++G  ++F
Sbjct: 625  VCTSEAWESKAVMSAWWVRSTQVSKTGQTGEFLPAGMFNITGKKEFLPPAQLVVGLAVMF 684

Query: 689  RLDESSLGSHLNER---------------------RVRGEEEGMDDFEDSGHHKENSDIE 727
             + ESS+ +H   R                     R   + E  D+F D+     + D E
Sbjct: 685  EISESSISNHQKHRIQATAVSAAEMTEDSTNAEEERNEADSEHDDEFPDAKLDSGSDDDE 744

Query: 728  --SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNG--IDSKIFDI 783
                K D  E   AES +     +P  SH     VD H+   ED T  N     ++  DI
Sbjct: 745  FPDAKIDDAEDSDAESEAGALRTNPLQSH---KMVDKHDSETEDDTSPNNKPAGTESHDI 801

Query: 784  ARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYI 843
                 AP      D  D A  +G    +S +H                           +
Sbjct: 802  RE---APAKESTVD--DGAESVGKTDPTSRRH---------------------------L 829

Query: 844  SKAERRKLKKGQ---GSSV-VDPKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKG 898
            S  ERR L+KGQ   G+ +   P    E   G   A ++P + V     +   + RG++G
Sbjct: 830  SARERRLLRKGQQLDGADIATGPGSADESVHGDPSAFTKPPATVTSQSSKASALPRGKRG 889

Query: 899  KLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKV 958
            K KK+  KY  QDEE+R + M LL S        G    E A+  K KK   +  D  + 
Sbjct: 890  KAKKLATKYAAQDEEDRALAMRLLGS------QSGQQAAEAAAQEKRKKEEQAQADKQR- 942

Query: 959  CYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRL 1018
                       +D       +    E+   V L+        A EE+D  E  E  +  L
Sbjct: 943  ----------RRDQHFRAQATGKAAEEARRVALEN-------AQEEDD--EGDEVLRTNL 983

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSL 1073
              ++  TG PLP D LL  IPVC P+SA+ +YKY+ KI PG+ K+GK ++   ++
Sbjct: 984  TKLNAFTGRPLPGDELLSAIPVCAPWSALSTYKYKAKIQPGSTKRGKAVKEILTI 1038



 Score =  106 bits (265), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 84/145 (57%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L  +L  +R +NVYDLS + ++ K              +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T YAR     PS F  KLRK+++TRR+  V Q+G DRI+ FQF  G+  + +
Sbjct: 53  DSGFRCHLTEYARTTAAAPSAFVAKLRKYLKTRRVTSVAQIGTDRILEFQFSDGL--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE YA GNI+LTD+   +L+LLR+
Sbjct: 111 YLEFYAGGNIVLTDANLHILSLLRN 135


>gi|46128721|ref|XP_388914.1| hypothetical protein FG08738.1 [Gibberella zeae PH-1]
          Length = 1077

 Score =  366 bits (939), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 275/816 (33%), Positives = 403/816 (49%), Gaps = 115/816 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKIIAHELQERLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  VRQ+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTKFARTTAAAPSIFVARLRKFLKTRRLTAVRQVGTDRVLEFEFSDGQ--YRM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD++  +L L R+  + +         + P  +   +           
Sbjct: 111 FLEFFASGNIILTDADLNILALARTVSEGE--------GQEPQRVGLQYSLENRQNYGEI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +KE   N      E     + +SK+    QKG    DL K               +L
Sbjct: 163 PALTKERVQNALKAAVEKAAADATSSKK----QKGKPGGDLRK---------------SL 203

Query: 241 KTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
              + E     P L +H +     DT + P+  L+    L++     LV ++ +    ++
Sbjct: 204 AVSITE---LPPVLVDHWLHTNNFDTTVKPHEVLANETLLDE-----LVKSLQEARKIVE 255

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSR---EFVK 350
           ++ S +    GYI  + +   +     E   + +   +YD+F P +  + ++    E ++
Sbjct: 256 ELTSSETCT-GYIFAKRRERPEGTEVDEETKTKRDNLLYDDFHPFIPYKLKNDPAIEVLE 314

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ ++  +DEF+S +E QR E +   +E  A  KL     +Q  R+  L++    + + A
Sbjct: 315 FQGYNETVDEFFSSLEGQRLESKLTEREATAKRKLEAAKNEQNKRIEGLQEAQSLNFRKA 374

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             IE N+E V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I   L L  N ++
Sbjct: 375 AAIEANVERVQEAMDAVNGLLNQGMDWVDVGKLVEREKKRHNPVAEIIKLPLNLAENLIT 434

Query: 470 LLL---------------------------SNNLDEMDDEEKTLPVEKVEVDLALSAHAN 502
           L L                           + +  +     K L    VE++L LS  +N
Sbjct: 435 LELAEEEFEPEEDDPYETDDDDDSALGDDEATSAAKGKQSNKAL---NVEINLGLSPWSN 491

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFE 558
           AR +++ +K    K+EKT    SKA K AE+K     +  + QEK +  +  +RK  WFE
Sbjct: 492 AREYFDQRKTAAVKEEKTQQQASKALKNAEQKITEDLKKGLKQEKAL--LQPIRKQMWFE 549

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPV 616
           KF WFISS+ YLVI G+DAQQNE I K+Y+ KGD+Y HADLHGASS +IKN+   P+ P+
Sbjct: 550 KFTWFISSDGYLVIGGKDAQQNETIYKKYLRKGDIYCHADLHGASSVIIKNNPKTPDAPI 609

Query: 617 PPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
           PP TL+QAG   VC S AWDSK    AWWV   QVSK+APTGE+L  GSFMIRGKKNFLP
Sbjct: 610 PPATLSQAGSLAVCSSNAWDSKAGMPAWWVNADQVSKSAPTGEFLQAGSFMIRGKKNFLP 669

Query: 677 PHPLIMGFGLLFRLDESSLGSHLNER-----RVRGEEE----------GMDDFEDSGHHK 721
           P  L++G GL FR+ E S   H+  R        G+E           G  D  D+GH  
Sbjct: 670 PAQLLLGLGLAFRISEESKAKHVKHRLHDVDSAIGDEGSGAPQSVGMMGDADEPDAGH-- 727

Query: 722 ENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNA 757
             SD+ S+ +  DEKP  ES   P  A       NA
Sbjct: 728 --SDVPSDYETEDEKPDEESRDNPLQAFKKGEGRNA 761



 Score = 73.2 bits (178), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 51/78 (65%), Gaps = 2/78 (2%)

Query: 993  ETAEMDKV--AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSY 1050
            ETAE +++   M EE +  + E+E  ++  +D + G PLP D +L +IPVC P++A+  Y
Sbjct: 916  ETAEHEEIRRVMMEEGVEMLDEDEASQMTVLDAIVGTPLPGDEILEIIPVCAPWNALGRY 975

Query: 1051 KYRVKIIPGTAKKGKGIQ 1068
            KY+ K+ PG  KKGK ++
Sbjct: 976  KYKAKLQPGATKKGKAVK 993


>gi|358369883|dbj|GAA86496.1| DUF814 domain protein [Aspergillus kawachii IFO 4308]
          Length = 1157

 Score =  366 bits (939), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 240/736 (32%), Positives = 377/736 (51%), Gaps = 84/736 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FK+             +  L++
Sbjct: 51  MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVV 102

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + P+ F  ++RK +++RR+  + Q+G DRII F F  GM  +++
Sbjct: 103 DSGFRCHVTQYSRATASAPTPFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHM 160

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI++TD E+ +L L R               +  T +   +  T     H  
Sbjct: 161 FLEFFAGGNIIITDREYNILALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI 212

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                      PD   E        +K  L  Q+G       K S K + D        L
Sbjct: 213 -----------PDITRERVQETVEKAK-ALFSQEGS----APKKSKKKNAD-------VL 249

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H      L P   L EV  L+D A+   V+ V +      D +
Sbjct: 250 RKALSQGFPEYPPLLLDHAFAVKELDPATPLDEV--LQDEALLTKVVDVLEAAKVETDKL 307

Query: 300 SGDIVPEGYILM-QNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSREFV---KF 351
           + +    GYI+  ++     D P      + +    +Y++F P    QF  +  V   ++
Sbjct: 308 ATEKSHPGYIVAKEDTRPSADSPAQGEEDAARKPGYLYEDFHPFKPKQFEGKPGVTILEY 367

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F+A +DE++S IE+Q+ E +   +E+ A  KL  +  +   R+  LK+  +  ++ A 
Sbjct: 368 PSFNATVDEYFSSIETQKLESRLTEREETAKRKLEAVRQEHAKRIGALKEVQELHIRKAG 427

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N ++L
Sbjct: 428 AIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITL 487

Query: 471 LLSNNLDEMDDEEKTLPVEK----------------------VEVDLALSAHANARRWYE 508
           +L  + +E D+ E     ++                      +++DL LS  ANA ++YE
Sbjct: 488 MLGESGEEQDEGEDLFSDDESESEDEQEEAAKAQKQSNNMLTIDIDLGLSPWANATQYYE 547

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 548 QKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFI 605

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLN 622
           SSE YLV+ GRDA Q+E++ +RY+ KGDV+VHADL GA+  ++KN  +    P+PP TL+
Sbjct: 606 SSEGYLVLGGRDAMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSSNAPIPPSTLS 665

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AWDSK + SA+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++
Sbjct: 666 QAGNLCVATSSAWDSKAIMSAYWVTASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 725

Query: 683 GFGLLFRLDESSLGSH 698
           GFG++F++ + SL +H
Sbjct: 726 GFGVMFQVSKESLRNH 741



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +  L G P P D +L  IPVC P++A+  YKYR+K+ PGT KKGK ++
Sbjct: 1020 IPALVGTPHPEDDILAAIPVCAPWAALGRYKYRIKLQPGTVKKGKAVK 1067


>gi|169783790|ref|XP_001826357.1| hypothetical protein AOR_1_1306054 [Aspergillus oryzae RIB40]
 gi|83775101|dbj|BAE65224.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1103

 Score =  366 bits (939), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 259/833 (31%), Positives = 408/833 (48%), Gaps = 134/833 (16%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKLAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + PS F  ++RK +R+RR+  V+Q+G DRII   F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATASMPSPFVTRMRKFLRSRRITSVKQIGTDRIIDISFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS---HRDDDKGVAIM-------SRHRYPTEICRVFE 170
            LE +A GNI++TD E  +L L R       ++  V I        + H  P EI     
Sbjct: 111 FLEFFAGGNIIITDREHNILALYRQVSVSEGEEARVGIQYTVTNKQNYHGIP-EITLDRI 169

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           R T  K  A                 EDG                       K S K + 
Sbjct: 170 RETLEKAKALF-------------AREDG---------------------APKKSKKKNA 195

Query: 231 DGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
           D        L+  L +    Y P L +H  +   + P   L +V  L+D ++   V  V 
Sbjct: 196 D-------VLRKALSQGFPEYPPLLLDHAFVTKEVDPTTPLDKV--LQDESLLQEVNGVL 246

Query: 290 KFEDWLQDVISGDIVPEGYILMQ------NKHLGKDHPPTESGSSTQIYDEFCPLLLNQF 343
           +        +S      GYI+ +      ++   ++  P+E+G+   +Y++F P    QF
Sbjct: 247 QEAQNENTRLSTQESHPGYIVAKEDNRSVSQSANENEKPSETGNL--LYEDFHPFKPRQF 304

Query: 344 RSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
             +     ++F + +A +DE++S IE+Q+ E +   +E+AA  KL  +  + E ++  LK
Sbjct: 305 EGKPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALK 364

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID- 459
           ++ +  ++ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I  
Sbjct: 365 EQQELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKL 424

Query: 460 KLYLERNCMSLLLSNNLDEMDD--------------------EEKTLP-VEKVEVDLALS 498
            L L  N ++LLL    DE D+                    E +  P V  +++DL +S
Sbjct: 425 PLKLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLGIS 484

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISHMRKVHW 556
             ANA+++YE KK+   K+++T  + +KA K+ EKK    L+   +K    +   R+  W
Sbjct: 485 PWANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFW 544

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQ 614
           FEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P  
Sbjct: 545 FEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTA 604

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG   V  S AWDSK V SAWWV   Q++KTA  G  L +G F+++G+KNF
Sbjct: 605 PIPPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNF 664

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHLNE----------RRVRGEEEGMDDFEDSGHHKE-- 722
           L P  L++GFG+ F++ + SL +H              R  G E+  +  + S   +E  
Sbjct: 665 LAPSQLVLGFGVTFQISKDSLKNHKTHFVDEPEAPEATREGGHEQAGESTQRSEQQQETE 724

Query: 723 ----------------NSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASN 759
                           +SD E+E+D+ D  P    L    S  P   HT A+ 
Sbjct: 725 EAHKPSLDPKEQAEEQSSDSENEQDNADSLPARNPLQRGPSESP---HTEAAQ 774



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 34/51 (66%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L  +  L G P P D +L  IPVC P+SA+  Y+Y+VK+ PGT KKGK ++
Sbjct: 959  LEWIPALIGTPRPEDEILAAIPVCAPWSALSRYRYKVKLQPGTVKKGKAVK 1009


>gi|212529000|ref|XP_002144657.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
 gi|210074055|gb|EEA28142.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
          Length = 1117

 Score =  363 bits (931), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 241/744 (32%), Positives = 385/744 (51%), Gaps = 92/744 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   +IG+R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSIDVKIICQELSTSIIGLRVSNIYDLSSRIFLFKLAKPD--------HRKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   +TPSGF  +LRK ++TRR+  V+QLG DR+I   F  G+   ++
Sbjct: 53  DSGFRCHLTEYSRTTASTPSGFVSRLRKCLKTRRVTSVQQLGTDRVIDIVFSDGL--FHI 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD+E  +L L R+           +  +   +I   +    A   H  
Sbjct: 111 YLEFFAGGNIILTDAENKILALFRT--------VAAAGEQDEVKIGLTYAVEKAQYYHGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGG--QKGGKSFDLSKNSNKNSNDGARAKQP 238
              S+E       ++      V++A +++ G   +K  K  D+ +               
Sbjct: 163 PPLSEE-------RLRTTIQKVADADQQSAGSAQKKSKKKVDVFR--------------- 200

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
             K +      + P L E     TG   ++ L +V  LED +     + V +     Q +
Sbjct: 201 --KAISSGFPEFPPLLLEDAFAATGFDSSVTLKQV--LEDESTFQKAMNVLRE---AQKI 253

Query: 299 ISG--DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFET 353
           I+G  +   +GYI+ + +   +D     +     ++++F P    QF  +     +++++
Sbjct: 254 IAGLSEGEKKGYIVAKERAKKEDQQVDSTSKENLLFEDFHPFRPRQFEGKPGYHILEYDS 313

Query: 354 FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
           F+  +DE++S IESQ+ E +    E+ A  KL     D +NR   LKQ  +  ++ AE I
Sbjct: 314 FNKTVDEYFSSIESQKLESRLAEHEETAKRKLETARADHQNRAGALKQAQELHIRKAEAI 373

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL 472
           + N+  V  A  AV   +A  M W ++AR+++ E++  NPVA  I   L L  N ++LLL
Sbjct: 374 QANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQQRNNPVAQTIKLPLKLYENTITLLL 433

Query: 473 --------------------------SNNLDEMDDEEKTLPVE--KVEVDLALSAHANAR 504
                                     S N  E D+  K    E   +++DL+LS  +NA 
Sbjct: 434 SEENTEVEEEQEEFSESEPEVSEDSDSENEIEKDEGPKQKIAEPLAIDIDLSLSPWSNAT 493

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKF 560
           ++YE K+    K++KTI +  KA K+ EKK     +  + QEK V   S  RK  WFEK+
Sbjct: 494 QYYEQKRTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPS--RKPFWFEKY 551

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVPP 618
            +FISSE YLV+ GRD+ Q E++ +RY+ KGDV+VHADL GA+  ++KN    P+ P+PP
Sbjct: 552 LYFISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTPDAPIPP 611

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL QAG  +V  S+AW++K +  +WWV+ HQVS+T   GE L  G+FM++G+KN+L P 
Sbjct: 612 GTLTQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLANGAFMVKGEKNYLAPG 671

Query: 679 PLIMGFGLLFRLDESSLGSHLNER 702
             I+GF +LF++ + S+ +H   R
Sbjct: 672 QPILGFAVLFQISKESVQNHRKHR 695



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 46/84 (54%), Gaps = 6/84 (7%)

Query: 995  AEMD--KVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKY 1052
            AE+D  K A  +E + +  E+    L+ +  L G PLP D +L  IPV  P+S V  +KY
Sbjct: 926  AEVDGEKEAYNDETVKQEAED----LSWLPALIGTPLPEDEVLAAIPVAAPWSVVARFKY 981

Query: 1053 RVKIIPGTAKKGKGIQIFYSLLLL 1076
            R K+  G  KKGK I+   S  ++
Sbjct: 982  RAKLQAGNIKKGKAIKEILSHWII 1005


>gi|119480773|ref|XP_001260415.1| hypothetical protein NFIA_084700 [Neosartorya fischeri NRRL 181]
 gi|119408569|gb|EAW18518.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 1116

 Score =  363 bits (931), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 246/736 (33%), Positives = 381/736 (51%), Gaps = 88/736 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVNLRVSNIYDLSSRIFLFKLAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RRL  + Q+G DR+I F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRLTSIEQIGTDRVIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRV-FERTTASK--L 177
            LE +A GNI++TD ++ +LTL R       GV          E  RV F+ T  +K   
Sbjct: 111 FLEFFAGGNIIITDRDYNILTLFRQV---PAGVG--------EEEMRVGFKYTVTNKQNY 159

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
           H        P+    D++ E       AS      Q+G       K S K + D      
Sbjct: 160 HGV------PEITL-DRIKETLEKAKEAS-----AQEG----TAPKKSKKKNVD------ 197

Query: 238 PTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
             L+  L +    Y P L +H      + P   L +V  L D+A+   V  V K    + 
Sbjct: 198 -VLRKALSQGFPEYPPLLLDHAFAVKEVDPATPLEKV--LGDDALMEQVNGVLKEAQSVT 254

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSR---EFV 349
             +S      GYI+ +           ++G  +Q    +Y++F P    QF  +     +
Sbjct: 255 IKLSAKEDHPGYIIAKEDKRPTAESTADTGDPSQKAGLLYEDFHPFRPRQFEGKPEVTIL 314

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F TF+A +DE++S +E+Q+ E +   +E+AA  KL+ +  + E R+  LK+  +  V+ 
Sbjct: 315 EFSTFNATVDEYFSSLETQKLESRLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRK 374

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A  IE N+  V   + AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N +
Sbjct: 375 AAAIEDNVYRVQEVMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTI 434

Query: 469 SLLLSNNLDEMDDEE--------------------KTLPVEKVEVDLALSAHANARRWYE 508
           +L+L    +E D  +                    K   +  +++DL LS  ANA ++YE
Sbjct: 435 TLVLGEASEEQDAADDLFSDESEEESESEEQEAARKAPEMLTIDIDLGLSPWANATQYYE 494

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 495 QKKMAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQV--LRPARKPFWFEKFLFFI 552

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLN 622
           SSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P+ P+PP TL+
Sbjct: 553 SSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLS 612

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AW+SK V +AWWV  +QV+KT  TG  L  G F ++G+KNFL P  L++
Sbjct: 613 QAGNLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEVKGEKNFLAPSQLVL 671

Query: 683 GFGLLFRLDESSLGSH 698
           GF ++F++ + SL +H
Sbjct: 672 GFAVMFQISKESLKNH 687



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 35/51 (68%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L+ +  L G P P D +L  IP+C P++A+  YKYRVK+ PGT KKGK ++
Sbjct: 975  LSWIPALIGTPRPEDEILAAIPICAPWAALGRYKYRVKLQPGTVKKGKAVK 1025


>gi|338717943|ref|XP_001496390.3| PREDICTED: nuclear export mediator factor NEMF [Equus caballus]
          Length = 827

 Score =  362 bits (929), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 195/441 (44%), Positives = 270/441 (61%), Gaps = 61/441 (13%)

Query: 307 GYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAALDEFYS 363
           GYI+ + +      P  E    TQ    Y+EF P L +Q     +++FE+FD A+DEFYS
Sbjct: 17  GYIIQKREM----KPSLEVDKPTQDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDEFYS 72

Query: 364 KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAA 423
           KIE Q+ + +   +E  A  KL+ +  D E+R+  L+Q  +      ELIE NL+ VD A
Sbjct: 73  KIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELIEMNLQIVDRA 132

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----NLDEM 479
           I  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N    + +E 
Sbjct: 133 IQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYLLSEEED 192

Query: 480 DDEEKTLPVEK----------------------------VEVDLALSAHANARRWYELKK 511
           DD +  + VEK                            V+VDL+LSA+ANA+++Y+ K+
Sbjct: 193 DDVDGDISVEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKKYYDHKR 252

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
               K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENYL+
Sbjct: 253 YAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLI 312

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           I GRD QQNE+IVKRY++ G                      +P+PP TL +AG   +C+
Sbjct: 313 IGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTMALCY 350

Query: 632 SQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++D
Sbjct: 351 SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 410

Query: 692 ESSLGSHLNERRVRGEEEGMD 712
           ES +  H  ER+VR ++E M+
Sbjct: 411 ESCVWRHRGERKVRVQDEDME 431


>gi|351702906|gb|EHB05825.1| Serologically defined colon cancer antigen 1, partial
           [Heterocephalus glaber]
          Length = 762

 Score =  361 bits (927), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 186/371 (50%), Positives = 247/371 (66%), Gaps = 33/371 (8%)

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ 
Sbjct: 1   KEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQVVDRAIQVVRSALANQID 60

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----NLDEMDDEEKTLPVEK-- 490
           W ++  +VKE +  G+PVA  I +L L+ N +++LL N    + +E DD +  + VEK  
Sbjct: 61  WTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDADGDVSVEKNE 120

Query: 491 --------------------------VEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                                     V+VDL+LSA+ANA+++Y+ K+    K +KT+ A 
Sbjct: 121 TEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAA 180

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENYL+I GRD QQNEMIV
Sbjct: 181 EKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIV 240

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           KRY++ GD+YVHADLHGA+S VIKN   E P+PP TL + G   +C+S AWD++++TSAW
Sbjct: 241 KRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEVGTMALCYSAAWDARVITSAW 299

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           WVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+V
Sbjct: 300 WVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKV 359

Query: 705 RGEEEGMDDFE 715
           R ++E ++  E
Sbjct: 360 RVQDEDVETLE 370


>gi|327303108|ref|XP_003236246.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
 gi|326461588|gb|EGD87041.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
          Length = 1098

 Score =  360 bits (924), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 309/1111 (27%), Positives = 507/1111 (45%), Gaps = 209/1111 (18%)

Query: 14   EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
            +VK + R     ++G+R +N+YD+S +T++FKL        +    K  L++ +G   H 
Sbjct: 9    DVKVISRELSANILGLRIANIYDISGRTFLFKL--------ALPDIKKQLIINAGFHCHL 60

Query: 69   TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
            T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   GM   Y  LE +A G
Sbjct: 61   TESSRTTADAPSHFVSRLRKLLKTRRITGVRQIGTDRIIEFEISDGMFRLY--LEFFAAG 118

Query: 129  NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
            N++LTD+++              G+  + R   P       +     +L + L       
Sbjct: 119  NLILTDAKY--------------GIVALLRQVAPGSDIEEVKIGMTYRLESKL------- 157

Query: 189  ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
                     + N +   + E L              S    ++G++  + +L     E  
Sbjct: 158  ---------NYNGIPPLTIERL-------------KSALEQDNGSKVLKRSLYFGFPE-- 193

Query: 249  GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
             Y P L +H     G   + KL     L DN +   ++ V +  D +   +S D    GY
Sbjct: 194  -YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQKLMEVLQEADRVNTALSSDTQQAGY 250

Query: 309  ILMQNKHLGKDHPPTESGSSTQI-----YDEFCPLLLNQFR---SREFVKFETFDAALDE 360
            I+ +N         ++ G  TQ      + +F P   +Q +   +   ++F  F++A+D 
Sbjct: 251  IIAKNVAPAA----SDVGGGTQTAPMAEFRDFHPFEPSQSKEAPNTTILRFGNFNSAVDR 306

Query: 361  FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            ++S IE+Q+ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE NL  V
Sbjct: 307  YFSSIEAQKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIETNLPRV 366

Query: 421  DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEM 479
            + A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++LL+    E 
Sbjct: 367  EEAMNAVNGLVAQSMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTITVLLNEGGTED 426

Query: 480  DDEEKTL----------------------PVEK-------------------VEVDLALS 498
            D+EE+                        P +K                   +++DL +S
Sbjct: 427  DEEEEEEEEPEEEEEEDDDDGYGDDEYERPSQKKHSAKPLKEKKEKKDTRLSIDIDLGIS 486

Query: 499  AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHMRKV 554
              ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QEK V  +   R  
Sbjct: 487  PWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV--LRRTRNP 544

Query: 555  HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ 614
             WFEKF +FISS+ YLVI GRD QQ+E++ +RY+ KGD+YVH DL G    ++KN     
Sbjct: 545  TWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYLKKGDIYVHTDLDGGVPLIVKNKPDAP 604

Query: 615  PVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
              P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G FMI+G+K
Sbjct: 605  DDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEK 664

Query: 673  NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDD 732
            N +PP  +++GF +LF++   SL                       +H ++     E D 
Sbjct: 665  NHIPPGQIVLGFAVLFQISNRSL----------------------QNHTKSLPSAPEDDV 702

Query: 733  TDEKPVAESLSVPNSAHPAPSHTNASNVDSHE---FPAEDKTISNGIDSKIFDIARNVAA 789
            T+E+P++ +  +  S         A+  D  E      ED+      D+K  DI+    A
Sbjct: 703  TNEEPISSTADMDQS--------EANQSDQEEDVPLEQEDEHQVESEDAKK-DISDERVA 753

Query: 790  PVTPQLEDL-IDRALGLGSASISSTKHGIETTQFDLSE-EDKHVERTATVRDKPYISKAE 847
            P+  QL+ + ++ +L   +A ++      E  +++ S+ E++ VE  +   ++   S   
Sbjct: 754  PLGEQLQSIHVEGSLDSNAAQVT------EADKYEASQAENQPVEGPSKNAEETEDSGES 807

Query: 848  RRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKY 907
              + +    S++ + +           + + +  VR           G++GK KK+  KY
Sbjct: 808  NDESRLATSSAIRESRSSTPSVISSSGTQKSKPPVR-----------GKRGKAKKLATKY 856

Query: 908  GDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGH 967
             DQDEE+RN+ + LL SA         P+ + A    E+       +A K   + +    
Sbjct: 857  KDQDEEDRNLALRLLGSAAGPSTPTTKPKTK-ADIEAER-------EAQKERRRAQHERA 908

Query: 968  LSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGN 1027
            L    ++    + + VED                         GEE K   + +  L G 
Sbjct: 909  LQAVKRQQEAFTRNSVEDAS-----------------------GEEHKLDFSILPALVGT 945

Query: 1028 PLPSDILLYVIPVCGPYSAVQSYKYRVKIIP 1058
            P+  D +   IPVC P++A+  YKYR K+ P
Sbjct: 946  PVSGDEIEAAIPVCAPWTALGQYKYRAKLQP 976


>gi|238493615|ref|XP_002378044.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
 gi|220696538|gb|EED52880.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
          Length = 1105

 Score =  360 bits (923), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 241/746 (32%), Positives = 381/746 (51%), Gaps = 105/746 (14%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLL 58
           +K R ++ DV    + L   ++ +R SN+YDLS   + ++FKL             +  L
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSVCRIFLFKLAKPD--------HRKQL 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++SG R H T Y+R   + PS F  ++RK +R+RR+  V+Q+G DRII   F  GM  +
Sbjct: 53  IVDSGFRCHVTQYSRATASMPSPFVTRMRKFLRSRRITSVKQIGTDRIIDISFSDGM--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRS---HRDDDKGVAIM-------SRHRYPTEICRV 168
           ++ LE +A GNI++TD E  +L L R       ++  V I        + H  P EI   
Sbjct: 111 HMFLEFFAGGNIIITDREHNILALYRQVSVSEGEEARVGIQYTVTNKQNYHGIP-EITLD 169

Query: 169 FERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
             R T  K  A                 EDG                       K S K 
Sbjct: 170 RIRETLEKAKALF-------------AREDG---------------------APKKSKKK 195

Query: 229 SNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
           + D        L+  L +    Y P L +H  +   + P   L +V  L+D ++   V  
Sbjct: 196 NAD-------VLRKALSQGFPEYPPLLLDHAFVTKEVDPTTPLDKV--LQDESLLQEVNG 246

Query: 288 VAKFEDWLQDVISGDIVPEGYILMQN------KHLGKDHPPTESGSSTQIYDEFCPLLLN 341
           V +        +S      GYI+ ++      +   ++  P+E+G+   +Y++F P    
Sbjct: 247 VLQEAQNENTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNL--LYEDFHPFKPR 304

Query: 342 QFRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHT 398
           QF  +     ++F + +A +DE++S IE+Q+ E +   +E+AA  KL  +  + E ++  
Sbjct: 305 QFEGKPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGA 364

Query: 399 LKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI 458
           LK++ +  ++ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I
Sbjct: 365 LKEQQELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARII 424

Query: 459 D-KLYLERNCMSLLLSNNLDEMDD--------------------EEKTLP-VEKVEVDLA 496
              L L  N ++LLL    DE D+                    E +  P V  +++DL 
Sbjct: 425 KLPLKLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLG 484

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISHMRKV 554
           +S  ANA+++YE KK+   K+++T  + +KA K+ EKK    L+   +K    +   R+ 
Sbjct: 485 ISPWANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQP 544

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--P 612
            WFEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P
Sbjct: 545 FWFEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDP 604

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
             P+PP TL+QAG   V  S AWDSK V SAWWV   Q++KTA  G  L +G F+++G+K
Sbjct: 605 TAPIPPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEK 664

Query: 673 NFLPPHPLIMGFGLLFRLDESSLGSH 698
           NFL P  L++GFG+ F++ + SL +H
Sbjct: 665 NFLAPSQLVLGFGVTFQISKDSLKNH 690



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 34/51 (66%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L  +  L G P P D +L  IPVC P+SA+  Y+Y+VK+ PGT KKGK ++
Sbjct: 961  LEWIPALIGTPRPEDEILAAIPVCAPWSALSRYRYKVKLQPGTVKKGKAVK 1011


>gi|159129335|gb|EDP54449.1| DUF814 domain protein, putative [Aspergillus fumigatus A1163]
          Length = 1116

 Score =  359 bits (922), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 242/741 (32%), Positives = 375/741 (50%), Gaps = 98/741 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVNLRVSNIYDLSSRIFLFKLAKPDNRKQ--------LVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RRL  + Q+G DR+I F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRLTSIEQIGTDRVIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLR------SHRDDDKGV--AIMSRHRYPTEICRVFERT 172
            LE +A GNI++TD E+ +LTL R         +   G+   + ++  Y       FER 
Sbjct: 111 FLEFFAGGNIIITDREYNILTLFRQVPAGVGEEEMRVGLKYTVTNKQNYHGVPEITFER- 169

Query: 173 TASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
               +   L  +KE  A E       G     + K+N+                      
Sbjct: 170 ----IKETLEKAKEASAQE-------GTAPKKSKKKNVD--------------------- 197

Query: 233 ARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
                  L+  L +    Y P L +H      + P   L   N L D+ +   V  V K 
Sbjct: 198 ------VLRKALSQGFPEYPPLLLDHAFAVKEVDPATPLE--NVLGDDTLMEQVNGVLKE 249

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSRE 347
              +   +S      GYI+ +           ++G  ++     Y++F P    QF    
Sbjct: 250 AQSVTIKLSAKEDHPGYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHPFRPRQFEGNP 309

Query: 348 FVK---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
            VK   F TF+A +DE++S +E+Q+ E +   +E+AA  KL+ +  + E R+  LK+  +
Sbjct: 310 EVKILEFSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKEAQE 369

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYL 463
             V+ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L
Sbjct: 370 IHVRKAAAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKL 429

Query: 464 ERNCMSLLLSNNLDEMDDEE--------------------KTLPVEKVEVDLALSAHANA 503
             N ++L+L    +E D  +                    K   +  +++DL LS  ANA
Sbjct: 430 YENTITLVLGEASEEQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPWANA 489

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEK 559
            ++YE KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEK
Sbjct: 490 TQYYEQKKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQV--LRPARKPFWFEK 547

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P+ P+P
Sbjct: 548 FLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIP 607

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL+QAG   V  S AW+SK V +AWWV  +QV+KT  TG  L  G F I+G+KNFL P
Sbjct: 608 PSTLSQAGNLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNFLAP 666

Query: 678 HPLIMGFGLLFRLDESSLGSH 698
             L++GF ++F++ ++SL +H
Sbjct: 667 SQLVLGFAVMFQISKNSLKNH 687



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 35/51 (68%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L+ +  L G P P D +L  IP+C P++A+  YKYRVK+ PGT KKGK ++
Sbjct: 975  LSWIPALIGTPRPEDEILAAIPICAPWAALVRYKYRVKLQPGTVKKGKAVK 1025


>gi|71001140|ref|XP_755251.1| DUF814 domain protein [Aspergillus fumigatus Af293]
 gi|66852889|gb|EAL93213.1| DUF814 domain protein, putative [Aspergillus fumigatus Af293]
          Length = 1116

 Score =  359 bits (921), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 244/741 (32%), Positives = 375/741 (50%), Gaps = 98/741 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVNLRVSNIYDLSSRIFLFKLAKPDNRKQ--------LVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RRL  + Q+G DR+I F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRLTSIEQIGTDRVIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLR------SHRDDDKGV--AIMSRHRYPTEICRVFERT 172
            LE +A GNI++TD E+ +LTL R         +   G+   + ++  Y       FER 
Sbjct: 111 FLEFFAGGNIIITDREYNILTLFRQVPAGVGEEEMRVGLKYTVTNKQNYHGVPEITFER- 169

Query: 173 TASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
               +   L  +KE  A E       G     + K+N+                      
Sbjct: 170 ----IKETLEKAKEASAQE-------GTAPKKSKKKNVD--------------------- 197

Query: 233 ARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
                  L+  L +    Y P L +H      + P   L   N L D+ +   V  V K 
Sbjct: 198 ------VLRKALSQGFPEYPPLLLDHAFAVKEVDPATPLE--NVLGDDTLMEQVNGVLKE 249

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSRE 347
              +   +S      GYI+ +           ++G  ++     Y++F P    QF    
Sbjct: 250 AQSVTIKLSAKEDHPGYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHPFRPRQFEGNP 309

Query: 348 FVK---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
            VK   F TF+A +DE++S +E+Q+ E +   +E+AA  KL+ +  + E R+  LK+  +
Sbjct: 310 EVKILEFSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKEAQE 369

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYL 463
             V+ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L
Sbjct: 370 IHVRKAAAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKL 429

Query: 464 ERNCMSLLL---SNNLDEMDD-----------------EEKTLPVEKVEVDLALSAHANA 503
             N ++L+L   S   D  DD                   K   +  +++DL LS  ANA
Sbjct: 430 YENTITLVLGEASREQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPWANA 489

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEK 559
            ++YE KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEK
Sbjct: 490 TQYYEQKKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQV--LRPARKPFWFEK 547

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P+ P+P
Sbjct: 548 FLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIP 607

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL+QAG   V  S AW+SK V +AWWV  +QV+KT  TG  L  G F I+G+KNFL P
Sbjct: 608 PSTLSQAGNLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNFLAP 666

Query: 678 HPLIMGFGLLFRLDESSLGSH 698
             L++GF ++F++ ++SL +H
Sbjct: 667 SQLVLGFAVMFQISKNSLKNH 687



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 35/51 (68%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L+ +  L G P P D +L  IP+C P++A+  YKYRVK+ PGT KKGK ++
Sbjct: 975  LSWIPALIGTPRPEDEILAAIPICAPWAALVRYKYRVKLQPGTVKKGKAVK 1025


>gi|302665563|ref|XP_003024391.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
 gi|291188443|gb|EFE43780.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
          Length = 1074

 Score =  358 bits (918), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 309/1097 (28%), Positives = 499/1097 (45%), Gaps = 209/1097 (19%)

Query: 35   KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRR 94
            +T++FKL        +    K  L++ +G   H T  +R   + PS    +LRK ++TRR
Sbjct: 12   RTFLFKL--------ALPDIKKQLIINAGFHCHLTESSRTTADAPSHLVSRLRKLLKTRR 63

Query: 95   LEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR--SHRDDDKG 152
            +  VRQ+G DRII F+   G+   Y  LE +A GN++LTD+++ ++ LLR  +   D + 
Sbjct: 64   ITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAGNLILTDAKYGIVALLRQVAPGSDIEE 121

Query: 153  VAIMSRHRYPTEI-CRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLG 211
            V I   +R  +++        T  +L +AL                + +NVS A K +L 
Sbjct: 122  VKIGMTYRLESKLNYNGIPPLTIERLKSAL----------------EQDNVSKALKRSL- 164

Query: 212  GQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLS 271
                   F   +                          Y P L +H     G   + KL 
Sbjct: 165  ------YFGFPE--------------------------YPPTLLDHAFNVVGF--DSKLQ 190

Query: 272  EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
                L DN +   ++ V +  D +   +S D    GYI+ +N         ++ G  TQ 
Sbjct: 191  PAQILTDNNLVQKLMEVLQEADRVNTALSSDTQQAGYIIAKNVAPAA----SDVGGGTQT 246

Query: 332  -----YDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
                 + +F P   +Q +   +   ++FE F++A+D ++S IE+++ E +   KEDAA  
Sbjct: 247  APVTEFRDFHPFEPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARK 306

Query: 384  KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
            KL     + E RV+ LK++ +  V+ A  IE NL  V+ A+ AV   +A  M W ++AR+
Sbjct: 307  KLESTKREHEKRVNALKEKQEFHVRKARAIETNLPQVEEAMNAVNGLVAQGMDWVEIARL 366

Query: 444  VKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDEEKTL---------------- 486
            ++ E+  GNPVA  I   L L  N +++LL+    E D+EE+                  
Sbjct: 367  IEMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEDESEEEEEDDDDDGYGD 426

Query: 487  -----PVEK-------------------VEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                 P +K                   +++DL +S  ANAR++Y+ KK    K+EKT+ 
Sbjct: 427  DEYERPSQKKHSAKPLKEKKGKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLK 486

Query: 523  AHSKAFKAAEKKTR----LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
            A +KA K+ E+K +    + + QEK V  +   R   WFEKF +FISS+ YLVI GRD Q
Sbjct: 487  ASTKAIKSTERKVKADLKMALKQEKPV--LRRTRNPTWFEKFFFFISSDGYLVIGGRDHQ 544

Query: 579  QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL--TLNQAGCFTVCHSQAWD 636
            Q+E++ +RYM KGD+YVH DL G    ++KN       P    T++QA  +TV  S+AWD
Sbjct: 545  QDEILFQRYMKKGDIYVHTDLDGGVPLIVKNKPDAPDDPIPPNTISQASAYTVASSKAWD 604

Query: 637  SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
            +K     WWV+  QVSK   TG+ L  G FMI+G+KN +PP  +++GF +LF++   S+ 
Sbjct: 605  TKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNRSVQ 664

Query: 697  SHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTN 756
            +H  + ++   E G+ + E      +    E+ + D +E        VP           
Sbjct: 665  NH-TKSQLSAPEGGVTNEEPISSTADMDQPEANQSDQEE-------DVP----------- 705

Query: 757  ASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL-IDRALGLGSASISSTKH 815
                D H+  +ED            DI+    AP+  Q++ + +D +L   +A +     
Sbjct: 706  LEQEDEHQVESEDAKK---------DISDERVAPLGEQMQSIHVDDSLDSSAAQV----- 751

Query: 816  GIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDAS 875
                     +E DK  +  +   ++P    ++  +  +  G S  + ++       +  +
Sbjct: 752  ---------TEADK--DEASQAENQPVEGPSKNAEETEDSGESDDESRLATPSATQESRA 800

Query: 876  SQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDP 935
            S P  I      +     RG++GK KK+  KY DQDEE+R + + L  SA          
Sbjct: 801  STPLVISSSGTQKSKPPVRGKRGKAKKLATKYKDQDEEDRKLALRLPGSAA--------- 851

Query: 936  QNENASTHKEKKPAISPVDAPKVCYK-CKKAGH---LSKDCKEHPDDSSHGVEDNPCVGL 991
                 ST   K    + ++A +   K  ++A H   L    ++    + + VED      
Sbjct: 852  ---GPSTPTTKPKTKADIEAEREAQKERRRAQHERALQAVKRQQEAFTRNSVEDAS---- 904

Query: 992  DETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYK 1051
                               GEE K   + +  L G P+  D +   IPVC P++A+  YK
Sbjct: 905  -------------------GEEHKLDFSILPALVGTPVDGDEIEAAIPVCAPWAALGQYK 945

Query: 1052 YRVKIIPGTAKKGKGIQ 1068
            YR K+ PG  KKGK ++
Sbjct: 946  YRAKLQPGKIKKGKAVK 962


>gi|340059520|emb|CCC53907.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 1048

 Score =  357 bits (917), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 238/754 (31%), Positives = 379/754 (50%), Gaps = 125/754 (16%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  N+YD+ PK ++FK  +       GE +K LLL
Sbjct: 1   MVKQRMTALDVRATVEEMRTELLGLRLMNIYDIPPKIFLFKFGH-------GEKKKTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            E+G+RLH T + R+K   P+ FTL+LRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ENGLRLHLTQFVREKPKVPTQFTLRLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L  LRSHRD+  GV I  R  YP              + 
Sbjct: 113 HIIVELFSKGNVILTDHEYRILLPLRSHRDE--GVNIFVRELYP--------------VT 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            +   ++  D  E + + E                       L +  +   + GA  +  
Sbjct: 157 PSFDQNRLRDMQESECIEE-----------------------LRREWSVVFSRGADYE-- 191

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T K++L     +GP+L++H+++ TG V N+K S +    D   + L+  +   E W    
Sbjct: 192 TTKSMLSGTHHFGPSLADHVLVVTG-VKNVKKSSMTCSGDELFEALLPGL--LEAWR--- 245

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI----------------------YDEFC 336
           I+   +  G  L++N   GK    ++SG++ +                       YD+F 
Sbjct: 246 IAISPLSSGGFLIKNCKSGKPRCDSQSGTAGEQENSAVDTVSASGPGKRNLQGEGYDDFT 305

Query: 337 PLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
           P+LL Q+      K    +F +  D F+   E  + EQQ + K  A   K  +   D + 
Sbjct: 306 PVLLAQYDGENVTKSYLPSFGSVCDTFFLHTEEGKIEQQKEKKTVAVMSKKERCERDHQR 365

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           R+  L++    + +  EL+  N E +DAAI  +  ALA+ + W+ L R++K+    G+PV
Sbjct: 366 RIEALERMELENARKGELLIQNAEKIDAAIGLINGALASGIQWDALRRLLKQRHAEGHPV 425

Query: 455 AGLIDKLYLERNCMSLLL-SNNLDEMDDEEKTLPVEK-----------VEVDLALSAHAN 502
           A ++ +L+L+RN MS+L+ +N+ D+  DE  ++  E            +EVDL+ +AHAN
Sbjct: 426 AYMVHELFLDRNNMSVLVETNDDDDCIDEGGSVSYESKVDDCNKPPWVIEVDLSKTAHAN 485

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNW 562
           A  ++  KK   +K ++T+ A ++A + AEKK      + +TV +I+  R   W+EKFNW
Sbjct: 486 AAAYFSQKKANRAKLDRTVAATAQAMRGAEKKGERMAARHQTVKDIATERHRCWWEKFNW 545

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ-------- 614
           F +S   LV+ G D Q  E++V+R M  GD++VH D+ GA   ++++ R           
Sbjct: 546 FRTSCGDLVLLGHDVQSTELLVRRVMCLGDLFVHCDVDGALPCILRSGRSVWCAAASGSQ 605

Query: 615 ------------------PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
                              V   +L +A  + V  S AW+ K    AWWVY  Q+     
Sbjct: 606 CVDNWMEKNIGSTRSDMLAVHVTSLREAAAWCVSRSSAWEGKFNVGAWWVYASQIIGGTA 665

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           TG YL        G+K+ + P PL +G GLLFR+
Sbjct: 666 TGCYL------FSGEKHHVLPQPLALGCGLLFRV 693



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 95/221 (42%), Gaps = 52/221 (23%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS--AGKVQKNDGDPQNENASTHKEKKP 948
            ++++ Q+ KLKK+++KY DQDEE+R +  ALL      K+Q+   + +   A   K   P
Sbjct: 806  QLTKHQRKKLKKIQQKYKDQDEEDR-LYGALLNGNHLSKIQQEMLEVERARAKIDKRAGP 864

Query: 949  AISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
                   P V     ++GH+ +             E+ P   +D   E+D  A  E ++ 
Sbjct: 865  H------PCVTTSSDESGHVHE-------------EECPDTAMDCGREIDSNAGSECELE 905

Query: 1009 EIGEE---EKGRLND----------VD-----------------YLTGNPLPSDILLYVI 1038
             +  E   EKG  +D          VD                 + T  P  SD + Y +
Sbjct: 906  RVLPEHGLEKGNQSDATTQPLTTTGVDLELARKSRNTEFIQEWAHFTSRPQASDTVQYAV 965

Query: 1039 PVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLS 1079
             VC P  +V SYKYR+++  G+AKKG+      S    M S
Sbjct: 966  AVCAPIGSVISYKYRMELSLGSAKKGQVANSIISYFTSMAS 1006


>gi|258574555|ref|XP_002541459.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237901725|gb|EEP76126.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 1070

 Score =  356 bits (913), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 325/1087 (29%), Positives = 504/1087 (46%), Gaps = 195/1087 (17%)

Query: 35   KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRR 94
            + Y+FKL       +        ++++SG R H T Y R     PS F  +LR+ +++RR
Sbjct: 12   RIYLFKLQKPDVRKQ--------IVIDSGFRCHLTEYTRATAPAPSHFVSRLRQFLKSRR 63

Query: 95   LEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR----SHRDDD 150
            +  V Q+G DRII  +F  G    +++LE +A GNI+LTD+EF +++LLR        D+
Sbjct: 64   VTAVSQVGTDRIIHIEFSDGQ--FHLLLEFFASGNIILTDNEFKIVSLLRIVPEGEEQDE 121

Query: 151  KGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENL 210
              + ++ R         V    +  +L  AL   KE DA++P+                 
Sbjct: 122  IRIGLIYRLDNKQNYGGV-PPLSVDRLRTALERGKERDASQPEAT--------------- 165

Query: 211  GGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII----LDTGLVP 266
                       +K + K  ++  R     L     E   Y P L EH +     D+ L P
Sbjct: 166  -----------TKRAKKKQDEALRR---ALSLGFPE---YPPLLLEHALHVTGFDSTLRP 208

Query: 267  NMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP----EGYILMQNKHLGKDHPP 322
            N  L E + + D  + VL  A           +SG++       GYI+ +N++   + P 
Sbjct: 209  NQIL-EASDMIDELMHVLEEA---------QRVSGELSTAEQTRGYIITRNENKPSEPPT 258

Query: 323  --TESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAK 377
              TE+      Y ++ P    QF        +  E+F+ A+DE+YS +E+Q+ E +   +
Sbjct: 259  QGTETKPDKSSYIDYHPFEPKQFADNPDTRILPLESFNKAVDEYYSSVEAQKLESRLTDR 318

Query: 378  EDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSW 437
            E+    KL     D E RV  LK+    +V+ A+ IE NL  V+ AI A    +A  M W
Sbjct: 319  EETMKRKLEATKRDHEKRVGALKEVQQLNVRKAQAIEANLSKVEEAINAANSLIAQGMDW 378

Query: 438  EDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL-------------------- 476
             ++AR+++ E+   NP+A +I   L L  N +++LL + +                    
Sbjct: 379  VEIARLIEMEQSRRNPIAKMIKLPLKLYENTITILLPDGMPVDDESESESEDEDEEDESG 438

Query: 477  DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT- 535
            DE + + +   V  +++DLAL+  ANA ++Y+ KK    K++KTI A  KA K+AEKK  
Sbjct: 439  DEPEKKSREPEVLSIDIDLALTPWANASQYYDQKKTAAMKEDKTIKASKKALKSAEKKVT 498

Query: 536  ---RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
               +  + QEK V  +   R   WFEKF +FISS+ YLV+ G+DA+Q+E++  R++ KGD
Sbjct: 499  ADLKQGLKQEKPV--LRPARTPFWFEKFFFFISSDGYLVLGGQDARQDEILYHRHLQKGD 556

Query: 593  VYVHADLHGASSTVIKNHRP---EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            VYVH D  GA   +IKN +P   + P+PP TL QAG FTV  S+AWD+K +  AWWV   
Sbjct: 557  VYVHTDTEGAMPMIIKN-KPGAFDDPIPPGTLAQAGTFTVATSRAWDTKALLGAWWVKAE 615

Query: 650  QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
            QVS+T  TGEYL   S +I G+KN L P  LI+GF +LF++   S+ +H   RR R EE 
Sbjct: 616  QVSRTTATGEYLPT-SVVISGEKNHLAPGQLILGFAVLFQISPESVANH---RRHRLEES 671

Query: 710  GMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAED 769
            G               +ESE D  D +P +E                   V  H+   ED
Sbjct: 672  GSPQIA----------VESE-DGKDPQPPSE-----------------REVLEHD---ED 700

Query: 770  KTISNGIDSKIFDIARNVAAPVTPQLE---DLIDRALGLGSASISSTKHGIETTQFDLSE 826
            K    G + +        A+ + PQ +   DL D      S  + +   G    + D S 
Sbjct: 701  K----GGELEEKGEPSEAASSLHPQNDEHGDLND------STPLMNEPQG----EVDQSS 746

Query: 827  EDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPE-----SI 881
            ED++       + +P  S    +     +  S+      RE+     ++SQP      SI
Sbjct: 747  EDEYDSADPAYQQQPEASDTATKDFSHARSPSI------REEGESVPSTSQPSRTSTPSI 800

Query: 882  VRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENAS 941
               +  +  +  RG++GK KK+  KY DQDEE+R + + LL SA K        ++  A 
Sbjct: 801  QSSSTPKSQQQVRGKRGKAKKLASKYKDQDEEDRELALRLLGSAPKADAPKKTRESREAE 860

Query: 942  THKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVA 1001
               +K+             + +   H +   +    ++ H  +     GLD T    K+ 
Sbjct: 861  LQAQKE-------------RRRAQHHKAAQAERQRQENFHRRQQE---GLD-TGYAGKIV 903

Query: 1002 MEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTA 1061
             +              L+ +  L G P+  D ++  IPVC P++AV   KYR K+ PG  
Sbjct: 904  ND--------------LSVLPTLVGAPVVGDEIISAIPVCAPWTAVGQCKYRAKLQPGPT 949

Query: 1062 KKGKGIQ 1068
             KGK ++
Sbjct: 950  GKGKVVR 956


>gi|242764776|ref|XP_002340841.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
 gi|218724037|gb|EED23454.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
          Length = 1111

 Score =  355 bits (912), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 255/797 (31%), Positives = 402/797 (50%), Gaps = 108/797 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   +IG+R SN+YDLS + ++FKL       +        L++
Sbjct: 1   MKQRFSSIDVKIICQELNTSIIGLRVSNIYDLSSRIFLFKLAKPDYRKQ--------LII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   NTPSGF  +LRK ++TRR+  V+QLG DRII      G+   ++
Sbjct: 53  DSGFRCHLTEYSRTTANTPSGFVSRLRKCLKTRRVTAVKQLGTDRIIDIVISDGL--FHI 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD+E  +L L R+     +   +     Y  E  + +           
Sbjct: 111 YLEFFAGGNIILTDAENKILALFRTVAAAGEQDEVKIGLTYAVEKAQYY----------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                              N +   S+E L      K+ D  ++   N+    + K    
Sbjct: 160 -------------------NGIPPVSEERLRATI-QKAIDAEQSPGGNAQRKPKKKVDVF 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK-FEDWLQDV 298
           +  +      + P L E     TG   ++ L EV  LED +I    +AV +  E  +  +
Sbjct: 200 RRAVSSGFPEFPPLLLEDAFAATGFDSSITLKEV--LEDESIFQKAMAVLREAEKIVAGL 257

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSR---EFVKF 351
             G+   +GYI+ + +   KD    +S  S      ++++F P    QF  +     +++
Sbjct: 258 SEGET--KGYIVAKER-AKKDTDFDQSNDSASKENLLFEDFHPFRPRQFEGKPGYHILEY 314

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           + F+  +DE++S IESQ+ E +    E+ A  KL     D  +R   LKQ  +  ++ AE
Sbjct: 315 DNFNKTVDEYFSSIESQKLESRLAEHEETAKRKLEAARADHLDRAGALKQAQELHIRKAE 374

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            I+ N+  V  A  AV   +A  M W ++AR+++ E++  NPVA  I   L L  N ++L
Sbjct: 375 AIQANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQERNNPVAKTIKLPLKLFENTITL 434

Query: 471 LL---------------------SNNLDEMDDEEKTLPVEK------VEVDLALSAHANA 503
           LL                     S++  E + E+   P  K      +++DL+LS  +NA
Sbjct: 435 LLSEESAKGEGDKEEFSESEPEGSDSNSESEFEKDGGPKRKNAEPLAIDIDLSLSPWSNA 494

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEK 559
            ++YE KK    K++KTI +  KA K+ EKK     +  + QEK V   S  RK  WFEK
Sbjct: 495 TQYYEQKKTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPS--RKPFWFEK 552

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVP 617
           + +FISSE YLV+ GRD+ Q E++ +RY+ KGDV+VHADL GA+  ++KN       P+P
Sbjct: 553 YLYFISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTSNAPIP 612

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL QAG  +V  S+AW++K +  +WWV+ HQVS+T   GE L  G FM++G+KN+L P
Sbjct: 613 PGTLTQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLASGGFMVKGEKNYLAP 672

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFE-----DSGHHKENSDIESE--- 729
              ++GF +LF++ + S+ +H   R+ R EE    D +     ++   + +SD++S    
Sbjct: 673 GQPVLGFAVLFQISKESVHNH---RKHRIEEYSELDTKETVSAETSAQEASSDVKSTVKE 729

Query: 730 -----KDDTDEKPVAES 741
                 DDT E+P  E+
Sbjct: 730 DVLAVADDTVEQPETET 746



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 50/168 (29%), Positives = 76/168 (45%), Gaps = 17/168 (10%)

Query: 910  QDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLS 969
            QDEE+R + + LL +  KV K       E+A+  K K+ A       +   + ++A    
Sbjct: 853  QDEEDRELALRLLGANTKVNKT-----AESAAEIKAKREAELEAQKQRRRAQHERAAEAE 907

Query: 970  KDCKEH-PDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNP 1028
            +  +E    +   G   +   G DET   + +  E ED           L+ +  L G P
Sbjct: 908  RKRQEQFLKNRREGEGADVANGEDETYNDETIKAEAED-----------LSWLPALVGTP 956

Query: 1029 LPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLL 1076
            LP D +L  IPV  P+S V  +KYR K+  G+ KKGK I+      +L
Sbjct: 957  LPEDEVLAAIPVAAPWSVVARFKYRAKLQAGSVKKGKAIKEILGQWIL 1004


>gi|225563152|gb|EEH11431.1| DUF814 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 1158

 Score =  355 bits (912), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 325/1095 (29%), Positives = 508/1095 (46%), Gaps = 194/1095 (17%)

Query: 58   LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L++++G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRII  +   G N 
Sbjct: 33   LIVDTGFRCHLTGYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIIDIELSDG-NF 91

Query: 118  HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            H V+LE YA GNI+LTD E+ +L L   HR   +G           E  RV        L
Sbjct: 92   H-VLLEFYAAGNIILTDKEYKILAL---HRIVPEG--------SDQEEVRV-------GL 132

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
               LT+ +  +   P  +    + +  A       +  GK            N  A+ KQ
Sbjct: 133  QYVLTNKQNYNGVPPLSIERLRDALEKAKDLTGPAEAAGK------------NKRAKKKQ 180

Query: 238  P-TLKTVLGEALG---YGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFE 292
               L+  +  +LG   Y P L EH    TG   ++K  ++  LED  + + L++A+   E
Sbjct: 181  AEALRRAV--SLGFPEYPPLLLEHAFHITGFDTSLKPEQL--LEDPKLAEKLMVALVVAE 236

Query: 293  DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESG----SSTQIYDEFCPLLLNQFRSR-- 346
            +    + + +  P GYI+ + +    +    +S     SS   Y +F P    QF S   
Sbjct: 237  NVNSSLSTAEETP-GYIVSKTEGKAGEDASVDSTDPSKSSNVAYIDFHPFEPKQFESEPG 295

Query: 347  -EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
               ++F+TF+ A+DE++S +ESQ+ E +   +E+ A  KL     DQ+ RV  LK+  + 
Sbjct: 296  TSILRFDTFNKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAKTDQDKRVGVLKEAQEL 355

Query: 406  SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
             ++ A+ IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L 
Sbjct: 356  HIRKAQAIEANLLRVEEAVNAVNGLIAQGMDWGEIARLIEMEQSRQNPVAKVIKLPLKLY 415

Query: 465  RNCMSLLLSNNLDEMD-------------------------------DEEKTLPVEKVEV 493
             N ++LLL    +  +                                ++   P+  +++
Sbjct: 416  ENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSQKTRQPLLSIDI 475

Query: 494  DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANIS 549
            DL +S  ANAR++YE KK    K+EKT+ +  KA K+ EKK     +  + QEK V   +
Sbjct: 476  DLGISPWANARQYYEQKKAAAVKEEKTLNSTKKAIKSTEKKVAADLKQALKQEKPVLRPT 535

Query: 550  HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
                                    GRD QQ E++ +R++ +GDV+VHAD+ GA   ++KN
Sbjct: 536  RT-------------------PFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPIIVKN 576

Query: 610  H--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
                P+ P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKT P GEYL  G F+
Sbjct: 577  KPGTPDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFV 636

Query: 668  IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER------------RVRGEEE---GMD 712
            I G+KN LPP  L++GF ++F++   S+ +H   R             + G EE   G+D
Sbjct: 637  ICGEKNQLPPAQLLLGFAVMFQISGESIKNHTKHRVPDEAPTSESAKDILGTEELPSGLD 696

Query: 713  DFEDSGHHKEN-SDIESEKDDTDEKPVAESLSVPNSAHPAP-----SHTNASNVDSHEFP 766
              E   + K N +D + ++ D+ ++   E   + ++    P     + +N S  +S E P
Sbjct: 697  -LETPKNSKRNETDHQHQESDSTDQENGEIEQIADNKRTNPLLNDGAESNRSGSESEE-P 754

Query: 767  AEDKTISNGIDS---KIFDIAR--NVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQ 821
               +  S  +D+   K +D +R   V  P   Q+E+L                     ++
Sbjct: 755  NIGENGSQDVDARYDKGYDNSRFEAVEVPKLGQMENL---------------------SK 793

Query: 822  FDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESI 881
             + S E +    TA     P++   ERR LK G         +E+   R  D +S   + 
Sbjct: 794  EEASSEPQTDSITAQPAKHPFVR--ERRLLKNG--------FIEQVPARLTDPASHSATN 843

Query: 882  V--RKTKIEGGKIS-----RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGD 934
            V  R +    G  +     RG++GK KK+  KY  QDEE+R + + LL S  K      D
Sbjct: 844  VPSRSSTPSIGASTATPNIRGKRGKNKKIATKYQHQDEEDRELALRLLGSDSKP-----D 898

Query: 935  PQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDET 994
               E A   K K   ++ ++A K   + ++A H          D +   E      L + 
Sbjct: 899  KLREAA---KRKADRLAELEAQK---QRRRAQH----------DRAAQAERERQKALQQQ 942

Query: 995  AEMDKVAMEEEDIH-EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYR 1053
            AE      + +    ++  +    L+ +  L G P+  D ++  IPVC P++A+  YKYR
Sbjct: 943  AETQAGGDDADGGDTQLDADTAADLSCLPSLIGTPVAGDEIVAAIPVCAPWTALSQYKYR 1002

Query: 1054 VKIIPGTAKKGKGIQ 1068
             K+ PGT KKGK ++
Sbjct: 1003 AKLQPGTVKKGKVVK 1017


>gi|255941192|ref|XP_002561365.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211585988|emb|CAP93725.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1160

 Score =  355 bits (912), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 240/732 (32%), Positives = 379/732 (51%), Gaps = 90/732 (12%)

Query: 4   VRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG 63
           V++ T ++A+E  C    + +R SN+YDLS + ++FKL             +  L+++SG
Sbjct: 63  VKVITQELASE--C----VNLRVSNIYDLSSRIFLFKLAKPD--------HRRQLIIDSG 108

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R H T Y+R    TPS F  +LRK++++RR+  + Q+G DRII F F  G  A+++ LE
Sbjct: 109 FRTHVTQYSRTAATTPSPFVTRLRKYLKSRRITGISQIGTDRIIDFSFSDG--AYHIFLE 166

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTS 183
            +A GNI+LTD E+ +L + R          I    +Y   +C                 
Sbjct: 167 FFAGGNIILTDREYNILAVFRQVAAGVGQEEIKVGLKY--TVC----------------- 207

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
                    +K N DG     A +     +K    F    N+ K S    +     L+  
Sbjct: 208 ---------NKQNYDGVPDITADRVLQTLEKAQALFAQEGNAPKKSK---KKGTDVLRKA 255

Query: 244 LGEALG-YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           L +    Y P L +H+      DT    +  L   +KL+  A++ ++    +  +     
Sbjct: 256 LSQGFPEYPPLLLDHVFAIKEFDTTTPLDQVLGSQDKLQ--AVKEVLEESRRISNTFD-- 311

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFD 355
            SGD  P GYI+ +          T S +   +Y++F P    QF ++   + ++FE F+
Sbjct: 312 -SGDSHP-GYIVAKEDTRPVPEGETASKAPALLYEDFHPFKPRQFENKPGTKILEFERFN 369

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           A +DE++S +ESQR E +   +E+AA  KL  +  + + R+  LK   +  ++ A+ I+ 
Sbjct: 370 ATVDEYFSSLESQRLESRLTEREEAAKKKLESVRSEHKKRIDELKNVQEIHIRKADAIQD 429

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN 474
           N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N ++L+L  
Sbjct: 430 NVYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQGRGNPVAQTIKLPLKLYENTVTLVLGE 489

Query: 475 -----------------NLDEMDDEEKTLPVEK------VEVDLALSAHANARRWYELKK 511
                            +  E + E++T   E+      +++DL LS  ANA ++Y+ KK
Sbjct: 490 AGDDEDEDEEFSSSDEESDSENEAEQETARAERESKLLTIDIDLGLSPWANASQYYDQKK 549

Query: 512 KQESKQEKTITAHSKAFKAAEKK--TRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           +   K+++T  + +KA K+ EKK  T L+   +K    +   R   WFEKF +FISSE Y
Sbjct: 550 QASEKEQRTTQSSAKALKSHEKKVTTDLKRGLKKEKQVLRQARTPFWFEKFIFFISSEGY 609

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCF 627
           LVI  RDA Q+E++ +RY+SKGD++VHADL GA+  V+KN     + P+ P TL+QAG  
Sbjct: 610 LVIGARDAMQSELLYRRYLSKGDIFVHADLEGATPIVVKNRAGSADAPISPSTLSQAGNL 669

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            V  S AWDSK V SAWW + HQVSK A  G   +  G F I+G+KNFL P  L++GFG+
Sbjct: 670 CVATSSAWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKGEKNFLAPSQLVLGFGI 729

Query: 687 LFRLDESSLGSH 698
           +F++ + S+ +H
Sbjct: 730 MFQISQESVRNH 741


>gi|167395586|ref|XP_001741648.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165893772|gb|EDR21907.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 960

 Score =  354 bits (908), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 234/692 (33%), Positives = 355/692 (51%), Gaps = 100/692 (14%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L+    + VYD++ + Y+ KL  +          K  +++ESGVR+H T Y R+K + P
Sbjct: 27  KLLNFNINTVYDINRRLYVIKLSKTDC--------KEFIVIESGVRVHLTEYNREKSDFP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT KLRK++  ++L  + Q+G DR+I   FG     + ++++LY+ GNI L D E+ +
Sbjct: 79  NNFTSKLRKYLNKKKLIKINQIGNDRVIELVFGNVTERYSLVVDLYSNGNICLCDQEYKI 138

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDAN---EPDKVN 196
           L  LRS   D  G  +    +YP              LH         DAN   E  K+ 
Sbjct: 139 LLTLRSFTFDKTGDKVAVGEKYP--------------LHLL------SDANGIDELKKII 178

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
           ++ N +  +  E++ G                          TLK ++     +G  LS+
Sbjct: 179 KEYNTIFTS--ESMKGW-------------------------TLKQLINYTSDFGQQLSD 211

Query: 257 HIILDTG------LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYIL 310
           H     G              E   L  N   +L  A+ ++E     + SG+   +GYI 
Sbjct: 212 HCCSQFGKESSKTKKFEEFNEEEKSLMKN---ILEEAITRYEK----IDSGNC--KGYIF 262

Query: 311 MQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRA 370
               H  K             Y+E    + NQ   R++++FE+F+ A+DEF+S IE Q  
Sbjct: 263 YHETHQKK------------YYEEVSCDIFNQDSKRKYIEFESFEKAMDEFHSHIEKQEY 310

Query: 371 EQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVA 430
           E + + KE     K+  +    + R   L  + +     AE +E N++ VD  I  + V 
Sbjct: 311 EAEVEKKEMIMKKKVQAVIDGHQKRYQGLLDKAETLKNEAEAVEENIQVVDQLIQEINVF 370

Query: 431 LANRMSWEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSL-LLSNNLDEMDDEEKTLP 487
           L  +M WE +  ++ EE K  +P  +A  I +   +   + L L   N D++ D      
Sbjct: 371 LKEKMKWEQIEGII-EELKENDPTSIAKYIKRFDFKNEVVVLELRHTNEDKIID------ 423

Query: 488 VEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE-KKTRLQILQEKTVA 546
              VE+ L  +   N R +YE++K   +K EKTI +   A K AE K+ R+   ++ T+ 
Sbjct: 424 ---VEIALNKNGFENVRNFYEMRKNILAKAEKTIESKDLAIKQAENKQERVAKEKKITLV 480

Query: 547 NISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           ++  MRK  WFEKF+WF+SSEN+++ISG+DA QN++I +RYM   D+YVHAD+HGA+S +
Sbjct: 481 DVKKMRKRFWFEKFHWFLSSENFIIISGKDALQNDIIYRRYMKNTDIYVHADIHGAASCI 540

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSF 666
           IK   P + +   TL QAG   VC S AW SK+VTSAWWVY  QVSKTAP+GEYLT GSF
Sbjct: 541 IKG-IPGKTIGAPTLEQAGKIAVCRSSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSF 599

Query: 667 MIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           MIRGKKN+LPP PL+ G G++F +++    +H
Sbjct: 600 MIRGKKNYLPPVPLVFGIGIMFVVEKEDKENH 631



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 32/164 (19%)

Query: 907  YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKK 964
            Y +QDEE+R           K+++  G   N      +E+KP   I  V  P  C+ C  
Sbjct: 763  YEEQDEEDRK----------KMEERIGHKFN----VKEEEKPKEDIKKV-VPVQCFFCGS 807

Query: 965  AGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYL 1024
              HL+KDC +  ++     E+     ++E   +D   M  +D   +GE  +G        
Sbjct: 808  TEHLAKDCPKRKEELKKKQEEKIKERMEEEEGIDDEEMSIDDTIFVGELVEGMS------ 861

Query: 1025 TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
                     + + +PVCGPY  +  YKY +K+ PG  K GK I+
Sbjct: 862  ---------VKFAVPVCGPYECISKYKYHIKLTPGNTKAGKAIK 896


>gi|407406699|gb|EKF30889.1| hypothetical protein MOQ_005283 [Trypanosoma cruzi marinkellei]
          Length = 1098

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 251/779 (32%), Positives = 394/779 (50%), Gaps = 106/779 (13%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESGVR+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G +A Y
Sbjct: 54  -ESGVRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEDASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S    +  E ++  +         + N   Q+    F               A   
Sbjct: 169 TH--SEGGKEEEEKEQEEQQQQQQRQVRRTNALRQEWHTVF------------ARHADYE 214

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +    +L+  +   + W    
Sbjct: 215 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGELTSDAETLFTLLLPGM--LQAW---E 268

Query: 299 ISGDIVPEGYILMQNKH---------------LGKDHPPTESGSSTQI------------ 331
           I+   +P G  L+ N                 +G+D P TE   S  +            
Sbjct: 269 IAFSPLPGGGYLISNHRQRKEFRKGGKDVSSKIGEDKPQTEEEKSVNVNVADRSQQQMQT 328

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 329 VQYDDFSPVLLAQYSSEGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 388

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D + R++TL+ E   + +  E I  +   +D AI  +  ALA  + W+ L  ++K  
Sbjct: 389 FERDHQRRLNTLEMEEQENQRKGECIIQHAVKIDEAIGLINGALAAGIQWDALRSLLKRR 448

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK--TLPVEKVEVDLALSAHANARR 505
              G+PVA ++ +L+LERN +S+L+ +N  E + EE     P+  +EV+L+ +A+ANA  
Sbjct: 449 HAEGHPVAYMVHELFLERNSISVLVESNEQEDEGEEDCDVTPM-VIEVELSKTAYANATT 507

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           ++   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +
Sbjct: 508 YFSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRT 567

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV---------------IKNH 610
           S    V+ G+D Q  E++V+R M  GDV++H D+ GA   V               +K H
Sbjct: 568 SCGDFVLQGKDLQTTEILVRRVMQLGDVFLHCDVDGALPCVLRPIGSAWTTAFVEDVKGH 627

Query: 611 RPEQP------VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
           R E        +   +L++AG + V  S AW+ K   +AWWV+  Q++    +G YL   
Sbjct: 628 RQEGSQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFTVAAWWVHASQITGGTASGCYL--- 684

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRL--------DESSLGSHLNE---RRVRGEEEGMD 712
                G+K++L P P+    GLLFR+        D   L + ++E   R    EEEG D
Sbjct: 685 ---FDGEKHYLRPQPITFACGLLFRVPTRRIDPNDRDELPNFISEGERRPQHAEEEGED 740



 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 1023 YLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            Y T  P P+D + Y + VC P S V SYKYR +++ G AKKG   Q+  SL    L++T
Sbjct: 1006 YFTSQPQPTDNIEYALAVCAPMSCVISYKYRAELLFGNAKKG---QVTTSLQGHFLAMT 1061


>gi|67468480|ref|XP_650274.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56466879|gb|EAL44894.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449704977|gb|EMD45123.1| zinc knuckle domain containing protein [Entamoeba histolytica KU27]
          Length = 959

 Score =  353 bits (905), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 231/685 (33%), Positives = 358/685 (52%), Gaps = 86/685 (12%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L     + VYD++ + Y+ KL  +          K  +++ESGVR+H T Y R+K + P
Sbjct: 27  KLQNFNINTVYDVNRRLYVIKLSKTDC--------KEFIVIESGVRVHLTEYNREKSDFP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT +LRK++  ++L  + Q+G DR+I   FG     + +I++LY+ GNI L D E+ +
Sbjct: 79  NNFTSRLRKYLNKKKLIKINQIGNDRVIELVFGNATERYSLIVDLYSNGNICLCDQEYKI 138

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L +LRS   D  G  +    +YP              LH         DAN  D++    
Sbjct: 139 LLILRSFTFDKTGDKVAVGEKYP--------------LHLL------SDANGIDEL---- 174

Query: 200 NNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII 259
                        +K  K +D    S          K  TLK ++     +G  LS+H  
Sbjct: 175 -------------KKIIKEYDTIFTSE-------SMKGWTLKQLINYTSDFGQQLSDHCC 214

Query: 260 LDTGL--VPNMKLSEVNKLEDNAIQ-VLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
              G       +L E N+ E + ++ +L  A+ ++E     + SG    +GYI       
Sbjct: 215 SQFGKESSKTKRLEEFNEEEKSLMKKILEEAITRYEK----IDSGKC--KGYIFYH---- 264

Query: 317 GKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
                     +  + Y+E    +  Q   R++++FE+F+ A+DEF+S IE Q  E + + 
Sbjct: 265 --------ETNKKKYYEEVSCDIFYQDSKRKYIEFESFEKAMDEFHSHIEKQEYEAEVEK 316

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE     K+  +    + R   L  + +     AE +E N++ VD  I  + V L  +M 
Sbjct: 317 KEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNEAEAVEENIQVVDQLIQEINVFLKEKMK 376

Query: 437 WEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVD 494
           WE +  ++ E  K  +P  +A  I +   +   + L L +      +E+K +   +VE+ 
Sbjct: 377 WEQIEGII-ESLKENDPTSIAKYIKRFDFKNEVVVLELKHT-----NEDKII---EVEIA 427

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE-KKTRLQILQEKTVANISHMRK 553
           L  +   N R +YE++K   +K EKT+ +   A K AE K+ R+   ++ T+ ++  MRK
Sbjct: 428 LNKNGFENIRNFYEMRKNILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRK 487

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPE 613
             WFEKF+WF+SSEN+++ISG+DA QN++I +RYM   DVYVHAD+HGA+S +IK   P 
Sbjct: 488 RFWFEKFHWFLSSENFIIISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKGI-PG 546

Query: 614 QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
           + +   TL QAG   VC S AW SK+VTSAWWVY  QVSKTAP+GEYLT GSFMIRGKKN
Sbjct: 547 KTIGAPTLEQAGKIAVCRSSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKN 606

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSH 698
           +LPP PL+ G G++F +++    +H
Sbjct: 607 YLPPVPLVFGIGIMFAVEKEDKENH 631



 Score = 63.2 bits (152), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 82/175 (46%), Gaps = 29/175 (16%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG+ GK+KK+K +Y DQDEE+R           K+++  G     N    ++ K  I  V
Sbjct: 750  RGKAGKMKKLK-RYEDQDEEDRK----------KMEERIG--HKFNVKEEEQPKEDIKKV 796

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
              P  C+ C    HL+KDC              P    +   + ++   E  +  E  ++
Sbjct: 797  -VPIQCFFCGSTEHLAKDC--------------PKRKEELKKKQEEKIKERMEEEEEIDD 841

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            E+  +ND  ++ G  +    + + +PVCGPY  V  YKY +K+ PG  K GK I+
Sbjct: 842  EEMSVNDTIFV-GELVEGMNVKFAVPVCGPYDCVSKYKYHIKLTPGNTKAGKAIK 895


>gi|296813237|ref|XP_002846956.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
           113480]
 gi|238842212|gb|EEQ31874.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
           113480]
          Length = 1103

 Score =  353 bits (905), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 242/761 (31%), Positives = 381/761 (50%), Gaps = 136/761 (17%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++G+R +N+YD+SP+T++FKL        +    K  L++
Sbjct: 1   MKQRYSSLDVKVISRELSANILGLRIANIYDISPRTFLFKL--------ALPDIKKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G   H T  +R   + PS F  +LRK ++TRR+  VRQ+G DRI+ F+   G+   Y 
Sbjct: 53  NAGFHCHLTESSRTTADAPSHFVSRLRKLLKTRRITGVRQIGTDRILEFEISDGLFRLY- 111

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GN++LTD+++              G+  + RH  P                  
Sbjct: 112 -LEFFAAGNLILTDAKY--------------GIVALLRHVAP------------------ 138

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN-----SNDGARA 235
                             G++V           K G S+ L    N N     + D  +A
Sbjct: 139 ------------------GSDVEEV--------KVGMSYKLESKMNYNGIPPLTIDRLKA 172

Query: 236 --KQPTLKTVLGEALGYG-----PALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             ++ T   VL  +L +G     P L +H     G   + KL     L DN +   ++ V
Sbjct: 173 TLEKDTGSKVLKRSLYFGFPEYPPTLLDHAFHIIGF--DSKLQPAQILTDNNLIHGLMGV 230

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI-YDEFCPLLLNQFR--- 344
            +  D + + +S D    GYIL +N   G       + S+  I + +F P   +Q +   
Sbjct: 231 LQEADRVNNALSSDRQTPGYILAKNIVPGTADGAEGTQSAPTIEFRDFHPFEPSQSKDLP 290

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +   ++F+TF++A+D+++S IE+++ E +   +EDAA  KL     D E RV+ LK++ +
Sbjct: 291 NTTMLRFDTFNSAVDKYFSSIEARKLESRLTEREDAARKKLEATKRDHEKRVNALKEKQE 350

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYL 463
             V+ A  IE NL  V+ AI AV   +A  M W ++AR+++ E+  GNPVA  I   L L
Sbjct: 351 FHVRKAHAIEANLPQVEDAINAVNGLVAQGMDWVEIARLIEMEQAKGNPVALCIKLPLKL 410

Query: 464 ERNCMSLLLSNN-----------------------------LDEMDDEEKTLPVEK---- 490
             N +++LL+                                +    ++ T   +K    
Sbjct: 411 YENTITILLTEETAETEDEDEESDESEGDDEDEDNDYGDDEYERPKHKKMTAKTQKEKKE 470

Query: 491 -------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQI 539
                  +++DL +S  ANAR++Y+ KK    K+EKT+ A +KA K+ EKK +    L +
Sbjct: 471 RKDNRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTEKKVKADLKLAL 530

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
            QEK V  +   R   WFEKF +FISS+ YLVI GRD QQ+E++ +RY+ KGD+YVH DL
Sbjct: 531 KQEKPV--LRRARNPAWFEKFFFFISSDGYLVIGGRDQQQDEILFQRYLKKGDIYVHTDL 588

Query: 600 HGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
            G    ++KN    P+ P+PP T++QA  ++V  S+AWD+K     WWV+  QVSK   T
Sbjct: 589 EGGVPLIVKNKPEFPDDPIPPNTISQASAYSVASSKAWDTKAAMGGWWVHASQVSKVTST 648

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           G+ L  G FMI+G+KN LPP  +++GF +LF+L   S+ +H
Sbjct: 649 GDILKAGHFMIKGEKNHLPPGQIVLGFAVLFQLSPQSVQNH 689



 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 71/165 (43%), Gaps = 32/165 (19%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG++GK KK+  KY DQDEE+R + + LL SA         P +   +  K K    +  
Sbjct: 845  RGKRGKAKKLATKYKDQDEEDRKLALRLLGSA---------PGSTTVNKTKTKADIEAER 895

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
            +A K   + +    L    ++    + H VED                         GEE
Sbjct: 896  EAQKERRRAQHERALQAVKRQQEAFTRHSVEDAS-----------------------GEE 932

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIP 1058
             K   + +  L G P+  D +   IPVC P++A+  YKYR K+ P
Sbjct: 933  HKLDFSMLPALVGTPVEGDEIEAAIPVCAPWTALGQYKYRAKLQP 977


>gi|408392777|gb|EKJ72097.1| hypothetical protein FPSE_07722 [Fusarium pseudograminearum CS3096]
          Length = 1078

 Score =  352 bits (904), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 267/794 (33%), Positives = 396/794 (49%), Gaps = 107/794 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKIIAHELQERLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  VRQ+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTKFARTTAAAPSIFVARLRKFLKTRRLTAVRQVGTDRVLEFEFSDGQ--YRM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD++   L +L   R   +G       + P  +   +           
Sbjct: 111 FLEFFASGNIILTDAD---LNILALARTVSEG-----EGQEPQRVGLQYSLENRQNYGGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +K+   N      E     + +SK+    QKG    DL K               +L
Sbjct: 163 PPLTKQRVQNALKAAVEKAAADATSSKK----QKGKPGGDLRK---------------SL 203

Query: 241 KTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
              + E     P L +H +     DT + P+  L+    L++     LV ++ +    ++
Sbjct: 204 AVSITE---LPPVLVDHWLHTNNFDTTVKPHEVLANEILLDE-----LVKSLQEARKIVE 255

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSR---EFVK 350
           ++ S +    GYI  + +   +     E   + +   +YD+F P +  + ++    E ++
Sbjct: 256 ELTSSETC-TGYIFAKRRERPEGTEVDEETKTKRDNLLYDDFHPFIPYKLKNDPAIEVLE 314

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE ++  +DEF+S +E QR E +   +E  A  KL     +Q  R+  L++    + + A
Sbjct: 315 FEGYNETVDEFFSSLEGQRLESKLTEREATAKRKLEAAKNEQNKRIEGLQEAQSLNFRKA 374

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             IE N+E V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I   L L  N ++
Sbjct: 375 AAIEANVERVQEAMDAVNGLLNQGMDWVDVGKLVEREKKRHNPVADIIKLPLNLAENLIT 434

Query: 470 LLLSNNLDEMDDE--------------------------EKTLPVEKVEVDLALSAHANA 503
           L L+    E +++                          ++T     VE++L  S  +NA
Sbjct: 435 LELAEEEFEPEEDDPYETDDDDDDDSALGDDEGTSAAKGKQTNKALNVEINLGFSPWSNA 494

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEK 559
           R +++ +K    K+EKT    S+A K AE+K     +  + QEK +  +  +RK  WFEK
Sbjct: 495 REYFDQRKTAAVKEEKTQQQASRALKNAEQKITEDLKKGLKQEKAL--LQPIRKQMWFEK 552

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F WFISS+ YLVI G+DAQQNE I K+Y+ KGD+Y HADLHGASS +IKN+   P+ P+P
Sbjct: 553 FTWFISSDGYLVIGGKDAQQNETIYKKYLRKGDIYCHADLHGASSVIIKNNPKTPDAPIP 612

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL+QAG   VC S AWDSK    AWWV   QVSK+APTGE+L  GSFMIRGKKNFLPP
Sbjct: 613 PATLSQAGSLAVCSSNAWDSKAGMPAWWVNADQVSKSAPTGEFLQAGSFMIRGKKNFLPP 672

Query: 678 HPLIMGFGLLFRLDESSLGSHLNER-----RVRGEEE----------GMDDFEDSGHHKE 722
             L++G GL FR+ E S   H+  R        G+E           G  D  D+GH   
Sbjct: 673 AQLLLGLGLAFRISEESKAKHVKHRLHDVDSAIGDEGSGAPQSAGMMGDADEPDAGHSDV 732

Query: 723 NSDIESEKDDTDEK 736
            SD E E +  DE+
Sbjct: 733 PSDYEIEDEKHDEE 746



 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 51/78 (65%), Gaps = 2/78 (2%)

Query: 993  ETAEMDKV--AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSY 1050
            ETAE +++   M EE +  + E+E  ++  +D + G PLP D +L +IPVC P++A+  Y
Sbjct: 917  ETAEHEEIRRVMMEEGVEMLDEDEASQMTVLDAIVGTPLPGDEILEIIPVCAPWNALGRY 976

Query: 1051 KYRVKIIPGTAKKGKGIQ 1068
            KY+ K+ PG  KKGK ++
Sbjct: 977  KYKAKLQPGATKKGKAVK 994


>gi|407039370|gb|EKE39608.1| zinc knuckle domain containing protein [Entamoeba nuttalli P19]
          Length = 959

 Score =  351 bits (900), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 245/738 (33%), Positives = 382/738 (51%), Gaps = 93/738 (12%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L     + VYD++ + Y+ KL  +          K  +++ESGVR+H T Y R+K + P
Sbjct: 27  KLQNFNINTVYDVNRRLYVIKLSKTDC--------KEFIVIESGVRVHLTEYNREKSDFP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT +LRK++  ++L  + Q+G DR+I   FG     + +I++LY+ GNI L D E+ +
Sbjct: 79  NNFTSRLRKYLNKKKLIKINQIGNDRVIELVFGNATERYSLIVDLYSNGNICLCDQEYKI 138

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L  LR+   D  G  +    +YP              LH         DAN    +NE  
Sbjct: 139 LLTLRNFTFDKTGDKVAVGEKYP--------------LHLL------SDAN---GINELK 175

Query: 200 NNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII 259
           N +              K +D    S          K  TLK ++     +G  LS+H  
Sbjct: 176 NII--------------KEYDTIFTSE-------SMKGWTLKQLINYTSDFGQQLSDHCC 214

Query: 260 LDTGL--VPNMKLSEVNKLEDNAIQ-VLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
              G       +L E N+ E + ++ +L  A+ ++E     + SG    +GYI       
Sbjct: 215 SQFGKESSKTKRLEEFNEEEKSLMKKILEEAITRYEK----IDSGKC--KGYIFYH---- 264

Query: 317 GKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
                     +  + Y+E    +  Q   R++++FE+F+ A+DEF+S IE Q  E + + 
Sbjct: 265 --------ETNKKKYYEEVSCDIFYQDSKRKYIEFESFEKAMDEFHSHIEKQEYEAEVEK 316

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE     K+  +    + R   L  + +     AE +E N++ VD  I  + V L  +M 
Sbjct: 317 KEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNEAEAVEENIQVVDQLIQEINVFLKEKMK 376

Query: 437 WEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVD 494
           WE +  ++ E  K  +P  +A  I +   +   + L L +      +E+K +   +VEV 
Sbjct: 377 WEQIEGII-ESLKENDPTSIAKYIKRFDFKNEVVVLELKHT-----NEDKII---EVEVA 427

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE-KKTRLQILQEKTVANISHMRK 553
           L  +   N R +YE++K   +K EKT+ +   A K AE K+ R+   ++ T+ ++  MRK
Sbjct: 428 LNKNGFENIRNFYEMRKNILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRK 487

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPE 613
             WFEKF+WF+SSEN+++ISG+DA QN++I +RYM   DVYVHAD+HGA+S +IK   P 
Sbjct: 488 RFWFEKFHWFLSSENFIIISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKGI-PG 546

Query: 614 QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
           + +   TL QAG   VC S AW SK+VTSAWWVY  QVSKTAP+GEYLT GSFMIRGKKN
Sbjct: 547 KTIGAPTLEQAGKIAVCRSSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKN 606

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
           +LPP PL+ G G++F +++    +H  E  ++ E + +   E+     + S+ E +K+  
Sbjct: 607 YLPPVPLVFGIGIMFAVEKEDKENH--EEVIQQETKEVQQKENVESVIKISEQERDKEQK 664

Query: 734 DEK----PV-AESLSVPN 746
           +EK    PV  E ++V N
Sbjct: 665 EEKQEVVPVQVEKVNVKN 682



 Score = 63.2 bits (152), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 82/175 (46%), Gaps = 29/175 (16%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG+ GK+KK+K +Y DQDEE+R           K+++  G     N    ++ K  I  V
Sbjct: 750  RGKAGKMKKLK-RYEDQDEEDRK----------KMEERIG--HKFNVKEEEQPKEDIKKV 796

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
              P  C+ C    HL+KDC              P    +   + ++   E  +  E  ++
Sbjct: 797  -VPIQCFFCGSTEHLAKDC--------------PKRKEELKKKQEEKIKERMEEEEEIDD 841

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            E+  +ND  ++ G  +    + + +PVCGPY  V  YKY +K+ PG  K GK I+
Sbjct: 842  EEMSVNDTIFV-GELVEGMNVKFAVPVCGPYDCVSKYKYHIKLTPGNTKAGKAIK 895


>gi|154281559|ref|XP_001541592.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150411771|gb|EDN07159.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 1177

 Score =  350 bits (899), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 255/752 (33%), Positives = 390/752 (51%), Gaps = 106/752 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R ++ DV A       L+G+R SN+YDLS + ++FKL             +  L+++
Sbjct: 16  MKQRFSSLDVKA-------LVGLRISNIYDLSSRIFLFKLAKPD--------TRRQLIVD 60

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRI+  +   G N H V+
Sbjct: 61  AGFRCHLTEYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIVDIELSDG-NFH-VL 118

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LE YA GNI+LTD ++ +L L   HR   +G           E  RV        L   L
Sbjct: 119 LEFYAAGNIILTDKDYKILAL---HRIVPEG--------SDQEEVRV-------GLQYVL 160

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP-TL 240
           T+ +  +   P  +    + +  A       +  GK            N  A+ KQ   L
Sbjct: 161 TNKQNYNGVPPLSIERLRDALKKAKGVTGPAEAAGK------------NKRAKKKQAEAL 208

Query: 241 KTVLGEALG---YGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQ 296
           +  +  +LG   Y P L EH    TG   ++K  ++  LED  + + L++A+   E+   
Sbjct: 209 RRAV--SLGFPEYPPLLLEHAFHITGFDTSLKPEQL--LEDPKLAEKLMVALVVAENVNS 264

Query: 297 DVISGDIVPEGYILMQNK-HLGKD---HPPTESGSSTQIYDEFCPLLLNQFRSR---EFV 349
            + + +  P GYI+ + +   G+D        S SS   Y +F P    QF S      +
Sbjct: 265 SLSTAEETP-GYIVSKTEGKAGEDASVDSTVPSKSSNVAYIDFHPFEPKQFESEPGTSIL 323

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F+TF+ A+DE++S  ESQ+ E +   +E+ A  KL     DQ+ RV  LK+  +  ++ 
Sbjct: 324 RFDTFNKAVDEYFSSAESQKLESRLTEREEIAKRKLEAAQKDQDKRVGVLKEAQELHIRK 383

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L L  N +
Sbjct: 384 AQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQGRQNPVANVIKLPLKLYENAV 443

Query: 469 SLLL---SNNLDEMD----------------------------DEEKTLPVEKVEVDLAL 497
           +LLL   + N + MD                             ++   P+  +++DL +
Sbjct: 444 TLLLGEPTENEEPMDESEDEAEVEEEEEQESSEDEDSGKKPGVSKKPRQPLLSIDIDLGI 503

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRK 553
           S  ANAR++YE KK    K++KT+ +  +A K+ +KK     +  + QEK V  +   R 
Sbjct: 504 SPWANARQYYEQKKVAAVKEKKTLNSTKEAIKSTKKKVAADLKQALKQEKPV--LRPTRT 561

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP- 612
             WFEKF +F+SS+ YLV+ GRD QQ E++ +R++ +GDV+VHAD+ GA   ++KN +P 
Sbjct: 562 PFWFEKFIFFLSSDGYLVLGGRDVQQTEILYRRHLKRGDVFVHADVQGAIPVIVKN-KPG 620

Query: 613 --EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             + P+PP TL+QAG   V  S AWDSK V  AWW   +QVSKT P GEYL  G F+I G
Sbjct: 621 TLDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWANANQVSKTTPLGEYLVTGGFVICG 680

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           +KN LPP  L++GF ++F++   S+ +H   R
Sbjct: 681 EKNQLPPAQLLLGFAVMFQISGESIKNHTKHR 712


>gi|261195108|ref|XP_002623958.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239587830|gb|EEQ70473.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 1150

 Score =  346 bits (888), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 308/995 (30%), Positives = 467/995 (46%), Gaps = 171/995 (17%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L + L+G+R SN+YDLS + Y+FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVISRELSQALVGLRISNIYDLSSRIYLFKLAKPDTRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T Y+R     PS F ++LRK ++TRR+  V Q+G DRII  +   G N H V
Sbjct: 53  DTGFRCHLTEYSRTTAAAPSPFIVRLRKFLKTRRVTAVTQVGTDRIIDIELSDG-NFH-V 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE YA GNI+LTD E+ ++ L   HR   +G           E  RV        L   
Sbjct: 111 LLEFYAGGNIILTDKEYKIVAL---HRIVPEG--------NDQEEVRV-------GLQYV 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT+ +  +   P  +      +  A      G+  G       N+ +     A A +  +
Sbjct: 153 LTNKQNYNGVPPLSIERLRETLEQAKDVAGSGEGAG-------NTKRAEKKQAEALRRAV 205

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVI 299
                E   Y P L EH+   TG+ P++K  +V  L DN  ++ L+LA+ + E     + 
Sbjct: 206 SLGFPE---YPPLLLEHVFHITGVDPSLKPEQV--LGDNELVEKLMLALVEAESVNSSLS 260

Query: 300 SGDIVPEGYILMQNKHLG-KDHPPTESG---SSTQIYDEFCPLLLNQFRSRE---FVKFE 352
           + D  P GYI+ + +    +D   T +    S    Y +F P    QF ++     +KF+
Sbjct: 261 TADDTP-GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPFEPKQFENQADMAILKFD 319

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           TF+ A+DE++S +E Q+ E +   +E+ A  KL     DQE RV  LK+  +  V+ A+ 
Sbjct: 320 TFNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQA 379

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL 471
           IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L  N ++LL
Sbjct: 380 IEANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLL 439

Query: 472 LSNNL------------------------------DEMDDEEKTLPVEKVEVDLALSAHA 501
           L                                   +  +++    +  +++DL +S  A
Sbjct: 440 LGEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWA 499

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANISHMRKVHWFEK 559
           NAR++YE KK    K+EKT+ +  KA K+ EKK    + Q  ++    +  +R   WFEK
Sbjct: 500 NARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEK 559

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F +FISS+ YL + GRDAQQ E++ +R++ KGDVYVHAD+ GA    +KN    P+ P+P
Sbjct: 560 FIYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIP 619

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL+QAG   V  S AW SK V                 GEYL  G F+IRG+KN LPP
Sbjct: 620 PGTLSQAGNLCVATSSAWHSKAV----------------MGEYLETGGFVIRGEKNQLPP 663

Query: 678 HPLIMGFGLLFRLDESS-------------LGSHLNERRVRGEEEGMDDF----EDSGHH 720
             L++GF      D+SS             L S L+++  R E E  + +    ++    
Sbjct: 664 AQLLLGFA-----DDSSTTTGVKETQGMEELPSRLDQQTPR-ESENKETYHQPEQNDSSD 717

Query: 721 KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS-NGIDSK 779
           +EN +IE   DD    P           H     +++ + D      ED+    +  D +
Sbjct: 718 EENGEIEENTDDKRTNPF---------LHEKAESSDSDSEDGESKIGEDRPQDVDAKDER 768

Query: 780 IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRD 839
            +D A + A         + + ALG    S    + G E           H + +A  R 
Sbjct: 769 EYDHAESKA---------VEEAALGGKETSSQEEQAGSEP----------HTD-SAAARP 808

Query: 840 KPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKIS----RG 895
              +S  E  +LKK  G S+         E+     + PES  R T  E  + S    RG
Sbjct: 809 AKRLSATENGQLKK--GVSI---------EQASTPPTDPES--RLTPNEPSRSSTPNIRG 855

Query: 896 QKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           ++GK KK+  KY  QDEE+R + + LL SA K  K
Sbjct: 856 KRGKNKKIATKYQHQDEEDRELALRLLGSAPKPDK 890



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L G  +  D ++  IPVC P+ A+  YKYR K+ PG  KKGK ++
Sbjct: 965  LIGTAVVGDEIVAAIPVCAPWMALGQYKYRAKLQPGPLKKGKAVK 1009


>gi|209875685|ref|XP_002139285.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209554891|gb|EEA04936.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 1427

 Score =  346 bits (888), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 255/844 (30%), Positives = 422/844 (50%), Gaps = 124/844 (14%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   D+ A V  + + L G +  N+YD++ +TY+ K          G   K+ LL
Sbjct: 1   MVKSRMTAIDICAMVHSIAKDLKGQKLVNIYDINHRTYLLKF---------GGEGKLFLL 51

Query: 60  MESGVRLHTTAYARDKKNTP--------SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
           +E+G+R HTT + R  + T         S F  KLR+++R R+L D+ Q+  DRI+   F
Sbjct: 52  IEAGIRFHTTHWKRGSQQTMNSSSVVSISYFNNKLRRYLRGRKLVDMAQMDLDRIVKLTF 111

Query: 112 GLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER 171
           G G N  ++ILE +  GNI+LTD+ + +L +LR    D+  ++I  R+ +   I    + 
Sbjct: 112 GFGENIFHLILEFFVAGNIILTDNNYNILVILR----DNGNLSIGKRYNWENSI----DI 163

Query: 172 TTASKLHAALTSSKEPDAN---EPDKVN-------EDGNNVSNASKENLGGQKGGKSFDL 221
             +  +  ++  S  PD +    P  +        ED  N+    KE   G +  K   +
Sbjct: 164 DCSHAVFPSILRSPAPDIDVDQAPWMIQWLDESYLEDQLNI--MIKEAEAGSEE-KQLQI 220

Query: 222 SKNS----NKNSNDGARAKQP---TLKTVLGEALGYG-PALSEHIILDTGL-----VPNM 268
           S+ S    +K  ND   + QP   T + +LG+ L +  P + + ++   GL     V + 
Sbjct: 221 SRGSTNKRSKQGNDTIPSNQPSGITSQVLLGKILRFCHPIMLQQLLEKYGLDKDQLVTSS 280

Query: 269 KLSEVNK-----LEDNAIQVLVLAVAKFEDWLQDVI-SGDIVPEGYILMQNKHLGKDHPP 322
            + +++K     ++D    + +L  ++    +   + S D + EG +++++    + H  
Sbjct: 281 SIRDISKKFIKCIKDAKYLLGILCNSEVLGIMTLCLTSRDQMKEGDLILRDLQQVETHVS 340

Query: 323 TESGSSTQ------IYDEFCP------------------LLLNQFRSREFVKFETFDAAL 358
           +E  +  +      +Y  F P                  L++N+F S+       F   +
Sbjct: 341 SECKAKAEQDKTEPLYISFSPYVKDHEWIYSVQALPKDGLIVNRFTSK-------FSDCV 393

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYS I+  +  ++ + +E A   K++K+ +DQE R+  L +E +  +K A  +E    
Sbjct: 394 DEFYSSIDINKETKEIQQEEKAINSKIDKLRIDQERRLKELVEEKEACIKRANFMECCEL 453

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN--- 475
            ++  +L  R  +A    W+D+   V+++RK G+P+A  I  L LE + + +    +   
Sbjct: 454 LLEKILLLTRHLIATGAQWKDICNEVRQQRKIGHPIAKYIKSLDLEHDRVVVYFGADEFP 513

Query: 476 ---------LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
                      E + + K+    ++ ++++ S  AN R  YE  K   +K E+T +A+ +
Sbjct: 514 EDFDYSRYGYGESNSKLKSQEGIEIYLNISKSMQANIRSEYEESKHISAKLERTKSAYKR 573

Query: 527 AFKAAEKKT-----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
           A     K       +L       V  I  +R+ +WFEKF+WFISS+ +LVI G D+ QNE
Sbjct: 574 ALNKVTKTVNRNTEKLTGPLNTGVNRIHKIRQSYWFEKFHWFISSDGFLVIGGNDSSQNE 633

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT 641
           ++ +RY+ K D Y+HAD HGA++ ++KN +    +P  TL +AG  ++C+S++W +K V 
Sbjct: 634 LLYRRYLEKNDRYIHADTHGATTCIVKNPKNLADIPMNTLCEAGQMSICYSRSWANKTVI 693

Query: 642 SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNE 701
           SAWWVYP QVSKTAP+GEYLT GSF+IRGKKNFLPP  L MG  L+F            +
Sbjct: 694 SAWWVYPDQVSKTAPSGEYLTTGSFVIRGKKNFLPPLKLEMGIALVFV-----------K 742

Query: 702 RRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVD 761
            + + E+E + D ED     E+S   SE  DT+ K         NS     SH N+ N  
Sbjct: 743 TKKQAEKEELSDLEDISSKFEDSTY-SETVDTEIKVNL------NSNISDKSHVNSDNDL 795

Query: 762 SHEF 765
           S +F
Sbjct: 796 SSKF 799



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 70/213 (32%), Positives = 95/213 (44%), Gaps = 51/213 (23%)

Query: 871  GKDASSQPESIVRKTKIEGG-------KISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
            GK     P    R   IE G       K+SR +K KLKKM  KYG+QDE+ER +RM L  
Sbjct: 1191 GKIELKMPNISSRGRSIESGNNQSTNQKLSRRKKFKLKKMALKYGEQDEQERKLRMVLTG 1250

Query: 924  SAG-KVQKNDGDPQNENASTHKEKKPAISPVDAPKVCY------KCKKAGHLSKDCKEHP 976
            S   K+  +   P      T + K+P+ S +D PK  +      K K+   L +  KE  
Sbjct: 1251 SKDMKLAYSSKSP------TVESKEPS-SSIDIPKPLHITQQEKKKKEQERLERIYKER- 1302

Query: 977  DDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLY 1036
                          +D T E      E E+I E     K    D+D       P+  L+ 
Sbjct: 1303 -------------NVDNTIE----NREFENIRECL--LKSNRVDID-------PN--LIA 1334

Query: 1037 VIPVCGPYSAVQSYKYRVKIIP-GTAKKGKGIQ 1068
            +IP+C PYS V+ Y+Y VK+ P G  K+ K  Q
Sbjct: 1335 IIPICAPYSCVRDYEYIVKLTPGGNLKRSKAAQ 1367


>gi|239610682|gb|EEQ87669.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 1131

 Score =  346 bits (888), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 308/995 (30%), Positives = 467/995 (46%), Gaps = 171/995 (17%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L + L+G+R SN+YDLS + Y+FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVISRELSQALVGLRISNIYDLSSRIYLFKLAKPDTRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T Y+R     PS F ++LRK ++TRR+  V Q+G DRII  +   G N H V
Sbjct: 53  DTGFRCHLTEYSRTTAAAPSPFIVRLRKFLKTRRVTAVTQVGTDRIIDIELSDG-NFH-V 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE YA GNI+LTD E+ ++ L   HR   +G           E  RV        L   
Sbjct: 111 LLEFYAGGNIILTDKEYKIVAL---HRIVPEG--------NDQEEVRV-------GLQYV 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT+ +  +   P  +      +  A      G+  G       N+ +     A A +  +
Sbjct: 153 LTNKQNYNGVPPLSIERLRETLEQAKDVAGSGEGAG-------NTKRAKKKQAEALRRAV 205

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVI 299
                E   Y P L EH+   TG+ P++K  +V  L DN  ++ L+LA+ + E     + 
Sbjct: 206 SLGFPE---YPPLLLEHVFHITGVDPSLKPEQV--LGDNELVEKLMLALVEAESVNSSLS 260

Query: 300 SGDIVPEGYILMQNKHLG-KDHPPTESG---SSTQIYDEFCPLLLNQFRSRE---FVKFE 352
           + D  P GYI+ + +    +D   T +    S    Y +F P    QF ++     +KF+
Sbjct: 261 TADDTP-GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPFEPKQFENQADMAILKFD 319

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           TF+ A+DE++S +E Q+ E +   +E+ A  KL     DQE RV  LK+  +  V+ A+ 
Sbjct: 320 TFNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQA 379

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL 471
           IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L  N ++LL
Sbjct: 380 IEANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLL 439

Query: 472 LSNNL------------------------------DEMDDEEKTLPVEKVEVDLALSAHA 501
           L                                   +  +++    +  +++DL +S  A
Sbjct: 440 LGEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWA 499

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANISHMRKVHWFEK 559
           NAR++YE KK    K+EKT+ +  KA K+ EKK    + Q  ++    +  +R   WFEK
Sbjct: 500 NARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEK 559

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F +FISS+ YL + GRDAQQ E++ +R++ KGDVYVHAD+ GA    +KN    P+ P+P
Sbjct: 560 FIYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIP 619

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL+QAG   V  S AW SK V                 GEYL  G F+IRG+KN LPP
Sbjct: 620 PGTLSQAGNLCVATSSAWHSKAV----------------MGEYLETGGFVIRGEKNQLPP 663

Query: 678 HPLIMGFGLLFRLDESS-------------LGSHLNERRVRGEEEGMDDF----EDSGHH 720
             L++GF      D+SS             L S L+++  R E E  + +    ++    
Sbjct: 664 AQLLLGFA-----DDSSTTTGVKETQGMEELPSRLDQQTPR-ESENKETYHQPEQNDSSD 717

Query: 721 KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS-NGIDSK 779
           +EN +IE   DD    P           H     +++ + D      ED+    +  D +
Sbjct: 718 EENGEIEENTDDKRTNPF---------LHEKAESSDSDSEDGESKIGEDRPQDVDAKDER 768

Query: 780 IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRD 839
            +D A + A         + + ALG    S    + G E           H + +A  R 
Sbjct: 769 EYDHAESKA---------VEEAALGGKETSSQEEQAGSEP----------HTD-SAAARP 808

Query: 840 KPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKIS----RG 895
              +S  E  +LKK  G S+         E+     + PES  R T  E  + S    RG
Sbjct: 809 AKRLSATENGQLKK--GVSI---------EQASTPPTDPES--RLTPNEPSRSSTPNIRG 855

Query: 896 QKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           ++GK KK+  KY  QDEE+R + + LL SA K  K
Sbjct: 856 KRGKNKKIATKYQHQDEEDRELALRLLGSAPKPDK 890



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L G  +  D ++  IPVC P+ A+  YKYR K+ PG  KKGK ++
Sbjct: 965  LIGTAVVGDEIVAAIPVCAPWMALGQYKYRAKLQPGPLKKGKAVK 1009


>gi|240275734|gb|EER39247.1| DUF814 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 1183

 Score =  346 bits (887), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 325/1103 (29%), Positives = 492/1103 (44%), Gaps = 210/1103 (19%)

Query: 58   LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L++++G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRII  +   G N 
Sbjct: 58   LIVDTGFRCHLTRYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIIDIELSDG-NF 116

Query: 118  HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            H V+LE YA GNI+LTD E+ +L L   HR   +G           E  RV        L
Sbjct: 117  H-VLLEFYAAGNIILTDKEYKILAL---HRIVPEG--------SDQEEVRV-------GL 157

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLG-GQKGGKSFDLSKNSNKNSNDGARAK 236
               LT+ +  +   P  + E   +    SK+  G  +  GK            N  A+ K
Sbjct: 158  QYVLTNKQNYNGVPPLSI-ERLRDALEKSKDVTGPAEAAGK------------NKRAKKK 204

Query: 237  QP-TLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
            Q   L+  +  +LG   Y P L EH       DT L P  +L E  KL +  +  LV+A 
Sbjct: 205  QAEALRRAV--SLGFPEYPPLLLEHAFHITGFDTSLKPE-QLVEDPKLAEKLMVALVVA- 260

Query: 289  AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI----YDEFCPLLLNQFR 344
               E+    + + +  P GYI+ + +    +    +S   +++    Y +F P    QF 
Sbjct: 261  ---ENVNSSLSTAEETP-GYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPFEPKQFE 316

Query: 345  SR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
            S      ++F+TF  A+DE++S +ESQ+ E +   +E+ A  KL     DQ+ RV  LK+
Sbjct: 317  SEPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRVGVLKE 376

Query: 402  EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
              +  ++ A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   
Sbjct: 377  AQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAKVIKLP 436

Query: 461  LYLERNCMSLLLSNNLDEMD-------------------------------DEEKTLPVE 489
            L L  N ++LLL    +  +                                ++   P+ 
Sbjct: 437  LKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKTRQPLL 496

Query: 490  KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTV 545
             +++DL +S  ANAR++YE KK    K+EKT+ +   A K+ EKK     +  + QEK V
Sbjct: 497  SIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALKQEKPV 556

Query: 546  ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
               +                        GRD QQ E++ +R++ +GDV+VHAD+ GA   
Sbjct: 557  LRPTRT-------------------PFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPI 597

Query: 606  VIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTV 663
            ++KN    P+ P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKT P GEYL  
Sbjct: 598  IVKNKPGTPDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVT 657

Query: 664  GSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER------------RVRGEEE-- 709
            G F+I G+KN L P  L++GF ++F++   S+ +H   R               G EE  
Sbjct: 658  GGFVICGEKNHLSPAQLLLGFAVMFQISGESIKNHTKHRVPDETPISESAKDTLGTEELP 717

Query: 710  -GMD---------------DFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPS 753
             G+D                 E  G  +EN +IE   D+    P+    +  N +     
Sbjct: 718  SGLDLETPKYSKINETDHQHQESDGTDQENGEIEQIADNKRTNPLLNDGAESNRSGSESE 777

Query: 754  HTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISST 813
              N     S +    D     G D+  F+    V  P   Q+E+L               
Sbjct: 778  EPNIGGNGSQDV---DARYDKGYDNSRFEA---VEVPKLGQMENL--------------- 816

Query: 814  KHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKD 873
                   + + S E +    T      P++   ERR LK G         +E+   R  D
Sbjct: 817  ------PKEEASSEPQTDSITVQPAKHPFVR--ERRLLKNG--------IIEQVPARLTD 860

Query: 874  ASSQPESIV--RKTKIEGGKIS-----RGQKGKLKKMKEKYGDQDEEERNIRMALLASAG 926
             +S   + V  R +    G  +     RG++GK KK+  KY  QDEE+R + + LL S  
Sbjct: 861  PASHSATNVPSRSSTPSIGASTATPNIRGKRGKNKKIATKYQHQDEEDRELALRLLGSDS 920

Query: 927  KVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDN 986
            K      D   E A   K K   ++ ++A K   + ++A H          D +   E  
Sbjct: 921  K-----PDKLREAA---KRKADRLAELEAQK---QRRRAQH----------DRAAQAERE 959

Query: 987  PCVGLDETAEMDKVAMEEEDIH-EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYS 1045
                L + AE      + +    ++  +    L+ +  L G P+  D ++  IPVC P++
Sbjct: 960  RQKALQQQAETQAGGDDADGGDTQLDADTAADLSCLPSLIGTPVAGDEIVAAIPVCAPWT 1019

Query: 1046 AVQSYKYRVKIIPGTAKKGKGIQ 1068
            A+  YKYR K+ PGT KKGK ++
Sbjct: 1020 ALSQYKYRAKLQPGTVKKGKAVK 1042


>gi|452000540|gb|EMD93001.1| hypothetical protein COCHEDRAFT_1172752 [Cochliobolus heterostrophus
            C5]
          Length = 1128

 Score =  346 bits (887), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 292/857 (34%), Positives = 427/857 (49%), Gaps = 126/857 (14%)

Query: 288  VAKFEDWLQDV--ISGDIVP----EGYILMQ-NKHLGKDHPPTESGSSTQ-IYDEFCPLL 339
            V K  D LQD   I+ +I      +GYIL + N    K  P  ES    + +YD+F P  
Sbjct: 241  VEKLVDVLQDARKITDEITKTDRIKGYILAKPNPSASK--PDDESSDKPRFLYDDFHPFR 298

Query: 340  LNQFRSRE--FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVH 397
              QF + +  F++F+ F+ A+DEF+S IE Q+ E +   +E  A  KL K   + E+R+ 
Sbjct: 299  PQQFENTDYTFLEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIG 358

Query: 398  TLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGL 457
             L+Q  + + + AE I  N+  V  A  AV   +   M W D+ R+++ E+ +GN VA L
Sbjct: 359  GLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIERLIEREQNSGNAVAQL 418

Query: 458  ID-KLYLERNCMSLLL---------------------SNNLDEMDDE-EKTLPVEKV--- 491
            I   L L  N ++LLL                     S + D+ DD   KT P + V   
Sbjct: 419  IRLPLKLHENTITLLLNETNWEEGGEEEDEGNETSSVSEDTDDEDDRPRKTSPPKPVARP 478

Query: 492  ----EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEK 543
                ++DL LSA AN+  +++ KK    K+ +T+ A +KA K+ EKK     +  + QEK
Sbjct: 479  QLAIDIDLGLSAWANSTEYFDQKKTAADKEGRTLQASTKALKSHEKKVAEDLKKGLKQEK 538

Query: 544  TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
             V  +  +RK HWFEKF +FISS+ YLV+ G+DAQQNE+I +R++ KGDVYVHADL GA 
Sbjct: 539  EV--LRPVRKQHWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAM 596

Query: 604  STVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
              +IKN    P+ P+PP TL+QAG  ++C S AWDSK V SAWWV   QVSKT  TGE+L
Sbjct: 597  PMIIKNKPDTPDAPIPPSTLSQAGNLSICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFL 656

Query: 662  TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH----LNERRVRGEEEGMDDFEDS 717
              G F I+GKK FLPP  L++G  ++F + +SS  +H    + E  V   E  M D +  
Sbjct: 657  PAGMFNIKGKKEFLPPAQLVVGLAVMFEISDSSKANHHKHRVQETAVSAAE--MTD-QPG 713

Query: 718  GHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAP--SHTNASNVDSHEFPAEDKTISNG 775
               KE +  ++++ + DE P A+  S      P     HT  S+ +S          SN 
Sbjct: 714  NESKEAAATKTDESNDDEFPDAKFDSDSEDDFPDAKMEHTEESDAESEAAAPR----SNP 769

Query: 776  IDSKIFDIARNVAAPVTPQLEDLIDRALGLGSA--SISSTKHGI----ETTQFDLSEEDK 829
            + S      RN A   + + ++L+   +G G A  +    K+G+    E  + + S  D 
Sbjct: 770  LQSST----RN-AKEDSGEEDELV---VGKGDAEHAKPGEKNGVVAKKEPPEDEGSIADT 821

Query: 830  HVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPE-------SIV 882
                 +T R K  +S  ERR  +KGQ   +  P+V  +     D + Q E       S  
Sbjct: 822  EPISKSTGRGK--LSARERRLARKGQLPEL--PQVPSDTVPAVDGADQDEGDSAEGGSAK 877

Query: 883  RKTKIEG-----------GKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKN 931
              TK++G             + RG++ K KK   KY  QDEE+R + M LL S       
Sbjct: 878  APTKVDGTVTSQMNKQKNAPLPRGKRAKAKKQAAKYAAQDEEDRELAMRLLGS------K 931

Query: 932  DGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGL 991
             G    E A+  K +K   +  D  +     ++  HL          +    E+     L
Sbjct: 932  TGQQAAEAAAQEKRQKEEQAQADKQR-----RREQHLRAQA------AGKAAEEARLRAL 980

Query: 992  DETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYK 1051
            +            ED  E  E  K  L ++D  TG PLP+D L+  IPVC P+SA+ +YK
Sbjct: 981  ENA----------EDDDEGDEVLKTNLQNLDAFTGRPLPNDELISAIPVCAPWSALSTYK 1030

Query: 1052 YRVKIIPGTAKKGKGIQ 1068
            Y+ K+ PG+ K+GK ++
Sbjct: 1031 YKAKMQPGSTKRGKAVK 1047



 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 59/145 (40%), Positives = 86/145 (59%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L  +L  +R +NVYDLS + ++ K              +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T YAR     PSGF  KLRK+++TRR+  + Q+G DRI+ FQF  G+  + +
Sbjct: 53  DSGFRCHLTEYARTTAAAPSGFVAKLRKYLKTRRVTSISQIGTDRILEFQFSDGL--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE YA GNI+LTD++  VL+LLR+
Sbjct: 111 YLEFYAGGNIILTDADLNVLSLLRN 135


>gi|213403135|ref|XP_002172340.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
 gi|212000387|gb|EEB06047.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
          Length = 1013

 Score =  344 bits (882), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 249/739 (33%), Positives = 387/739 (52%), Gaps = 85/739 (11%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +  DV+A    L+ RL+G R +N+YDL+ +T++ K     G  +  ES    +++
Sbjct: 1   MKQRFSALDVSAITAELKDRLLGCRLNNIYDLNARTFLLKF----GKQDVKES----VII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN---- 116
           ESG R+H T + R+     SGF  KLRKH+++RRL ++ QL  DR+++F FG G N    
Sbjct: 53  ESGARVHATKFQRNPAPL-SGFVTKLRKHLKSRRLTNLYQLRSDRVVVFTFGGGENDSDP 111

Query: 117 --AHYVILELYAQGNILLTDSEFTVLTLLRS-HRDDDKGVAIMSRHRYPTEICRVFERTT 173
              +Y++ E +A GNILL D  F +L+LLR    D ++  A+  R+              
Sbjct: 112 AWTYYLVCEFFAAGNILLLDGSFKILSLLRVVTFDKNQFYAVGQRY-------------- 157

Query: 174 ASKLHAALTSSKEPDANEP-----DKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
              L+ ALT ++   + E      D++ E    V++ S  N                 K 
Sbjct: 158 --DLNDALTEAQRTISMESLSLLLDQITEQEKAVADVSPTNE-----------EVKDTKK 204

Query: 229 SNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
           SN   + K  TL+  L   LG YG AL EH I  + L P M  S+    E+   ++L  A
Sbjct: 205 SNKSKKPKVTTLRKALTIRLGRYGNALIEHCIRLSQLDPLMLASDFKNDEEKKKELLE-A 263

Query: 288 VAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI-----YDEFCPLLLNQ 342
             + +  + D     I  +GYI    + + K    T +  + Q+     +  F PL L Q
Sbjct: 264 FHEADKIMNDATKPPI--KGYIFGLQQDIIKSGEETGAQKTEQVLMYEDFHPFKPLQLLQ 321

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
             +R  ++F +++  +DEF+S +ESQ+ E+Q+  +      ++     D EN++  L++ 
Sbjct: 322 NNNRTCIEFPSYNECVDEFFSSLESQKIEKQNHDRLKTFAKRIENAKRDVENKLKELQKA 381

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KL 461
            + S K A+ IE N + V+ AI  V   +   M W D+ +++  +++  +  A +I   L
Sbjct: 382 QELSEKKAQAIELNPQLVEGAIEYVNSLVGQAMDWLDIEKLITVQQRRQHAFASVIRLPL 441

Query: 462 YLERNCMSLLLSN-NLDEMDDEEK--------------TLPVEK---------VEVDLAL 497
            L++N ++L+L + N   +D+E +                PV++         VEVDLAL
Sbjct: 442 QLKKNLITLVLPDPNPLAVDEESEQSESESDSEPESTIITPVQRRLIQPKGLAVEVDLAL 501

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISHMRKVH 555
            A ANAR  Y  ++    K+EKTI + SKA K  +K+    L+    +    ++  R+  
Sbjct: 502 GAFANARVHYNNRRLAALKEEKTIESSSKAIKNTQKRAEADLKTAAAEAKQALTASRRTF 561

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           +FEKF+WFISS+ YLV+ GRD QQ E++ ++Y +KGDVYV ADL  +SS +IKN     P
Sbjct: 562 FFEKFHWFISSDGYLVLGGRDNQQRELLYEKYCNKGDVYVSADLPNSSSVIIKNRNENDP 621

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL QAG   +  S+AWD+K V SAWWV  H VSK   T + L  G F I  +KN+L
Sbjct: 622 IPPNTLQQAGALALATSKAWDTKTVISAWWVPIHAVSKVDQTKQILPTGHFWINEEKNYL 681

Query: 676 PPHPLIMGFGLLFRLDESS 694
           PP  L+MG+G+L+ LDE S
Sbjct: 682 PPTNLVMGYGILWFLDEVS 700



 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 1/85 (1%)

Query: 982  GVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDV-DYLTGNPLPSDILLYVIPV 1040
            G E+ P    ++ AE  +V ++      +  E+   +N+  DYL+      D +LY +P+
Sbjct: 870  GKEELPAQQHEKQAERTRVLVDMPTQTFLSAEQLAEVNEARDYLSPELSEKDKVLYAVPI 929

Query: 1041 CGPYSAVQSYKYRVKIIPGTAKKGK 1065
              PYS +  + Y++KI PG+AK GK
Sbjct: 930  FMPYSGMNKFTYKIKIQPGSAKVGK 954


>gi|391869409|gb|EIT78607.1| putative RNA-binding protein [Aspergillus oryzae 3.042]
          Length = 1103

 Score =  342 bits (876), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 230/750 (30%), Positives = 373/750 (49%), Gaps = 115/750 (15%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKLAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + PS F  ++RK +R+RR+  V+Q+G DRII   F  GM   ++
Sbjct: 53  DSGFRCHVTQYSRATASMPSPFVTRMRKFLRSRRITSVKQIGTDRIIDISFSDGMYHMFL 112

Query: 121 ----------------ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
                           IL LY Q ++   +     +    +++ +  G+  ++  R    
Sbjct: 113 EFFAGGNIIITDREHNILALYRQVSVSEGEEARVGIQYTVTNKQNYYGIPEITLDRI--- 169

Query: 165 ICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKN 224
                 R T  K  A                 EDG                       K 
Sbjct: 170 ------RETLEKAKALF-------------AREDG---------------------APKK 189

Query: 225 SNKNSNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV 283
           S K + D        L+  L +    Y P L +H  +   + P   L +V  L+D ++  
Sbjct: 190 SKKKNAD-------VLRKALSQGFPEYPPLLLDHAFVTKEVDPTTPLDKV--LQDESLLQ 240

Query: 284 LVLAVAKFEDWLQDVISGDIVPEGYILMQN------KHLGKDHPPTESGSSTQIYDEFCP 337
            V  V +        +S      GYI+ ++      +   ++  P+E+G+   +Y++F P
Sbjct: 241 EVNGVLQEAQNENTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNL--LYEDFHP 298

Query: 338 LLLNQFRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
               QF  +     ++F + +A +DE++S IE+Q+ E +   +E+AA  KL  +  + E 
Sbjct: 299 FKPRQFEGKPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEK 358

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           ++  LK++ +  ++ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPV
Sbjct: 359 KIGALKEQQELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPV 418

Query: 455 AGLID-KLYLERNCMSLLLSNNLDEMDD--------------------EEKTLP-VEKVE 492
           A +I   L L  N ++LLL    DE D+                    E +  P V  ++
Sbjct: 419 ARIIKLPLKLHENTITLLLGEAGDEQDEGDELFSSDESEKSEDEQDNGESQQPPSVLTID 478

Query: 493 VDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISH 550
           +DL +S  ANA+++YE KK+   K+++T  + +KA K+ EKK    L+   +K    +  
Sbjct: 479 IDLGISPWANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQ 538

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            R+  WFEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN 
Sbjct: 539 TRQPFWFEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNR 598

Query: 611 R--PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
              P  P+PP TL+QAG   V  S AWDSK V SAWWV   Q++KTA  G  L +G F++
Sbjct: 599 SKDPTAPIPPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLV 658

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           +G+KNFL P  L++GFG+ F++ + SL +H
Sbjct: 659 KGEKNFLAPSQLVLGFGVTFQISKDSLKNH 688



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 34/51 (66%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L  +  L G P P D +L  IPVC P+SA+  Y+Y+VK+ PGT KKGK ++
Sbjct: 959  LEWIPALIGTPRPEDEILAAIPVCAPWSALSRYRYKVKLQPGTVKKGKAVK 1009


>gi|71411706|ref|XP_808091.1| hypothetical protein Tc00.1047053507483.60 [Trypanosoma cruzi
           strain CL Brener]
 gi|70872222|gb|EAN86240.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1081

 Score =  341 bits (875), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 242/746 (32%), Positives = 379/746 (50%), Gaps = 107/746 (14%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESGVR+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ESGVRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A   S KE +  +               + N   Q+    F               A   
Sbjct: 169 AQSESGKEKEEEQ--------------RRTNALRQEWHTVF------------ARHADYE 202

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +   ++L+  +   + W    
Sbjct: 203 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGEITSDAETMFKLLLPGM--LQAW---E 256

Query: 299 ISGDIVPEGYILMQNKH---------------LGKDHPPTES----------GSSTQI-- 331
           I+   +P G  L+ N                 +G+D    E           GS  Q+  
Sbjct: 257 ITFSPLPGGGYLISNHRQRKDSRKGGQEASSKIGEDKSQAEEEKSVNANVADGSQQQMQA 316

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 317 VQYDDFSPVLLAQYSSDGVVMSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 376

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D   R++ L+ E   + +  E I  N+  +D AI  +  ALA  + W+ L  ++K  
Sbjct: 377 FERDHLRRLNALEMEEQENQRKGECIIQNVVKIDEAIGLINGALAAGIQWDALRSLLKRR 436

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK--TLPVEKVEVDLALSAHANARR 505
              G+PVA ++  L+LERN +S+L+ +N  E + EE     P+  +EV+L+ +A+ANA  
Sbjct: 437 HAEGHPVAYMVHDLFLERNSISVLVESNEQEDEGEEDCDVTPM-VIEVELSKTAYANATT 495

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           ++   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +
Sbjct: 496 YFAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRT 555

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--------------NHR 611
           S    V+ G+D Q  E++V+R M  GDV+VH D+ GA   +++                 
Sbjct: 556 SCGDFVLQGKDLQTTEILVRRVMQLGDVFVHCDVDGALPCLLRPIGSAWATAFVEDVEGD 615

Query: 612 PEQPVPPLT-------LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
           P++     T       L++AG + V  S AW+ K   +AWWV+  Q++    +G YL   
Sbjct: 616 PQEGCQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFSVAAWWVHASQINGGTASGCYL--- 672

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRL 690
                G+K++L P P+    GLLFR+
Sbjct: 673 ---FDGEKHYLRPQPITFACGLLFRV 695



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 1023 YLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            Y T  P P D + Y + VC P S V  YKYR ++  G AKKG   Q+  SL    L++T
Sbjct: 989  YFTSQPKPMDNIEYALAVCAPMSCVIPYKYRAELSFGNAKKG---QVTTSLQGHFLAMT 1044


>gi|118350963|ref|XP_001008760.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89290527|gb|EAR88515.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 1213

 Score =  341 bits (874), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 204/532 (38%), Positives = 307/532 (57%), Gaps = 60/532 (11%)

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI--SGDIVPEG 307
           + P + +HII   GL PN K++    + D AI      + +  D  +D+I      V +G
Sbjct: 272 HNPVI-DHIISSNGLNPNQKVT----VADVAI------IKQMADKCKDLILDFQKTVHQG 320

Query: 308 YILMQNKHLGKDHPPTESGSSTQ-----------------------IYDEFCPLLLNQFR 344
           Y+++ +K   K  P  +     +                        Y +F PL L    
Sbjct: 321 YLIVSDKKEVKHRPNKQEQQQIEGAQNNDEIPTEKAKEEKKEEEKEKYFDFSPLYLTCHE 380

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
            ++F++  +F+A++D+++ ++ +Q+ +++    E  A+ K   I  DQ NR+  LK E +
Sbjct: 381 GKKFIENNSFNASVDKYF-QVMAQKIQEEQNDVESIAWKKYENIKNDQLNRIQKLKNEQE 439

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             V  A+LIE N++ VDA I  ++   ++  SW+ + +M+ E +K G+P+A LI  L  E
Sbjct: 440 EYVVKAQLIEMNIDYVDAIINIIKTLKSSGESWDKITKMINEGKKNGDPMAYLIHSLDFE 499

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
            N +S+LL +  D+M  EE T+    V +D+A SAH NAR +YE KKK   K++KT+ A 
Sbjct: 500 NNEISVLLGDPCDDM--EEYTV----VAIDIAYSAHQNARNYYENKKKNIVKEKKTLDAS 553

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
             A K AEK    +I   K   N+ + RK +WFEKF WFISSENYLVISGRD QQNE+IV
Sbjct: 554 KLALKQAEKTALKEIENLKLKNNVVNTRKQYWFEKFYWFISSENYLVISGRDMQQNEIIV 613

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           K+YM KGD+Y+HAD HGA+ST+IKN   + PV   T+ +A   T+C S+AW++K++ SAW
Sbjct: 614 KKYMRKGDIYMHADFHGAASTIIKNPFKDIPVSQQTIEEAAIATICRSKAWEAKIIASAW 673

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           WVY HQVSK A TGEYL  GSFMIRGKKNF+ P  + MG  LL++LD+  +  HLN+RR 
Sbjct: 674 WVYDHQVSKRAETGEYLPSGSFMIRGKKNFIYPARMEMGCTLLYKLDDQFVEKHLNDRRR 733

Query: 705 RGEE-----------EGMDDFEDSGHH-KENSDIESEKDD-----TDEKPVA 739
           + ++           +  +DF+++    + N  +ES++ D      +E P A
Sbjct: 734 KDKDDNTTTVSGVQIDNQNDFDETNFEIRPNMQLESQQSDQGVSIVNEDPFA 785


>gi|344257308|gb|EGW13412.1| Serologically defined colon cancer antigen 1-like [Cricetulus
           griseus]
          Length = 554

 Score =  341 bits (874), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 168/327 (51%), Positives = 225/327 (68%), Gaps = 31/327 (9%)

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N 
Sbjct: 2   NLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNP 61

Query: 476 --LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
             L E +D++    VE                             V+VDL+LSA+ANA++
Sbjct: 62  YLLSEEEDDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 121

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 122 YYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 181

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 182 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 240

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 241 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 300

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E ++
Sbjct: 301 FLFKVDESCIWRHRGERKVRAQDEDIE 327


>gi|346325475|gb|EGX95072.1| serologically defined colon cancer antigen 1 [Cordyceps militaris
           CM01]
          Length = 1048

 Score =  340 bits (872), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 254/798 (31%), Positives = 392/798 (49%), Gaps = 99/798 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L + L  +R +N+YDLS K  +FK    +         K  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELNQSLTSLRVANIYDLSTKILLFKFAKPN--------TKKQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G R HTT YAR     PS F  +LRK ++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DIGFRCHTTEYARATAGIPSVFVARLRKVLKTRRLTSVSQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GN++LTD+   +L + R+  + D         + P ++           L  +
Sbjct: 111 FLEFFASGNVILTDANLKILAIFRNVLEGD--------GQEPQKVG----------LQYS 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L  S++     P+   E       A+ E +   KG  S    K  N        A +  L
Sbjct: 153 L-ESRQNFLGIPELSQERVRTALTAAVETVSATKGHHSKPAPKQGN--------ALRKCL 203

Query: 241 KTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
              + E     P + +H++     DT L P   L + + L       LV  + K  + L 
Sbjct: 204 AVSITE---LPPIIVDHVLQANDFDTSLKPETILEDASLLSS-----LVENLRKARE-LV 254

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSR---EFV 349
             I+      G+I  + K   + + PTE  SS      +YD+F P +  +F++    E +
Sbjct: 255 GAITSSPSCTGFIFAK-KPAQEQNLPTEDTSSEAKAGLLYDDFHPFVPQKFQNNSKIEIL 313

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +FE F+  +D+F+S +E Q+ + +   +E AA  KL+    DQENR+  L+     + + 
Sbjct: 314 RFEGFNRTVDDFFSSLEGQKLQSRVVEREAAAQRKLDAAKQDQENRLKGLQTSQSDNFRK 373

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A  IE N+E V  A+ ++   LA  M W D+ ++V  E+K  N VA LI   L L  N +
Sbjct: 374 AAAIEANIERVQEAMDSINGLLAQGMDWVDIGKLVAREQKKNNAVANLICLPLSLADNVI 433

Query: 469 SLLLSNNLD---------EMDDE----EKTLPVEK-----------VEVDLALSAHANAR 504
           S+ LS   D         E DD     E  L   K           VE+ L LS  +NAR
Sbjct: 434 SIRLSEEDDAGSEVEDPFETDDSDADSETDLNAAKSVQNYSDKTIIVELTLTLSPWSNAR 493

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHMRKVHWFEKF 560
            +Y+ +K    K+EKT     +A K+ E+K +      + QEK +  +  +R + WFEKF
Sbjct: 494 EYYDQRKTAVVKEEKTQLQADRAIKSTEQKIKHDLKRALKQEKAL--LQPIRNLMWFEKF 551

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP--VPP 618
            WFISS+ YLV+  +D  Q E++ +R++  GD++ HAD + A+  ++KN+   +   + P
Sbjct: 552 YWFISSDGYLVVGAKDKSQAEILYRRHLGSGDIFCHADANNAAIVIVKNNSNTEDAHIAP 611

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL QAG  ++C S+AWDSK    AWWV   QVSK+ PTG+ L  G+F I G+KNFLPP 
Sbjct: 612 ATLAQAGQLSICSSEAWDSKAGIGAWWVNSSQVSKSTPTGDILQPGNFNISGEKNFLPPG 671

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRGEEE--GMDDFEDSGHHKENSDI----ESEKDD 732
            LI+G  ++F++ E S   H N+ R++  +E  G    E   + K+++ I    +   D+
Sbjct: 672 QLILGLSIMFKISEES-EIHHNKHRIQDGDETAGAPGRETETNSKQDTSIMDMNQESSDE 730

Query: 733 TDEKPVAESLSVPNSAHP 750
            DE    +    P  A+P
Sbjct: 731 EDEGDYKDGDKQPTRANP 748



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 30/48 (62%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            ++ L G P P D +L  + +C P++A+   KY+ K+ PG  KKGK ++
Sbjct: 918  INLLVGTPRPGDEILEAVVICAPWAALSRSKYKFKLQPGATKKGKAVK 965


>gi|71413048|ref|XP_808681.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70872935|gb|EAN86830.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1082

 Score =  339 bits (870), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 237/746 (31%), Positives = 377/746 (50%), Gaps = 107/746 (14%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESG+R+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ESGIRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A   S KE +                        Q+  K+         ++     A   
Sbjct: 169 AQSESGKEKEEE----------------------QRRTKAL----QQEWHTVFARHADYE 202

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +   ++L+  +   + W    
Sbjct: 203 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGEITSDAETMFKLLLPGM--LQAW---E 256

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--------------------------- 331
           I+   +P G  L+ N    K+       +S++I                           
Sbjct: 257 ITFSPLPGGGYLISNHRQRKESRKGGQEASSKIEEDKSQAEEEKSMNVNVADESQQQMQA 316

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 317 VKYDDFSPVLLAQYSSDGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 376

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D + R++ L+ E   + +  E I  N   +D AI  +  ALA  + W+ L  ++K  
Sbjct: 377 FERDHQRRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLLKRR 436

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK--TLPVEKVEVDLALSAHANARR 505
              G+PVA ++  L+LERN +S+L+ +N  E + EE     P+  +EV+L+ +A+ANA  
Sbjct: 437 HAEGHPVAYMVHDLFLERNSISVLVESNEQEDEGEEDCDVTPM-VIEVELSKTAYANATT 495

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           ++   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +
Sbjct: 496 YFAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRT 555

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--------------NHR 611
           S    V+ G+D Q  E++++R M  GDV+VH D+ GA   V++                 
Sbjct: 556 SCGDFVLQGKDLQTTEILIRRVMQLGDVFVHCDVDGALPCVLRPIGSAWTTAFVEDVEGD 615

Query: 612 PEQPVPPLT-------LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
           P++     T       L++AG + V  S AW+ K   +AWWV+  Q++    +G YL   
Sbjct: 616 PQEGCQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFSVAAWWVHASQINGGTASGCYL--- 672

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRL 690
                G+K++L P P+    GLLFR+
Sbjct: 673 ---FDGEKHYLRPQPITFACGLLFRV 695



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 33/59 (55%), Gaps = 3/59 (5%)

Query: 1023 YLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            Y T  P P D + Y + VC P S V SYKYR ++  G AKKG   Q+  SL    L++T
Sbjct: 990  YFTSQPKPMDNIEYALAVCAPMSCVISYKYRAELSFGNAKKG---QVTTSLQGHFLAMT 1045


>gi|32565397|ref|NP_497411.2| Protein Y82E9BR.18 [Caenorhabditis elegans]
 gi|373220360|emb|CCD73050.1| Protein Y82E9BR.18 [Caenorhabditis elegans]
          Length = 921

 Score =  339 bits (869), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 177/411 (43%), Positives = 255/411 (62%), Gaps = 12/411 (2%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           QIY +F P+ + +F ++   +  +F  A+DEFYS+IE+Q+ EQ+    E  A  KL  + 
Sbjct: 271 QIYQDFNPISM-EFTAKLSKELSSFCEAVDEFYSRIETQKQEQKAVNMEKQALKKLENVE 329

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
            DQ++R+  L+    +  +MA  I  N E V+ A+L +R ALAN+ SW+ +  M K    
Sbjct: 330 KDQKDRIEALQLTQSQREQMANRIILNTELVEKALLLIRSALANQFSWQTIEEMRKTAAG 389

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
            G+PVA  ID    E N   + L+   D  DDE + L   KV +D++L+A  NA+R +  
Sbjct: 390 NGDPVAKSIDSFKFENNEFMMSLA---DPYDDEAEVL---KVPIDISLNASKNAQRHFVD 443

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           KK    K +KT+ +  KA K A++K +  + Q K V  +   RK  WFEKF WFISSE +
Sbjct: 444 KKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVKIVVEVKKSRKSMWFEKFRWFISSEGF 503

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           +V++GRDAQQNE++VK+Y+   D+Y+HAD+ GASS VI+N   +  +PP TL +A    V
Sbjct: 504 IVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVVIRNKSFDAEIPPKTLTEAAQMAV 563

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C+S AW++ +  SAWWV+P QVS+TAPTGEYL  GSFMIRGKKNF+PP  L+MG G+LFR
Sbjct: 564 CYSNAWEATVTASAWWVHPDQVSRTAPTGEYLPSGSFMIRGKKNFMPPSQLVMGLGILFR 623

Query: 690 LDESSLGSHLNERRVRGEEEGMDD---FEDSGHHKENSDIESEKDDTDEKP 737
           +DE S+  H+   + + EE+  +D    EDS   K+ + I     + DE P
Sbjct: 624 MDEESIERHVALEKSKAEEKSEEDGEKMEDSP--KKTAKIPENPAENDEFP 672



 Score =  132 bits (333), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 67/160 (41%), Positives = 94/160 (58%), Gaps = 8/160 (5%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R    DV A    L++L GMR +NVYD+  KTY+ KL        S   EK ++L E
Sbjct: 1   MKNRFTLVDVIAATTELKKLEGMRVNNVYDIDNKTYLIKL--------SRTDEKAVILFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T +   K  TPS F++KLRKHI  +RL  +R +G+DR++   FG     + + 
Sbjct: 53  SGVRLHQTFHDWPKSQTPSSFSMKLRKHINQKRLTSIRVVGFDRLVELTFGTEDRENRLY 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           +ELY +GN++LTD E T+L +LR   D D  V    R ++
Sbjct: 113 VELYDRGNVVLTDQELTILNILRVRTDKDTSVRWAVREKF 152



 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 37/57 (64%), Gaps = 4/57 (7%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK----GIQIF 1070
            L+ +  LT  PL  D LL+ +PV  PYSA+ +YKYRVKI PG  K+GK     I++F
Sbjct: 826  LSILTTLTAQPLDEDTLLFAVPVVAPYSALSTYKYRVKITPGIGKRGKATKSAIELF 882


>gi|407846065|gb|EKG02413.1| hypothetical protein TCSYLVIO_006562 [Trypanosoma cruzi]
          Length = 1080

 Score =  338 bits (868), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 238/746 (31%), Positives = 377/746 (50%), Gaps = 107/746 (14%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESG+R+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ESGIRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A   S KE +                        Q+  K+         ++     A   
Sbjct: 169 AQSESGKEKEEE----------------------QRRTKAL----RQEWHTVFARHADYE 202

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +   ++L+  +   + W    
Sbjct: 203 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGEITSDAETMFKLLLPGM--LQAW---E 256

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--------------------------- 331
           I+   +P G  L+ N    K+       +S++I                           
Sbjct: 257 ITFSPLPGGGYLISNHRQRKESRKGGQEASSKIEEDKSQAEVEKSVNVNVAEESQQQMQA 316

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 317 VQYDDFTPVLLAQYSSDGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 376

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D + R++ L+ E   + +  E I  N   +D AI  +  ALA  + W+ L  ++K  
Sbjct: 377 FERDHQRRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLLKRR 436

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK--TLPVEKVEVDLALSAHANARR 505
              G+PVA ++  L+LERN +S+L+ +N  E + EE     P+  +EV+L+ +A+ANA  
Sbjct: 437 HAEGHPVAYMVHDLFLERNSISVLVESNEQEDEGEEDCDVTPM-VIEVELSKTAYANATT 495

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           ++   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +
Sbjct: 496 YFSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRT 555

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--------------NHR 611
           S    V+ G+D Q  E++V+R M  GDV+VH D+ GA   V++                 
Sbjct: 556 SCGDFVLQGKDLQTTEILVRRVMQLGDVFVHCDVDGALPCVLRPIGSAWTTAFVEDVEGD 615

Query: 612 PEQPVPPLT-------LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
           P++     T       L++AG + V  S AW+ K   +AWWV+  Q++    +G YL   
Sbjct: 616 PQEGCQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFSVAAWWVHASQINGGTASGCYL--- 672

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRL 690
                G+K++L P P+    GLLFR+
Sbjct: 673 ---FDGEKHYLRPQPVTFACGLLFRV 695



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/218 (28%), Positives = 94/218 (43%), Gaps = 31/218 (14%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS--AGKVQKNDGDPQNENASTHKEKKP 948
            ++++ Q+ KLKK++EKY DQDEE+R +  ALL      KVQ      Q +    H+   P
Sbjct: 830  QLTKHQRRKLKKIQEKYKDQDEEDR-LYGALLNGNQMSKVQLGVLALQRKKEKRHELFPP 888

Query: 949  AI---SPVDAPKVCYKCKKAGHLSKD----CKEHPDDSSHGVEDNPCVGLDETAEMDKVA 1001
                    D  +     +    + +D       H + S   + +N   G +   E ++  
Sbjct: 889  KTFEEKNFDEKQEEEVEEVTEFIDEDKSGETNSHNESSISLLPNNSVDGKEGQKEEEEEE 948

Query: 1002 MEEEDIHEIGE------------EEKGRLNDVD------YLTGNPLPSDILLYVIPVCGP 1043
              E + H  G+            EE    ND +      Y T  P P D + Y + VC P
Sbjct: 949  EVENEKHNAGQPQSKTRAVATSVEESCIANDEELRREWQYFTSQPKPMDNIEYALAVCAP 1008

Query: 1044 YSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
             S V SYKYR ++  G AKKG   Q+  SL    L++T
Sbjct: 1009 MSCVISYKYRAELSFGNAKKG---QVTTSLQGHFLTMT 1043


>gi|116193227|ref|XP_001222426.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
 gi|88182244|gb|EAQ89712.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
          Length = 1115

 Score =  338 bits (866), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 247/727 (33%), Positives = 366/727 (50%), Gaps = 99/727 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +N+YDL+ K  + K        +        +L+ESG R H T +AR     PS
Sbjct: 26  LVSLRLANIYDLNSKILLLKFAKPDNRQQ--------VLIESGFRCHLTDFARAAAPAPS 77

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII F+F  G  A+ + LE +A GN++LTD++  +L
Sbjct: 78  AFVARLRKFLKTRRVTGVSQIGTDRIIEFRFSDG--AYRLYLEFFAGGNVILTDADLKIL 135

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR        +    + + P  +   +       L      +KE       ++ +   
Sbjct: 136 ALLR--------IVPEGKGQEPQRVGLTYSLENRQNLGGVPPLTKE-------RLRDALT 180

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIIL 260
            V+  +      +K G             +DG R    T  T L       P L +H+  
Sbjct: 181 TVTAQAATEKAKKKKG-------------SDGLRRGIVTTITELP------PVLIDHVFR 221

Query: 261 DTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH 320
             G  P    +EV  L D ++   +    +    + D ++     +GYI+       K +
Sbjct: 222 LRGFNPTTTPTEV--LNDESLFNALFGSLEEARSISDEVTSSPTAKGYII------AKPN 273

Query: 321 PPT-------------ESGSSTQIYDEFCPLLLNQF---RSREFVKFETFDAALDEFYSK 364
           P T             +  +   +Y++F P L  QF   R  E + F+ ++  +D F+S 
Sbjct: 274 PRTAELLKEGEEEEGQKEKARNLLYEDFQPFLPKQFEDIRDCEILSFDGYNKTVDNFFSS 333

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           +E Q+ E + + +E  A  KL     DQ  R+  L+     +++ A  +E N+E V  A+
Sbjct: 334 LEGQKLESRLQEREITAKRKLEAARRDQAQRIEGLQDVQMLNLRKAAAVEANIERVQEAM 393

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLID---KLY---------------LERN 466
            AV   +   M W D+ ++V+ E+K  NPVA +I    KL+                   
Sbjct: 394 DAVNGLIQQGMDWVDINKLVEREQKQHNPVAEMIKLPMKLHESVITLLLGEEEEEGKVEE 453

Query: 467 CMSLLLSNNLDEMDD--EEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTI 521
            M      + +  DD  EEK+   +K   ++++L LS   NAR +Y+ K+    KQEKT+
Sbjct: 454 EMDFDYDTDEETADDAAEEKSKGPDKRLAIDINLKLSPRNNARYYYDQKRTAADKQEKTV 513

Query: 522 TAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDA 577
                A K AE+K     +  + QEK +  +  +RK  WFEKF WF+SS+ YLV+ GRDA
Sbjct: 514 QRSEIALKNAEQKIAEDLKKGLKQEKPI--LQPIRKQMWFEKFTWFVSSDGYLVLGGRDA 571

Query: 578 QQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAW 635
           QQNE++ KRY+ KGDVYVHAD+HGASS VIKN+   P+ P+PP TL QAG  +VC S AW
Sbjct: 572 QQNEILYKRYLRKGDVYVHADMHGASSVVIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAW 631

Query: 636 DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           DSK    AWWV   QVSK+AP+GEYL VGSFM+RGK+N LPP  L++GFGLLF++ E S 
Sbjct: 632 DSKAAMGAWWVNADQVSKSAPSGEYLPVGSFMVRGKRNLLPPSLLMLGFGLLFKISEESK 691

Query: 696 GSHLNER 702
             H   R
Sbjct: 692 SRHGKHR 698



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 34/51 (66%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L  +D L G PLP D +L V+PVC P++A+   KY+ K+ PG  KKGK ++
Sbjct: 951  LAPLDALVGTPLPGDEILEVVPVCAPWNALARLKYKAKLQPGHVKKGKAVK 1001


>gi|156083749|ref|XP_001609358.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154796609|gb|EDO05790.1| conserved hypothetical protein [Babesia bovis]
          Length = 1006

 Score =  337 bits (865), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 302/1107 (27%), Positives = 500/1107 (45%), Gaps = 192/1107 (17%)

Query: 1    MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
            MV+ R+N  DVAA V  LR +++     N+YD++ + Y+ K         S   +K  +L
Sbjct: 1    MVRERLNAVDVAAVVGNLRSQILDYNLVNIYDVTSRVYVLKF--------SRNEDKRFVL 52

Query: 60   MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
             E G R+HTT + R     PS F +KLRKH+RTR+L  + Q+  DR++ F F  G  A++
Sbjct: 53   FEIGHRIHTTQFLRTTDKLPSNFNVKLRKHLRTRKLRGIYQIAQDRVVDFTFSSGEYAYH 112

Query: 120  VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
            +I++L+  GN+ LTD  + VLT+LR     D    +   +  P          + + +  
Sbjct: 113  LIVQLFLPGNVYLTDYSYKVLTVLRPQNAGDSFFRVGETYGIPEASVPWNIPVSPAVIDG 172

Query: 180  ALTSSKEPDANEPDKVNEDGN-NVSNASKENLGGQKGGKSFDLSKNSNKNSND------- 231
             L+                GN + SN+ K+    +   ++ D SK S  N +D       
Sbjct: 173  ILSGMGH------------GNVDASNSQKKVTNSRGKPETGDSSKQSIVNGSDQGDYLDI 220

Query: 232  GARAKQPTLKTVLGEALGYGPALSEHII---LDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
            G+  K  ++  +L       P+++  ++   L   +  ++  S+V+ +E + I   V A+
Sbjct: 221  GSEFKDRSVSMLLKLIF---PSVTLRMMRYALVKAIGADICDSDVSAVESSTIYTAVEAL 277

Query: 289  AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
                D L + ++ ++   GY+  +          TE       Y++F          R  
Sbjct: 278  RSTLDSLSNPVNLNL---GYLYKKG---------TE-------YEDFGCFDYGDGWER-- 316

Query: 349  VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
              F+ F+ ALD +++K E ++ E++ + K+     KL KI  DQ  R    ++EV R   
Sbjct: 317  --FDDFNMALDAYFTKSELRKIERKEQPKKPI---KLQKIKDDQNRRELEREREVHRLGV 371

Query: 409  MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
               L+E + +  D  +  +R  +A+  SW+++   +  +R +G+ +A  I  + +    +
Sbjct: 372  SIALVEGHRDTFDTVLDLMRSLVASGASWQEITDQLSRQRDSGHLLARHIRSVNIPDRRV 431

Query: 469  SLLLSN-------NLDEMDDEEKTLPVEK-----------VEVDLALSAHANARRWYELK 510
             + L N       N+  M D+      +K           V +D  L+   N    Y  K
Sbjct: 432  DVCLPNDDPGYYTNVTSMGDKRNKRGSKKSQSSDQFDDTSVTLDYGLTCFQNLEIMYSQK 491

Query: 511  KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANIS--HMRKVHWFEKFNWFISSEN 568
            K+   K E+T   H  A K  +++   Q+ + +   N+S   +RK  WFEKF+WFI+S+ 
Sbjct: 492  KRMAEKLERTRAGHQFALKRVDREKEKQV-KSRGDRNVSLVKVRKRMWFEKFHWFITSDG 550

Query: 569  YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
            +LV+ GRD+ QNE++VKRY++KGD+Y HAD+HGA+S ++KN        P T+++A CF+
Sbjct: 551  FLVLGGRDSTQNELLVKRYLTKGDLYFHADVHGAASCILKNPSGNAESFPNTIDEAACFS 610

Query: 629  VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            +C S AW  KMV  AWWV+ HQVS++AP+GEYL  GSFMIRGKKN++ P  L M  G++F
Sbjct: 611  LCLSSAWSQKMVVPAWWVHHHQVSRSAPSGEYLPHGSFMIRGKKNYVQPQRLEMAIGVVF 670

Query: 689  RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTD----------EKPV 738
             ++       ++E  V  E     D ED+       D+ES++ D            E+PV
Sbjct: 671  HIEVPD----IDEEEV--EAPAGPDTEDAPQ-----DVESDESDASLTVDDLIGHGEEPV 719

Query: 739  AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTP-QLED 797
                 V +   P+       N     F  ++ T           + +    P T    +D
Sbjct: 720  VNDDVVMSDESPSSDDDMLENKRVVRFNLDNDTEPKERVGNFHLLRKGTGYPCTGFNPDD 779

Query: 798  LIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGS 857
            L ++   LG      T       +F   E DK +          +I +A  R L+K   +
Sbjct: 780  LAEKLSALGLIDPDDTDSPESHVRF--IEPDKPI----------HIPEAVER-LRKRLPT 826

Query: 858  SVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNI 917
             ++ PK    K RG                     SR  + K  K ++KYGD DEE + +
Sbjct: 827  GIIAPK----KPRGP--------------------SRLARVKAAKARKKYGDDDEEIQQL 862

Query: 918  RMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPD 977
            R  L  S  ++ K+  D             P + PV                      P+
Sbjct: 863  RCQLTGS--RLLKSGID------------TPVVEPV----------------------PE 886

Query: 978  DSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYV 1037
            +S           L       + A++  D  E+       +  +  L+ +P   D++L  
Sbjct: 887  ES-----------LQPKPVFQRQAIQPLDDRELSSH----MRQLRALSKSPSEGDVILSA 931

Query: 1038 IPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
            IP+C PY A++S+ Y +K++PG  KKG
Sbjct: 932  IPMCAPYGALKSHPYHLKLVPGNNKKG 958


>gi|340975808|gb|EGS22923.1| hypothetical protein CTHT_0014010 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 1116

 Score =  336 bits (861), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 242/735 (32%), Positives = 360/735 (48%), Gaps = 129/735 (17%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R SN+YDL+ K  + K        +        LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSEVLVSLRLSNIYDLNSKILLLKFAKPDCRRQ--------LLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H T +AR     PS F  +LRK ++TRR+  + Q+G DRII FQF  G  A+ +
Sbjct: 53  ESGFRCHLTDFARTAAPAPSAFVARLRKFLKTRRVTRISQIGTDRIIEFQFSDG--AYRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLR---------------SHRDDDK----GVAIMSRHRY 161
            LE +A GN++LTD++  +L LLR               ++R D++    GV  ++R R 
Sbjct: 111 YLEFFASGNVILTDADLKILALLRNVPEGEGQEPQRVGLTYRLDNRQNYGGVPALTRER- 169

Query: 162 PTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDL 221
                          L  AL ++ E    +P                             
Sbjct: 170 ---------------LRTALQTAVEQAVKKP----------------------------- 185

Query: 222 SKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI 281
              S K + D  R    T  T L       P L +H+         +K  EV K ED   
Sbjct: 186 ---SKKKAADELRRGLATTITELP------PVLVDHVFQLNKFDSTVKPLEVLKNED-LF 235

Query: 282 QVLVLAVAKFEDWLQDVISGDIVPEGYILMQ-NKHL------GKDHPPTESGSSTQIYDE 334
           + L  A+ +    L ++ S  ++ +GYI+ + N H       G + P     +S+ +Y++
Sbjct: 236 ESLFKALEQGRAILDEITSSPVL-KGYIIAKPNPHAQEQASEGGEAP--NGKASSLLYED 292

Query: 335 FCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           F P L  QF    + E + F+ F+  +DEF+S +E Q+ + + + +E  A  KL     D
Sbjct: 293 FQPFLPKQFEEDPNLEVLTFDGFNKTVDEFFSSLEGQKLQSRLQEREATAKKKLEAARQD 352

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q  R+  L++    +++ A  IE N+E V  A+ AV   L   M W D+ ++V+ E+K  
Sbjct: 353 QAKRIEGLQEAQVLNLRKAAAIEANIERVQEAMDAVNGLLQQGMDWVDINKLVEREQKLH 412

Query: 452 NPVAGLID-KLYLERNCMSLLLSNNLD------------EMDDEEKTLPVEK-------- 490
           NPVA +I   + L  N ++LLL    +            + D+E    P  +        
Sbjct: 413 NPVAEIIKLPMRLHENIITLLLGEEEEEGPEDEEMDFEYDTDEEAANDPQPEKAKGPDKR 472

Query: 491 --VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKT 544
             V+++L LS   NAR +YE K+    K +KTI     A K AE K     +  + QEK 
Sbjct: 473 LAVDINLKLSPWNNAREYYEQKRSAADKAQKTIQQAEIALKNAEMKIAKDLKKDLKQEKP 532

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
           +  +  +R+  WFEKF WFISS+ YLV+ GRDAQQNE++ KRY  KGDV+VH+D+ GA++
Sbjct: 533 I--LQPIRQQLWFEKFIWFISSDGYLVLGGRDAQQNEILYKRYFKKGDVFVHSDVKGAAT 590

Query: 605 TVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
            +IKN    P+ P+PP TL QAGC +VC S AWDSK    AWWV   +VSK  PTG+ + 
Sbjct: 591 VIIKNDPKTPDAPIPPATLTQAGCLSVCCSSAWDSKAAMGAWWVTADKVSKLGPTGDPMP 650

Query: 663 VGSFMIRGKKNFLPP 677
            G+FMI G++N L P
Sbjct: 651 EGTFMINGERNPLEP 665



 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 53/175 (30%), Positives = 78/175 (44%), Gaps = 27/175 (15%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RGQ+GK KK+  KY  QDEE+R    AL+     V       + E A+  K +    + +
Sbjct: 873  RGQRGKAKKIAAKYRHQDEEDR----ALMEELLGVAAAKAKREAEAAAKAKREAELAAAL 928

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
            +  K   +  +     +   EH                   A   K+  E  D  + G +
Sbjct: 929  ERKKAAQERAR-----RQIAEH------------------EARRQKILRENIDNEDDGAD 965

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
                L  ++ L G PLP D +L V+PVC P+ A+   KY+ KI PG AKKGK ++
Sbjct: 966  AVMDLRVLESLVGTPLPGDEILEVVPVCAPWQALGKVKYKAKIQPGMAKKGKAVK 1020


>gi|425773025|gb|EKV11400.1| hypothetical protein PDIG_50370 [Penicillium digitatum PHI26]
 gi|425782195|gb|EKV20118.1| hypothetical protein PDIP_19610 [Penicillium digitatum Pd1]
          Length = 1107

 Score =  335 bits (859), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 241/748 (32%), Positives = 374/748 (50%), Gaps = 105/748 (14%)

Query: 4   VRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG 63
           V++ T ++A+E  C    + +R SN+YDLS + ++FKL             +  L+++SG
Sbjct: 10  VKVITQELASE--C----VNLRVSNIYDLSSRIFLFKLAKPD--------HRRQLIIDSG 55

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R H T Y R    TPS F  +LRK++++RR+  + Q+G DRII   F  G  A+++ LE
Sbjct: 56  FRTHVTQYTRTTATTPSPFVTRLRKYLKSRRITGISQIGTDRIIEISFSDG--AYHIFLE 113

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL-- 181
            +A GNI+LTD E+ +L   R                      +V       ++ A L  
Sbjct: 114 FFAGGNIILTDREYNILAFFR----------------------QVAAGVGQEEIKAGLKY 151

Query: 182 -TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             S+K+     PD +  D    +    + L  Q+G    +  K   K   D        L
Sbjct: 152 TVSNKQNYDGVPD-ITADRVLQTLEKAQGLSAQEG----NAPKKFKKKGTD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H+           L +V   +D  +Q +   + +         
Sbjct: 200 RKALSQGFPEYPPLLLDHVFAIKEFDTTTPLDQVIGSQD-LLQAVKEVLEESRRVSNTFD 258

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVK---FETFDA 356
           SG   P GYI+ +          T S ++  +Y++F P    QF ++  +K   FE F+A
Sbjct: 259 SGASHP-GYIVAKEDTRPIPEGETSSKAAGLLYEDFHPFKPRQFENKPGIKILEFERFNA 317

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
            +DE++S +ESQR E +   +E+AA  KL  +  + + R+  LK   +  ++ A  I+ N
Sbjct: 318 TVDEYFSSLESQRLESRLTEREEAAKKKLESVRFEHKKRIDELKNVQELHIRKANAIQDN 377

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN- 474
           +  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N +SLLL   
Sbjct: 378 VYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQDRGNPVAQIIKLPLKLYENTVSLLLGEA 437

Query: 475 -------------------NLDEMDDE----EKTLPVEKVEVDLALSAHANARRWYELKK 511
                              + +E D E    E+   +  +++DL LS  ANA ++Y+ KK
Sbjct: 438 GDDEDEEEEFSSSDESDSDSENEADQETSSAERESKLLTIDIDLGLSPWANASQYYDQKK 497

Query: 512 KQESKQEKTITAHSKAFKAAEKK--TRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           +   K+++T  + +KA K+ EKK  T L+   +K    +   R   WFEKF +FISSE Y
Sbjct: 498 QASEKEQRTTQSSTKALKSHEKKVTTELKRGLKKEKQVLRQARTPFWFEKFVFFISSEGY 557

Query: 570 LVI----------------SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--R 611
           LVI                S RDA Q+E++ +RY+SKGD++VHADL GA+  V+KN    
Sbjct: 558 LVIGYVIPLNTVLRHTNPSSARDAMQSELLYRRYLSKGDIFVHADLEGATPIVVKNRAGS 617

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFMIRG 670
            + P+ P TL+QAG   V  S AWDSK V SAWW + HQVSK A  G   +  G F I+G
Sbjct: 618 ADAPISPSTLSQAGNLCVATSTAWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKG 677

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           +KNFL P  L++GFG++F++ + S+ +H
Sbjct: 678 EKNFLAPSQLVLGFGIMFQVSQESVRNH 705



 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 58/230 (25%), Positives = 97/230 (42%), Gaps = 47/230 (20%)

Query: 839  DKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKG 898
            ++P ++  ERR L+  QG S+  P        G++ S+ P              +RG++ 
Sbjct: 835  EEPNLNARERRTLR--QGKSLDRP--------GEEESAAPRIAP----------TRGKRA 874

Query: 899  KLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKV 958
            K K+   KY +QDE+ER + + L+ +                   +E        +A + 
Sbjct: 875  KDKRAAAKYANQDEDERELALRLVGANKGKAAKAAKAAEAKEQRERE-------AEAQRQ 927

Query: 959  CYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRL 1018
              + +       + K     + +G +D      +ETA     A E  D           L
Sbjct: 928  RRRAQHERAAEAERKRQAQFTENGTDDYS----EETA-----AAEASD-----------L 967

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
              +  L G P   D ++  IPVC P++A+  YKY+VK+ PGT KKGK ++
Sbjct: 968  TWIPALVGTPTTDDEIIAAIPVCAPWAALGRYKYKVKLQPGTVKKGKAVK 1017


>gi|429858117|gb|ELA32948.1| duf814 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 1040

 Score =  335 bits (858), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 192/488 (39%), Positives = 286/488 (58%), Gaps = 38/488 (7%)

Query: 252 PALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM 311
           P L +H    TG   + K +E+   E   +  L++A+ +    ++D  S     +GYI  
Sbjct: 212 PILVDHSFKTTGFDGSKKPAEILDNE-TLLDDLLVALTEARSIVKDATSS-ATAKGYIFA 269

Query: 312 QNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVK---FETFDAALDEFYSKI 365
           + ++   D  P E G + +   +Y++F P L N+F +   +K   F+ F+  +DEF+S +
Sbjct: 270 KYRN-QPDETPAEEGQTKRSDLLYEDFHPFLPNKFANDPTIKVLEFDGFNKTVDEFFSSL 328

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           E Q+ E +   +E AA  KL     DQ  R+  L++    +V+ A  IE N+E V  A+ 
Sbjct: 329 EGQKLESKLSEREAAAKRKLEAARNDQAKRIEGLQEVQSLNVQKATAIEANVERVQEAMD 388

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL------------ 472
           AV   L   M W D++++++ E+K GNPVA +I   L L  N ++LLL            
Sbjct: 389 AVNGLLQQGMDWIDISKLIEREQKRGNPVAEIIKLPLNLADNTITLLLGEEEDIEDEDSN 448

Query: 473 -------SNNLDEM-DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                  S++ DE   +++KT    +V+V++ L+ +ANAR +YE K+    K+EKT+   
Sbjct: 449 YETDSDASDSEDEAASNKQKTAKHLEVDVNIGLTPYANAREYYEQKRSAAKKEEKTVQQT 508

Query: 525 SKAFKAAEKKTRLQIL----QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
             A K AE+K + ++     QEK V  ++ +RK  WFEKF WFIS++ YLV+ G+DAQQN
Sbjct: 509 EIALKNAEQKIQAELRKGLKQEKAV--LAPIRKQIWFEKFIWFISTDGYLVLGGKDAQQN 566

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           EM+ KRY+ KGDVY+HAD+HGA++ +IKN    P+ P+PP TL QAG   VC S AWDSK
Sbjct: 567 EMLYKRYLRKGDVYIHADIHGAATVIIKNTPSDPDAPIPPSTLAQAGTLAVCSSSAWDSK 626

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
               AWWV   QVSK+APTGEYL  GSFM+RG+KNFLPP  L++GFG+++++ E S   H
Sbjct: 627 AGMGAWWVKADQVSKSAPTGEYLPTGSFMVRGQKNFLPPAQLLLGFGIMWKISEESKARH 686

Query: 699 LNERRVRG 706
           +  R   G
Sbjct: 687 VKHRLYDG 694



 Score =  104 bits (260), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/145 (38%), Positives = 82/145 (56%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L+ +R +NVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSIDVKVIAHELQENLVSLRLANVYDLSSKILLLKFAKPDN--------KKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T + R     PS F  +LRK ++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DSGFRCHLTDFTRTTAAAPSAFVTRLRKFLKTRRLTKVSQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE +A GN++LTD++  +LTLLR+
Sbjct: 111 FLEFFASGNVILTDADLKILTLLRN 135



 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 50/81 (61%), Gaps = 2/81 (2%)

Query: 993  ETAEMDKV--AMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSY 1050
            E AE ++V   M EE I  +  EE G++  +D L G PLP D +L  IPVC P++A+  +
Sbjct: 881  EIAEHEEVRRLMNEEGIEVLDAEEMGKMTLLDNLVGTPLPGDEILEAIPVCAPWNAMGKF 940

Query: 1051 KYRVKIIPGTAKKGKGIQIFY 1071
            KY+ K+ PG  KKGK ++  +
Sbjct: 941  KYKAKLQPGAVKKGKAVKEVF 961


>gi|325093107|gb|EGC46417.1| DUF814 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 1136

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 261/816 (31%), Positives = 395/816 (48%), Gaps = 136/816 (16%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++++G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRII  +   G N 
Sbjct: 66  LIVDTGFRCHLTRYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIIDIELSDG-NF 124

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           H V+LE YA GNI+LTD E+ +L L   HR   +G           E  RV        L
Sbjct: 125 H-VLLEFYAAGNIILTDKEYKILAL---HRIVPEG--------SDQEEVRV-------GL 165

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLG-GQKGGKSFDLSKNSNKNSNDGARAK 236
              LT+ +  +   P  + E   +    SK+  G  +  GK+    K + K   +  R  
Sbjct: 166 QYVLTNKQNYNGVPPLSI-ERLRDALEKSKDVTGPAEAAGKN----KRAKKKQAEALRR- 219

Query: 237 QPTLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
                     +LG   Y P L EH       DT L P  +L E  KL +  +  LV+A  
Sbjct: 220 --------AVSLGFPEYPPLLLEHAFHITGFDTSLKPE-QLVEDPKLAEKLMVALVVA-- 268

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI----YDEFCPLLLNQFRS 345
             E+    + + +  P GYI+ + +    +    +S   +++    Y +F P    QF S
Sbjct: 269 --ENVNSSLSTAEETP-GYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPFEPKQFES 325

Query: 346 R---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
                 ++F+TF  A+DE++S +ESQ+ E +   +E+ A  KL     DQ+ RV  LK+ 
Sbjct: 326 EPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRVGVLKEA 385

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KL 461
            +  ++ A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L
Sbjct: 386 QELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAKVIKLPL 445

Query: 462 YLERNCMSLLLSNNLDEMD-------------------------------DEEKTLPVEK 490
            L  N ++LLL    +  +                                ++   P+  
Sbjct: 446 KLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKTRQPLLS 505

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVA 546
           +++DL +S  ANAR++YE KK    K+EKT+ +   A K+ EKK     +  + QEK V 
Sbjct: 506 IDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALKQEKPV- 564

Query: 547 NISHMRKVHWFEKFNWFISSENYLVI---------------------SGRDAQQNEMIVK 585
            +   R   WFEKF +F+SS+ YLV+                     SGRD QQ E++ +
Sbjct: 565 -LRPTRTPFWFEKFIFFLSSDGYLVLGLVTVLMSCGFLLCFIANCVSSGRDVQQTEILYR 623

Query: 586 RYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
           R++ +GDV+VHAD+ GA   ++KN    P+ P+PP TL+QAG   V  S AWDSK V  A
Sbjct: 624 RHLKRGDVFVHADVQGAIPIIVKNKPGTPDAPIPPGTLSQAGNLCVATSTAWDSKAVMGA 683

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER- 702
           WWV   QVSKT P GEYL  G F+I G+KN L P  L++GF ++F++   S+ +H   R 
Sbjct: 684 WWVNADQVSKTTPLGEYLVTGGFVICGEKNHLSPAQLLLGFAVMFQISGESIKNHTKHRV 743

Query: 703 -----------RVRGEEE---GMDDFEDSGHHKEN-SDIESEKDDTDEKP-VAESLSVPN 746
                         G EE   G+ D E   + K N +D + ++ D  E P + +  ++P 
Sbjct: 744 QDETPISESAKDTLGTEELPSGL-DLETPKYSKINETDHQHQESDAVEVPKLGQMENLPK 802

Query: 747 SAHPAPSHTNASNVD--SHEFPAEDKTISNGIDSKI 780
               +   T++  V    H F  E + + NGI  ++
Sbjct: 803 EEASSEPQTDSITVQPAKHPFVRERRLLKNGIIEQV 838



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 75/175 (42%), Gaps = 51/175 (29%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG++GK KK+  KY  QDEE+R + + LL S  K      D   E A   K K   ++ +
Sbjct: 872  RGKRGKNKKIATKYQHQDEEDRELALRLLGSDSK-----PDKLREAA---KRKADRLAEL 923

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
            +A K   + ++A H                              D+ A  E       E 
Sbjct: 924  EAQK---QRRRAQH------------------------------DRAAQAER------ER 944

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +K      +   G     D ++  IPVC P++A+  YKYR K+ PGT KKGK ++
Sbjct: 945  QKALQQQAETQAGG----DEIVAAIPVCAPWTALSQYKYRAKLQPGTVKKGKAVK 995


>gi|340505619|gb|EGR31934.1| hypothetical protein IMG5_099620 [Ichthyophthirius multifiliis]
          Length = 1423

 Score =  328 bits (841), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 165/380 (43%), Positives = 249/380 (65%), Gaps = 8/380 (2%)

Query: 332  YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
            Y EF PL+LN ++ ++  + E+F+  +++++ K+  +  E+Q +  E  A+ K   I  D
Sbjct: 727  YFEFSPLILNSYQGKQIEQMESFNDCINKYFQKMSKKIEEEQKEDVESIAWKKYLNIKTD 786

Query: 392  QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
            QENR+  LK E +  +  A+LIE N +DV+A    ++   ++ ++W+ + +M+ E +K G
Sbjct: 787  QENRIKKLKDEQEEFITKAQLIEENYQDVEAITNILKTMKSSGLAWDKIIKMINEGKKQG 846

Query: 452  NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
            +P+A LI ++  E N +S+ L    D+M +    +PV    VD+  SAH NAR +YE K+
Sbjct: 847  DPLANLIHQIDFENNEVSIYLGFIDDQMSE---LIPVS---VDIYQSAHQNARNYYENKR 900

Query: 512  KQESKQEKTITAHSKAFKAAEKKTRLQI-LQEKTVANISHMRKVHWFEKFNWFISSENYL 570
            K   K++KT+ A   A K AEK    +I  Q+     + ++RK +WFEKF WFI+SENYL
Sbjct: 901  KNVLKEKKTLDATKTALKQAEKTALKEIETQKHKTMQLVNVRKQYWFEKFYWFITSENYL 960

Query: 571  VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN-HRPEQPVPPLTLNQAGCFTV 629
            VISGRD+QQNE++VK+YM KGD+Y+HAD HGA+ST+IKN H+    +   T+ +A   T+
Sbjct: 961  VISGRDSQQNEILVKKYMKKGDIYMHADYHGAASTLIKNPHKDSSFISQQTIEEAAVATI 1020

Query: 630  CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
            C S+AW++K++ SAWWV  HQVSK A TGEYL  GSFMIRGKKNF+ P  + M   +LF+
Sbjct: 1021 CRSKAWEAKIIASAWWVDSHQVSKRAETGEYLPSGSFMIRGKKNFVYPSRMEMACTILFK 1080

Query: 690  LDESSLGSHLNERRVRGEEE 709
            L++ SL  HLN+R+ +  EE
Sbjct: 1081 LNDDSLERHLNDRKRKVNEE 1100


>gi|390356696|ref|XP_001200483.2| PREDICTED: nuclear export mediator factor Nemf-like
           [Strongylocentrotus purpuratus]
          Length = 334

 Score =  326 bits (835), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 161/330 (48%), Positives = 228/330 (69%), Gaps = 5/330 (1%)

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           +ESQ+ + +   +E  A  KL+ +  D E R+ +L+Q  + + K   LIE NL  V+ A+
Sbjct: 1   MESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLVEQAL 60

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD--- 481
             VR A+AN++ W+++  ++KE +  G+PVA  I  L L+ N   +LL +   + DD   
Sbjct: 61  RVVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIRSLRLDTNHFQMLLRDPYKQYDDADE 120

Query: 482 -EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
            EE       V++D+A SA+ANAR+++  KK  + K++KT+ + SKA K+AEKKT   + 
Sbjct: 121 GEEDGARPMLVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSSKAIKSAEKKTMQALK 180

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
              TVA+I+  RK +WFEK+ W ISSENY++I+GRD QQNE++VK+Y+S GD+YVHAD+H
Sbjct: 181 DVATVASINKSRKTYWFEKYYWCISSENYIIIAGRDQQQNEIVVKKYLSPGDIYVHADIH 240

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASS +IKN +   PVPP TL +AG   VC+S AWD+K++TSAWWV   QVSKTAPTGE+
Sbjct: 241 GASSVIIKNPKGG-PVPPKTLQEAGTMAVCYSVAWDAKVITSAWWVRHDQVSKTAPTGEF 299

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           LT GSFM+RGKKNFLPP  L+MGFG L ++
Sbjct: 300 LTTGSFMVRGKKNFLPPTQLVMGFGFLMKV 329


>gi|345565416|gb|EGX48366.1| hypothetical protein AOL_s00080g336 [Arthrobotrys oligospora ATCC
           24927]
          Length = 1207

 Score =  325 bits (833), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 244/816 (29%), Positives = 381/816 (46%), Gaps = 145/816 (17%)

Query: 28  NVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLR 87
           N++DLS +T+ FK  +S+  T      K +L+++SG R H T +AR+   +PSGF  KLR
Sbjct: 28  NIHDLSSRTFQFKFTSSATQT------KHILIVDSGFRCHLTNFARNVAASPSGFVEKLR 81

Query: 88  KHIRTRRLEDVRQLGYDRIILFQFGL---------------------------------- 113
           K ++TRR+  +RQ+G DRI+  QFG+                                  
Sbjct: 82  KCLKTRRVTGIRQVGSDRIVELQFGIVGDNAAATTSATTATGGGVGGGEGGAEGGVEIKG 141

Query: 114 --GMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY-----PTEIC 166
              +  + +  E +A GNI+LTD+ F ++TLLR   +      I     Y      T   
Sbjct: 142 IPHVGGYRLFFEFFAGGNIILTDASFKIITLLRIVPEGPNQPKIARGETYTISSASTTFG 201

Query: 167 RVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSN 226
            ++  T+ +++  AL S  E   NE  K  +D                  K +   K   
Sbjct: 202 SLYTNTSNAQIKKALKSHLEKRENEEKKGIDDL-----------------KDWQKKKLKK 244

Query: 227 KNSNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLV 285
              +DG       L  VLG  +  +   L EH +L  G+ P++K  EV  + D+AI   V
Sbjct: 245 TRKDDG-------LNRVLGAVMTEFSSTLIEHCLLTVGVDPDLKAGEV--VGDDAIIDKV 295

Query: 286 LAVAKF-EDWLQDVISGDIVPEGYILMQNK------------------------------ 314
               K  E  ++D++    V  G+I+ +                                
Sbjct: 296 AEGFKLAETMVKDIVENKEVI-GWIIAKKPSPKTEKADTEDNGTKSKKNKKKKVAFGDAG 354

Query: 315 ------------HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFDAALD 359
                        L +D  P ++ +S  IYD+F P L  QF+ +     +   T++  +D
Sbjct: 355 IKEAEDELEAMLELDEDITP-QTDASGYIYDDFHPFLPTQFKDKPNVHTIPITTYNKTVD 413

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            F+S IESQ+ EQ+   K+  A  +L     + +N++ +LK   +  V+ A+ IE N+E 
Sbjct: 414 SFFSSIESQKLEQKTAEKKSLAAKRLANARNEHKNKIESLKSAQEVHVRKAQAIEANVER 473

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL------- 472
           V+  I AV   +A  M W ++  +V+ E+ AGN VA +I  +    N + + L       
Sbjct: 474 VEEVIDAVNGLIAQGMDWTEIRSLVEREKSAGNGVAEMIRDVKFMENTVVVRLYEEEEED 533

Query: 473 -------SNNLDEMDDEEKTLPVE-KVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                   +  ++ + EEK       +E+DLAL+ +ANAR +YE K+    K+ KT+ + 
Sbjct: 534 DSDDDDDESGSEDGNGEEKEGRSHLDIEIDLALTGYANARIYYEQKRSAAVKETKTLQSS 593

Query: 525 SKAFKAAEKKTRLQILQEKTV--ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           +KA K+ EKK +  + Q        +  +R+  W+EKF WF SSE YLV+  +D  Q +M
Sbjct: 594 AKALKSTEKKIQKDLKQAYKAEKMELRTLRRQGWWEKFYWFRSSEGYLVLGAKDPTQADM 653

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPE--QPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           + K+Y  KGDV+VHA++ G+   V+KN   +   P+PP TL+QAG   V  S AW+ KMV
Sbjct: 654 LYKKYFKKGDVWVHAEVPGSCHVVVKNKVEDVNSPIPPGTLSQAGSLAVASSDAWEKKMV 713

Query: 641 TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH-- 698
            SAWW    QV K    G  L  G F+++G+K +LPP  L+MGF + + L +   G    
Sbjct: 714 ISAWWAGYEQVGKIGAGGIVLGTGEFVVKGEKKWLPPAMLVMGFAVGWLLADGEGGEDED 773

Query: 699 -LNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
            L E R    E    + E+    KE+SD + E  DT
Sbjct: 774 ILEEERTNLPEVSNSE-EEKVEQKEDSDDDEEFPDT 808


>gi|326471330|gb|EGD95339.1| hypothetical protein TESG_02825 [Trichophyton tonsurans CBS 112818]
          Length = 1099

 Score =  325 bits (832), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 237/753 (31%), Positives = 376/753 (49%), Gaps = 138/753 (18%)

Query: 14  EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VK + R     ++G+R +N+YD+S +T++FKL        +    K  L++ +G   H 
Sbjct: 9   DVKVISRELSANILGLRIANIYDISGRTFLFKL--------ALPDIKKQLIINAGFHCHL 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   G+   Y  LE +A G
Sbjct: 61  TESSRTTADAPSHFVSRLRKLVKTRRITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAG 118

Query: 129 NILLTDSEFTVLTLLR--SHRDDDKGVAIMSRHRYPTEI-CRVFERTTASKLHAALTSSK 185
           N++LTD+++ ++ LLR  +   D + V I   +R  +++        T  +L +AL    
Sbjct: 119 NLILTDAKYEIVALLRHVAAGSDIEEVKIGMTYRLESKLNYNGIPPLTIERLKSAL---- 174

Query: 186 EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLG 245
                       D +N S   K +L        F            G     PTL     
Sbjct: 175 ------------DQDNGSKVLKRSL-------YF------------GFPEYPPTLLDHAF 203

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP 305
             +G+          D+ L P   L+     ++N +Q L + V +  D +   +S D   
Sbjct: 204 NVVGF----------DSKLQPAQILT-----DNNLVQKL-MEVLQEADRVNTALSSDSQQ 247

Query: 306 EGYILMQNK-----HLGKD---HPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETF 354
            GYI+ +N       +G D    P TE       + +F P   +Q +   +   ++FE F
Sbjct: 248 AGYIIAKNVAPTALDVGGDIQKAPVTE-------FRDFHPFEPSQSKEAPNTTILRFENF 300

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           ++A+D ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE
Sbjct: 301 NSAVDRYFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIE 360

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLS 473
            NL  V+ A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++LL+
Sbjct: 361 INLPRVEEAMNAVNGLVAQGMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTITVLLN 420

Query: 474 NNLDEM--------------------------------DDEEKTLPVEK----------V 491
               E                                   ++ T P+++          +
Sbjct: 421 EEGTEDDEEEEEEESEEEEEEEEEDDDGYGDDEYERPSQKKQLTKPLKEKKEMKDTRLSI 480

Query: 492 EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVAN 547
           ++DL +S  ANAR++Y+ KK    K+EKT+ A +KA K+ E+K     ++ + QEK V  
Sbjct: 481 DIDLGISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV-- 538

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           +   R   WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G    ++
Sbjct: 539 LRRTRNPTWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYMKKGDIYVHTDLDGGVPLIV 598

Query: 608 KNHRPEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           KN       P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G 
Sbjct: 599 KNKPDTPDDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGH 658

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           FMI+G+KN +PP  +++GF +LF++   S+ +H
Sbjct: 659 FMIKGEKNHIPPGQIVLGFAVLFQISNRSVQNH 691



 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 259/888 (29%), Positives = 416/888 (46%), Gaps = 163/888 (18%)

Query: 250  YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYI 309
            Y P L +H     G   + KL     L DN +   ++ V +  D +   +S D    GYI
Sbjct: 194  YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQKLMEVLQEADRVNTALSSDSQQAGYI 251

Query: 310  LMQNK-----HLGKD---HPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAAL 358
            + +N       +G D    P TE       + +F P   +Q +   +   ++FE F++A+
Sbjct: 252  IAKNVAPTALDVGGDIQKAPVTE-------FRDFHPFEPSQSKEAPNTTILRFENFNSAV 304

Query: 359  DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
            D ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE NL 
Sbjct: 305  DRYFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIEINLP 364

Query: 419  DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD 477
             V+ A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++LL+    
Sbjct: 365  RVEEAMNAVNGLVAQGMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGT 424

Query: 478  EM--------------------------------DDEEKTLPVEK----------VEVDL 495
            E                                   ++ T P+++          +++DL
Sbjct: 425  EDDEEEEEEESEEEEEEEEEDDDGYGDDEYERPSQKKQLTKPLKEKKEMKDTRLSIDIDL 484

Query: 496  ALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHM 551
             +S  ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QEK V  +   
Sbjct: 485  GISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV--LRRT 542

Query: 552  RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
            R   WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G    ++KN  
Sbjct: 543  RNPTWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYMKKGDIYVHTDLDGGVPLIVKNKP 602

Query: 612  PEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
                 P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G FMI+
Sbjct: 603  DTPDDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIK 662

Query: 670  GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESE 729
            G+KN +PP  +++GF +LF+         ++ R V+  E+ +    + G           
Sbjct: 663  GEKNHIPPGQIVLGFAVLFQ---------ISNRSVQNHEKCLPSAPEDGV---------- 703

Query: 730  KDDTDEKPVAES--LSVPNSAHPAPSH-TNASNVDSHEFPAEDKTISNGIDSKIFDIARN 786
               T+++P++ +  +  P +    P         D H+   ED +  + ID ++      
Sbjct: 704  ---TNDEPISSTGDMDQPEANQSDPEEDVPLEQEDEHQEEPED-SKKDIIDERV------ 753

Query: 787  VAAPVTPQLEDL-IDRALGLGSASISSTKHGIETTQFDLSE-EDKHVERTATVRDKPYIS 844
              AP+  QL+ + ++ +L    A +       E  + + S+ E++ VE  +   + P  S
Sbjct: 754  --APLGEQLKSMHVEDSLDSNPAQVH------EADKEEASKGENQPVEGPSKNAEGPEDS 805

Query: 845  KAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMK 904
            +      +    S +  P   +E       +S P +I      +     RG++GK KK+ 
Sbjct: 806  E------QSDDESILATPSATQESR-----ASTPSAISSSGTQKSKPPVRGKRGKAKKLA 854

Query: 905  EKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYK-CK 963
             KY DQDEE+R + + LL SA             +A T+K K  A   +DA +   K  +
Sbjct: 855  TKYKDQDEEDRKLALRLLGSAA----------GPSAPTNKPKTKA--DIDAEREAQKERR 902

Query: 964  KAGH---LSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLND 1020
            +A H   L    ++    + + VED                         GEE K   + 
Sbjct: 903  RAQHERALQAVKRQQEAFTRNSVEDAS-----------------------GEEHKLDFSI 939

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +  L G P+  D +   IPVC P++A+  YKYR K+ PG  KKGK ++
Sbjct: 940  LPALVGTPVEGDEIEAAIPVCAPWAALGQYKYRAKLQPGKIKKGKAVK 987


>gi|326479424|gb|EGE03434.1| DUF814 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 979

 Score =  323 bits (829), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 235/758 (31%), Positives = 377/758 (49%), Gaps = 128/758 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++G+R +N+YD+S +T++FKL        +    K  L++
Sbjct: 1   MKQRYSSLDVKVISRELSANILGLRIANIYDISGRTFLFKL--------ALPDIKKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G   H T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   G+   Y 
Sbjct: 53  NAGFHCHLTESSRTTADAPSHFVSRLRKLVKTRRITGVRQIGTDRIIEFEISDGLFRLY- 111

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GN++LTD+++ +             VA++ RH        V   +   ++   
Sbjct: 112 -LEFFAAGNLILTDAKYEI-------------VALL-RH--------VAAGSDIEEVKIG 148

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           +T   E   N         N +   + E L              S  + ++G++  + +L
Sbjct: 149 MTYRLESKLNY--------NGIPPLTIERL-------------KSALDQDNGSKVLKRSL 187

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
                E   Y P L +H     G   + KL     L DN +   ++ V +  D +   +S
Sbjct: 188 YFGFPE---YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQKLMEVLQEADRVNTALS 242

Query: 301 GDIVPEGYILMQNK-----HLGKD---HPPTESGSSTQIYDEFCPLLLNQFR---SREFV 349
            D    GYI+ +N       +G D    P TE       + +F P   +Q +   +   +
Sbjct: 243 SDSQQAGYIIAKNVAPTALDVGGDIQKAPVTE-------FRDFHPFEPSQSKEAPNTTIL 295

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +FE F++A+D ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ 
Sbjct: 296 RFENFNSAVDRYFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRK 355

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A  IE NL  V+ A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +
Sbjct: 356 ARAIEINLPRVEEAMNAVNGLVAQGMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTI 415

Query: 469 SLLLSNNLDEM--------------------------------DDEEKTLPVEK------ 490
           ++LL+    E                                   ++ T P+++      
Sbjct: 416 TVLLNEEGTEDDEEEEEEESEEEEEEEEEDDDGYGDDEYERPSQKKQLTKPLKEKKEMKD 475

Query: 491 ----VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQE 542
               +++DL +S  ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QE
Sbjct: 476 TRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQE 535

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K V  +   R   WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G 
Sbjct: 536 KPV--LRRTRNPTWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYMKKGDIYVHTDLDGG 593

Query: 603 SSTVIKNHRPEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
              ++KN       P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ 
Sbjct: 594 VPLIVKNKPDTPDDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDI 653

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           L  G FMI+G+KN +PP  +++GF +LF++   S+ +H
Sbjct: 654 LKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNRSVQNH 691



 Score = 40.8 bits (94), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 24/33 (72%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLASAG 926
           RG++GK KK+  KY DQDEE+R + + LL SA 
Sbjct: 844 RGKRGKAKKLATKYKDQDEEDRKLALRLLGSAA 876


>gi|74025594|ref|XP_829363.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834749|gb|EAN80251.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1100

 Score =  322 bits (826), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 239/784 (30%), Positives = 377/784 (48%), Gaps = 146/784 (18%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L G+R +NVYD+ P+T++FK  NS         +K  LL
Sbjct: 1   MVKQRMTALDVRASVEEMRTELQGLRLTNVYDIPPRTFLFKFGNSE--------KKRTLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+GVRLH T   R+K   P+ FTL+LRKH+R  RL+ V QL +DR + F+FG+   A Y
Sbjct: 53  LENGVRLHLTQLVREKPKVPTQFTLRLRKHVRAWRLDSVTQLQHDRTVDFRFGVAEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GNI+LTD E+ ++ LLR+H+DD  GV +  R  YP  + + FE+    +  
Sbjct: 113 HIIVELFSKGNIVLTDHEYRIMLLLRAHKDD--GVNMFVRELYP--VTKSFEQQQEEECQ 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                ++  +A                       ++ G  F               A+  
Sbjct: 169 QLTEGAQRVEALR---------------------REWGAVFT------------RHAEYE 195

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T ++ L     +GP+L++HI+  TG V ++K + +    D   + L+  +   E W    
Sbjct: 196 TTRSTLSATHHFGPSLADHILTVTG-VKSVKKANMTCSGDEMFEKLLPGM--LEAWR--- 249

Query: 299 ISGDIVPEGYILMQ---------NKHLGKDHPPTESGSSTQI------------------ 331
            +   +P G  L+           +  GK  P  ++G  T                    
Sbjct: 250 FAFSPLPTGGYLISKTAATKGRGTQERGKAPPHVDAGVGTTADGGEAGSGVEKQPRPHLQ 309

Query: 332 ---YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLN 386
              Y++F P+LL Q+R          +F +  D F+   E ++ EQ +         K  
Sbjct: 310 GVQYEDFSPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVLSKKE 369

Query: 387 KIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
           K   D   R+  L++  + + +  ELI  N E +D AI  +  ALA  + WE L R++K+
Sbjct: 370 KFERDHNRRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRRLLKQ 429

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM---DDEEKTLPV--------------E 489
               G+PVA ++ +L+L+RN +S+L+  N +++   +DEE  + V              E
Sbjct: 430 RHAEGHPVAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGGNSGE 489

Query: 490 K-------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           K             +EVDL+ +A+ANA  ++  KK   +K EKTI A +KA   AEKK  
Sbjct: 490 KKDRVEGCSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAEKKGE 549

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
               +++T   I+  R   W+EKFNWF +S   LV+ G D Q  E++V+R M  GDV+VH
Sbjct: 550 RLAAKKQTKKAIATERHRCWWEKFNWFRTSCGDLVLQGHDTQSTELLVRRIMRLGDVFVH 609

Query: 597 ADLHGA-------------SSTVIKNHRPEQP------------VPPLTLNQAGCFTVCH 631
           +D+ G              +ST       E+             +  ++L++A  + VC 
Sbjct: 610 SDVEGGLPCILRAAGSAWDASTAFGEGESEENSIQVGESTKGWLIHMISLDEAAAWCVCR 669

Query: 632 SQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           S AW+SK    AWWV+  Q+      G YL      + G+KN+L P PL++G GLLFR+ 
Sbjct: 670 SSAWESKFSVGAWWVHASQIVGGTAAGCYL------LSGEKNYLRPRPLMLGCGLLFRIS 723

Query: 692 ESSL 695
             ++
Sbjct: 724 SRAI 727



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 56/223 (25%), Positives = 93/223 (41%), Gaps = 52/223 (23%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEER------------NIRMALLASAGKVQKNDGDPQNE 938
            ++++ Q+ KLKK+++KY DQD+E+R             +++ LLAS    Q N+      
Sbjct: 847  QLTKHQRKKLKKIQQKYKDQDDEDRLTGALLNGNQLSKVQLELLASERAKQTNE------ 900

Query: 939  NASTHKEKKPAISPVDAPKVCYKC-------KKAGHLSKDCKEHPDDSSHGVEDNPCVGL 991
                     PA S   A +   +C       +  G +         D+ H +  +P  G 
Sbjct: 901  ----IVRTSPAGSSSAAGEAGERCGGEAWGEECVGEVRGRAPAKGGDAGHLLAASPSCGS 956

Query: 992  DETAEMDKVAMEEEDIHEIGEEEKGRL--------------NDVDY------LTGNPLPS 1031
            D  A+ ++   E+ +      + + R               ND ++       T  P P 
Sbjct: 957  DGPADNERTPREDNEPSTGEPQPRSRAIDSTAASLEATRAANDAEFNREWIHFTAKPQPG 1016

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            D + Y + VC P  +V SYKYR ++  G AKKG   Q+  SL+
Sbjct: 1017 DCVEYAVAVCAPMGSVISYKYRAELSCGNAKKG---QVALSLI 1056


>gi|400593352|gb|EJP61303.1| DUF814 domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 1062

 Score =  319 bits (818), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 240/796 (30%), Positives = 378/796 (47%), Gaps = 100/796 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L + L  +R +N+YDLS + ++FK              K  LL+
Sbjct: 1   MKQRFSSLDVKVVAHELSQSLTSLRVANIYDLSTRIFLFKFAKPG--------TKKQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G R HTT + R    TPS F  +LRK ++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DIGFRCHTTEFVRTTAGTPSAFVCRLRKALKTRRLTSVSQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GN +LTD +   L +L  +R+  +G    S+     ++  ++   +       
Sbjct: 111 FLEFFASGNAILTDVD---LRILALYRNVSEGEGQESQ-----KVGLLYSLKSRQNFFGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                      PD   +       A+ E +   K         +SN+    G      TL
Sbjct: 163 -----------PDLSQDRVRTALAAAIEKVSTTKAA-------SSNRTPKQG-----DTL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDV 298
           +  L  ++    P L +H +       +++L  +  L D ++   L   + +  ++L D 
Sbjct: 200 RKCLAVSITELPPILLDHTLQSNHFDSSLELKAI--LNDASLLSSLTENLREAREFL-DS 256

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI---YDEFCPLLLNQFRSR---EFVKFE 352
           I+      G+I  +     +     + GS  ++   YD+F P +  +F      E ++FE
Sbjct: 257 ITSHSRCTGFIFAKKPVQDQSLQEQDGGSKAKLRLLYDDFHPFVPTKFEKNDDIEILRFE 316

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
            ++  +DEF+S +E QR E +   +E AA  K++    DQENR+  L+     + + A  
Sbjct: 317 GYNRTVDEFFSSLEGQRLESRLMEREAAAQRKIDAARQDQENRIRGLQTAQLDNFRKAAA 376

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL 471
           IE N+E V  A+ ++   L   M W D+ ++V  E+K  NPVA LI   L L  N +S+ 
Sbjct: 377 IEANIERVQEAMDSINGLLNQGMDWVDIGKLVAREQKKNNPVATLICLPLNLVDNVISVR 436

Query: 472 LSNNLDEMDDEEKTLPVEK------------------------VEVDLALSAHANARRWY 507
           LS   D   ++E+    +                         VE+ L LS  +NAR +Y
Sbjct: 437 LSEEDDVASEDEEPYETDDSDVRFEDDLDTTESGLKNSDKTIVVELTLNLSPWSNARGYY 496

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISHMRKVHWFEKFNWFIS 565
           + +K    K+EKT     KA K+ E K +  L+ + ++  A +  +R   WFEKF WFIS
Sbjct: 497 DQRKNAVVKEEKTQLQADKAIKSTEHKVKQDLKKVLKQEKALLQPIRNPMWFEKFYWFIS 556

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVPPLTLNQ 623
           S+ YLV+  +D  Q E++ ++++  GD + HAD   A+  V+KN+    + P+ P TL Q
Sbjct: 557 SDGYLVLGAKDKSQAELLYRQHLRSGDAFCHADASNAAIVVVKNNSKTADVPIAPATLAQ 616

Query: 624 AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           AG  ++C S+AWDSK    AWWV  +QVSK+  TG+ L  G+F I G+KNFLPP  L++G
Sbjct: 617 AGQLSICSSEAWDSKAGIGAWWVNSNQVSKSTSTGDILQPGNFNISGEKNFLPPGQLVLG 676

Query: 684 FGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLS 743
             +LF++ E S   H N+ R+  E    D        KE              P +E  +
Sbjct: 677 LSVLFKISEES-KIHHNKHRIPDEPAVSD-----APRKETY------------PNSEQEA 718

Query: 744 VPNSAHPAPSHTNASN 759
             N   PA S  N SN
Sbjct: 719 TTNDIQPAASTANGSN 734



 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 53/181 (29%), Positives = 81/181 (44%), Gaps = 30/181 (16%)

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKK 947
            E  K+ RGQ+GK KK+  KY DQDEE+R    AL  S     K + + Q +    H  ++
Sbjct: 830  EPNKLKRGQRGKAKKIAAKYRDQDEEDRAAAEALTGSTAGKHKAEAEVQAKLKREHDMEQ 889

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
                         +  +     K+  EH ++    + D    GLD               
Sbjct: 890  AKAR---------RHARHERRQKEVAEH-EEKRRAIYD----GLDPE------------- 922

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
               G+E + +   +D L G P P D +L  + VC P++A+   KY+ K+ PGT KKGK +
Sbjct: 923  ---GDEAEEQWAPIDLLVGTPRPGDEILEAVTVCAPWAALSRSKYKFKLQPGTVKKGKAV 979

Query: 1068 Q 1068
            +
Sbjct: 980  K 980


>gi|89130574|gb|AAI14230.1| Zgc:153813 protein [Danio rerio]
          Length = 556

 Score =  319 bits (817), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 155/281 (55%), Positives = 202/281 (71%), Gaps = 11/281 (3%)

Query: 435 MSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-------NLDEMDDEEKTLP 487
           + W ++ RMV E + AG+PVA  I +L L+ N ++LLL N          E+   +K+  
Sbjct: 99  VDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPEACPEGGAAELQSGKKSRS 158

Query: 488 VEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
            EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  KAFK+AEKKT+  +   +T
Sbjct: 159 REKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQT 218

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
           V +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKRY+  GD+YVHADLHGA+S
Sbjct: 219 VTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATS 278

Query: 605 TVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV   QVSKTAP+GEYLT G
Sbjct: 279 CVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTG 337

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           SFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++
Sbjct: 338 SFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMK 378



 Score = 90.9 bits (224), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 65/107 (60%), Gaps = 9/107 (8%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
           ESG+R+H T +   K   PSGF +K RKH+++RRL  VRQLG DRI+
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRKHLKSRRLVHVRQLGVDRIV 99


>gi|401416565|ref|XP_003872777.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322489002|emb|CBZ24251.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 1189

 Score =  315 bits (807), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 235/753 (31%), Positives = 373/753 (49%), Gaps = 117/753 (15%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+YD+  K ++FK  +       GE++K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRATLIGLRLLNIYDIGSKMFLFKFGH-------GENKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN--A 117
            ESG RLH T  AR+K   PS FTLKLRKH+R  RL+ V QL +DR I   FG+      
Sbjct: 54  -ESGTRLHLTELAREKPKVPSQFTLKLRKHVRAWRLDSVAQLQHDRTIDLCFGVPSTEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            ++I+EL+++GN++LT+  +T++ LLR+HRDD+ G+ +M    YP          TA  +
Sbjct: 113 FHIIVELFSKGNVILTNYAYTIMMLLRTHRDDE-GLKLMVNQVYPV---------TAPFV 162

Query: 178 HAALTSSKE-PDANEPDKVNEDGN---------NVSNASKENLGGQKGGKSFDLSKNSNK 227
            A    S+E P    P  V+  G+         +++ A ++    +      D     ++
Sbjct: 163 AAVAAESEESPMFLYPPHVDASGHLHLQRTADADLTLAQRQLKEERTRLMKVDWEVGLSR 222

Query: 228 NSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
            SND     +  ++T++     +GP L++H++  TG VPN       +  DN    L+  
Sbjct: 223 -SND-----RTVVQTLVAGIQHFGPDLAQHVLTVTG-VPNAPRKSWTQSTDNVFVTLLPG 275

Query: 288 VAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI---------------- 331
           +   E +  D+   D+   G  L++        P  + GS+                   
Sbjct: 276 L--LEAF--DLAKVDLTSAGGYLIK--------PKAKPGSTVHAPAPPAPGAPAGAADLV 323

Query: 332 -----YDEFCPLLLNQFRSR--EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHK 384
                Y+ F P+LL Q+ +   E +   +F    DEF+   E++R +  +  +++ A  K
Sbjct: 324 AVAEQYESFTPILLAQYTNDGVEALYRSSFGRVCDEFFLITETERIDASNAKRKNTAKSK 383

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
            +K   D   R++ L+ ++  +    E +  N + VD AI  +  ALA  +SW+ L  ++
Sbjct: 384 EDKFATDHARRINALEADIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLL 443

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANA 503
           K     G+PVA +I  L+LERN +S+LL   LDE   EE   +P   VEV L+ +AHANA
Sbjct: 444 KRRHAEGHPVAYMIHDLFLERNSISVLLEAVLDEEKGEEDCDVPPLVVEVALSKTAHANA 503

Query: 504 RRWYELKKKQESKQEKTITAHSKAF---------KAAEKKTRLQILQEKTVANISHMRKV 554
             ++  +K   SK E+T+ A +KA          KAA +K R  I++E         R+ 
Sbjct: 504 ADYFSKQKHHRSKLERTVAATAKAAAGAALKGARKAAAQKERKVIVKE---------RQR 554

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--- 611
            W+EKF WF ++   LV+ G+D Q  E++V+R M  GD+++H ++ GA   +++      
Sbjct: 555 QWWEKFLWFRTTAGDLVLRGKDVQSTELLVRRVMRLGDLFIHCEVDGALPCLLRPMNDVW 614

Query: 612 ----------------PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
                             QPV   ++ +AG + V  S AW+ K  T +WWVY  QV+   
Sbjct: 615 QELGGNNAGGDFTAAPATQPVALHSVCEAGAWCVAFSGAWERKQTTGSWWVYASQVTGGT 674

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            TG YL        G+++ LPP  + +G  LLF
Sbjct: 675 ATGAYLYA------GERHHLPPQSMSLGCALLF 701


>gi|430813962|emb|CCJ28739.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 631

 Score =  313 bits (802), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 221/655 (33%), Positives = 338/655 (51%), Gaps = 70/655 (10%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN 77
           L++LIG+R  N+YD+S +T+ FK         SG  E   LL+ESG R+H T Y R+   
Sbjct: 8   LQKLIGLRLQNIYDISERTFQFKF------ATSGHKEH--LLVESGSRIHLTCYVRETAA 59

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA-------HYVILELYAQGNI 130
            PS F  KLRKH++++RL  ++Q+  DR++   FG G          +Y+I E YA GNI
Sbjct: 60  LPSQFCAKLRKHLKSKRLVSLKQINSDRVVYLGFGCGSETVESFKPQYYLIFEFYAAGNI 119

Query: 131 LLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY---PTEICRVFERTTASKLHAALTSSKEP 187
           LLTDS+  +L+LLR  R             Y   PT   +  E+ T   L + + + K+ 
Sbjct: 120 LLTDSDMKILSLLRLVRPGGMHQQFSVGQLYQITPTPQNKQVEKMTEDVLRSLIKTLKDK 179

Query: 188 DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEA 247
             +  ++      N+S + K+    +K  +                    P  K V  E 
Sbjct: 180 YLSPKEEPLPKQMNLSTSFKKTSKKEKKPREL------------------PLKKLVSWEL 221

Query: 248 LGYGPALSEHIILDTGLVPNMKLSEV-NKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPE 306
             YG AL EHII D  + P+MK+ E  + +E   +Q L+L+  + +D ++    G +   
Sbjct: 222 SNYGNALIEHIIRDANIDPDMKIDEFYHNIESINLQHLLLSFQRADDLIKKCEEGSVT-- 279

Query: 307 GYILMQNKHLGKDHPPTESGSST----QIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           GYI+ + +   + +    +  ST    +IY +F P +  Q+ +       TFD      Y
Sbjct: 280 GYIVEKIESKTRINLNDITLESTPDPVKIYVDFNPFIPKQYSNNPNYSVITFDDG----Y 335

Query: 363 SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDA 422
           +K  SQ+ + + K ++D A+ +L     + + ++  L++  +  +K A+ IE N E VD 
Sbjct: 336 NK--SQKFDMKLKNQKDIAYRRLQITKEEHQKKIDDLQKFQNICIKKAKAIEENQEIVDE 393

Query: 423 AILAVRVALANRMSWEDLARMVKEERKAGN-------PVAGLIDKLYLERNCMSLLLSNN 475
            I AV   +   M WED+A++VK E++  +       P   L D +Y   +   L    N
Sbjct: 394 TIKAVNTCVLRSMDWEDIAKLVKTEKEYESNTITIQLPCPHLDDNIYENDSTTGLFNGQN 453

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
                D+ +TL    +++ L+L+A  NAR +YE KK    K+EKTI A SKA K AE+K 
Sbjct: 454 -----DKTETL---NIDIKLSLNAWTNARDYYEKKKAASVKEEKTIAASSKALKNAERKI 505

Query: 536 ----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
               +    QEK    +  MR + WFEKF WFISS+ YLV++G D  QN+++++ + SK 
Sbjct: 506 NSDLKRNTAQEK--KKLVPMRNLQWFEKFLWFISSDGYLVLAGHDLLQNKILIQNHFSKN 563

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           D+YVHADL  A+  +IKN      VPP TLNQAG F++  S AW SK+VTSAW +
Sbjct: 564 DIYVHADLKDAAVVIIKNMIDSSFVPPNTLNQAGAFSIAKSNAWTSKIVTSAWCI 618


>gi|154332902|ref|XP_001562713.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059716|emb|CAM41838.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 1198

 Score =  312 bits (800), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 231/754 (30%), Positives = 371/754 (49%), Gaps = 93/754 (12%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+Y++  K ++FK  +       GE +K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRANLIGLRLLNIYNMDSKMFLFKFGH-------GEHKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN--A 117
            ESGVR H T   R+K   PS FTLKLRKH+R  RL+ + QL +DR I   FG+  +   
Sbjct: 54  -ESGVRFHLTELEREKPKVPSQFTLKLRKHVRAWRLDSISQLQHDRTIDLCFGVSSSEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER------ 171
            ++I+EL+++GN++LTD  + ++ LLR+HRDD+ G  +M    YP  +   F        
Sbjct: 113 FHIIVELFSKGNVILTDYTYKMMMLLRTHRDDE-GHNLMVNQVYP--VTAPFVAAVAVES 169

Query: 172 ----------TTASKLHAALTSSKEPDAN-EPDKVNEDGN----NVSNASKENLGGQKGG 216
                     T +S    A+++++ P     P  V+  G+     +++A       Q   
Sbjct: 170 ASAQEADTATTVSSVTRTAVSAAEVPHIFLYPPHVDASGHLHVQRIADADLTLAQQQVKE 229

Query: 217 KSFDLSKNSNK----NSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE 272
           +   L K   +     SND     +  ++T++     +GP L++H++  TG+    + S 
Sbjct: 230 ERTRLMKAEWEVGLTRSND-----RTVVQTLVAGIQHFGPDLAQHVLAITGVSNAPRKSW 284

Query: 273 VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK-----HLGKDHPPTESGS 327
               +D    +L   +  F     D+   D+   G  L+++K           PP    S
Sbjct: 285 KQSTDDIFATLLPGLLEAF-----DLAKVDLASAGGYLIKSKAGPGSRANAAEPPAPDAS 339

Query: 328 ST-----------QIYDEFCPLLLNQFRSREFVKF--ETFDAALDEFYSKIESQRAEQQH 374
           +            + Y+ F P+LL Q+     V F   +F    DEF+   E+ R +  +
Sbjct: 340 TAAAGVADLVAVAEKYESFTPILLAQYTEDGVVSFYRASFGRVCDEFFLITETARIDASN 399

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANR 434
           + +++ + +K +K   D   R++ L+ ++  +    + +  N + VD AI  +  ALA  
Sbjct: 400 EKRKNTSKNKEDKFAADHARRINALETDIAANQLKGQQLILNADRVDEAIQLINGALATG 459

Query: 435 MSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEV 493
           +SWE L  ++K     G+PVA +I  L+LERN +S+LL   LDE   EE   +P   VEV
Sbjct: 460 ISWEALRILLKRRHAEGHPVAYMIHDLFLERNSISVLLETVLDEEAGEEDCDVPPMVVEV 519

Query: 494 DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRK 553
            L+ +AHANA  ++  +K+  SK E+TI A  +A   A +K   +  ++K    I   R+
Sbjct: 520 ALSKTAHANAADYFGRQKQHRSKLERTIAATDRAAAGAARKGERKAAEQKERKVIVKERQ 579

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK----- 608
             W+EKF WF +S   LV+ G+D Q  E++V+R M  GD+++H D+ GA   +++     
Sbjct: 580 RSWWEKFFWFRTSAGDLVLRGKDVQSTELLVRRVMRLGDLFIHCDVDGALPCLLRPMNDV 639

Query: 609 -----NHRP---------EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
                 H            QPV   +  +AG + V  S AW+ K  T +WWVY  QV+  
Sbjct: 640 WQELGGHNAGGNAVVSPRTQPVAMHSACEAGAWCVAFSGAWERKQTTGSWWVYASQVTGG 699

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
             TG YL        G+++ LPP  + +G  LLF
Sbjct: 700 TATGTYLYT------GERHHLPPQSMSLGCALLF 727


>gi|402074990|gb|EJT70461.1| serologically defined colon cancer antigen 1 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 1086

 Score =  309 bits (791), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 196/522 (37%), Positives = 285/522 (54%), Gaps = 51/522 (9%)

Query: 252 PALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEGYIL 310
           P L +H   +    P  K +++  LED  +   L  A+ +    + D+ S D V +GYI+
Sbjct: 212 PILVDHAFKENNFDPKAKPADI--LEDEGVFDALFTALERARGIIDDITSSDTV-KGYIV 268

Query: 311 MQN---------KHLGKDHPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAAL 358
            +N                P     S   +Y++F P L  QF    S   + FE F+  +
Sbjct: 269 ARNPDVADAGAAAEGAVVKPFAPELSKGLLYEDFSPFLPQQFAGDPSNVVLTFEGFNKTV 328

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEF+S +E Q+ E +   +E  A  KL+    + E R+  L++    +++ A  IE N+E
Sbjct: 329 DEFFSSLEGQKLESRLTEREAGAKRKLDAAKREHEKRIEGLQEYQLLNLRKAAAIEANVE 388

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD 477
            V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I+  + L  N ++L+++   D
Sbjct: 389 RVQEAMDAVIGLLEQGMDWVDVGKLVEREQKRHNPVAEIIELPMDLANNTITLVIAEQDD 448

Query: 478 EMDDEEKTLPVE---------------------KVEVDLALSAHANARRWYELKKKQESK 516
             DD E     E                     +V++ L+L+   NA  +Y+ K+    K
Sbjct: 449 VDDDSEDGYETESSASDDDDDAAAVQTGKAKTLEVDIKLSLTPWGNAGEYYDQKRSAAVK 508

Query: 517 QEKTITAHSKAFKAAEKKTR--LQ--ILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
           QEKT+   S A K+A++K    LQ  + +EK V  ++  R+  WFEKF+WFISS+ YLV+
Sbjct: 509 QEKTVQQSSIALKSAQEKIAKDLQKGLKKEKPVMQLA--RRQMWFEKFHWFISSDGYLVL 566

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVC 630
            GRDAQQNE++ +RY+ +GDVYVHADLHGA S +IKN+   P+ PVPP TL+QAG   VC
Sbjct: 567 GGRDAQQNEILYRRYLKRGDVYVHADLHGAPSVIIKNNPRTPDAPVPPSTLSQAGQLAVC 626

Query: 631 HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
            S AW+SK    A+WV   QVSK+APTGE+L  GSFM+RGK+N LPP PLI+GFG++FR+
Sbjct: 627 ASSAWESKAGMGAYWVGADQVSKSAPTGEFLPTGSFMVRGKRNELPPAPLIVGFGVMFRI 686

Query: 691 DESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDD 732
            + S   H    RV    EG    E S   K +   E+  DD
Sbjct: 687 SDESKAKH-TRHRVYESAEG----EPSTAPKPSPGTEAAADD 723



 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 54/148 (36%), Positives = 84/148 (56%), Gaps = 11/148 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R+++ DV A    L++ L+ +R SN+YDLS K ++ +              K  L++
Sbjct: 1   MKQRLSSLDVRAIAHELQQSLVTLRLSNIYDLSSKIFLLRFAKPD--------LKKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T ++R     PS F  ++RK +RTRR   V Q+G DRII  QF  G  +  +
Sbjct: 53  DSGFRCHLTDFSRPTAPAPSQFVARVRKFLRTRRCTAVSQVGTDRIIELQFSDG--SLRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRD 148
             E +A GNI+LTD+   +L LLR+ ++
Sbjct: 111 FFEFFASGNIILTDANLNILALLRNVKE 138



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 33/51 (64%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L+ +D L G P   D +L VIP+C PY+A+   KY+ K+ PG  KKGK ++
Sbjct: 954  LSTLDSLVGTPQAGDEILEVIPICAPYAAMARVKYKAKLQPGMQKKGKALK 1004


>gi|224014996|ref|XP_002297159.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
 gi|220968134|gb|EED86484.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
          Length = 968

 Score =  308 bits (790), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 215/605 (35%), Positives = 319/605 (52%), Gaps = 53/605 (8%)

Query: 250 YGPALSEHIILDTGLVPNMKLSEVN---KLEDNAIQVLVLAV-AKFEDWLQDVISGDIVP 305
           YGP+L EH I   G+ P +KL+  N    L + +   LV ++  +    ++++ SG+   
Sbjct: 192 YGPSLIEHCITTAGVDPMVKLTHDNIEYTLPEASWNDLVSSLCGEGAKVIENLSSGE--S 249

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
            GYIL + K         +     +   EF P LL+Q +++  + + TF  A DEF+S +
Sbjct: 250 GGYILYKPKQ------TDDKNDYNKTLLEFQPHLLHQHKNQHALSYTTFATATDEFFSHL 303

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
            SQR  Q+  A E AA  +L+KI +DQ+ RV  L  E ++S   A L+E + EDVD  + 
Sbjct: 304 SSQRIAQRADAAEAAARERLSKIQLDQQRRVDGLVAEQEKSRDCARLVEMHAEDVDRVLG 363

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL-LLSNNLDEMDDEEK 484
            +  AL + M+W+ L ++V  E+   NP+A LI KL L ++ + L L   +  +  D ++
Sbjct: 364 VINSALESGMNWDALEQLVLVEQGNENPIALLIFKLELCKDQVVLALPDIDDWDDSDPDR 423

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH-------SKAFKAAEKKTRL 537
              +  V V +  SAH NAR  +   K+ ++ +  T            +  +A +KK R+
Sbjct: 424 PPKLHYVTVSIKESAHGNARNMFATIKQSKTLEASTTALKAAEAKAKQQLAEAQKKKQRI 483

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
           Q++           RK +WFEKF WFI+S+NYLV++G+DAQQNE +VK+Y+  GD Y+HA
Sbjct: 484 QVMPN---------RKTYWFEKFAWFITSDNYLVVAGQDAQQNEQLVKKYLRPGDAYLHA 534

Query: 598 DLHGASSTVIKNHRPEQ--------PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
           ++HGA++ +++  R  +        P+    L +AG FT C S AW SKMV SA+WV  H
Sbjct: 535 EVHGAATCILRAKRRRRSDGKTQVIPLSDQALREAGTFTTCRSSAWSSKMVCSAYWVESH 594

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNERRVRGEE 708
           QVSKTAPTGEYLTVGSFMIRG+KNFLPP  L MG G+LFRL D++S+  H NERR     
Sbjct: 595 QVSKTAPTGEYLTVGSFMIRGRKNFLPPSSLEMGMGVLFRLGDDASVARHANERRDFALM 654

Query: 709 EGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAE 768
           E  + F      +E + +  E +D  E    +S    +       HTNA +         
Sbjct: 655 EHEEIFARQDALREKNKVSVEVEDESEPIPLDSYEKEHDDVCPTGHTNAID--------- 705

Query: 769 DKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEED 828
                N  D  I D   NV   VTP  E+  ++      +S      G E    D  ++ 
Sbjct: 706 ----GNAGDEAIEDTENNV--EVTPDAEESTEQPNSDNESSDGKQSDGDEVPTADTKKKQ 759

Query: 829 KHVER 833
           K + R
Sbjct: 760 KELSR 764



 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 87/133 (65%), Gaps = 12/133 (9%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAY----ARD 74
           R ++G + +NVYD S       +M ++   ++ ++++ +LL+ESGVR H T +    +  
Sbjct: 7   RSMLGFKLANVYDGSA----LGIMPAA---DAEQAKRAMLLIESGVRFHPTTHYSQSSSS 59

Query: 75  KKNTPSGFTLKLRKHIRTRRLEDVRQLG-YDRIILFQFGLGMNAHYVILELYAQGNILLT 133
             + PS F +KLRKH+R  RLE+V QLG  DR++ F+FG G   H+++LELY+ GN++L 
Sbjct: 60  SSSMPSAFAMKLRKHLRNLRLENVTQLGNLDRVVDFRFGSGSLTHHLLLELYSLGNLILC 119

Query: 134 DSEFTVLTLLRSH 146
           D ++ +L LLR+H
Sbjct: 120 DGQYRILGLLRTH 132



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/52 (51%), Positives = 33/52 (63%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            LTG P P D+LL  IPVC PY  +  YKYRVK+ PG+ K+GK  +    L L
Sbjct: 856  LTGKPSPDDVLLCAIPVCAPYQVLNQYKYRVKLTPGSVKRGKASKQCVELFL 907


>gi|355718192|gb|AES06188.1| serologically defined colon cancer antigen 1 [Mustela putorius
           furo]
          Length = 547

 Score =  308 bits (790), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 139/222 (62%), Positives = 177/222 (79%), Gaps = 1/222 (0%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I  
Sbjct: 43  VDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQK 102

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            RKV+WFEKF WFISSENYL+I GRD QQNEMIVKRY++ GD+YVHADLHGA+S VIKN 
Sbjct: 103 ARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCVIKNP 162

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRG
Sbjct: 163 TGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 221

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 222 KKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 263


>gi|302419577|ref|XP_003007619.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261353270|gb|EEY15698.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 1107

 Score =  308 bits (789), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 232/741 (31%), Positives = 339/741 (45%), Gaps = 158/741 (21%)

Query: 1    MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
            ++K R ++ DV      L   L+ +R +NVYDLS K  + K              K  +L
Sbjct: 418  IMKQRFSSLDVKVIAHELHESLVTLRLANVYDLSSKILLLKFAKPD--------NKKQIL 469

Query: 60   MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ++SG R H T +AR     PS F  +LRK ++TRRL  V Q+G DRII F F  G   + 
Sbjct: 470  IDSGFRCHLTDFARTTAAAPSAFVARLRKFLKTRRLTAVSQVGTDRIIEFTFSDGQ--YR 527

Query: 120  VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
            + LE +A GN++LTD+E  +LTLLR+                                  
Sbjct: 528  LFLEFFASGNVILTDAELRILTLLRN---------------------------------- 553

Query: 180  ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQK--------------GGKSFDLSKNS 225
                  E +  EP +V   G + S  +++N GG                  K+ +     
Sbjct: 554  ----VPEGEGQEPQRV---GLSYSLDNRQNFGGVPPLTRERLQNALRVMAAKAANAPTTG 606

Query: 226  NKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEV---NKLEDNAIQ 282
             K    G + ++  L T + E     P L +H    TG  P    +E+   + L D+ + 
Sbjct: 607  KKKIKPGDQLRK-GLATTITE---LPPMLVDHAFQVTGFDPTKTPAELLDSDALLDSLLH 662

Query: 283  VLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKD-HPPTESGSSTQ----IYDEFCP 337
             L +A    ED      +      GY++ + +   ++     + G+ T+    +YD+F P
Sbjct: 663  ALTVARKVVED-----ATSSATTTGYVIAKYRQKSEETEEKPDDGAETKREDLLYDDFHP 717

Query: 338  LLLNQFRSREFVKFETFDA---ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
             L  +F     VK  TFD     +DEF+         +     E                
Sbjct: 718  FLPQKFADDPSVKVLTFDGFNKTVDEFFFLARGPETREAQSLNE---------------- 761

Query: 395  RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
                         + A  IE N+E V  A+ AV   +   M W ++ ++++ E+K  NP 
Sbjct: 762  -------------QKAAAIEANVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRHNP- 807

Query: 455  AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE--------------------KVEVD 494
                       N M+LLL     E +DE      +                    ++E++
Sbjct: 808  -----------NLMTLLLGTEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEIN 856

Query: 495  LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISH 550
            L LS  ANAR +Y+ ++    K+ KT+   + A K AEKK     +  + QEK V  +  
Sbjct: 857  LGLSPWANAREYYDQRRTAAVKELKTVQHSTMALKNAEKKITEDLKKGLKQEKAV--LQP 914

Query: 551  MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            +RK  WFEKF WF+SS+ YLV+ G+DAQQNE + KRY+ KGDVY HAD+HGA++ ++KN 
Sbjct: 915  IRKQMWFEKFIWFLSSDGYLVLGGKDAQQNETLYKRYLRKGDVYCHADMHGAATVIVKNK 974

Query: 611  R--PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
            +  P+ P+PP TL QAG  +VC S AWDSK    AWWV   QVSK+APTGEYL   +FM 
Sbjct: 975  QDTPDAPIPPSTLAQAGMLSVCSSSAWDSKAGMGAWWVRADQVSKSAPTGEYLPAAAFMG 1034

Query: 669  RGK-KNFLPP-HPLIMG-FGL 686
             G  +NFLPP  PL  G FG+
Sbjct: 1035 AGPGRNFLPPGRPLGAGAFGI 1055


>gi|302917991|ref|XP_003052561.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
           77-13-4]
 gi|256733501|gb|EEU46848.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
           77-13-4]
          Length = 1072

 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 185/457 (40%), Positives = 261/457 (57%), Gaps = 52/457 (11%)

Query: 331 IYDEFCPLL---LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
           +Y++F P +   L++  + E ++F+ ++  +DEF+S +E Q+ E +   +E AA  KL+ 
Sbjct: 290 LYEDFHPFVPQKLSKDPTIEVLEFKGYNETVDEFFSSLEGQKLESRLTEREAAAKRKLDA 349

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              +Q  R+  L++  + + + A  IE N+E V  A+ AV   L+  M W D+ ++V+ E
Sbjct: 350 AKQEQAKRIEGLQEAQNLNFRKAAAIEANVERVQEAMDAVNGLLSQGMDWVDVGKLVERE 409

Query: 448 RKAGNPVAGLID-KLYLERNCMSLL-------------LSNNLDEMDDEEKTLPVE---- 489
           +K  NPVA +I   L L  N ++L                 + DE  DEE + P +    
Sbjct: 410 KKRHNPVAEIIKLPLNLAENLITLELAEEEFEPEEDDPYETDDDESADEEDSTPTKGKHA 469

Query: 490 ----KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQ 541
                VE++L LS  +NAR +++ +K    K+EKT    S+A K AE+K     +  + Q
Sbjct: 470 SKALSVEINLGLSPWSNAREYFDQRKSAAVKKEKTEQQASRALKNAEQKITQDLKKGLKQ 529

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK +  +  +RK  WFEKF WFISS+ YLVI G+DAQQNEMI KRY+ KGD+Y HADLHG
Sbjct: 530 EKAL--LQPIRKQLWFEKFIWFISSDGYLVIGGKDAQQNEMIYKRYLRKGDIYCHADLHG 587

Query: 602 ASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
           ASS +IKN+   P+ P+PP TL+QAG   VC S AWDSK   SAWWV   QVSK+APTGE
Sbjct: 588 ASSVIIKNNPKTPDAPIPPATLSQAGSIAVCSSDAWDSKAGMSAWWVNADQVSKSAPTGE 647

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR-------------- 705
           +L  GSFM+RGKKNFLPP  L++G GL+FR+ E S   H+  R                 
Sbjct: 648 FLPTGSFMVRGKKNFLPPAQLLLGLGLVFRISEESKAKHVKHRLYDVDSAIGDSVSGITT 707

Query: 706 -----GEEEGMDDFEDSGHHKENSDIESEKDDTDEKP 737
                G+     +  ++ H    SD ESE D  DEKP
Sbjct: 708 PQVEVGQGSAEAEQSEAAHSDHVSDDESEDDQPDEKP 744



 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/145 (38%), Positives = 83/145 (57%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKVIAHELQQRLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  V Q+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTEFARTTAAAPSAFVARLRKFLKTRRLTSVSQVGTDRVLEFEFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE +A GNI+LTD++  +LTL R+
Sbjct: 111 FLEFFASGNIILTDADLKILTLART 135



 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 85/181 (46%), Gaps = 41/181 (22%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALL-ASAGK-----VQKNDGDPQNENASTHKEKK 947
            RGQKGK KK+  KY DQDE++R    AL+ A+ G+       K   D + E A+  + ++
Sbjct: 838  RGQKGKAKKIAAKYRDQDEDDRAAAEALIGATVGQKKAEAEAKAKADREAELAAAKERRR 897

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
                            +     K+  EH                    E+ +V M +E I
Sbjct: 898  ---------------AQHQRQQKETAEH-------------------EEIRRVMM-DEGI 922

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
              +  +E   + ++D L G PLP D +L  IPVC P++A+   KY+ K+ PGT KKGK +
Sbjct: 923  DMLDVDEASHMTELDALVGTPLPGDEILEAIPVCAPWNALGRVKYKAKLQPGTTKKGKAV 982

Query: 1068 Q 1068
            +
Sbjct: 983  K 983


>gi|148704666|gb|EDL36613.1| mCG3169, isoform CRA_b [Mus musculus]
          Length = 658

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 137/222 (61%), Positives = 177/222 (79%), Gaps = 1/222 (0%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V+VDL+LSA+ANA+++Y+ K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I  
Sbjct: 53  VDVDLSLSAYANAKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQK 112

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            RKV+WFEKF WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN 
Sbjct: 113 ARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNP 172

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRG
Sbjct: 173 TGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 231

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 232 KKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 273


>gi|340516439|gb|EGR46688.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1078

 Score =  301 bits (771), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 189/550 (34%), Positives = 292/550 (53%), Gaps = 76/550 (13%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCP 337
           +  LV  +++  D ++++I+     +GYI  + +      P     +      +Y++F P
Sbjct: 243 LDALVNHLSEARDVVENIIASSTC-KGYIFAKRRTTPSSAPDDAEQAQKHEGLLYEDFHP 301

Query: 338 LLLNQFR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
            +  +F+   S + ++F+ ++  +DEF+S +E Q+ E +   +E+AA  KL     +Q  
Sbjct: 302 FVPQKFKNDPSIQVLEFDGYNRTVDEFFSSLEGQKLESRLTGREEAARKKLEAARQEQAK 361

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           R+  L+     + + A  IE N+E V  A+ AV   LA  M W D+ ++++ E+K  NPV
Sbjct: 362 RIQGLQDAQAMNYRKAAAIEANVERVQEAMDAVNGLLAQGMDWVDIGKLIEREKKRQNPV 421

Query: 455 AGLID-KLYLERNCMSLLLSN----------------NLDEMDDEE-----------KTL 486
           A +I   L L  N ++LLL+                   D+ D EE           KT 
Sbjct: 422 AEIISLPLKLADNTITLLLAEEAFDEDEAEEEEDNPFETDDSDSEEDQGGKATSKDKKTD 481

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQE 542
            +  V++ L +S  +NAR +YE ++    KQEKT    +KA K+ E+K     +  + QE
Sbjct: 482 KLLTVDIVLNMSPWSNAREYYEERRSAAMKQEKTQQQATKALKSTEQKIAEDLKKGLKQE 541

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K +  +  +RK  WFEKF WFISS+ YLV+ G+D QQ+E++ +RY+ KGDVY HAD+ GA
Sbjct: 542 KAL--LQPIRKQMWFEKFLWFISSDGYLVLGGKDPQQSEILYRRYLRKGDVYCHADIRGA 599

Query: 603 SSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           ++ VIKN+   P+ P+PP TL+QAG  +VC S+AWDSK    AWWV   QVSKT P+G+ 
Sbjct: 600 ANIVIKNNPNMPDAPIPPATLSQAGSLSVCTSEAWDSKAGMGAWWVNADQVSKTTPSGDI 659

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER-----------RVRGEE- 708
           L  G+F I+GKKN+LPP  L++G G  F++ E S G+HL  R              G+E 
Sbjct: 660 LPAGTFTIQGKKNYLPPTQLLLGLGFAFKISEQSKGNHLKHRVHDGRSSTATEAATGDEG 719

Query: 709 -----EGMDDFEDS---------GHHKENSDIESE------KDDTDEKPVAESLS-VPNS 747
                EG+DD EDS         GH +  + ++S        DD  +K  A  +S  P +
Sbjct: 720 EAQNTEGIDDQEDSDSEPEDNQPGHEERANPLQSSGIGEETADDAADKLSAVKISDQPGN 779

Query: 748 AHPAPSHTNA 757
             P P   +A
Sbjct: 780 DEPTPPSEDA 789



 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 59/177 (33%), Positives = 83/177 (46%), Gaps = 33/177 (18%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALL-ASAGKVQKNDGDPQNENASTHKEKKPAISP 952
            RGQKGK KK+ +KY DQDEE+R    AL+ A+ G+ +               E       
Sbjct: 848  RGQKGKAKKIAQKYKDQDEEDRATAEALIGATVGRQRAEAEAAAKAQRQAELE------- 900

Query: 953  VDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEE-DIHEIG 1011
                    K ++     +  KE                + E  E+ +  + E  D+ E  
Sbjct: 901  ------AMKERRRAQHERKQKE----------------VAEQEELRRAMLNEGLDVQEPD 938

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            E E  R  ++D L G PL  D +L VIPVC P+SA+  YKY+VK+ PG+ KKGK I+
Sbjct: 939  EAE--RATNLDTLVGTPLAGDEILEVIPVCAPWSALVRYKYKVKLQPGSVKKGKAIK 993


>gi|414878086|tpg|DAA55217.1| TPA: hypothetical protein ZEAMMB73_507954 [Zea mays]
          Length = 522

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 212/481 (44%), Positives = 279/481 (58%), Gaps = 85/481 (17%)

Query: 667  MIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDI 726
            MIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNERRVRGE+E + + E +   K+ S+ 
Sbjct: 1    MIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNERRVRGEDEALHEME-AESRKKQSNP 59

Query: 727  ESEKDDTDEKPVAES-----------------LSVPNSAHPAPSHTNASNVDSHEFPAE- 768
            ES++D   E    E+                 L +P+ +      +N    +S E   E 
Sbjct: 60   ESDEDIGSEGANKETHEDESNGQTTNIQQNNDLELPDLS------SNIGTANSSELLPEI 113

Query: 769  --DKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSE 826
              ++T+ NG  S I      + A V+ QL+DL+D+ L LG A +S     + +    L+E
Sbjct: 114  QAEETLDNG--SSILK-EETIEASVSSQLDDLLDKTLCLGPAKVSGKSSLLTSIPSSLAE 170

Query: 827  EDKHVE-RTATVRDKPYISKAERRKLKKGQ-----------GSSVVDPKVEREKERGK-- 872
            +D  +E +  T+RDKPYISKAERRKLKKGQ           G +V  P   ++ E+GK  
Sbjct: 171  DDDDLEVKRPTIRDKPYISKAERRKLKKGQVNDETATDSQNGEAVETPGTSKQ-EKGKAE 229

Query: 873  -----DASSQPESIVRK-----TKIEG----------------------GKISRGQKGKL 900
                   +SQP++  ++     TK  G                       K+SRGQKGKL
Sbjct: 230  TKATDSKASQPDTSQQEKGKANTKATGSKLSQPGNSQQEKGKGSTHAGNAKVSRGQKGKL 289

Query: 901  KKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCY 960
            KK+KEKY +QDEEER IRMALLAS+GK  + D   Q+   ++ KE KP+    D+ K+CY
Sbjct: 290  KKIKEKYAEQDEEEREIRMALLASSGKALRKDKPSQDVEETSVKESKPSAGEDDSSKICY 349

Query: 961  KCKKAGHLSKDCKEHP------DDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEE 1014
            KCKKAGHLS+DC E        D S     D+   G +         M+E+D+ EIG+EE
Sbjct: 350  KCKKAGHLSRDCPESTSEVDRNDGSISRSRDD--TGTNTAPAGGNSPMDEDDVQEIGDEE 407

Query: 1015 KGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
            K +L D+DYLTGNPLP+DILLY +PVC PY+A+Q+YKYRVKI PGTAKKGK  +   SL 
Sbjct: 408  KEKLIDLDYLTGNPLPNDILLYAVPVCAPYNALQTYKYRVKITPGTAKKGKAAKTAMSLF 467

Query: 1075 L 1075
            L
Sbjct: 468  L 468


>gi|326434920|gb|EGD80490.1| hypothetical protein PTSG_13144 [Salpingoeca sp. ATCC 50818]
          Length = 947

 Score =  298 bits (764), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 174/478 (36%), Positives = 272/478 (56%), Gaps = 16/478 (3%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           L+  L   +  GPA  EH +L+ G  PN +++E   +  +  +VL  A+ + E  L   +
Sbjct: 17  LRKHLTRIMDCGPAFIEHCLLEAGFPPNARVNEGCNVATDLPRVLA-ALQQAEHLLFTKL 75

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
               V +GYIL++  H   D     +     ++++  P  L QF  R F +F++FD A+D
Sbjct: 76  EQGQV-KGYILLK-AHAKADARKDAAKEEVVVFEDVMPFPLKQFEGRTFKEFDSFDVAVD 133

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            ++S+IES + E +   +E AA  KL +      +RV   K+        A LIE N E 
Sbjct: 134 TYFSEIESHKLEMRALQQERAARQKLEQARRSHHDRVKGYKEARLEDEYKATLIELNHEL 193

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           V+ AI  +   + N + W ++  +V+E R  G+PVA  I KL L++N + + L+      
Sbjct: 194 VNEAIDVINKMVGNHLDWREIEELVQESRVRGDPVANAISKLKLKKNAIVMHLTEPSMGG 253

Query: 475 -------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  + DE +DE+       VE+DLA +AH NAR+ +E KK   SK+EK + +  +A
Sbjct: 254 ADDDSWSDEDEDEDEDDNTKGALVEIDLAETAHGNARKLHERKKTIRSKEEKALASTEQA 313

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
            ++ EK+   ++ + +  A IS  R   WFEKFNWFISSENYLV++GRD  QNE +V+++
Sbjct: 314 LRSVEKRAMDRLKKTQITATISKSRAPLWFEKFNWFISSENYLVLAGRDRLQNEALVRKH 373

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +++ D+YVHAD++GASS V+KN    + +PP TL++A  F V HS AW++     AWWV+
Sbjct: 374 LTQHDLYVHADMNGASSVVVKNSNTGE-IPPKTLSEAATFAVAHSPAWENNQQADAWWVH 432

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
            +QV KT+  G+ L  GSF I G K+F+P   L + + +LF++D+ S   H  ERR +
Sbjct: 433 ANQVEKTSSEGKPLGAGSFRITGAKHFIPIRQLALAYAILFKVDDESAKRHEGERRCK 490



 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 85/198 (42%), Gaps = 22/198 (11%)

Query: 889  GGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKP 948
            GGK  +  K K +K+   +   DE+ER    A+L+     Q+ D  P +      + K+ 
Sbjct: 701  GGKQKKLSKTKQRKINRFHAKFDEDER----AMLSQ----QRPDNKPLSRQEKRRRRKEM 752

Query: 949  AISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV-------EDNPCVGLDETAEMDKVA 1001
             I     PK     + A     +  E    ++  V       E    +    + + D  A
Sbjct: 753  GIRGSRQPKQQRGAQGAELPPAEVLEKLAATTEKVLASAQQAEGGDVIDAGPSGDADATA 812

Query: 1002 MEEEDIHEIGEEEKG---RLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIP 1058
               + + +  EE +     L+++  LT  P P D +LY +PVC PYSA Q Y  R K++P
Sbjct: 813  AALDAMDDEDEESEALTQSLSNLHSLTAQPTPEDTVLYALPVCAPYSATQGYALRAKLVP 872

Query: 1059 GTAKKGKGI----QIFYS 1072
            G  KKG+ I    Q F S
Sbjct: 873  GNTKKGRAIRGVVQTFVS 890


>gi|260803888|ref|XP_002596821.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
 gi|229282081|gb|EEN52833.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
          Length = 834

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 169/428 (39%), Positives = 248/428 (57%), Gaps = 45/428 (10%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            LK +L   L YGPA+ +H +L+ G     K+     +  +  Q L+ A+ + E +L+  
Sbjct: 24  ALKRILNSKLVYGPAVLDHCLLNAGFPEGAKVGRDFDVSQDLPQ-LMAALVEAEKFLE-- 80

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--YDEFCPLLLNQFRSREFVKFETFDA 356
            SG    +GYI+ + +      P  + G + ++  Y EF P    Q      V+F +F+ 
Sbjct: 81  ASGSQPCQGYIVQKREK----KPKQDGGPAEELLTYAEFHPFQFKQHEKSPCVEFPSFNK 136

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEF+S++ESQR + +   +E  A  KL  +  D E R+ TL++  D     A+LIE N
Sbjct: 137 AVDEFFSQLESQRLDLKALQQEKVAIKKLENVKKDHERRLETLQKVQDEDKHKAQLIELN 196

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
           L+ VD AIL VR A+AN++ W ++  +VKE +  G+PVA  I  L L+ N ++L+L N  
Sbjct: 197 LDLVDKAILVVRSAIANQIDWTEIWDIVKEAQAQGDPVASTIKSLKLDSNHITLVLRNPF 256

Query: 477 ------DEMDDEEKTLPVE-------KVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
                  E DD++  +  E       K+++DLALSA+ANA+++Y+ K+    K++KTI A
Sbjct: 257 SGYESDSEGDDDKAGVGREASSDRPMKIDIDLALSAYANAKKYYDQKRHAAKKEQKTIDA 316

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             K                       H   V  FEKF WFI+SENYLVI+GRD+QQNE+I
Sbjct: 317 SEKC----------------------HEFVVERFEKFLWFITSENYLVIAGRDSQQNELI 354

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
           VKR++  GD+YVHADLHGA+S VI+NH     VPP +LN+AG F +CHS AWD+K+VTSA
Sbjct: 355 VKRHLKPGDLYVHADLHGATSCVIQNHS-SNSVPPKSLNEAGTFAICHSAAWDAKVVTSA 413

Query: 644 WWVYPHQV 651
           W+V+  Q 
Sbjct: 414 WYVHHDQT 421



 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/175 (40%), Positives = 101/175 (57%), Gaps = 7/175 (4%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RGQK K KKM++KY DQDEEER +RM LL S G   K+D   + +     ++++      
Sbjct: 610  RGQKAKQKKMRKKYKDQDEEERQMRMELLRSEGNPDKDDK--KKKGKKNKQKEQQQRPQS 667

Query: 954  DAPKVCYKCKKAGHLSKDCKE-HPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGE 1012
               +   K  +A H  KD    H DD++  ++ +   G  E A+ +  + EE+D    GE
Sbjct: 668  AQQRKQGKGGQASHAFKDSMVIHEDDATVPIQAHVQEG--EVAKEEPESDEEKDAVLAGE 725

Query: 1013 EEK--GRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK 1065
              K     + +D LTG P P DILL+ IPVC PYSA+ ++KY+VK++PG+ KKGK
Sbjct: 726  NIKLVEASSVLDTLTGCPHPEDILLFAIPVCAPYSAMNNFKYKVKLVPGSNKKGK 780


>gi|221059774|ref|XP_002260532.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|193810606|emb|CAQ42504.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 2040

 Score =  290 bits (741), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 175/454 (38%), Positives = 262/454 (57%), Gaps = 40/454 (8%)

Query: 317 GKDHPPTESGSSTQI-YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQ 368
           GK     E  S  +I + EF P++LN  +++      E + F+ F+  +D ++S++E S+
Sbjct: 439 GKGVVKEEEKSGEEITFTEFSPIILNNHKNKVEENKLEIIHFDDFNKCVDSYFSRMELSK 498

Query: 369 RAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVR 428
             +QQ   K   +  K++KI +D E R+  L++EV    K   LI+ N E V+ AI  +R
Sbjct: 499 YDKQQEVIKIKKSLTKMDKIKLDHERRIEQLEKEVSSLRKKISLIQMNDELVEQAIQLMR 558

Query: 429 VALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN---NLDEMDDEEKT 485
            A+A   +WE +   +K  +K  +P+A  I  +      M LLL +   N +  DD  + 
Sbjct: 559 AAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDDGEENEEGSDDSSRE 618

Query: 486 LPVEK------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
              E             V ++L  S + N   + +++KK E K  KT  + + A K  EK
Sbjct: 619 ADEESPKRATGRESKLAVTINLNNSVYGNVEDYQKMRKKAEEKIRKTKISTNFAVKKVEK 678

Query: 534 KTRLQIL-----QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
           K + +         KTV  I  +RKV+WFEKF+WFISSENYLVI+GRDA QNE++ +RY 
Sbjct: 679 KKKEKENKQKGKHNKTVGQIQKLRKVYWFEKFHWFISSENYLVIAGRDALQNEILFRRYF 738

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
            K DVYVHAD+HGAS+ +IKN   + P+P  TL++AG   +C S AW++K++TSAWWV+ 
Sbjct: 739 QKNDVYVHADIHGASTCIIKNPYKDIPIPEKTLSEAGQLAICRSSAWNNKIITSAWWVHY 798

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
           HQVSK+APTGEYL  GSF+IRGKKN+LP   L MG  ++F++D +++ +         EE
Sbjct: 799 HQVSKSAPTGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAAVEND--------EE 850

Query: 709 EGMDDFEDSGHHKENSDIESEKDDTDEKPVAESL 742
             +DD + S    EN D E +  D D++ V ++L
Sbjct: 851 NNLDDTQKSF---ENDD-EKKNSDGDQEVVEDAL 880



 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 91/147 (61%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVK-CLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   ++G   +N+Y++S K Y+ K         S + +K   L
Sbjct: 1   MAKQRLTALDIRAIITLCKNIIVGCVVTNIYNISNKIYVLKC--------SKKEQKYFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ ++ QLG DR++  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSAFTMKLRKHLRSRKITNISQLGGDRVVDIQFGFDDKACH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD+   +L++L+S+
Sbjct: 113 LIVELYIAGNIILTDNNHKILSILKSN 139



 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 47/73 (64%), Gaps = 1/73 (1%)

Query: 1006 DIHEIGEEE-KGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
            +  EI EEE K +++++  L   P   D L++ IP+C PYSA+Q+ KY+VK++PG AKKG
Sbjct: 1917 NFEEINEEEMKDKMSELKKLVCTPKEGDNLVFAIPMCAPYSAIQNQKYKVKLVPGNAKKG 1976

Query: 1065 KGIQIFYSLLLLM 1077
            K  +   S  L M
Sbjct: 1977 KVAESCISYFLKM 1989


>gi|320581674|gb|EFW95893.1| hypothetical protein HPODL_2176 [Ogataea parapolymorpha DL-1]
          Length = 940

 Score =  289 bits (739), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 222/745 (29%), Positives = 386/745 (51%), Gaps = 98/745 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+   VK +   I G R  NVY+L  +P++++ K         S    K  L
Sbjct: 1   MKQRVSAFDIRVLVKEIEHAIKGHRLQNVYNLVANPRSFLLKF--------SVPDSKANL 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++ESG +++ T + R     PS F +KLRKH+++RRL +++Q+G DR+++ +FG GM  +
Sbjct: 53  VIESGFKVYLTEFQRPTAPEPSNFVVKLRKHLKSRRLSNIKQVGNDRVVVLEFGDGM--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR---SHRDDDKGVAIMSRHRYPTEICR-VFERTTA 174
           Y++LE ++ GNI+L DS+  +L+L R    H ++D         RY   +   +F+R+  
Sbjct: 111 YLVLEFFSAGNIILLDSDRKILSLFRLVEEHENND---------RYAVGVTYGMFDRSLF 161

Query: 175 SKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGAR 234
            + H  L         EP                        + +D +K++++N+     
Sbjct: 162 EE-HGQL---------EPRHYT------------------SAEIYDWAKSASENT----- 188

Query: 235 AKQPTL-KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
           +K P++ K V   A      L +  +   G+ P    S V  ++D  +   V A     +
Sbjct: 189 SKVPSIAKLVFLNAAYLSSDLIQIQLSKNGIDPAS--SGVKIVQDEELLAKVTAAVNSCE 246

Query: 294 W----LQDVISGDIVPEGYILMQNKHLGKDHP---PTESGSSTQ---IYDEFCPL--LLN 341
                L ++ +G++   GYI+      GK +P   P E  S      +YDEF P   +  
Sbjct: 247 QEFYRLTNLPAGEL--SGYII------GKHNPFFKPEEDASYDNLEYVYDEFHPFEPVHK 298

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
           +  +    + + ++  LD+F+S +ES +A  + + ++  A  +L  +  +   ++  L++
Sbjct: 299 KKENTRVEEVKGYNRTLDKFFSTLESSKAVLKIQQQQANAAKRLQTVKDEHMTKLQRLEE 358

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
           +   + +  ELI ++ E ++    +V+  L  +M W ++ ++V  E+K  NP+A +I   
Sbjct: 359 QQAINYRKGELITFHSEQIEQCKQSVQALLDQQMDWTNIEKLVAMEQKRRNPIANMIKLP 418

Query: 461 LYLERNCMSLLLSN--------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
           L L +N +++LL +        +  + +++ K+ PV  V +DL+LSA+ANA R+++  + 
Sbjct: 419 LNLAKNEITVLLPDIEEQSDSDSDSDSEEKRKSGPVA-VAIDLSLSAYANATRYFDAMRA 477

Query: 513 QESKQEKTITAHSKAFKAAEKKTR--LQILQEKTV--ANISHMRKVHWFEKFNWFISSEN 568
              KQ KT  + S A K  E+  +  L+ +Q+K+   + +  +R   WFEKF WFI+S+N
Sbjct: 478 ALDKQNKTKNSASIAIKNTERTIQQDLKRMQKKSQEPSGLKQIRAKFWFEKFWWFITSDN 537

Query: 569 YLVISGRDAQQNEMIVKRYMSK-GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           +L I+GRD  Q ++I  RY  K  DV V  DL G    ++KN    + +PP TL QAG F
Sbjct: 538 HLCIAGRDDTQVDLIYYRYFDKNNDVLVSNDLDGL-KVIVKNPFKNKDIPPSTLLQAGIF 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           ++  S+AWD+KMVTS W V   QVSK    G  +  G   I+G+K FLPP  L+MGFGLL
Sbjct: 597 SLSASKAWDNKMVTSPWMVKGTQVSKKDFDGSIVPAGMLNIQGEKTFLPPCQLVMGFGLL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           +  DE +   + +  + R +E G++
Sbjct: 657 WLGDEETTRKYRDSAKSRIQEVGLE 681



 Score = 43.5 bits (101), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 63/161 (39%), Gaps = 44/161 (27%)

Query: 909  DQDEEERNIRMALLAS--AGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
            DQDEEER +RM +L +    K Q    + Q EN    ++ +             K ++  
Sbjct: 758  DQDEEERRLRMEVLGTLKQKKEQPEKSETQPENKGLDRKTR-------------KKQQDI 804

Query: 967  HLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTG 1026
             L K      +DS+   +  P                    +EI          +  L  
Sbjct: 805  RLLKKLVGELEDSAEETDTTPY-------------------NEI----------ISGLIP 835

Query: 1027 NPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
             P  SD ++  I V  PYSA+  Y Y+VK+ PG  KKGK +
Sbjct: 836  APKESDSIVNCILVFAPYSALSKYTYKVKVQPGPLKKGKAL 876


>gi|156101618|ref|XP_001616502.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805376|gb|EDL46775.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 2067

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 179/484 (36%), Positives = 262/484 (54%), Gaps = 63/484 (13%)

Query: 332 YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHK 384
           + EF P++LN  +++      E V F+ F+  +D ++S++E S+  +QQ   K   +  K
Sbjct: 441 FTEFSPIILNNHKNKVEENKLEVVHFDDFNKCVDTYFSRMELSKYDKQQEVIKIKKSLTK 500

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           ++KI +D E R+  L++EV    K   LI+ N E V+ AI  +R A+A   +WE +   +
Sbjct: 501 MDKIKLDHERRIDQLEKEVSTLRKKISLIQMNDELVEQAIQLMRAAVATNANWEKIWEHI 560

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLL----SNNL------------DEMDDEEKTLPV 488
           K  +K  +P+A  I  +      M LLL     N L            D+   E    P 
Sbjct: 561 KLFKKQNHPIALRISSVNFNNCEMELLLDDGEENGLGSDDSSEANGRSDDPSSEANEQPS 620

Query: 489 EK---------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF----- 528
           +                V ++L  S + N   + +L+KK E K  KT  + + A      
Sbjct: 621 KGKKSSNKKAATNNRFAVTINLNNSVYGNVEDYQKLRKKAEEKIRKTKISTNFAVKKVEK 680

Query: 529 KAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
           K  EK+ + +    KTV  I  +RKV+WFEKF+WFISSENYLVI+GRDA QNE++ +RY 
Sbjct: 681 KKKEKENKQKGKHNKTVGQIQKIRKVYWFEKFHWFISSENYLVIAGRDALQNEILFRRYF 740

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
            K DVYVHAD+HGAS+ +IKN   + P+P  TL++AG   +C S AW++K++TSAWWV+ 
Sbjct: 741 QKNDVYVHADIHGASTCIIKNPHKDIPIPEKTLSEAGQLAICRSSAWNNKIITSAWWVHY 800

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
           HQVSK+APTGEYL  GSF+IRGKKN+LP   L MG  ++F++D ++L ++        EE
Sbjct: 801 HQVSKSAPTGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAALDNN--------EE 852

Query: 709 EGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASN-----VDSH 763
             +DD + S  +    D E    D D+  V     V   A  A  H  A N     ++  
Sbjct: 853 NNLDDTQKSFEN----DGERRSSDGDQAVVG---GVTIDACTAEGHIQAGNPYTGPMEGT 905

Query: 764 EFPA 767
            FPA
Sbjct: 906 SFPA 909



 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 91/147 (61%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +   R +I G   +N+Y++S K Y+ K         S + +K   L
Sbjct: 1   MAKQRLTALDIRAIITLCRNIIVGCVVTNIYNISNKIYVLKC--------SKKEQKYFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ ++ QLG DR++  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSAFTMKLRKHLRSRKITNISQLGGDRVVDIQFGFDDKACH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD+   +L++L+++
Sbjct: 113 LIVELYIAGNIILTDNNHKILSILKTN 139


>gi|221482059|gb|EEE20420.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 1859

 Score =  283 bits (725), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 158/385 (41%), Positives = 235/385 (61%), Gaps = 23/385 (5%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q++E+        A  ++ KI  DQE R+  L++E  
Sbjct: 448 TRVLLHFRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAA 507

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K + K G+P+A  + +L LE
Sbjct: 508 NLLQQAQAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLE 567

Query: 465 RNCMSLLLSNNLDEM-----DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           +    LLL     E      +  E  L    V VD+ALSAH NA+  +   K+ ++K +K
Sbjct: 568 KQRAMLLLEAPRREEAEEPGEASETIL----VPVDVALSAHGNAQLLHSQVKQLKAKTQK 623

Query: 520 TITAHSKAFKAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           T  A + A  AA++K +  + Q+     +    +  +RK  WFEKF+WFISS++YLV++G
Sbjct: 624 TSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAG 683

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-------VPPLTLNQAGCF 627
           RDAQQNE++ +RY+   DVYVHAD+HGA++ +IKN R  +P       VP  TL Q G F
Sbjct: 684 RDAQQNEILFRRYLRSNDVYVHADVHGAATCIIKNSRETEPGKCDDPPVPLTTLQQCGEF 743

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW +K  ++AWWVY  QVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLL
Sbjct: 744 AVCRSSAWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLL 803

Query: 688 FRL-DESSLGSHLNER-RVRGEEEG 710
           FRL DE+S+  H+  R R+  EE G
Sbjct: 804 FRLADEASVARHVAARTRLALEEAG 828



 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  +  +G+  KV L +
Sbjct: 4   TKQRVGALDVRALVASVRPSIVGLRVTNVYDFSAGGSRGGTSSSYILKFAGKESKVFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR+++  FG   NA ++
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCVRLRKGLRGKKLEDIHQHGADRVVILTFGKSENALHL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           ++ELY  GNI+LTD    +  +LR H
Sbjct: 124 VVELYVSGNIILTDHTNLIQAVLRRH 149



 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 104/241 (43%), Gaps = 59/241 (24%)

Query: 835  ATVRDKPYISKAERRKLKKGQGSSVVDPKVERE-------KERGKDASSQPESIVRKTKI 887
            A +  +  +S AERR+ KKG   +  DP    E       KE+ K    QP         
Sbjct: 1606 AEIPSRKRMSAAERRRQKKGNREAKDDPAGTAEEKEDMGGKEKAKGPRLQP--------- 1656

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKK 947
                + RG++GKL KMK+KYG  D++E                             +EK+
Sbjct: 1657 ----VPRGKRGKLAKMKKKYG--DQDE-----------------------------EEKQ 1681

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
              +S + A ++    K+ G  +      P  ++  +         E     K  +EEE  
Sbjct: 1682 FKMSLIGAEEI----KRGGPTATANAAAPACAAKKLPGRKAAQQREERRELKEVLEEEGD 1737

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
              + E+     + +D LT +PLP D LL V+PV  PYSA+  YK++ K++PG+ KKG   
Sbjct: 1738 ERLTEQ----CSQIDLLTASPLPEDALLCVVPVTAPYSAMSKYKFKAKLVPGSMKKGNAG 1793

Query: 1068 Q 1068
            Q
Sbjct: 1794 Q 1794


>gi|221502557|gb|EEE28284.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1859

 Score =  283 bits (725), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 158/385 (41%), Positives = 235/385 (61%), Gaps = 23/385 (5%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q++E+        A  ++ KI  DQE R+  L++E  
Sbjct: 448 TRVLLHFRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAA 507

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K + K G+P+A  + +L LE
Sbjct: 508 NLLQQAQAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLE 567

Query: 465 RNCMSLLLSNNLDEM-----DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           +    LLL     E      +  E  L    V VD+ALSAH NA+  +   K+ ++K +K
Sbjct: 568 KQRAMLLLEAPRREEAEEPGEASETIL----VPVDVALSAHGNAQLLHSQVKQLKAKTQK 623

Query: 520 TITAHSKAFKAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           T  A + A  AA++K +  + Q+     +    +  +RK  WFEKF+WFISS++YLV++G
Sbjct: 624 TSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAG 683

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-------VPPLTLNQAGCF 627
           RDAQQNE++ +RY+   DVYVHAD+HGA++ +IKN R  +P       VP  TL Q G F
Sbjct: 684 RDAQQNEILFRRYLRSNDVYVHADVHGAATCIIKNSRETEPGKCDDPPVPLTTLQQCGEF 743

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW +K  ++AWWVY  QVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLL
Sbjct: 744 AVCRSSAWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLL 803

Query: 688 FRL-DESSLGSHLNER-RVRGEEEG 710
           FRL DE+S+  H+  R R+  EE G
Sbjct: 804 FRLADEASVARHVAARTRLALEEAG 828



 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  +  +G+  KV L +
Sbjct: 4   TKQRVGALDVRALVASVRPSVVGLRVTNVYDFSAGGSRGGTSSSYILKFAGKESKVFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR+++  FG   NA ++
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCVRLRKGLRGKKLEDIHQHGADRVVILTFGKSENALHL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           ++ELY  GNI+LTD    +  +LR H
Sbjct: 124 VVELYVSGNIILTDHTNLIQAVLRRH 149



 Score = 73.2 bits (178), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 101/241 (41%), Gaps = 59/241 (24%)

Query: 835  ATVRDKPYISKAERRKLKKGQGSSVVDPKVERE-------KERGKDASSQPESIVRKTKI 887
            A +  +  +S AERR+ KKG   +  DP    E       KE+ K    QP         
Sbjct: 1606 AEIPSRKRMSAAERRRQKKGNREAKDDPAGTAEEKEDMGGKEKAKGPRLQP--------- 1656

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKK 947
                + RG++GKL KMK+KY                         GD   E      EK+
Sbjct: 1657 ----VPRGKRGKLAKMKKKY-------------------------GDQDEE------EKQ 1681

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
              +S + A ++    K+ G  +      P  ++  +         E     K  +EEE  
Sbjct: 1682 FKMSLIGAEEI----KRGGPTATANAAAPACAAKKLPGRKAAQQREERRELKEVLEEEGD 1737

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
              + E+     + +D LT +PLP D LL V+PV  PYSA+  YK++ K++PG+ KKG   
Sbjct: 1738 ERLTEQ----CSQIDLLTASPLPEDALLCVVPVTAPYSAMSKYKFKAKLVPGSMKKGNAG 1793

Query: 1068 Q 1068
            Q
Sbjct: 1794 Q 1794


>gi|308808798|ref|XP_003081709.1| zinc knuckle (CCHC-type) family protein (ISS) [Ostreococcus tauri]
 gi|116060174|emb|CAL56233.1| zinc knuckle (CCHC-type) family protein (ISS), partial
           [Ostreococcus tauri]
          Length = 1090

 Score =  283 bits (724), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 193/588 (32%), Positives = 296/588 (50%), Gaps = 63/588 (10%)

Query: 1   MVKVRMNTADVAAEVKCLRRL-IGMRCSNVYDLSPKTYIFKLMNSSGVTESGES----EK 55
           M K +    DVAA    +RRL +G   +N  D+  +     +M  +  +  G+      +
Sbjct: 120 MPKRKYTAFDVAASTAAIRRLALGCALANARDVDGEGGDAVMMTFNRPSRDGDGVESRAR 179

Query: 56  VLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
           V ++++   R H T+YAR +  TPS F + +R+  R ++L D RQLG DR +   FG G 
Sbjct: 180 VRVVIDPSSRAHVTSYARARDGTPSAFVMAVRRAARGKKLRDARQLGRDRAMDLTFGAGD 239

Query: 116 NAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTAS 175
            A +VI+EL+ +GN+++TD+ +TV   LR+ RDDD    + +   Y       +     +
Sbjct: 240 GACHVIVELFGRGNVIVTDANYTVARALRTRRDDDVKTRVEANQPYSLARFHAWRPYGKA 299

Query: 176 KLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA 235
            + +AL +++         V  DG          LG +      D++            A
Sbjct: 300 DVVSALATAR--------VVAGDGE---------LGVE------DVT---------AVDA 327

Query: 236 KQP-TLKTVLGEALGYGPALSEHIILDTGLV--PNMKLSEVNKLEDNAIQVLVLAVAKFE 292
           ++P TL+  L  A GY P ++EH+    G++   N  L   + + +  +  L  A+   E
Sbjct: 328 RRPATLREALCRAFGYSPPIAEHVARAAGVLDGSNAALPFADDVRERYVDGLTRAIEDIE 387

Query: 293 DWLQDVISGDIV---PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
            W + V +G  V   P  Y  M  K  G D          ++ D+F P  L Q   R   
Sbjct: 388 SWFEGVTTGKRVADAPRVYTKMDAKADGTDE--------IEVVDDFAPFELKQNEGRRTK 439

Query: 350 KFE---------TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
            +E          FD  +DE++++++SQ    Q +  E  A  +L K   DQ+NRV  L+
Sbjct: 440 TYELPKGLDPALAFDHYVDEYFNELDSQSVILQRRKAEAQAIARLEKTLRDQKNRVEQLE 499

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDK 460
           +E +   + A LIEYN E VD AI AV  ALA+ MSW++L  M+KEER+ GNPVAG+I  
Sbjct: 500 RERELEEQRAVLIEYNHEAVDVAIEAVNSALASGMSWDELEAMIKEERRLGNPVAGMIKS 559

Query: 461 LYLERNCMSLLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQ 517
           + L  N +++ L N+LDE+ ++E  L  +K   V VDL LSAHANA   +  KKK   K 
Sbjct: 560 MDLANNEITITLENHLDELGEDEDALGKKKRVAVSVDLGLSAHANASVRFAAKKKNADKF 619

Query: 518 EKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           EKT+ A +KA  AAE K +  + +   V   +  R+  WFEKF+WFI+
Sbjct: 620 EKTLNAQNKAVAAAESKMKSAMERAANVVVATRARQPLWFEKFHWFIT 667



 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 82/180 (45%), Gaps = 38/180 (21%)

Query: 909  DQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEK-----------KPAI--SPVDA 955
            DQDEE+R + M LL + G+ +K+ G    + A+  KEK           KP +  +P  A
Sbjct: 921  DQDEEDRELAMKLLGAEGR-KKSAG--MTKKAARMKEKAANDFEERKLTKPQVPSAPEPA 977

Query: 956  PKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEK 1015
            P    + + A  +S D                   +D      KV M  E+  +I  E  
Sbjct: 978  PPKWKRNESAADMSAD-------------------VDVGEAQPKVEMPLEERLKIESE-- 1016

Query: 1016 GRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
             RL+ ++ +   PL  D + Y +PVC P +A    KYR+K+ PG  KKGK  ++   +LL
Sbjct: 1017 -RLSIINRIVAFPLRHDEIEYCLPVCAPIAATNGLKYRMKVTPGAQKKGKAAKLAMDILL 1075


>gi|237842889|ref|XP_002370742.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
 gi|211968406|gb|EEB03602.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
          Length = 1859

 Score =  283 bits (724), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 158/385 (41%), Positives = 235/385 (61%), Gaps = 23/385 (5%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q++E+        A  ++ KI  DQE R+  L++E  
Sbjct: 448 TRVLLHFRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAA 507

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K + K G+P+A  + +L LE
Sbjct: 508 NLLQQAQAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLE 567

Query: 465 RNCMSLLLSNNLDEM-----DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           +    LLL     E      +  E  L    V VD+ALSAH NA+  +   K+ ++K +K
Sbjct: 568 KQRAMLLLEAPRREEAEEPGEASETIL----VPVDVALSAHGNAQLLHSQVKQLKAKTQK 623

Query: 520 TITAHSKAFKAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           T  A + A  AA++K +  + Q+     +    +  +RK  WFEKF+WFISS++YLV++G
Sbjct: 624 TSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAG 683

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-------VPPLTLNQAGCF 627
           RDAQQNE++ +RY+   DVYVHAD+HGA++ +IKN R  +P       VP  TL Q G F
Sbjct: 684 RDAQQNEILFRRYLRSNDVYVHADVHGAATCIIKNSRETEPGKCDDPPVPLTTLQQCGEF 743

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW +K  ++AWWVY  QVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLL
Sbjct: 744 AVCRSSAWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLL 803

Query: 688 FRL-DESSLGSHLNER-RVRGEEEG 710
           FRL DE+S+  H+  R R+  EE G
Sbjct: 804 FRLADEASVARHVAARTRLALEEAG 828



 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  +  +G+  KV L +
Sbjct: 4   TKQRVGALDVRALVASVRPSIVGLRVTNVYDFSAGGSRGGTSSSYILKFAGKESKVFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR+++  FG   NA ++
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCVRLRKGLRGKKLEDIHQHGADRVVILTFGKSENALHL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           ++ELY  GNI+LTD    +  +LR H
Sbjct: 124 VVELYVSGNIILTDHTNLIQAVLRRH 149



 Score = 73.2 bits (178), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 101/241 (41%), Gaps = 59/241 (24%)

Query: 835  ATVRDKPYISKAERRKLKKGQGSSVVDPKVERE-------KERGKDASSQPESIVRKTKI 887
            A +  +  +S AERR+ KKG   +  DP    E       KE+ K    QP         
Sbjct: 1606 AEIPSRKRMSAAERRRQKKGNREAKDDPAGTAEEKEDMGGKEKAKGPRLQP--------- 1656

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKK 947
                + RG++GKL KMK+KY                         GD   E      EK+
Sbjct: 1657 ----VPRGKRGKLAKMKKKY-------------------------GDQDEE------EKQ 1681

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
              +S + A ++    K+ G  +      P  ++  +         E     K  +EEE  
Sbjct: 1682 FKMSLIGAEEI----KRGGPTATANAAAPACAAKKLPGRKAAQQREERRELKEVLEEEGD 1737

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
              + E+     + +D LT +PLP D LL V+PV  PYSA+  YK++ K++PG+ KKG   
Sbjct: 1738 ERLTEQ----CSQIDLLTASPLPEDALLCVVPVTAPYSAMSKYKFKAKLVPGSMKKGNAG 1793

Query: 1068 Q 1068
            Q
Sbjct: 1794 Q 1794


>gi|328774280|gb|EGF84317.1| hypothetical protein BATDEDRAFT_8510 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 695

 Score =  281 bits (720), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 163/461 (35%), Positives = 254/461 (55%), Gaps = 49/461 (10%)

Query: 252 PALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM 311
           PA+++  I D     ++ L  ++  + ++   L+ A+ + +D L   I  D   +GYI+ 
Sbjct: 145 PAVNDANIADVQDSTSLDLYRIST-DSSSFLALLNALKQGDDILSSSI--DTPQQGYIVT 201

Query: 312 QNKHLGKDHPPTESGSST-----QIYDEFCPLLLNQF-------------RSREFVKFET 353
            +  + +    +++  S+       Y EF P    QF             +   F++F +
Sbjct: 202 SDSMVSQQLASSDTAQSSPTTTFTTYQEFHPYRFEQFNQDRSTSLSAELPKQTRFMEFVS 261

Query: 354 FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
           FD A+DE++SK+ESQR E +    E AA  KL  +    + ++   +  V+ + + A+ I
Sbjct: 262 FDKAVDEYFSKMESQRLEIRAHQAELAAVKKLENVKKSHQAQIQNFQSNVESNEQYAQAI 321

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E  LED+D+ +  V+  LA+ M W+DL  +VKEE   GN +A +I    L          
Sbjct: 322 ESRLEDIDSVLRTVQSFLASGMDWKDLEDLVKEETNNGNALAKMIIGFKLN--------- 372

Query: 474 NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
                       +   K+++D+  +A+ANARR+Y  KK   +KQ KT+   +K  K AE 
Sbjct: 373 ------------MEFFKIDLDIYSTAYANARRYYGAKKVAITKQSKTMEQSAKVVKMAEM 420

Query: 534 KTRLQILQ-EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
           K    +   +KT  +I+ +RK +WFEKF WF+SSEN+LV+ G+DA Q+ M+V RY+ KGD
Sbjct: 421 KIFQHLASVQKTAVSITKIRKPYWFEKFLWFVSSENFLVVGGKDATQSNMLVTRYLKKGD 480

Query: 593 VYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
            YVH+DL GA+S ++K       +       AG  +VC S+AWD+K++TSA+W   HQVS
Sbjct: 481 AYVHSDLPGAASVIVK------CMQSCVGTDAGTMSVCQSRAWDAKIITSAYWAEAHQVS 534

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           KT  TG+ L +G+FMIRGKKN+LPP  LI G  +LF+ D S
Sbjct: 535 KTTSTGDTLPLGTFMIRGKKNWLPPVQLIYGMAMLFQTDHS 575



 Score =  149 bits (377), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 73/145 (50%), Positives = 102/145 (70%), Gaps = 9/145 (6%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +  DV+A V  L+ RL+G+R  NVYD++ KTY+FK         S    K LLL+
Sbjct: 1   MKQRFSALDVSASVVELKTRLVGLRLQNVYDINSKTYLFKF--------SRNETKELLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT ++RDK   PSGF +KLRKH+RTRRL ++RQLG DRI+  QF  G  A ++
Sbjct: 53  ESGIRMHTTQFSRDKSQMPSGFCMKLRKHLRTRRLVNLRQLGADRIMDMQFSEGEYAFHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
           I+E Y+ GNI+LTD E+ +L++LR+
Sbjct: 113 IVEFYSSGNIILTDHEYRILSVLRT 137



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 44/66 (66%)

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFY 1071
            +E+   ++ +D LTG P  +D LLY IPVC P++A+Q YKY+VK++PG  K+GK  +   
Sbjct: 577  DEQAIDMSFLDLLTGQPHETDNLLYAIPVCAPWTALQKYKYKVKLLPGALKRGKAAKSIT 636

Query: 1072 SLLLLM 1077
            +  L M
Sbjct: 637  ASFLSM 642


>gi|347828082|emb|CCD43779.1| similar to serologically defined colon cancer antigen 1
           [Botryotinia fuckeliana]
          Length = 674

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 211/647 (32%), Positives = 335/647 (51%), Gaps = 95/647 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R SNVYDLS K ++ K              K  +L+
Sbjct: 78  MKQRFSSIDVKVIAHELSNALVTLRVSNVYDLSSKIFLIKFAKPDN--------KQQILI 129

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T ++R     PS F  +LRK ++TRR+  V Q+G DRII FQF  G    Y 
Sbjct: 130 DSGFRCHLTDFSRATAAAPSVFVQRLRKFLKTRRVTQVSQVGTDRIIEFQFSDGQYRLY- 188

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE YA GNI+LTD E  +LTLLR     D G A                     +L   
Sbjct: 189 -LEFYAGGNIILTDKELNILTLLRVV---DPGEA-------------------QEELRVG 225

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSND-GARAKQP- 238
           L  S +      ++ N  G  + + +KE L          L K  +K  +D G + K+P 
Sbjct: 226 LKYSLD------NRQNYGG--IPDLTKERLQEA-------LQKGVDKGEDDSGKKKKKPG 270

Query: 239 -TLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
             L+  L  ++  + P L +H +  T    ++K +EV + ED  ++ L+ ++ + +  +Q
Sbjct: 271 DALRKALAVSITEFPPMLVDHAMRITNFNSSLKPAEVLQSED-LLEHLMKSLQEAQRVVQ 329

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFR---SREFVK 350
           ++ S +   +GYI+ + K      P  E+ +  +   +YD+F P    QF+   S  F++
Sbjct: 330 EITSSE-TAKGYIVAKKKD--PQTPSDENETDIRKGLLYDDFHPFKPKQFQDDPSLVFLE 386

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F+  +DEF+S IE Q+ E + + +E  A  K+     +Q  R+  L++    + + A
Sbjct: 387 FEGFNKTVDEFFSSIEGQKLESKLEEREKQAQKKIQAARNEQAKRLGGLQEIQALNERKA 446

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             ++ N+E V  A  AV   +A  M W ++ R+++ E+K  NPVA +I   L LE N ++
Sbjct: 447 SALQANVERVQEATDAVNGLIAQGMDWFEIGRLIEREQKFNNPVASMIKLPLKLEENTVT 506

Query: 470 LLLS---------------NNLDEMDDEEKTLPVEK-----------VEVDLALSAHANA 503
           +LL                +++ E +DE+ T    K           +++DLALS  ANA
Sbjct: 507 ILLDEEAFDEEEDSTYETDSDVSESEDEDDTAKTNKKKEKVADTRIPIDIDLALSPWANA 566

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEK 559
           R +++ K+   SK++KT+ + SKA K+ E K     +  + QEK +  +  +RK  WFEK
Sbjct: 567 RNYFDQKRSAASKEDKTLQSSSKALKSTEAKIAQDLKKGLKQEKAI--LRPVRKQMWFEK 624

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           F WFISS+ YLV++G+DAQQ+E++ KRY+ KGDVY+HAD+ GA+S +
Sbjct: 625 FIWFISSDGYLVLAGKDAQQSEILYKRYLKKGDVYLHADIRGAASVI 671


>gi|401410580|ref|XP_003884738.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
 gi|325119156|emb|CBZ54708.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
          Length = 1853

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 152/372 (40%), Positives = 229/372 (61%), Gaps = 18/372 (4%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q+ E+        A  ++ KI  DQ  R+  L++E  
Sbjct: 435 TRVLLHFRDINVCVDEYFSSVDVQKGERAEALARHEALSRVEKIRSDQAQRMQQLEEEAA 494

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K++ K G+P+A  + +L LE
Sbjct: 495 SLLEEAQAVEANVGLVEQIIQLLRAALATGVDWDELGRQMKQQAKEGHPLAVHVQELRLE 554

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
           +    LLL     E   EE +     V +D+ LSAH NA+  +   K+ ++K  KT +A 
Sbjct: 555 KQRALLLL-----EAPGEEASGATTVVSIDITLSAHGNAQLLHSQVKQLKAKTLKTSSAT 609

Query: 525 SKAFKAAEKKTRLQILQEKTVANIS-----HMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
           + A  AA++K +  + Q++     +      +RK  WFEKF+WFISS++YLV++GRDAQQ
Sbjct: 610 AAALAAADRKAQRTLKQKEQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAGRDAQQ 669

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKN-------HRPEQPVPPLTLNQAGCFTVCHS 632
           NE++ +RY+   DVYVHAD+HGA++ +IKN          E PVP  TL Q G F VC S
Sbjct: 670 NEILFRRYLRANDVYVHADVHGAATCIIKNTGETDPGKTEEPPVPLATLQQCGEFAVCRS 729

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-D 691
            AW++K   +AWWVY HQVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLLFRL D
Sbjct: 730 SAWNTKTPAAAWWVYGHQVSKSAPSGLYLSTGSFMIRGRRNFIQIHRLEMGFGLLFRLAD 789

Query: 692 ESSLGSHLNERR 703
           E+S+  H+  R+
Sbjct: 790 EASVARHVAARK 801



 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/146 (40%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  V  +G+  K+ L +
Sbjct: 4   TKQRVGALDVRALVASIRPAVLGLRVTNVYDFSSGGGRGAGSSSYIVKLAGKDSKIFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR++L  FG G N   +
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCMRLRKSLRGKKLEDIHQHGADRVVLLTFGKGENTLRL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           I+ELY  GNI+LTD    +L +LR H
Sbjct: 124 IVELYVSGNIVLTDHTNLILAVLRRH 149



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 35/57 (61%)

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            + +D LT +P P D LL V+PV  PYSA+  YK++ K++PG+ KKG   Q      L
Sbjct: 1739 SQIDLLTASPFPEDALLCVVPVTAPYSAMSKYKFKAKLVPGSMKKGNAGQAVLRHFL 1795


>gi|297736765|emb|CBI25966.3| unnamed protein product [Vitis vinifera]
          Length = 369

 Score =  275 bits (703), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 128/160 (80%), Positives = 142/160 (88%), Gaps = 2/160 (1%)

Query: 593 VYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           +Y+HADLHGASSTVI+NH+PE PVPPLTL+QAGCFTVCHSQAWDSK+VTSAWWVYPHQVS
Sbjct: 104 LYIHADLHGASSTVIENHKPEHPVPPLTLSQAGCFTVCHSQAWDSKIVTSAWWVYPHQVS 163

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KTAPT EYLTVGSFMIRGKKNFLPPHPL+MGFGLL  LDESSLGSHLNERRVRGEEEG  
Sbjct: 164 KTAPTVEYLTVGSFMIRGKKNFLPPHPLMMGFGLLLCLDESSLGSHLNERRVRGEEEGAQ 223

Query: 713 DFEDSGHHKENSDIESEKDDTDEKPVAESLSV--PNSAHP 750
           DFE++   K NSD ESEK++TDEK  AES S+  P++  P
Sbjct: 224 DFEENESLKGNSDSESEKEETDEKRTAESKSIMDPSTHQP 263


>gi|428179079|gb|EKX47951.1| hypothetical protein GUITHDRAFT_106038 [Guillardia theta CCMP2712]
          Length = 841

 Score =  273 bits (699), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 229/766 (29%), Positives = 378/766 (49%), Gaps = 116/766 (15%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTES------------ 50
           K+RM++ D++ E + LR LIG R +N+YD++ +T   +L  S  + ES            
Sbjct: 91  KMRMSSLDLSVETRILRNLIGTRVANIYDINARTLEIRLGASCALKESQTLPMSADALHV 150

Query: 51  -GESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
            G S+++ +++ESG RLHT+ + R   + PS F  K+RKHIR + L DVRQ+G DR++  
Sbjct: 151 NGSSQRISVVIESGSRLHTSRFHRATASRPSNFATKIRKHIRGQFLNDVRQVGKDRVLQM 210

Query: 110 QFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF 169
            FGLG  ++++ILE YA GNI+L D E T+L+LLRS+   D G  +  + +Y  +    F
Sbjct: 211 TFGLGNRSNHLILEFYAAGNIILCDHEMTILSLLRSYETPD-GRHVEVKSKYLIDDGGGF 269

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           +  +  +L  A+  S+               ++    +E+ G     K            
Sbjct: 270 QPMSCDRLVKAIERSR---------------SICRGLRESTGSSLTRKD----------- 303

Query: 230 NDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
                 K+  L  +L     Y   L EH++L  G+ P++   EV    D  +Q L+ A  
Sbjct: 304 -----KKKTALMKLLATECQYPGQLIEHVLLCAGIQPDIPADEVRN--DIDLQRLLQAFK 356

Query: 290 KFEDWLQ-------DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQ 342
           + +              S     +GY+++           ++S + T +  EF P+LL Q
Sbjct: 357 EIDHLFMLGHSQQLATPSSSAALKGYVILDR--------ISDSSNQTLVISEFSPILLKQ 408

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
              +  +++ + D A+DEF+S I+  R ++      +    K+NK   D ++    LKQE
Sbjct: 409 QEDKMVLEYPSIDVAMDEFFSTIDFNRDQKDANEAVETVSKKVNKAKKDIKSHTEGLKQE 468

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K A L+E N   +D A+  +R              +VK  R A N    ++ +++
Sbjct: 469 ELLNHKKATLLELNSFHIDEALDKIR-------------GLVKIHRNAAN----VLHEIH 511

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVE--VDLALSAHANARRWYELKKKQESKQEKT 520
              +  S  L        +  K+     +   +D ++S+ ANAR +++ KKK  +KQ++ 
Sbjct: 512 EMNSTASFRLPQEGIVESEAVKSRGATDITLVLDYSISSLANARNFFQKKKKVAAKQQRA 571

Query: 521 -----ITAHSKAFKAAEKK----TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
                I+  +   KA+++K    ++     + +   IS +R+  WFEKF WFISS+  LV
Sbjct: 572 EEMADISLKNTQIKASQRKNTKASKNDFQSKSSSIGISSVRRKFWFEKFFWFISSDQILV 631

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-PPLTLNQAGCFTVC 630
           I+G+DAQQNE++VKR                      N   E+ V P  T+ QA  F VC
Sbjct: 632 IAGKDAQQNELLVKR----------------------NELKERKVLPENTILQAAEFAVC 669

Query: 631 HSQAWDSKMV--TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            S AW SK    T+A+WVYP QVSK   +GEYL+ G F+IRGKKNF+    L MGFG+ F
Sbjct: 670 RSSAWKSKTASGTAAYWVYPDQVSKAPQSGEYLSKGGFVIRGKKNFVSISTLCMGFGIFF 729

Query: 689 RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTD 734
               ++  ++ +E  +R E+E ++   ++      ++I+++K +TD
Sbjct: 730 YSPRANDLTY-DENLMRKEQEDVEIVTETMSQTSFTEIDADKRNTD 774



 Score = 47.4 bits (111), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 18/64 (28%), Positives = 34/64 (53%)

Query: 1004 EEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKK 1063
            E    ++ E E  +  D  +   NP     ++Y +PV  P+SA++ Y++R  +IPG  ++
Sbjct: 777  ENSTSQVEEAEAFQATDFRHFITNPATKQEIVYALPVVAPFSAIRDYRFRGMLIPGLMRR 836

Query: 1064 GKGI 1067
             K +
Sbjct: 837  YKAL 840


>gi|68475252|ref|XP_718344.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
 gi|68475451|ref|XP_718248.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
 gi|46440007|gb|EAK99318.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
 gi|46440107|gb|EAK99417.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
          Length = 1018

 Score =  273 bits (699), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 215/751 (28%), Positives = 359/751 (47%), Gaps = 108/751 (14%)

Query: 19  RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKK 76
           + L   R  N+Y+++   + Y+FK         S    K ++++E G R+H T + R   
Sbjct: 19  KELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVVVLEYGNRIHLTDFERPTT 70

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
             P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +Y++LE ++ GNILL D  
Sbjct: 71  QQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KYYLVLEFFSAGNILLLDES 128

Query: 137 FTVLTLLR--SHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHAALTSSKEPDANEPD 193
             +L L R  S + ++   A+        E  ++F+++   +  H              +
Sbjct: 129 QRILALQRLVSAKQENDRYAV-------NEEYKMFDKSLFQQDFHY-------------E 168

Query: 194 KVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL-KTVLGEALGYGP 252
           K + D + V +  + +           LS+NS     D  +AK  ++ K     A     
Sbjct: 169 KRSYDLDEVESWIQTH--------KLKLSQNS-----DNKKAKVFSIHKLAFINASHLSG 215

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +    ++G+ P+       K ++ A+Q +V A+   ED   D+I+G+I  EGYI+ +
Sbjct: 216 ELIQKWFFESGIDPSQSCLSFEKNQE-ALQRVVNALGVCEDKYIDLINGEIATEGYIVAK 274

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPL---LLNQFRSREFVKFETFDAALDEFYSKIESQR 369
                K++  +E      IYDEF P      NQ    +F+    ++  LD+F+S IES +
Sbjct: 275 -----KNNKVSEKSDLEYIYDEFHPFEPYKPNQ-EGIKFISVSGYNKTLDKFFSNIESTK 328

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
              + + +++ A  +L K   +++ ++ +L  +   + K  ELI+Y+ E V+     V+ 
Sbjct: 329 FSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAKKGELIQYHSELVEECRSYVQS 388

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------- 477
            +  +M W ++  ++  E+K  N +A  I   L L+ N + +LL +  D           
Sbjct: 389 FIDQQMDWTNIETVISLEQKKKNELAQHIQLPLNLKENKIKVLLEDFDDYEEITESASAT 448

Query: 478 ------------------EMDDEEKTLPVEKVE-----------------VDLALSAHAN 502
                             E D++E  +PV++ +                 +DL+ SA AN
Sbjct: 449 ETGSETETESESESSSESESDNDEDKIPVKRTQRKTNTKEKPKRKTIPTWIDLSQSAFAN 508

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKF 560
           AR +++ KK  E+KQ K   + S A K AE+K    + +     N  +  +R  +WFEKF
Sbjct: 509 ARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLTRSLKQENDTLKEIRPKYWFEKF 568

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SSE YL ++G+DA Q +MI  R+ S  D  V AD+ G+    IKN    + +PP T
Sbjct: 569 FWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSADMEGSLKVFIKNPLKGEALPPST 628

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L QAG F +  S AW+ K+ TSAW ++  ++SK    G  +  G F    +K +LPP  L
Sbjct: 629 LMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDGSIVPEGEFNYLVQKEYLPPAQL 688

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           IMGFG    LD+ S   +   R  R  E G 
Sbjct: 689 IMGFGFYCLLDDESTKRYGEIRTKRELEHGF 719



 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 83/177 (46%), Gaps = 27/177 (15%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAI 950
            ++ RG++ KLKK+  KY +QDEEER +RM  L +  +V++     Q EN+    EK   +
Sbjct: 791  QLPRGKRSKLKKIAAKYRNQDEEERKLRMDALGTLKQVEERLSKTQIENS----EKSELV 846

Query: 951  SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
                  +V  + KK     +  K    D ++  E N    +    E+             
Sbjct: 847  KKQQQKEVILERKKKQKERELQKYLLGDDNNDEETNNESHIVNYLEI------------- 893

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
                      +D  T  P   D ++ ++PV  P+SA+Q +KY+VKI PG+ KKGK I
Sbjct: 894  ----------LDSFTAKPSTKDTIVGLVPVFAPWSALQKFKYKVKIQPGSGKKGKCI 940


>gi|389585510|dbj|GAB68240.1| hypothetical protein PCYB_131140 [Plasmodium cynomolgi strain B]
          Length = 1898

 Score =  273 bits (698), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 165/437 (37%), Positives = 250/437 (57%), Gaps = 56/437 (12%)

Query: 332 YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHK 384
           + EF P++LN  +++      E + F+ F+  +D ++S++E S+  +QQ   K   +  K
Sbjct: 464 FTEFSPIILNNHKNKVEENKLEVINFDDFNKCVDTYFSRMELSKYDKQQEVIKIKKSLTK 523

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           ++KI +D E R+  L++EV    K   LI+ N E V+ AI  +R A+A   +WE +   +
Sbjct: 524 MDKIKLDHERRIEQLEKEVSSLKKKISLIQMNDELVEQAIQLMRAAVATNANWEKIWEHI 583

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLL---------SNNLDEMDDEE------------ 483
           K  +K  +P+A  I  +      M LLL         S++L    +E+            
Sbjct: 584 KLFKKQNHPIALRISSVNFNNCEMELLLDDEEATEQGSDDLSSEANEQGSDDPSSEANEQ 643

Query: 484 -----------KTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE 532
                       T     V ++L  S + N   + +L+KK E K  KT  + + A K  E
Sbjct: 644 QSKGKASNREVATRSRFAVTINLNNSVYGNVEDYQKLRKKAEEKIRKTKISTNFAVKKVE 703

Query: 533 KKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
           KK + +I + +   NI+  R V+WFEKF+WFISSENYLVI+GRDA QNE++ +RY  K D
Sbjct: 704 KKKKKKINRRE---NITRQR-VYWFEKFHWFISSENYLVIAGRDALQNEILFRRYFQKND 759

Query: 593 VYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           +YVHAD+HGAS+ +IKN   + P+P  TL++AG   +C S AW++K++TSAWWV+ HQVS
Sbjct: 760 IYVHADIHGASTCIIKNPHKDIPIPEKTLSEAGQLAICRSSAWNNKIITSAWWVHYHQVS 819

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           K+AP GEYL  GSF+IRGKKN+LP   L MG  ++F++D ++L ++        EE  +D
Sbjct: 820 KSAPAGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAALDNN--------EENNLD 871

Query: 713 D----FEDSGHHKENSD 725
           D    FE+ G  K +SD
Sbjct: 872 DTQRSFENDG-EKRSSD 887



 Score =  116 bits (291), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 56/147 (38%), Positives = 91/147 (61%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVK-CLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   L+G   +N+Y++S K Y+ K         S + +K   L
Sbjct: 1   MAKQRLTALDIRAIITLCKNILVGCVVTNIYNISNKIYVLKC--------SKKEQKYFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ ++ QLG DR++  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSAFTMKLRKHLRSRKITNISQLGGDRVVDIQFGFDDKACH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD+   +L++L+S+
Sbjct: 113 LIVELYIAGNIILTDNNHKILSILKSN 139


>gi|83033024|ref|XP_729296.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23486663|gb|EAA20861.1| strong similarity to unknown protein-related [Plasmodium yoelii
           yoelii]
          Length = 1768

 Score =  272 bits (695), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 162/416 (38%), Positives = 240/416 (57%), Gaps = 29/416 (6%)

Query: 330 QIYDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIESQRAEQ-QHKAKEDAAF 382
           +++ EF P+LL    ++      E +KF+ F+  +D ++SKIE  + ++ Q   K   A 
Sbjct: 437 RLFVEFSPILLKNHINKINEKKIEIIKFDNFNMCVDTYFSKIELTKYDKHQEMNKNKNAL 496

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K++KI +D E R+  L++EV    K   LIE N + V  AI  +R A++   +WE +  
Sbjct: 497 TKMDKIKLDHEKRIEGLEKEVSMLKKKILLIELNYQFVGEAIKLMRSAISTSANWEKIWD 556

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLLS-------------NNLDEMDDEEKTLPVE 489
            +K  +K  +P+A  I  +      M LLL              NNL     +EK +  +
Sbjct: 557 HIKLFKKRNHPIALKIMSVNFNNCEMELLLDDNDDDDVEESGDDNNLKNDKWKEKVIEEK 616

Query: 490 K----VEVDLALSAHANARRWYELKKKQESKQEK----TITAHSKAFKAAEKKTRLQILQ 541
                V ++L  S   N   + +L+KK E K  K    T  A  K  K  + K   Q  +
Sbjct: 617 NKTCAVTINLNNSVFGNIEDYEKLRKKAEEKIRKIKMSTNIAVKKVEKKKKDKDIKQKGK 676

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
            K+V  I  +RK+ WFEKFNWFISSENYLVISGRD+ QNE++ +RY    D+YVHAD+HG
Sbjct: 677 NKSVFQIKKIRKIFWFEKFNWFISSENYLVISGRDSLQNEILFRRYFQNNDIYVHADIHG 736

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           A+S +IKN   + P+P  TL++AG   +C S AW++KM+TSAWWVY HQVSKTAPTGEY+
Sbjct: 737 AASCIIKNPYKDIPIPEKTLSEAGQLAMCRSSAWNNKMITSAWWVYYHQVSKTAPTGEYI 796

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
             GSF+IRGKKN+LP   L MG  ++F++++  +  +  E ++  +E   D+ E++
Sbjct: 797 KTGSFVIRGKKNYLPYAKLEMGLCIIFQINK-KVNDNNEENKLTDDEPNCDNNEEN 851



 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 100/156 (64%), Gaps = 9/156 (5%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   +IG   +N+Y++S K Y+ K         S + +K  LL
Sbjct: 1   MGKQRLTALDIRAIITSCKNTIIGSVVTNIYNISNKIYVLKC--------SKKEQKYFLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PSGFT+KLRKH+R+R++ ++ QLG DR+I  QFG   N ++
Sbjct: 53  LEAEKRVHITEWVREKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVIDIQFGYDDNMYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAI 155
           +I+ELY  GNI+LTDS++ ++ +L+S+ D+ K + I
Sbjct: 113 LIVELYIAGNIILTDSDYKIIFILKSNDDNKKNLKI 148


>gi|170576547|ref|XP_001893673.1| Serologically defined colon cancer antigen 1 [Brugia malayi]
 gi|158600188|gb|EDP37492.1| Serologically defined colon cancer antigen 1, putative [Brugia
           malayi]
          Length = 307

 Score =  271 bits (694), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 138/277 (49%), Positives = 185/277 (66%), Gaps = 10/277 (3%)

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           +AG+P+A  I  L L  N M+LLL       D     +  +KV +D+ALS++ NAR+ + 
Sbjct: 7   EAGSPIAASIVGLNLNSNQMTLLLG------DPYRPEIDPKKVTIDIALSSYQNARKLHT 60

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            KK  + K++KTI A SKA K+ + K +  +    T A +   R V WFEKF WF+SSEN
Sbjct: 61  EKKAAQQKEQKTICASSKALKSTKMKMKETLKVVHTKAEVMKKRHVMWFEKFFWFVSSEN 120

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YLVI GRDAQQNE++VKRY+  GD+Y+HAD+ GASS +I+N      VPP TLN+A    
Sbjct: 121 YLVIGGRDAQQNELLVKRYLRPGDIYMHADVRGASSIIIRNKLGGGDVPPRTLNEAATMA 180

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           + +S AW++K+ +SAWWV+ HQVS+TAPTGEYLT GSFMIRGKKN+LP   L MGFG++F
Sbjct: 181 ISYSSAWEAKITSSAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMF 240

Query: 689 RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSD 725
           +LDE SL  H  ER+V      M   ED+  H+++ D
Sbjct: 241 QLDEESLERHREERKV----APMVTAEDNAMHQDDGD 273


>gi|3859683|emb|CAA22020.1| conserved hypothetical protein [Candida albicans]
          Length = 1018

 Score =  271 bits (694), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 214/751 (28%), Positives = 358/751 (47%), Gaps = 108/751 (14%)

Query: 19  RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKK 76
           + L   R  N+Y+++   + Y+FK         S    K ++++E G R+H T + R   
Sbjct: 19  KELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVVVLEYGNRIHLTDFERPTT 70

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
             P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +Y++LE ++ GNILL D  
Sbjct: 71  QQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KYYLVLEFFSAGNILLLDES 128

Query: 137 FTVLTLLR--SHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHAALTSSKEPDANEPD 193
             +L L R  S + ++   A+        E  ++F+++   +  H              +
Sbjct: 129 QRILALQRLVSAKQENDRYAV-------NEEYKMFDKSLFQQDFHY-------------E 168

Query: 194 KVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL-KTVLGEALGYGP 252
           K + D + V +  + +           LS+NS     D  +AK  ++ K     A     
Sbjct: 169 KRSYDLDEVESWIQTH--------KLKLSQNS-----DNKKAKVFSIHKLAFINASHLSG 215

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +    ++G+ P+       K ++ A+Q +V A+   ED   D+I+G I  EGYI+ +
Sbjct: 216 ELIQKWFFESGIDPSQSCLSFEKNQE-ALQRVVNALGVCEDKYIDLINGAIATEGYIVAK 274

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPL---LLNQFRSREFVKFETFDAALDEFYSKIESQR 369
                K++  +E      IYDEF P      NQ    +F+    ++  LD+F+S IES +
Sbjct: 275 -----KNNKVSEKSDLEYIYDEFHPFEPYKPNQ-EGIKFISVSGYNKTLDKFFSNIESTK 328

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
              + + +++ A  +L K   +++ ++ +L  +   + K  ELI+Y+ E V+     V+ 
Sbjct: 329 LSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAKKGELIQYHSELVEECRSYVQS 388

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------- 477
            +  +M W ++  ++  E+K  N +A  I   L L+ N + +LL +  D           
Sbjct: 389 FIDQQMDWTNIETVISLEQKKKNELAQHIQLPLNLKENKIKVLLEDFDDYEESTESASAT 448

Query: 478 ------------------EMDDEEKTLPVEKVE-----------------VDLALSAHAN 502
                             E D++E  +PV++ +                 +DL+ SA AN
Sbjct: 449 ETGSETETESESESSSESESDNDEDKIPVKRTQRKTNTKEKPKRKTIPTWIDLSQSAFAN 508

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKF 560
           AR +++ KK  E+KQ K   + S A K AE+K    + +     N  +  +R  +WFEKF
Sbjct: 509 ARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLTRSLKQENDTLKEIRPKYWFEKF 568

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SSE YL ++G+DA Q +MI  R+ S  D  V AD+ G+    IKN    + +PP T
Sbjct: 569 FWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSADMEGSLKVFIKNPLKGEALPPST 628

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L QAG F +  S AW+ K+ TSAW ++  ++SK    G  +  G F    +K +LPP  L
Sbjct: 629 LMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDGSIVPEGEFNYLVQKEYLPPAQL 688

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           +MGFG    LD+ S   +   R  R  E G 
Sbjct: 689 VMGFGFYCLLDDESTKRYGEIRTKRELEHGF 719



 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 84/186 (45%), Gaps = 45/186 (24%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAI 950
            ++ RG++ KLKK+  KY DQDEEER +RM  L +  +V++     Q EN+    EK  ++
Sbjct: 791  QLPRGKRSKLKKIAAKYRDQDEEERKLRMDALGTLKQVEERLSKTQIENS----EKSESV 846

Query: 951  SPVDAPKVCYKCKKAGH--------LSKDCK-EHPDDSSHGVEDNPCVGLDETAEMDKVA 1001
                  +V  + KK           LS D   E  D+ SH V                  
Sbjct: 847  KKQQQKEVILERKKKQKERELQKYLLSDDNNDEETDNESHIV------------------ 888

Query: 1002 MEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTA 1061
                            L  +D  T  P   D ++ ++PV  P+SA+Q +KY+VKI PG+ 
Sbjct: 889  --------------NYLEILDSFTAKPSTKDTIVGLVPVFAPWSALQKFKYKVKIQPGSG 934

Query: 1062 KKGKGI 1067
            KKGK I
Sbjct: 935  KKGKCI 940


>gi|238879662|gb|EEQ43300.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1018

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 240/887 (27%), Positives = 410/887 (46%), Gaps = 129/887 (14%)

Query: 19  RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKK 76
           + L   R  N+Y+++   + Y+FK         S    K ++++E G R+H T + R   
Sbjct: 19  KELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVVVLEYGNRIHLTDFERPTT 70

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
             P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +Y++LE ++ GNILL D  
Sbjct: 71  QQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KYYLVLEFFSAGNILLLDES 128

Query: 137 FTVLTLLR--SHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHAALTSSKEPDANEPD 193
             +L L R  S + ++   A+        E  ++F+++   +  H              +
Sbjct: 129 QRILALQRLVSAKQENDRYAV-------NEEYKMFDKSLFQQDFHY-------------E 168

Query: 194 KVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL-KTVLGEALGYGP 252
           K + D + V +  + +           LS+NS     D  +AK  ++ K     A     
Sbjct: 169 KRSYDLDEVESWIQTH--------KLKLSQNS-----DNKKAKVFSIHKLAFINASHLSG 215

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +    ++G+ P+       K    A+Q +V A+   ED   D+I+G I  EGYI+ +
Sbjct: 216 ELIQKCFFESGIDPSQSCLSFEK-NQGALQRVVNALGVCEDKYIDLINGAIATEGYIVAK 274

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPL---LLNQFRSREFVKFETFDAALDEFYSKIESQR 369
                K++  +E      IYDEF P      NQ    +F+    ++  LD+F+S IES +
Sbjct: 275 -----KNNKVSEKSDLEYIYDEFHPFEPYKPNQ-EGIKFISVSGYNKTLDKFFSNIESTK 328

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
              + + +++ A  +L K   +++ ++ +L  +   + K  ELI+Y+ E V+     V+ 
Sbjct: 329 FSIKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAKKGELIQYHSELVEECRSYVQS 388

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------- 477
            +  +M W ++  ++  E+K  N +A  I   L L+ N + +LL +  D           
Sbjct: 389 FIDQQMDWTNIETVISLEQKKKNELAQHIQLPLNLKENKIKVLLEDFDDYEESTESASAT 448

Query: 478 ------------------EMDDEEKTLPVEKVE-----------------VDLALSAHAN 502
                             E D++E  +PV++ +                 +DL+ SA AN
Sbjct: 449 ETGSETETESESESSSESESDNDEDKIPVKRTQRKTNTKEKPKRKTIPTWIDLSQSAFAN 508

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKF 560
           AR +++ KK  E+KQ K   + S A K AE+K    + +     N  +  +R  +WFEKF
Sbjct: 509 ARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLTRSLKQENDTLKEIRPKYWFEKF 568

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SSE YL ++G+DA Q +MI  R+ S  D  V AD+ G+    IKN    + +PP T
Sbjct: 569 FWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSADMEGSLKVFIKNPLKGEALPPST 628

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L QAG F +  S AW+ K+ TSAW ++  ++SK    G  +  G F    +K +LPP  L
Sbjct: 629 LMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDGSIVPEGEFNYLVQKEYLPPAQL 688

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS-------------GHHKENSDIE 727
           +MGFG    LD+ S   +   R  R  E G     D+                KEN+  E
Sbjct: 689 VMGFGFYCLLDDESTKRYGEIRTKRELEHGFAIVVDNKKKELEEIRLAQKASAKENTAQE 748

Query: 728 SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV 787
              D++ E    E     ++  P  + T + + +  E P   + +  G  SK+    + +
Sbjct: 749 QRVDESSESDDNEDDGGEDADSP-DTDTVSVDANGEEKPVIVQQLPRGKRSKL----KKI 803

Query: 788 AAPVTPQLEDLIDRALGLGS-ASISSTKHGIETTQFDLSEEDKHVER 833
           AA    Q E+  +R L + +  ++   +  +  TQ + SE+ + V++
Sbjct: 804 AAKYRDQDEE--ERKLRMDALGTLKQVEERLSKTQIENSEKSESVKK 848



 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/177 (31%), Positives = 85/177 (48%), Gaps = 27/177 (15%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAI 950
            ++ RG++ KLKK+  KY DQDEEER +RM  L +  +V++     Q EN+    EK  ++
Sbjct: 791  QLPRGKRSKLKKIAAKYRDQDEEERKLRMDALGTLKQVEERLSKTQIENS----EKSESV 846

Query: 951  SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
                  +V  + KK     +  K    D ++  E N                   + H +
Sbjct: 847  KKQQQKEVILERKKKQKERELQKYLLGDDNNDEETN------------------NESHIV 888

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
                   L  +D  T  P   D ++ ++PV  P+SA+Q +KY+VKI PG+ KKGK I
Sbjct: 889  N-----YLEILDSFTAKPSTKDTIVGLVPVFAPWSALQKFKYKVKIQPGSGKKGKCI 940


>gi|70949333|ref|XP_744087.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56523889|emb|CAH79538.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 1345

 Score =  262 bits (669), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 155/375 (41%), Positives = 222/375 (59%), Gaps = 20/375 (5%)

Query: 330 QIYDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIESQRAEQ-QHKAKEDAAF 382
           +++ EF P+LL    ++      E +KF  F+  +D ++SK+E  + ++ Q   K   A 
Sbjct: 403 RLFVEFSPILLKNHINKIDEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKNAL 462

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K++KI +D E R+  L++EV+   K   LI+ N E V  AI  +R A++   +WE +  
Sbjct: 463 TKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKIWD 522

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHAN 502
            VK  +K  +PVA  I  +    NC   LL   L+E D EE +      E  +     A 
Sbjct: 523 HVKLFKKRNHPVALKIMSVNFN-NCEIELL---LNEGDTEESSSEDSSKEKGMEEKNKAC 578

Query: 503 ARRWYELKKKQESKQEK----TITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
                 L+KK E K  K    T  A  K  K  + K   Q  + K+V  I  +RK+ WFE
Sbjct: 579 T-----LRKKAEEKIRKIKMSTNVAIKKVEKKKKDKDTKQKGKHKSVFQIQKLRKIFWFE 633

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KFNWF+SSENYLVISGRD+ QNE++ +RY    D+YVHAD+HGA+S +IKN   + P+P 
Sbjct: 634 KFNWFLSSENYLVISGRDSLQNEILFRRYFQNNDIYVHADIHGAASCIIKNPYKDIPIPE 693

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL +AG   +C S AW++K++TSAWWVY HQVSKTAPTGEY+  GSF+IRGKKN+LP  
Sbjct: 694 KTLAEAGQLAMCRSSAWNNKVITSAWWVYYHQVSKTAPTGEYIKTGSFVIRGKKNYLPYA 753

Query: 679 PLIMGFGLLFRLDES 693
            L MG  ++F+++++
Sbjct: 754 KLEMGLSIIFQVNKN 768



 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 101/156 (64%), Gaps = 9/156 (5%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C + +IG   +N+Y++S K Y+ K         S + +K  LL
Sbjct: 1   MGKQRLTALDIRAIITSCKKTIIGSVVTNIYNISNKIYVLKC--------SKKEQKYFLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PSGFT+KLRKH+R+R++ ++ QLG DR++  QFG   N ++
Sbjct: 53  LEAEKRMHITEWMREKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVVDIQFGYDDNVYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAI 155
           +I+ELY  GNI+LT++E+ ++ +L+S+ D+ K + I
Sbjct: 113 LIVELYIAGNIVLTNNEYKIIFILKSNDDNKKKLKI 148


>gi|358254228|dbj|GAA54239.1| nuclear export mediator factor Nemf, partial [Clonorchis sinensis]
          Length = 527

 Score =  261 bits (668), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 194/588 (32%), Positives = 276/588 (46%), Gaps = 143/588 (24%)

Query: 525  SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            S AFKA + +  L     +TVA I+ +RK  WFEKF WFISSENYLV++GRD+QQNE++V
Sbjct: 4    SAAFKAQQTRKDL-----RTVAQITKIRKPMWFEKFFWFISSENYLVVAGRDSQQNEVLV 58

Query: 585  KRYMSKGDVYVHADLHGASSTVIKNHRP---------------EQPVPPL-TLNQAGCFT 628
            KR++   D+YVHAD+HGASS ++K  RP                 P+PP  TL +AG   
Sbjct: 59   KRHLGSDDIYVHADVHGASSVIVK-ARPLTTEESSSDSVSSTSRLPLPPPKTLIEAGTLA 117

Query: 629  VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            +  S AW++++VTSAWWV   QVSKTAP+GEYLT G+FMIRG+KN+LPP   + GFG+LF
Sbjct: 118  IVLSSAWNARVVTSAWWVRQDQVSKTAPSGEYLTTGAFMIRGRKNYLPPCHFMYGFGVLF 177

Query: 689  RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSA 748
            +LDE S+  H  ERRV                                            
Sbjct: 178  KLDEESVEHHRGERRV-----------------------------------------TRI 196

Query: 749  HPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSA 808
             P+   T+A   ++ + PAE          K+  +  +   P T QL       L L   
Sbjct: 197  DPSDDFTSAPKTNAEDVPAE----------KVEMVESDTEFPDT-QLR------LNLVKD 239

Query: 809  SISSTKHGIETTQFDLSEEDKHVERTATV---RDKPYISKAERRKLKKGQGSSVVDPKVE 865
                T    E++ F ++        T TV   +DK  + K+   KL  G   +V D +  
Sbjct: 240  DKVQTTADTESSHFTITCSRGKGASTRTVTNKKDKAIVDKSN--KLPNG---TVEDNRTS 294

Query: 866  REKERGKDASSQPESIVRKTKIEGG-----------KISRGQKGKLKKMKEKYGDQDEEE 914
             EK          +S +RK K + G           K+ +G+K KL +  ++ G    +E
Sbjct: 295  TEKSNSGPIKRGQKSKLRKIKQKYGTQDEDERMARMKLLQGEKAKLSQHHKRLGPPFTQE 354

Query: 915  RNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKE 974
             NI  A           +GD +N + S+  E+ P  +  +        +    L    +E
Sbjct: 355  SNITPA-----------EGDEENSHKSSDNEEAPKNTEEEEGVDVAISQSEDDLQ--LEE 401

Query: 975  HPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGR-------LNDVDYLTGN 1027
             P  + H   DN                        G+E++GR       L  +D  TG 
Sbjct: 402  QPPVTPHPDSDN------------------------GDEQEGRQAIEDDWLRLMDTFTGQ 437

Query: 1028 PLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            P  +DILLY +PVC PYSA+Q+YKY++K+ PGT K+GK  +   +  L
Sbjct: 438  PRENDILLYAMPVCAPYSALQNYKYKLKLTPGTVKRGKAAKTALNCFL 485


>gi|73853411|gb|AAZ86776.1| IP12823p [Drosophila melanogaster]
          Length = 489

 Score =  261 bits (668), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 192/553 (34%), Positives = 281/553 (50%), Gaps = 100/553 (18%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1   MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T L +LR H + +  +    R +YP E               A 
Sbjct: 115 LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE--------------RAK 159

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             +KE +     K+ E+  N                                      L+
Sbjct: 160 QPTKELELEALVKLLENARN-----------------------------------GDYLR 184

Query: 242 TVLGEALGYGPALSEHIILDTGL------------------------------VPNMKLS 271
            +L   L  GPA+ EH++L  GL                                N KL 
Sbjct: 185 QILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKLE 244

Query: 272 EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
           +      N + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G+    
Sbjct: 245 QKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQ-----VKEEKPTENGTVEFF 297

Query: 332 YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ + 
Sbjct: 298 FRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVK 357

Query: 390 MDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
            D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358 NDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 415

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
           +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR+
Sbjct: 416 QANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKDPEVTVVDVDLALSAWANARRY 475

Query: 507 YELKKKQESKQEK 519
           Y++K+    K++K
Sbjct: 476 YDMKRSAAQKKKK 488


>gi|320589532|gb|EFX01993.1| duf814 domain containing protein [Grosmannia clavigera kw1407]
          Length = 1969

 Score =  260 bits (665), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 160/429 (37%), Positives = 245/429 (57%), Gaps = 22/429 (5%)

Query: 327  SSTQI-YDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
            +ST++ Y +F P    QF +      + F++F+ A DEFYS ++  +A++Q   +E  AF
Sbjct: 1215 ASTKLDYVDFHPFKPRQFEADPKCVLLPFDSFNKAADEFYSHLQGLKADRQLHQQESVAF 1274

Query: 383  HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
             KL     DQ  R+ +L++    + + A  IE N E V AAI AV   L     WEDLA 
Sbjct: 1275 KKLEATRRDQAMRIESLQETQQLNTRKAAAIEANQEWVQAAIDAVNDQLHVGTDWEDLAH 1334

Query: 443  MVKEERKAGNPVAGLID-KLYLERNCMSLLLSNN--------LDEMDDEEKTLPVEKVEV 493
            ++ E     NPVA LI   + L    ++L LS+          DE ++E +   +  V V
Sbjct: 1335 LI-ENSADSNPVAALIKLPMRLADGIITLQLSDEPAADFDEDFDEDEEEAEEEELLDVNV 1393

Query: 494  DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT--RLQILQEKTVANISHM 551
             LALSA  NAR +Y+ K+   SK++KT    S A + AEKK    L+ +Q+        +
Sbjct: 1394 KLALSAWGNAREYYDQKRVAASKEQKTKEVTSMALRNAEKKVAEELKRVQKGGKPAPQLI 1453

Query: 552  RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
            R+  WFEKF WF+SS+ +LVI+ +++QQ E++ +R++ +GD+YVHAD+ G+   ++  +R
Sbjct: 1454 RRQLWFEKFLWFVSSDGHLVIAAKESQQCELMYRRHLRRGDIYVHADIRGSPGIIVVKNR 1513

Query: 612  PE----QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
            P+     P+PP TL QAGC  VC S+AWD+K    A+WV+ +QV KT  +G+ L +GSF 
Sbjct: 1514 PDVGADAPIPPGTLAQAGCLAVCASEAWDNKAGFGAYWVHANQVFKTTASGDVLPLGSFD 1573

Query: 668  IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIE 727
            IRG+KN LPP   ++GFGLLF++  +    +  E  V GE+   DD E  G   ++  +E
Sbjct: 1574 IRGEKNHLPPPQRVLGFGLLFQISNARTADYA-EVEVAGEDVA-DDVESDGPEIDSCPVE 1631

Query: 728  SEKDDTDEK 736
                +++ K
Sbjct: 1632 GNAQESEVK 1640



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 79/162 (48%), Gaps = 20/162 (12%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTE-----SGESEK 55
            +K R ++ DV A    L   L G R +NVYDL P +       +S         S   +K
Sbjct: 878  MKQRFSSLDVRAISHELHHSLAGTRVTNVYDLVPPSSSASSTAASTSRALLLRFSRGQDK 937

Query: 56   VLLLMESGVRLHTTAY-ARDKK-----------NTPSGFTLKLRKHIRTRRLEDVRQLGY 103
              L+++SG R H TAY AR              + PS F  +LR  +  R +  V+Q+G 
Sbjct: 938  FQLVVDSGFRCHLTAYDARASAASKGSSAGSAPHAPSAFVARLRTFLNGRHVTAVQQVGT 997

Query: 104  DRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRS 145
            DRI+  +F  G    Y  LE +A GN++LT++E  VL L R+
Sbjct: 998  DRIVELRFSDGQLRLY--LEFFAAGNVVLTNAEAKVLALQRT 1037



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 34/51 (66%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L+ +D L G P   D ++ V+PVC P+SA+   KY+VKI PG  KKG+ ++
Sbjct: 1822 LDTIDTLVGRPAVGDEIVEVVPVCAPWSALAQLKYKVKIQPGQTKKGRAMR 1872


>gi|26334499|dbj|BAC30950.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  258 bits (660), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 169/477 (35%), Positives = 247/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|51593729|gb|AAH80716.1| Sdccag1 protein, partial [Mus musculus]
          Length = 443

 Score =  258 bits (660), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 169/477 (35%), Positives = 247/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|74152610|dbj|BAE42589.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 169/477 (35%), Positives = 246/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH++ RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKGRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|12837616|dbj|BAB23886.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  256 bits (655), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 168/477 (35%), Positives = 246/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KAXLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD  
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKP 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|12857277|dbj|BAB30959.1| unnamed protein product [Mus musculus]
          Length = 415

 Score =  255 bits (652), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 168/475 (35%), Positives = 246/475 (51%), Gaps = 69/475 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNQVTMLL 410


>gi|54887337|gb|AAH37106.2| Sdccag1 protein [Mus musculus]
          Length = 415

 Score =  255 bits (651), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 168/475 (35%), Positives = 246/475 (51%), Gaps = 69/475 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410


>gi|113911846|gb|AAI22665.1| SDCCAG1 protein [Bos taurus]
          Length = 443

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 167/477 (35%), Positives = 249/477 (52%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRN 412


>gi|241958102|ref|XP_002421770.1| conserved hypothetical protein [Candida dubliniensis CD36]
 gi|223645115|emb|CAX39711.1| conserved hypothetical protein [Candida dubliniensis CD36]
          Length = 1012

 Score =  254 bits (649), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 220/781 (28%), Positives = 363/781 (46%), Gaps = 133/781 (17%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLL 58
           +K R+ + D+      L + L   R  N+Y+++   + Y+FK         S    K ++
Sbjct: 1   MKQRITSLDLQILTSELSKELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++E G R+H T + R     P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +
Sbjct: 53  VLEYGNRIHLTDFERPATQQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KY 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN         VL L       D+   I++  R                  
Sbjct: 111 YLVLEFFSAGN---------VLLL-------DESQKILALQR------------------ 136

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNA-SKENLGGQKGGKSFD-------LSKNSNKNSN 230
             L S+KE   N+   VNE+      +  +++   +K   + D         K     ++
Sbjct: 137 --LVSAKE--ENDRYAVNEEYKMFDKSLFQQDFHYEKRLYTLDEVESWIQTHKLKLSQAS 192

Query: 231 DGARAKQPTL-KTVLGEALGYGPALSEHIILDTGLVPNMK-LSEVNKLEDN--AIQVLVL 286
           D  +AK  ++ K     A      L +    ++G+ P+   LS     EDN  A+Q +V 
Sbjct: 193 DNKKAKVFSIHKLAFINASHLSGELIQKWFFESGIDPSQSCLS----FEDNQEALQQVVN 248

Query: 287 AVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPL---LLNQF 343
           A+   ED   D+I+G I  EGYI+ +     K++  +E+     IYDEF P      NQ 
Sbjct: 249 ALGVCEDKYIDLINGAIDNEGYIVAK-----KNNKASENSELEYIYDEFDPFEPYKPNQ- 302

Query: 344 RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
              +F+    ++  LD+F+S IES +   + + +++ A  +L K   +++ ++ +L  + 
Sbjct: 303 EGLKFIPVSGYNKTLDKFFSNIESTKFSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQ 362

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLY 462
             + K  ELI+Y+ E V+     V+  +  +M W ++  ++  E+K  N +A  I   L 
Sbjct: 363 KLNAKKGELIQYHSELVEECRNYVQSFIDQQMDWTNIETVISLEQKKKNDLAKHIQLPLN 422

Query: 463 LERNCMSLLLSNNLDEMDD---------------------------------EEKTLPVE 489
           L+ N + +LL    ++ DD                                 +E  +PV+
Sbjct: 423 LKENKIKVLL----EDFDDYEESTESASATETESETESETDSDSSSESESDNDEDKIPVK 478

Query: 490 KVE-----------------VDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE 532
           + +                 +DL+ SA ANAR +++ KK  E+KQ K  ++ S A K AE
Sbjct: 479 RTQRKKNAKEKPKRKTVPTWIDLSQSAFANARSYFDSKKTAETKQVKVESSTSMALKNAE 538

Query: 533 KKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           +K    + +     N  +  +R  +WFEKF WF+SSE YL ++G+DA Q +MI  R+ S 
Sbjct: 539 RKINQDLTRSLKQENETLKEIRPKYWFEKFFWFVSSEGYLCLAGKDASQTDMIYYRHFSD 598

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
            D  V AD+ G+    IKN    + +PP TL QAG F +  S AW+ K+ TSAW ++  +
Sbjct: 599 NDSIVSADMEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSTSSAWNGKVTTSAWVLHGTE 658

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           +SK    G  +  G F    +K +LPP  L+MGFG    LDE S   +   R  R  E G
Sbjct: 659 ISKRDYDGSIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDEESTKHYAEIRTKRELEHG 718

Query: 711 M 711
            
Sbjct: 719 F 719



 Score = 73.9 bits (180), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 86/179 (48%), Gaps = 32/179 (17%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAI 950
            ++ RG++ KLKK+  KY DQDE+ER +RM  L +  +V++     Q E++    EK  ++
Sbjct: 786  QLPRGKRSKLKKIAAKYRDQDEKERKLRMEALGTLKQVEERLSKTQIEDS----EKSESV 841

Query: 951  SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
                  ++  + KK                            +  E+ K  + +++  E 
Sbjct: 842  KKQQQKEMILERKKK--------------------------QKERELQKYLLGDDNDEET 875

Query: 1011 GEEEK--GRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
             EE      L  +D  T  P   D ++ ++PV  P+SA+Q +KY+VK+ PG+ KKGK I
Sbjct: 876  NEESHIVNYLEILDSFTAKPSTKDTIVGLVPVFAPWSALQKFKYKVKVQPGSGKKGKCI 934


>gi|254566655|ref|XP_002490438.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238030234|emb|CAY68157.1| hypothetical protein PAS_chr1-4_0316 [Komagataella pastoris GS115]
 gi|328350832|emb|CCA37232.1| Uncharacterized protein YPL009C [Komagataella pastoris CBS 7435]
          Length = 1007

 Score =  253 bits (647), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 215/782 (27%), Positives = 379/782 (48%), Gaps = 112/782 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+   V  L   I G R  N+Y +  + K+Y+FK      + +S +S    L
Sbjct: 1   MKQRISALDLKLIVSELSHSIKGYRLQNIYSMINNNKSYLFKF----AIPDSKKS----L 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++ESGV+LH T + R     PS F +KLRKH++ +RL +++Q+G DR+++F+F  GM  +
Sbjct: 53  VVESGVKLHLTDFQRPTTQQPSNFVVKLRKHLKAKRLTNLKQVGDDRLVVFEFSDGM--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASKL 177
           Y++LE ++ GN++L D +  ++TL R   + +      +  +Y T E   +F+   A KL
Sbjct: 111 YLVLEFFSGGNVILLDQDQKIMTLQRLVSEKE------NNEKYATGEFYNMFD---AKKL 161

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
            +      E  A+           + + SKE++      + F + +         A+   
Sbjct: 162 FS------EAPADHA---------IKSYSKEDIIQWLDTQDFKIEQ---------AKKTG 197

Query: 238 PTLKTVLGEALGY--GPALSE---HIIL-DTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
            T+K    + L +   P LS    HI+L + G+ P    S + + ED ++  L+ ++A+ 
Sbjct: 198 KTMKPYTIQKLLFVNAPHLSSDLIHIVLREKGIDPTSD-STLYRSED-SLAKLLESLAEA 255

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR--EFV 349
           E  L ++++     +GYI+ +   +   H P+  GS   IYDEF P      RS   +  
Sbjct: 256 EIRLSELLTRKEDVDGYIVSKRNPI---HDPSTEGSLEYIYDEFHPYEPTHKRSSDTQIK 312

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
             + ++  +D+F++ IE  +   + + ++  A  +L  +  +   ++  L +    +++ 
Sbjct: 313 TIKGYNKTIDDFFTTIEVSKHSLKEQQQKVNAERRLQSVKSENLEKIAKLTEAQLLNIQK 372

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
            E+I    + V+    AV+  L  +M W  + +++  E+K GN +A LI+  L L  N +
Sbjct: 373 GEVIMVYSDVVEQCKAAVQSLLDQQMDWNHIEKLIGVEKKRGNEIAKLINLPLNLLENKI 432

Query: 469 SLLL--------------------------------------SNNLDEMDDEEKTLPVEK 490
           SL L                                       ++      ++KT+    
Sbjct: 433 SLALPLVNFDESSEEEDESDSEDESDSEDSSSSDEQETKNKKQSSTKHSRKKDKTI---N 489

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV----- 545
           V +DL+LSA+ANA  +++ KK  + K  KT      A K+AE K    + ++K       
Sbjct: 490 VNIDLSLSAYANASTYFDAKKIAQDKLVKTEKNSELAIKSAESKINRDLKKQKKTESSQV 549

Query: 546 ----ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM-SKGDVYVHADLH 600
               A +  +R   WFEK+ WFISS+ +L ++GRD QQ + I   Y  +  D  V  +L 
Sbjct: 550 NNSNAALRQIRDKFWFEKYFWFISSDGFLCVAGRDDQQFDHIYFEYFDNDNDFLVSNELE 609

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GA   ++KN    + V P T  QAG F++  ++AW++KMV+S W V    VSK    G  
Sbjct: 610 GALKVIVKNPFLNKDVAPNTFIQAGAFSLSTTKAWENKMVSSPWIVTGSSVSKRDVDGSA 669

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           L  G   I  +K FLPP  ++MGFG+L+  D+ +   +L++ + R EE G++  + +   
Sbjct: 670 LAPGLVNITTEKQFLPPCQMVMGFGMLWLGDKRTNDDYLSKSQSRTEELGLESVDVNAFK 729

Query: 721 KE 722
           K+
Sbjct: 730 KK 731



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 110/229 (48%), Gaps = 40/229 (17%)

Query: 843  ISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESI-VRKTKIEGGKISRGQKGKLK 901
            +SK E+ +L+K Q  +  +P V+  +E  K   S  E + + + + +   + RG++ KLK
Sbjct: 743  LSKYEK-ELEKKQIQNDKEPSVDNAEEDSKSIVSSLEGLDINENQTQ---VKRGRRAKLK 798

Query: 902  KMKEKYGDQDEEERNIRMALLASAGKVQKNDG---DPQNENASTHKEKKPAISPVDAPKV 958
            K+K+KY DQDEE++  RM LL +  +VQ  +     P   N++   ++  + S V   K+
Sbjct: 799  KIKQKYADQDEEDKLKRMELLGTLKQVQAQEDIERQPSKSNSTNTAQQSSSASKVQKKKL 858

Query: 959  CYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRL 1018
              +  +   L ++  E PDD                         EE + E+   +    
Sbjct: 859  A-ELHQLRKLLEEF-ESPDD-------------------------EEVVPELHYTQV--- 888

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
              +  +  +P  +D ++  +PV  P+S++   KY+VK+ PG  KKGK +
Sbjct: 889  --LSTVISSPKKTDTIVDAVPVFAPWSSLNKLKYKVKVQPGNNKKGKSV 935


>gi|124805420|ref|XP_001350435.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
 gi|23496557|gb|AAN36115.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
          Length = 2158

 Score =  252 bits (644), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 147/405 (36%), Positives = 227/405 (56%), Gaps = 36/405 (8%)

Query: 332 YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHK 384
           + EF P++L     +      +++ F+ ++  +D ++SK+E S+  +QQ   K   A  K
Sbjct: 436 FTEFSPIILKNHEMKLNEGKIKYISFDDYNLCVDTYFSKLELSKYDKQQEITKSKNAITK 495

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           ++KI +D E R+  L++EV    K   LI+ N   ++  I  +R AL+   +WE +   +
Sbjct: 496 VDKIKLDHERRIEQLEKEVLLLKKKITLIQLNDVLIEEGIKLMRSALSTSANWEKIWEHI 555

Query: 445 KEERKAGNPVAGLIDKLYLERNC-MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANA 503
           K  +K  +P+A  I  +   +NC M  LLS    + DD +      K+  D   +   + 
Sbjct: 556 KIFKKQEHPIAVRIKSVNF-KNCEMDYLLS----DCDDRKGN----KMGDDGDDNDDDDD 606

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKA----------------AEKKTRLQILQEKTVAN 547
                    +   + KT  A  K  K                  +   + +   + +V  
Sbjct: 607 GDDDNNNNNKSCVKPKTFAAEEKIRKTKMATDFAVKKVEKKKKNKDNNKQKGKAKSSVGQ 666

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           I  +RKV+WFEKF+WFISSENYLVI+GRDA QNE++ +RY  K D+YVHAD+HGA+S +I
Sbjct: 667 IQKLRKVYWFEKFHWFISSENYLVIAGRDALQNEILFRRYFQKNDIYVHADIHGAASCII 726

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           KN   + P+P  TL++AG   +C S AW++K++TSAWWVY +QVSK+AP+GEYL  GSF+
Sbjct: 727 KNPYKDTPIPDKTLSEAGQLAICRSSAWNNKIITSAWWVYYNQVSKSAPSGEYLKTGSFV 786

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           IRGKKN+LP   L MGF +LF+++++     LN   +  EE  +D
Sbjct: 787 IRGKKNYLPHVKLEMGFCVLFQIEKN---EDLNVENLPLEENTID 828



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 94/147 (63%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVK-CLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A V  C + ++G   +N+Y++S K Y+ K         S + +K+  L
Sbjct: 1   MAKQRLTALDIRAIVTLCKKNIVGCIVTNIYNISNKIYVIKC--------SRKEQKLFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ +++QLG DR+I  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSSFTMKLRKHLRSRKISNIKQLGADRVIDIQFGYDEKASH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD  + +L++L+S+
Sbjct: 113 LIVELYIAGNIILTDENYKILSILKSN 139



 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 37/49 (75%)

Query: 1017 RLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK 1065
            +LN++  LT +P   D L + IP+C PYSA+Q++KY++K++PG  KKGK
Sbjct: 2050 KLNEIHKLTNSPNEGDNLSFAIPMCAPYSAIQTHKYKIKLVPGNTKKGK 2098


>gi|224108804|ref|XP_002314973.1| predicted protein [Populus trichocarpa]
 gi|222864013|gb|EEF01144.1| predicted protein [Populus trichocarpa]
          Length = 235

 Score =  251 bits (640), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 152/229 (66%), Positives = 174/229 (75%), Gaps = 8/229 (3%)

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
           MAE IE+NL+ VD+AILAV VALA  + WEDLARMVK+E+KAGNP+AGLIDKL+ E+NCM
Sbjct: 1   MAEFIEHNLQGVDSAILAVPVALAKGIGWEDLARMVKDEKKAGNPIAGLIDKLHFEKNCM 60

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
           +LL++  +  M        + K       S+HANA+RWYELKKKQE KQEKT TAH KAF
Sbjct: 61  ALLIA--IISMK-----WMMMKRHFQCISSSHANAQRWYELKKKQECKQEKTFTAHKKAF 113

Query: 529 KAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
           KAAEKK  LQ+ QEK+VA ISHM KVHW EKFNWFI + NYLVIS RDAQQNEM VKRYM
Sbjct: 114 KAAEKKIHLQLSQEKSVATISHMHKVHWLEKFNWFIGTWNYLVISRRDAQQNEMTVKRYM 173

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
           SKGD+ V     GASSTVIKNHRPEQPVPPLTLNQ G +     + W S
Sbjct: 174 SKGDLEVCPCRSGASSTVIKNHRPEQPVPPLTLNQ-GEYLTDEGEVWQS 221


>gi|66357888|ref|XP_626122.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
           II]
 gi|46227289|gb|EAK88239.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
           II]
          Length = 1378

 Score =  250 bits (639), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 140/357 (39%), Positives = 207/357 (57%), Gaps = 30/357 (8%)

Query: 351 FETFDAALDEFYSKI----ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
            + F   +DEFYS I    ES+ A Q+HK      + K++K+ +DQE R+  L  E +  
Sbjct: 370 LDNFCKCVDEFYSSIDIVKESKFATQEHKT----IYSKVDKVKIDQERRLEGLSSEKEAC 425

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
           +  A+ +E + E ++  +  +R  +A    W+D+   +++++K  +P+A  I  L L+ +
Sbjct: 426 IVRAKFMESHQEILEKILQLIRHLIATGAQWQDIWNEIQQQKKNNHPLARHIKSLNLKDD 485

Query: 467 CMSLLLSNNLDEMDDEEKTLPV-----EKVEVDLALSA--HANARRWYELKKKQESKQEK 519
            + +L S    + D   +T PV     + +E DL +S    +N R  Y   K    K EK
Sbjct: 486 KVKILFS----QRDLGSETTPVVDQIGKSIEFDLIISKSIQSNIRFQYMESKALAEKFEK 541

Query: 520 TITAHSKAFKA--------AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           T  A+  A K         AEK ++  +     V  I  +R  +WFEKF WFISS+ YL+
Sbjct: 542 TQLAYKIALKKVTNIAKKDAEKASKGLV---SNVPRIKKLRAQYWFEKFYWFISSDGYLI 598

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           I G DA QNE++ +RY+ K D Y+HAD+HGA++ ++KN    Q +P  TL +AG  ++C+
Sbjct: 599 IGGHDASQNELLFRRYLEKNDRYIHADIHGATTCIVKNTNNVQDIPLNTLCEAGQMSICY 658

Query: 632 SQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           S+AW +K V SAWWVYP QVSK AP+GEYL+ GSF+IRGKKNFLPP  L MG  L F
Sbjct: 659 SKAWVNKTVISAWWVYPDQVSKNAPSGEYLSTGSFVIRGKKNFLPPLKLEMGCALYF 715



 Score =  125 bits (313), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 92/154 (59%), Gaps = 15/154 (9%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM + D+ A V  + + L G +  N+YD++ +TY+FK          G  EK  LL
Sbjct: 4   MVKSRMTSVDICAMVHGISKDLKGQKLINIYDINSRTYLFKF---------GGEEKKFLL 54

Query: 60  MESGVRLHTTAYARDKK-----NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +ESG+R HTT + R+ +     ++ S F  KLR++IR ++L+D+ Q+G DRI+   FG G
Sbjct: 55  VESGIRFHTTQWKRENEHKTSVSSISFFNSKLRRYIRNKKLDDISQMGMDRIVKLTFGFG 114

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRD 148
            N  Y+I E +  GNI+LTD  + +L +LR   D
Sbjct: 115 DNTFYLIFEFFVAGNIILTDCNYKILVILRDTND 148



 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 96/212 (45%), Gaps = 42/212 (19%)

Query: 864  VEREKERGKDASSQPESI-VRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
            +ER  +  ++A+S   +I     K +   + RG+K KLKK+ +KYG+QD+EER I+M L 
Sbjct: 1142 LERLPKTSEEATSTKNNINSTNNKQKNSALPRGKKSKLKKVADKYGEQDDEERKIKMMLF 1201

Query: 923  ASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHG 982
             S    + ND      + S++K K       D+ +      +  H+S+  K         
Sbjct: 1202 GSKEMKKAND------DRSSNKTK-------DSNEFLNNQNRQLHISQQEKRRK------ 1242

Query: 983  VEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDI-----LLYV 1037
                      E  +M+KV   +  I +   E +    +  Y   + LP++      ++ V
Sbjct: 1243 ----------EQEKMEKVY--KNRIVDNSTENR----EFQYFKDSLLPTNKDEDSEIIAV 1286

Query: 1038 IPVCGPYSAVQSYKYRVKIIP-GTAKKGKGIQ 1068
            IP   P++ ++ +KY  ++ P G  K+ K  Q
Sbjct: 1287 IPTFAPFTCIKDFKYCARLTPGGVIKRSKAAQ 1318


>gi|34784822|gb|AAH56687.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 426

 Score =  249 bits (635), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 161/454 (35%), Positives = 240/454 (52%), Gaps = 68/454 (14%)

Query: 24  MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFT 83
           MR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS F 
Sbjct: 1   MRVNNVYDVDNKTYLIRLQKPDF--------KATLLLESGIRIHTTEFEWPKNMMPSSFA 52

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
           +K RKH+++RRL   +QLG DRI+ FQFG    A+++I+ELY +GNI+LTD E+ +L +L
Sbjct: 53  MKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEYVILNIL 112

Query: 144 RSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVS 203
           R   D+   V    R RYP +  R  E          LT  +  +             V+
Sbjct: 113 RFRTDEADDVKFAVRERYPLDHARAAE--------PLLTLERLTEI------------VA 152

Query: 204 NASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTG 263
           +A K  L                             LK VL   L YGPAL EH +L+ G
Sbjct: 153 SAPKGEL-----------------------------LKRVLNPLLPYGPALIEHCLLENG 183

Query: 264 LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK---HLGKDH 320
              N+K+ E  KLE   I+ +++++ K ED+++   + +   +GYI+ + +    L  D 
Sbjct: 184 FSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TTSNFSGKGYIIQKREIKPCLEADK 239

Query: 321 PPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDA 380
           P  +  +    Y+EF P L +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E  
Sbjct: 240 PVEDILT----YEEFHPFLFSQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQ 295

Query: 381 AFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDL 440
           A  KL+ +  D ENR+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ W ++
Sbjct: 296 ALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEI 355

Query: 441 ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
             +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 GLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRN 389


>gi|349605644|gb|AEQ00813.1| Serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 388

 Score =  248 bits (634), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 157/454 (34%), Positives = 234/454 (51%), Gaps = 70/454 (15%)

Query: 23  GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGF 82
           GMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS F
Sbjct: 1   GMRVNNVYDVDNKTYLIRLQKPDF--------KATLLLESGIRIHTTEFEWPKNMMPSSF 52

Query: 83  TLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTL 142
            +K RKH+++RRL   +QLG DRI+ FQFG    A+++I+ELY +GNI+LTD E+ +L +
Sbjct: 53  AMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEYLILNI 112

Query: 143 LRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHAALTSSKEPDANEPDKVNEDGNN 201
           LR   D+   V    R RYP +  R  E   T  +L   + S+                 
Sbjct: 113 LRFRTDESDDVKFAVRERYPVDHARAAEPLLTLERLTEIIASA----------------- 155

Query: 202 VSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILD 261
                                             K   LK VL   L YGPAL EH +++
Sbjct: 156 ---------------------------------PKGELLKRVLNPLLPYGPALIEHCLIE 182

Query: 262 TGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHP 321
            G   N+K+ E  K E   I+ +++ + K ED+++   + +   +GYI+ + +      P
Sbjct: 183 NGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--TTSNFSGKGYIIQKREM----KP 234

Query: 322 PTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
             E    TQ    Y+EF P L +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E
Sbjct: 235 SLEVDKPTQDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQE 294

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
             A  KL+ +  D E+R+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ W 
Sbjct: 295 KQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWT 354

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           ++  +VKE +  G+PVA  I +L L+ N +++LL
Sbjct: 355 EIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLL 388


>gi|440301763|gb|ELP94149.1| zinc knuckle domain containing protein, partial [Entamoeba invadens
           IP1]
          Length = 703

 Score =  246 bits (629), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 137/350 (39%), Positives = 215/350 (61%), Gaps = 18/350 (5%)

Query: 344 RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
           + R F  F++F  A+DEF+S IE Q  E++ + K+     K+  +    E R   L ++ 
Sbjct: 1   KGRLFDTFDSFCDAMDEFHSHIEKQEYEEELEKKDATMKKKIQAVIDGHEKRYKGLLEKA 60

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKL 461
           +  V  A+++E ++  VD  I  + V L+ +M WE +  ++ +  K  +P  VA  I K 
Sbjct: 61  EEMVVKAKVVESHIIIVDQLIKEINVFLSEKMQWERVEEII-QSAKENDPTSVAQYIKKF 119

Query: 462 YLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA--NARRWYELKKKQESKQEK 519
               + + L L N  ++           K++VD+ L+ +   N R +YE+++   +K +K
Sbjct: 120 DFANDVVVLSLENANNQ-----------KIDVDVLLTKNGFENVRNFYEMRRVVLAKADK 168

Query: 520 TITAHSKAFK-AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           T+ +   A + A +K+ R+   ++  +A++  MR+  WFEKF+WFISSEN+++ISG+DA 
Sbjct: 169 TLESRETAIQQATQKQERVAKTKQIDLADLKKMRRRFWFEKFHWFISSENFVIISGKDAL 228

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           QN+++ +RYM   D+YVHAD+HGA+S +IK  +  + +   TL QAG   VC S AW +K
Sbjct: 229 QNDVMYRRYMKNTDIYVHADIHGAASCLIKGVKG-KVIGAATLEQAGKVAVCRSSAWTNK 287

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +VTSA+WVY  QVSKTAP+GEYL  GSFMIRGKKN+LPP PL+ G G++F
Sbjct: 288 IVTSAYWVYSDQVSKTAPSGEYLVTGSFMIRGKKNYLPPAPLVFGLGIVF 337



 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 69/233 (29%), Positives = 111/233 (47%), Gaps = 37/233 (15%)

Query: 846  AERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKE 905
            A + ++KK Q   +     E++K+  +DA  Q     ++      K +RGQ  K KK+K 
Sbjct: 443  AHKEEMKKQQARLMY----EKQKKSEEDAKRQE----KEANKSANKKTRGQLRKEKKLK- 493

Query: 906  KYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPK-----VCY 960
            KY +QDEE+R +RMA               +       +EKKP    V+  K     +C+
Sbjct: 494  KYVEQDEEDR-LRMA---------------ERIGHKFEEEKKPVAVVVEEEKTVKELMCH 537

Query: 961  KCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLND 1020
             C    H+++DC +   + +   +D      DE A+ +KV    +D  +  EEE+  ++D
Sbjct: 538  YCGSKEHIARDCPKRLAEVNKKKQD------DEKAKAEKVEKNAKDEVDDDEEEEQGVDD 591

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSL 1073
            V ++ G       + Y  P+CGPY  V  YKY +KI PG  K GK ++   S+
Sbjct: 592  VVFV-GELKEGMNVRYAAPICGPYECVTKYKYHLKITPGKLKAGKAVKSVMSM 643


>gi|147771938|emb|CAN75699.1| hypothetical protein VITISV_035986 [Vitis vinifera]
          Length = 327

 Score =  244 bits (622), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 144/288 (50%), Positives = 172/288 (59%), Gaps = 59/288 (20%)

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
           KG++      HGASSTVIKNH+PE PVPPLTLNQAGCFTVCHSQ WDSK+VTSAWWVYPH
Sbjct: 9   KGNMISMKYPHGASSTVIKNHKPEHPVPPLTLNQAGCFTVCHSQVWDSKIVTSAWWVYPH 68

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           Q                                          SSLGSHL ERRVRGEEE
Sbjct: 69  Q------------------------------------------SSLGSHLYERRVRGEEE 86

Query: 710 GMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPN---------------SAHPAPSH 754
           G  DFE++   K NSD ESEK++TDEK  AES S+ +               SAH   + 
Sbjct: 87  GAQDFEENESLKGNSDSESEKEETDEKRTAESKSIXDPPTHQPILEGFSEISSAHNELTT 146

Query: 755 TNASNVDSHEFPAEDKTISNGIDSK-IFDIARNVAAPVTPQLEDLIDRALGLGSASISST 813
           +N  +++  E P E++ + NG DS+ I DI+    + V PQLEDLID AL LGS + S  
Sbjct: 147 SNVGSINLPEVPLEERNMLNGNDSEHIDDISGRHVSSVNPQLEDLIDWALELGSNTASGK 206

Query: 814 KHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
           K+ +ET+Q DL E+  H  R A VR+KPYISKAERRKLKKGQ +S  D
Sbjct: 207 KYALETSQVDL-EDHNHEXRKAKVREKPYISKAERRKLKKGQKTSTSD 253


>gi|68533893|gb|AAH99277.1| LOC733300 protein [Xenopus laevis]
          Length = 453

 Score =  239 bits (611), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 161/474 (33%), Positives = 246/474 (51%), Gaps = 63/474 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  L   L+GMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKSRFNTIDIRAVIAELTDSLLGMRVHNVYDIDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +             HA 
Sbjct: 113 IVELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVREHYPID-------------HAK 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP                            LS    K   D A+ K   L
Sbjct: 160 --------APEP---------------------------LLSVERLKEVLDNAK-KGDQL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YG  L EH +LDTGL  N+K+ +++  ED  ++ +  A+ K E ++   ++
Sbjct: 184 KKVLNPHLPYGATLIEHCLLDTGLSSNVKVDQISGPED--LEKVHTALRKAEGYMD--LT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +G+I+ Q +       P ++       +EF P L  Q  +  +++ ++F+  +DE
Sbjct: 240 QNFNGKGFII-QKREKKPSLEPDKASEDIFTNEEFHPFLFAQHANSTYIELDSFNKTVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+SK+E Q+ + +   +E  A  KL  +  D E+R+ +L+   D      ELIE NL+ V
Sbjct: 299 FFSKLEGQKIDIKALQQEKQALKKLGNVRKDHEHRLESLQYAQDADKAKGELIEMNLDIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N ++++L N
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMMLKN 412


>gi|68071251|ref|XP_677539.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497695|emb|CAH96713.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 1012

 Score =  238 bits (606), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 141/369 (38%), Positives = 216/369 (58%), Gaps = 11/369 (2%)

Query: 361 FYSKIESQRAEQ-QHKAKEDAAFHKLNKIHMDQENRVH-TLKQEVDRSVKMAELIEYNLE 418
           + +K+ES + ++ Q   K   A  K++KI +D E R+  + K++V    K   LI+ N E
Sbjct: 7   YLTKMESTKYDKHQEMNKRKNALTKIDKIKLDHERRIEGSTKKQVSILKKKISLIQLNDE 66

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            V  AI  +R A++   +WE +   +K  +K  +P+A  I  +      M LLL+++  E
Sbjct: 67  SVGEAIKLMRSAISTSANWEQIWDHIKLFKKRDHPIALKIMSVNFNNCEMELLLNDDDIE 126

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK----TITAHSKAFKAAEKK 534
            + ++  L     +  +A     N++    L+KK E K  K    T  A  K  K  + K
Sbjct: 127 ENGDDNNLKNNSWKEKIA---DKNSKTC-TLRKKAEEKIRKIKMSTNMAVKKVEKKKKDK 182

Query: 535 TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
              Q  + K+V  I  +RKV WFEKFNWFISSENYLVISG+D+ QNE++ +RY    D+Y
Sbjct: 183 DTKQKGKNKSVFQIKKLRKVFWFEKFNWFISSENYLVISGKDSLQNEILFRRYFQNNDIY 242

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
           VHAD+HGA++ +IKN   +  +P  TL +AG   +C S +W++K++TSAWWVY HQVSKT
Sbjct: 243 VHADVHGAATCIIKNPYKDISIPEKTLFEAGQLAMCRSSSWNNKIITSAWWVYYHQVSKT 302

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDF 714
           APTGEY+  GSF+IRGKKN+LP   L MG  ++F++++  +  +  E  + G+++  +  
Sbjct: 303 APTGEYIKTGSFVIRGKKNYLPYAKLEMGLCIIFQVNK-QMDDNNKENALNGDKQNYESI 361

Query: 715 EDSGHHKEN 723
                + EN
Sbjct: 362 NSGDENGEN 370


>gi|344230527|gb|EGV62412.1| hypothetical protein CANTEDRAFT_126343 [Candida tenuis ATCC 10573]
          Length = 969

 Score =  237 bits (605), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 194/715 (27%), Positives = 334/715 (46%), Gaps = 97/715 (13%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           +++E G R+H T Y R+ + TPS F  KLRKH++TRRL  ++Q+G DRI++ +F  G+  
Sbjct: 1   MIVEFGNRIHFTDYERNIEPTPSNFVTKLRKHLKTRRLSSIKQIGDDRILVMEFSDGL-- 58

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            Y++LE ++ GNI+L D +  +L L R    +D   A+        E   +F+R+     
Sbjct: 59  FYLVLEFFSAGNIVLLDHDRKILMLQRVVDSNDDKFAV-------NETYNMFDRSL---- 107

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
                   EP      +V +  + +     +    Q   K          NS+   + K+
Sbjct: 108 ---FEQEPEPYVKRQYEVEQINSWIEKEKTKVEDNQNRLKEL-------ANSHTPTKLKK 157

Query: 238 PTLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
             + ++          LS  +IL T    G+  +    E +  E   +  +V  + + ED
Sbjct: 158 SKIFSIHKLLFVNASHLSSDLILKTLNENGIRSSSSCFEFHDSE--MLSTIVATMNQCED 215

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQFRSREFVKFE 352
               ++ G  + EG I+ +       +   E+  + Q ++DEF P     F+     KF 
Sbjct: 216 EYVKILQGGEI-EGIIVSKKNT----NATEETAENLQYLFDEFHPF--RPFKDGSLYKFT 268

Query: 353 T---FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +   ++  LD+F+S +ES + E + + ++  A  +L+K   ++  ++ +L  E + ++K 
Sbjct: 269 SIQGYNKTLDQFFSTLESLKNEIKIENQKQLAMKRLDKAKNERVKQIESLINEKNANIKK 328

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
            +LI  N   V   I  +   L  +M W D+ + ++ ++ +G+ +   I    L  N + 
Sbjct: 329 GDLIILNANLVSGCIDFINGMLEKQMDWHDIEKYIELQKSSGDDITNAIQ---LPLNLLE 385

Query: 470 LLLSNNLDEMDDEEKT-------------------------------------------- 485
             +  NL + D +E                                              
Sbjct: 386 NKIKLNLPDTDVDENVESSETSSSDTESESDSSSSDSDSDSDSDSDSDSDFRGTKKSKSK 445

Query: 486 ------LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--- 536
                 +P   V +DL+LS +ANA  +++ KK  E KQ K       A + AE+K     
Sbjct: 446 SKKTKSVPTISVWIDLSLSPYANASTFFDSKKSAEVKQLKVEKNTGIALQNAERKITHDL 505

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            + LQ ++ A ++ +R+  WFEKF WF++S+ YL +SG+D  QN+MI  RY +  D +V+
Sbjct: 506 TKALQNESEA-LNKVREKFWFEKFYWFVTSDGYLCLSGKDDLQNDMIYYRYFNDDDFFVY 564

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           +D+ GA    IKN    + VPP T+ QAG F++ +S++W +K  +SAW++    VSK   
Sbjct: 565 SDIEGALKVFIKNPYKGETVPPSTIWQAGMFSLSNSESWSNKSSSSAWYLPGPGVSKKDI 624

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
            G  L  G F  +GKK  +PP  L+MGFG+ F  D+ +      +R VR EE G+
Sbjct: 625 DGSLLRPGKFNFKGKKEHMPPVQLVMGFGIYFVGDDETTKRAREKRLVRQEEMGL 679



 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 77/172 (44%), Gaps = 47/172 (27%)

Query: 903  MKEKYGDQDEEERNIRMALLASAGKVQKN-------DGDPQNENASTHKEKKPAISPVDA 955
            + EKY DQDEE+R +RM  L +  +V++N       + + QN+N+    + K        
Sbjct: 757  IAEKYADQDEEDRILRMEALGTLKQVEENRKKQIEVEQEQQNKNSKYENQDK-------- 808

Query: 956  PKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEK 1015
                 + +K     K+ +++                     +  +A ++ +I        
Sbjct: 809  ----IQQRKQKQDEKELRKYL--------------------LQDMADKQNEIE------- 837

Query: 1016 GRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
              L+  D L   P  SD ++  +PV GP+ ++Q +KY+VKI PG  KKGK I
Sbjct: 838  -YLSIFDGLIAKPTKSDTIVDFVPVFGPWFSLQKFKYKVKIQPGNNKKGKSI 888


>gi|342186351|emb|CCC95837.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1015

 Score =  235 bits (599), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 265/1115 (23%), Positives = 466/1115 (41%), Gaps = 199/1115 (17%)

Query: 1    MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKV-LL 58
            MVK RM + DV A  + +   L  +R  N+Y + P+T++F+          G++EK   +
Sbjct: 1    MVKSRMTSLDVKASSQEMHAELKNLRLLNIYSIPPRTFLFRF---------GQAEKKKTV 51

Query: 59   LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM-NA 117
            +++ G+RLH T   R+K   PS F  K+RK +   ++  VRQL +DR++ F  G+   N+
Sbjct: 52   VLDVGIRLHLTQVVREKPQIPSAFAQKMRKLLCNWKVRSVRQLDHDRVVDFHLGMSEENS 111

Query: 118  HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
             ++++EL+++GN+                        +++ H Y  ++  +F     +K+
Sbjct: 112  LHIVVELFSKGNL------------------------VVTDHEYRVKL--LFRTEAVNKV 145

Query: 178  HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
              A+           D++      +  A  E  GGQ+      L +  N+     A+   
Sbjct: 146  TPAV-----------DEIFL--KTIPRAPLEE-GGQEQISEEMLQQEWNEKF---AQWDG 188

Query: 238  PT-LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
            P  + ++L     +G +L+ HI+   G VPN+   ++N   +   + L+  +   + W  
Sbjct: 189  PVEICSILSSMYSFGNSLAGHIMSRAG-VPNVTKDKMNCSGEEMFRKLLPGM--LDAW-- 243

Query: 297  DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
             + S  +   GY+L  +K  G++         T I   FC +   +      + F+   A
Sbjct: 244  RLFSSPLPEGGYLLKSSKRGGQE----AMIPGTMISALFCSISTRRMLWLINI-FQISVA 298

Query: 357  ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
                F+   + +R E  +   +     K  +   +   R+  LK+  + S++   LI  N
Sbjct: 299  FAMNFFHIRKKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRKGHLIFQN 358

Query: 417  LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
             E +D  I  +  AL  ++ W+D   ++K+ R  G+P+A +I ++  ER  + +L++ + 
Sbjct: 359  TETIDKIIGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVVVLMNEDA 418

Query: 477  DEMDD----------EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
            D+ D+          E++     ++E+DL  +AH NA  ++   K   +K ++TI A  K
Sbjct: 419  DDDDEQTEDEEGEKREDRDRATYEIEIDLTKTAHTNAEEYFARAKSTAAKLKRTIAATEK 478

Query: 527  AFKAAEKKTRLQI--LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            A   AE+K R      QEK +      R   W+EKFNWF +S   LV+ GRD +  ++++
Sbjct: 479  AMAGAERKGRTVTGKTQEKKIIT---ERCRFWWEKFNWFRTSCGDLVLQGRDERSTQLLL 535

Query: 585  KRYMSKGDVYVHADLHGASSTVIK-------------NHRPEQ-------------PVPP 618
            +R M  GD+++   + G    +++                P+              PV  
Sbjct: 536  RRVMRLGDIFLCCHVVGGLPCILRPAGSVWSAVNASSKSGPDGGNGGDVCATPKMCPVRK 595

Query: 619  LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
             ++ +A  + V  S AW+SK    AWWV+  QVS     G YL        G+++ L P 
Sbjct: 596  KSVEEAASWCVSRSPAWESKFTVGAWWVHASQVSGGTSAGCYL------YEGEQHDLEPP 649

Query: 679  PLIMGFGLLFRLDE-SSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDE-K 736
               +G GLLFR+   S L            E G+     + + KE      E D   E +
Sbjct: 650  SSRLGCGLLFRVARISDLSDAFGP-----PELGLGTPAPNSYGKEGEGDFLEPDTAVELR 704

Query: 737  PVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLE 796
            P+     +PNS H             H    E  T+     +K  D+      P   + +
Sbjct: 705  PLP---PLPNSRH---------QRQGHGVTGEPPTVGPARPTKAVDL-----QPAGTEKK 747

Query: 797  DLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQG 856
                     G A I  T   ++  Q                     ++K +RRKLKK Q 
Sbjct: 748  ---------GGALIGETVAQLKCKQ---------------------LTKNDRRKLKKIQ- 776

Query: 857  SSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERN 916
                       K + +D     E  +    + G K+SR Q   L  +  +  +       
Sbjct: 777  ----------RKYKDQDE----EDCLAGALLNGNKLSRVQ---LSMLGLQMAESSSCAAV 819

Query: 917  IRMALLASAGKVQ---KNDGDPQNENASTHKEKKPAISPVDA---PKVCYKCKKAGHLSK 970
             +   L +AG+ +     DG+   + A  H  +   I   D    P V   C    HL++
Sbjct: 820  PQAKALTTAGRQRVPTTGDGEKNEKKALMHGSQLTDIDGSDTNIPPSVLRGCDD--HLTE 877

Query: 971  DCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLP 1030
              +     +   +  +P     ++  +D  A+  E +    EEE  R  +  + T NP P
Sbjct: 878  CGQPESPGAGQNIRSHP----SKSNPVDPAAVNLEPLCSANEEEFER--EWVHFTANPRP 931

Query: 1031 SDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK 1065
             D + Y +  C P SA++SYKY+ ++  G AKKG+
Sbjct: 932  DDCVQYAVVTCAPMSALESYKYKTELFYGNAKKGQ 966


>gi|403222989|dbj|BAM41120.1| uncharacterized protein TOT_030000383 [Theileria orientalis strain
           Shintoku]
          Length = 1119

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 146/422 (34%), Positives = 223/422 (52%), Gaps = 57/422 (13%)

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           DIVP GYI    K                + D+F P    + ++ E+   E ++ ALD F
Sbjct: 243 DIVP-GYIYRNAKG---------------VMDDFGPF---ELQNAEY--HEDYNYALDAF 281

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           ++K E  + E++ ++K+     KL KI  DQ+ R   L +E+    K   ++E N++ VD
Sbjct: 282 FTKNELVKQEKKTESKKPT---KLTKIKADQDKRESKLMEEIMGYDKQIRVLEENIDIVD 338

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
             +   +  +A+  SW D+   ++ +RK  +P+   I ++ +    +  + +   +E DD
Sbjct: 339 NCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVCYIKEINIPNQTLVFVSNPEGNERDD 398

Query: 482 E--EKTLPVEKVEV-DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           E   K L  E+V V D  L+ + N +++Y  +KK E+K E+T      A K   K    Q
Sbjct: 399 EPERKELVEEQVVVLDYRLTGYQNLKKFYINRKKAENKLERTKIGKEYALKKVAKSLSKQ 458

Query: 539 ILQEK-----TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
              +K         IS +RK  WFEKF WFI+S+ YLV++GRD+ QNE++VK+Y++KGD+
Sbjct: 459 PEVKKGDRRTREVKISSLRKRFWFEKFYWFITSQGYLVLAGRDSLQNELLVKKYLTKGDL 518

Query: 594 YVHADLHGASSTVIKNHRPEQPVPP-------------------------LTLNQAGCFT 628
           Y HAD+HGASS ++K +  E                              +++ +A  F 
Sbjct: 519 YFHADIHGASSVILKTNSQELIKSSESAEVSEVEKAGGRGNEEEFIAKIRVSIEEAANFA 578

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           VCHS AW+ K    +WWVY HQVSKT PTGEY+  GSF+IRGKKN+L P  L MG   LF
Sbjct: 579 VCHSNAWNDKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLQPQKLEMGITYLF 638

Query: 689 RL 690
           ++
Sbjct: 639 QV 640



 Score =  112 bits (280), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/171 (36%), Positives = 94/171 (54%), Gaps = 9/171 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MV+ R+N  DVA  V  L++ L  +   N+YD++ + +I K         S    K+ +L
Sbjct: 1   MVRERLNAIDVAISVANLKKTLDNITLVNIYDITNRLFILKF--------SRNENKIYVL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+HTT + R   + PS F  KLRKH+R RRL DV+Q+  DRII F F    +A +
Sbjct: 53  IEIGCRIHTTQFLRSVDHLPSNFNAKLRKHLRNRRLRDVKQMSQDRIIDFTFSSEEHAMH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           +I++L+  GNI LTD E+ VL +L+     D    + + +    E    FE
Sbjct: 113 LIVQLFLPGNIYLTDHEYKVLAVLKPKNTGDNFFKVGTNYVCDMEYNSWFE 163



 Score = 48.5 bits (114), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 27/33 (81%)

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
            D +L VIP+C PYSA++ Y++ +K++PG AKKG
Sbjct: 1043 DDVLSVIPMCAPYSAIKHYRHVLKLVPGNAKKG 1075


>gi|294875379|ref|XP_002767293.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239868856|gb|EER00011.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 1087

 Score =  233 bits (594), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 142/357 (39%), Positives = 217/357 (60%), Gaps = 11/357 (3%)

Query: 348 FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
            V++ +F   +D++Y+++   + E Q   K+     K+  I  DQ  R+  L++E     
Sbjct: 365 VVEYPSFTECVDDYYTRLMRAQLEGQLVQKQSQMISKVENIKSDQRRRMGELEKEQQSLW 424

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + A  +E N    DAAI  V   LA ++ W++L   VK++++AG+P+A  I +L L++N 
Sbjct: 425 EQAVALEANTTLADAAIQMVNALLAAKLRWDELTIAVKQQQRAGHPLAMHIRQLALDKNR 484

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +S++L       DD++    VE V +DL  +A AN    +E +K  + K  KT    ++A
Sbjct: 485 ISIVLEKAASTDDDDDGATTVE-VWLDLGRTAQANVALLHEKRKGMQEKMGKTEEQMARA 543

Query: 528 FKAAEKKTRLQILQEKTVAN---------ISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
            K AEK+ + +       A          ++  RK  WF+KF WFISS+  LV++GRDAQ
Sbjct: 544 VKMAEKRLKGKGAGGNQAAAALGGAEKQLLAKRRKKFWFQKFFWFISSDRLLVLAGRDAQ 603

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           QNE++ +RY++  D+YVHADL GA++ VIK  +     P  TL +AG +++C S+AWD+K
Sbjct: 604 QNELLWRRYLAPTDIYVHADLAGAATVVIKMPKGGVEPPQRTLAEAGQYSLCRSRAWDNK 663

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP-LIMGFGLLFRLDESS 694
           +VTSAWWV+  QVSKTAPTGE+L+ GSFMIRGKKNFLPP   L MG G+++ + + S
Sbjct: 664 IVTSAWWVWAKQVSKTAPTGEFLSTGSFMIRGKKNFLPPTGRLEMGLGVMWTVTDDS 720



 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 65/199 (32%), Positives = 93/199 (46%), Gaps = 23/199 (11%)

Query: 889  GGK---ISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS-----AGKVQKNDGDPQNENA 940
            GGK   ++R Q+ KL K++EKYGDQDEEER IRM L+ S       + Q+       E  
Sbjct: 806  GGKTKPLTRHQRKKLAKIREKYGDQDEEERLIRMKLMGSKEVKVVEEQQQQQQRQDEEED 865

Query: 941  STHKEKKPAISPVDAPKVCYKCKKAGHLSKDC----KEHPDDSSHGVEDNPCVGLDETAE 996
                E+  +        +C+KC + GHL+  C     E   +SS  V+D+      E  E
Sbjct: 866  DDVVEEASSKDVTTGKNICFKCGEEGHLASACPNAAAEAQANSSRQVDDH------EEEE 919

Query: 997  MDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNP-LPSDILLYVIPVCGPYSAVQSYKYRVK 1055
             ++   EE  +   G    G  + +D L   P    D +L  + VC PY A+     +VK
Sbjct: 920  EEEEDEEEAKVTSGG----GIAHTLDRLQSWPEWGEDEVLGAVMVCAPYQAMTQIPIKVK 975

Query: 1056 IIPGTAKKGKGIQIFYSLL 1074
              PG  K+GK  Q+   LL
Sbjct: 976  FTPGQMKRGKAAQLGLKLL 994


>gi|399216143|emb|CCF72831.1| unnamed protein product [Babesia microti strain RI]
          Length = 933

 Score =  231 bits (590), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 149/453 (32%), Positives = 246/453 (54%), Gaps = 53/453 (11%)

Query: 264 LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPT 323
           +V N ++SE N   D   + LV A+ K  + L+ +  G+    GY+ +  K++   +   
Sbjct: 209 IVHNEQISEDNI--DQCAERLVCAILKISELLETLKKGN--NGGYVTLDPKYV---NSSL 261

Query: 324 ESGSSTQIYDEFCPLLLNQFRSREFVKFETFD------------------------AALD 359
           +   +T + D + P++  +  +R  V F +++                          LD
Sbjct: 262 DCIPATALID-YSPIIA-EIDTRNCVSFNSYNEVSYFFVRIGYYNLIIEQSKIKISKCLD 319

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            ++ K E+     +  +K +       KI +DQE R+  +K +V  + K A LI+ +   
Sbjct: 320 FYFGKFETFEKPTKKPSKAE-------KIKIDQEKRISNMKTQVQIAEKNAYLIDKHSAL 372

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VD  I  +R  +A    W+D+   ++ +++ G+ +A L D++  +   + L L  N D+ 
Sbjct: 373 VDECISLMRTLIATGSRWDDIWDEIELQKQMGHEIAILFDRVDFKTGEIFLSLKENSDDE 432

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
           D       V  V V +  S  +N R  + ++K   +K ++T  + + A K  +K  +   
Sbjct: 433 D-------VCIVPVSVNQSVFSNLRGIHNMRKNILAKIDRTGLSMAMAIKNVQKNDKTPN 485

Query: 540 LQEKT----VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
             +K+    V  I  ++K +WFEKF WFISS++YLV++GRD+ QNE++VKR+M   D+Y+
Sbjct: 486 KSDKSSTKQVERI-KVKKRYWFEKFKWFISSDDYLVLAGRDSIQNEILVKRHMESNDIYI 544

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
           HAD+HGA+S ++KN+  + P+P  TL +AG F+VC+S AW +K +TSAWWV   QVSKT 
Sbjct: 545 HADIHGAASCIVKNNSSD-PIPQRTLIEAGQFSVCNSSAWKAKFMTSAWWVESSQVSKTP 603

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            TGEYL  GSF+IRGKKNFLPP  L MG  +++
Sbjct: 604 ETGEYLPSGSFVIRGKKNFLPPSKLEMGLAVIY 636



 Score =  120 bits (301), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 87/139 (62%), Gaps = 9/139 (6%)

Query: 6   MNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K ++  ++G    N+YD+S K YI K+ N           K  LL+E+G 
Sbjct: 4   MTSLDICAVLKEIKEAIVGGSVINLYDVSKKVYILKVSN--------RDSKFFLLLEAGS 55

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T + R K + PSGFT+KLRKH++ +R+  VRQLG DR++   FG G   H++I++ 
Sbjct: 56  RIHLTQFMRSKDSMPSGFTMKLRKHLKGKRVSKVRQLGLDRVVDIVFGTGDYEHHLIIQF 115

Query: 125 YAQGNILLTDSEFTVLTLL 143
           Y  GNI LTD+E+ +LT L
Sbjct: 116 YVSGNIFLTDNEYKILTSL 134



 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 86/199 (43%), Gaps = 21/199 (10%)

Query: 872  KDASSQPESIVRKTKIEG-----GKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAG 926
            K AS   + I + + I G     GK+ RGQK   KK  +KY DQD +   IRM L+ S+ 
Sbjct: 705  KGASFTVQRIAKASNIVGKKKSDGKLVRGQK-SKKKRMKKYEDQDSDIEEIRMMLMGSSK 763

Query: 927  KVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDN 986
             + K+   P  +     +  +  I  ++ P           +SK    + D S+    + 
Sbjct: 764  PI-KHKSQPDEQIVEKKQSVREDIIRIEKPFYRPPPFTTALISK--VSYTDQSTDASFEE 820

Query: 987  PCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSA 1046
              + +  + +  +   +E    EIG  +     D           ++    + +CGP+ A
Sbjct: 821  ANLTIPASTDSHRTN-DETACGEIGTAKTDDRAD-----------NVPFQCVVMCGPWEA 868

Query: 1047 VQSYKYRVKIIPGTAKKGK 1065
            +  Y+ R+K++PG  KKG+
Sbjct: 869  ICRYRLRIKLLPGNGKKGQ 887


>gi|154418675|ref|XP_001582355.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121916590|gb|EAY21369.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 875

 Score =  229 bits (583), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 143/436 (32%), Positives = 231/436 (52%), Gaps = 27/436 (6%)

Query: 321 PPTESGS-STQIYDEFC-PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
           PP   G   T+  D+F  P  L Q+   +   F+TFD A DEF+S  E +RA+++HK  E
Sbjct: 266 PPKPKGYVYTKGKDKFLSPFPLAQYDPSQSQVFDTFDKACDEFWSVRELERAQKEHKENE 325

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
            A   K+  +  + + +    + E+D   +   LI+ N   ++     +   +ANR+ W+
Sbjct: 326 AAPDKKVQSVKKNFDKKRKQFQDELDLLNRTGHLIQANATQIEQCRNVINSFIANRVRWD 385

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALS 498
           ++   ++  ++ GN +A +IDK+  E++    L++      D+E KT   E++ ++L  +
Sbjct: 386 EIRMSIRAYQECGNELASMIDKVDFEKSGFYCLVN------DEEGKT---ERIFIELKKT 436

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
           A+ANA  +++ +     K E       +  K  EK       ++K  + I   RK  WFE
Sbjct: 437 AYANASAYFDKRAVLVKKLEGANAKEEEVLKKVEKDAIA--AKKKVTSTIQERRKTWWFE 494

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           +F+WFI++ENYLVISGRD  QNE++V  Y+ K D+Y+HA++HGA+S +IKN    +PV P
Sbjct: 495 RFHWFITTENYLVISGRDKVQNEVLVAHYLKKDDIYLHAEIHGAASVIIKNPT-SKPVSP 553

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
           ++L QA  F V  S AW S    + +WV+  QV K  P       G+F I G+KN +   
Sbjct: 554 ISLEQAAEFAVARSSAWKSNEPCNCFWVHADQVKKNLPGQPTAPKGTFYIVGEKNMMTMT 613

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEK-- 736
              MG G+LF + E  +  H NER++R +E+           K  S+I  E+ +T  K  
Sbjct: 614 MPQMGLGILFHVTEQHVADHANERKIRVDED----------EKPESEIPKEEGETKPKLP 663

Query: 737 PVAESLSVPNSAHPAP 752
           P  +S  +  +A P P
Sbjct: 664 PRVDSAEI-EAALPFP 678



 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 75/143 (52%), Gaps = 13/143 (9%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           + ++ +V  E+  L+ LIGMR  N++ +   T   K     GV+        +L++++GV
Sbjct: 4   QFSSYEVKVEIDSLQELIGMRIGNIHQVDKDTLTMKFW-KLGVSR-------ILIVQNGV 55

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H T + R+K   P  F  +LRK +R RRL D+ Q   DR + F FG       +  EL
Sbjct: 56  RFHITDFPREKPKVPPDFCCRLRKLLRFRRLNDIIQPLNDRAVYFCFG----DLRLCFEL 111

Query: 125 YAQGNILL-TDSEFTVLTLLRSH 146
           +  GNI+L  +++  +  +L+ H
Sbjct: 112 FQGGNIILFQETDKIIQAVLKYH 134



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 1004 EEDIHEIGEEEKGRLN----DVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPG 1059
            EE + EI +EE   ++     ++ LTG PLP+D       +C P SA+  +KY+VK +PG
Sbjct: 747  EEGVQEIMQEEGIPIDLDTEGINALTGEPLPTDEFFAAYVMCAPVSALLKFKYKVKFVPG 806

Query: 1060 TAKKGKGIQIFYSLLLLM 1077
              KKGK   +  +    M
Sbjct: 807  ETKKGKAWPVISNYFQSM 824


>gi|159477991|ref|XP_001697091.1| hypothetical protein CHLREDRAFT_181058 [Chlamydomonas reinhardtii]
 gi|158269999|gb|EDO96040.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 246

 Score =  228 bits (582), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 116/202 (57%), Positives = 142/202 (70%), Gaps = 16/202 (7%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V VDL+LSAHANA  +++ ++K  +K  +      +       +      Q         
Sbjct: 1   VAVDLSLSAHANASAYFDTRRKHLAKLGEQDAGCQRGGAGGGGEEGGGGTQAA------- 53

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
                      WFISSENYLV+SGRDAQQNE++VKRY  KGDVYVHA+LHGASST++KN 
Sbjct: 54  ---------LPWFISSENYLVVSGRDAQQNELLVKRYFRKGDVYVHAELHGASSTIVKNP 104

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
           +P+QP+PP+TL QAGC  VC S+AWDSK+VTSAWWV+ HQVSKTAP+GEYL  GSFMIRG
Sbjct: 105 QPDQPIPPITLQQAGCACVCRSRAWDSKIVTSAWWVHHHQVSKTAPSGEYLVTGSFMIRG 164

Query: 671 KKNFLPPHPLIMGFGLLFRLDE 692
           KKNFLPP PL+MGFG LF+ DE
Sbjct: 165 KKNFLPPQPLVMGFGFLFKWDE 186


>gi|71027701|ref|XP_763494.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350447|gb|EAN31211.1| hypothetical protein, conserved [Theileria parva]
          Length = 1249

 Score =  228 bits (581), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 132/386 (34%), Positives = 204/386 (52%), Gaps = 51/386 (13%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F+ A+D F++K E     +Q K  +D    KLNKI +DQ+ R   L +++ +     
Sbjct: 275 FEDFNDAVDAFFTKHE---LAKQEKKTQDKKPTKLNKIKIDQDKREQKLVEDIRKLDLEI 331

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           +L+E N++  +  +   +  +A+  SW D+   ++ +RK  +P+   I ++ +     +L
Sbjct: 332 KLLEENVDIAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKEINIP--TQTL 389

Query: 471 LLSNNLDEMDDEEKTLPVEK-----------------VEVDLALSAHANARRWYELKKKQ 513
           +  N +   D   +     K                 V +D  L++H N ++ Y  +K+ 
Sbjct: 390 IFHNPISGSDQLSQGGQSGKPGKSGTQSKLSKDLTASVSLDYRLNSHQNLKKLYNERKRL 449

Query: 514 ESKQEKTITAHSKAFKAAEKKTRLQILQE----KTVANISHMRKVHWFEKFNWFISSENY 569
           E+K E+T      A K   K  + Q  ++    K    IS +RK  WFEKF WFI+S+ Y
Sbjct: 450 ENKLERTKIGKEYALKKVTKSLKKQETKKTDKNKRDVRISSVRKRFWFEKFYWFITSQGY 509

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK-----------------NHRP 612
           LV++GRDA QNE++VK+Y++ GD+Y HAD+HGA+S ++K                 N   
Sbjct: 510 LVLAGRDALQNELLVKKYLTNGDLYFHADIHGAASVILKTNSNSSSFNLTTGTTSDNTET 569

Query: 613 EQPVPPL--------TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
               PP         ++++AG F VC S AW+ K    +WWVY HQVSKT PTGEY+  G
Sbjct: 570 TNTSPPYDMIKSVKESIDEAGNFAVCLSTAWNEKFSVQSWWVYWHQVSKTPPTGEYVPQG 629

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRL 690
           SF+IRGKKN+LPP  L MG   LF++
Sbjct: 630 SFVIRGKKNYLPPQKLEMGITYLFQV 655



 Score =  128 bits (322), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 96/172 (55%), Gaps = 9/172 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIG-MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+N  DVA  V  L++LI  +   N+YD++ + +I K         S    K+ +L
Sbjct: 1   MAKERLNAVDVAVVVSNLKKLISNLTLVNIYDITNRIFILKF--------SKNENKIYIL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+H T + R   + PS F  KLRKH+R RRL D+ Q+  DR+I F F     AH+
Sbjct: 53  IEIGCRIHATQFLRSVDHLPSNFNAKLRKHLRNRRLRDISQISQDRVIDFTFSSEEYAHH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER 171
           +I++L+  GNI LTD E+ VLT+LR     DK   + S + Y  E    FE+
Sbjct: 113 LIVQLFLPGNIYLTDHEYKVLTVLRPQNTGDKFFKVGSNYVYDMEYNSWFEK 164



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 68/295 (23%), Positives = 115/295 (38%), Gaps = 76/295 (25%)

Query: 790  PVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYIS---KA 846
            P  P+  D I   L  G +S S T   +++T    +  D  ++ +   + KP  +   K 
Sbjct: 967  PKFPKFNDFI--PLNSGDSSNSRTSSDVKST----TNSDTKLKPSENTKLKPSENTKLKP 1020

Query: 847  ERRKLKKGQGSSV-VDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKE 905
            E  KLK  + ++V V  ++ + K RG                   ++ R    K+ K+K+
Sbjct: 1021 ENTKLKPFENTNVNVKLEMTQVKSRG-----------------SSRMMRFINQKVSKIKK 1063

Query: 906  KYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHK---------------EKKPAI 950
            KY   DEE + +R  LL  + K+Q      Q+ N  +                  +K   
Sbjct: 1064 KYAQDDEETQELR-RLLTGSKKIQAKTQKSQSTNQKSQSTNQKSQSSNQKSQSSNQKSQF 1122

Query: 951  SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
            +P    K+ Y    +   S+  KE                                I  I
Sbjct: 1123 TPNQVGKISYGNSVSTGQSEKFKE--------------------------------IETI 1150

Query: 1011 GEEE-KGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
             ++E +  +  +  LT      D ++ VIP+C P+SA++ YK  +K++PG AKKG
Sbjct: 1151 SDKELEYYMKQLSCLTKELKEDDDVINVIPMCAPFSAIKHYKNALKLVPGNAKKG 1205


>gi|304314240|ref|YP_003849387.1| hypothetical protein MTBMA_c04780 [Methanothermobacter marburgensis
           str. Marburg]
 gi|302587699|gb|ADL58074.1| conserved hypothetical protein [Methanothermobacter marburgensis
           str. Marburg]
          Length = 653

 Score =  227 bits (578), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 192/686 (27%), Positives = 314/686 (45%), Gaps = 109/686 (15%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DV A  + L  ++ G R    Y     T I +          GE  ++ ++M++GV
Sbjct: 4   MSNVDVFAVTRELNDILSGARVDKAYQPLRDTVIIRFHVP------GEG-RMDVVMQAGV 56

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T Y  +    P  F + LRKH+R   + +VRQ  +DRI+  +       + +++EL
Sbjct: 57  RIHRTDYPPENPKIPPSFPMLLRKHLRGGIVREVRQHSFDRIVEIEIE-KEQKYTLVVEL 115

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
           +++GNI+L + E  ++  L+     D+ +A  SR R        +E   +  +H      
Sbjct: 116 FSKGNIILLNQEGEIILPLKRKTWSDRRIA--SRER--------YEYPPSRGIH------ 159

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
             P   E  ++ E   N                  DL +   +N                
Sbjct: 160 --PLRYEIGELEEMLKNSDT---------------DLIRTLARN---------------- 186

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
               G+G   +E IIL +GL      S +++ E   I+    A+ +    L+D       
Sbjct: 187 ----GFGGLYAEEIILRSGLDKKRAASTLSRDEIEKIES---AINELFKPLRD------- 232

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
                L  N H+ K+      G       +  P+ L  +R RE   FETF+ A DEF+S 
Sbjct: 233 -----LKFNPHIIKN------GEG-----DVLPIELMVYRDREREYFETFNEAADEFFSS 276

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           I  +   + H+A+ +    K  K    Q   +   +  +D S +  +L+  +   V+  +
Sbjct: 277 IFREELRKVHEAEWEKEVEKFRKRLRIQRETLQKFQDTIDTSTRKGDLLYAHYAAVEDVL 336

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
             +R A   + SW+++ +++ + R  G   A +I ++    N M+LL+            
Sbjct: 337 RTIRDA-REKYSWKEIRKIIADARSKGMVEAQMIQEIDGMGN-MTLLIDG---------- 384

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--KTRLQILQE 542
               E++ +D  L    NA  +YE  KK + K +  + A  K  +  EK  K R   L+ 
Sbjct: 385 ----ERIRIDPTLGVPENAEVYYEKAKKAKRKIKGVLQAIEKTEREIEKVEKRRDDALRN 440

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
             V      RK+ WFEKF WFISS+++LVI GRDA  NEM+VKR+M   D+Y+H+D+HGA
Sbjct: 441 IMVPQKRVKRKLRWFEKFRWFISSDDFLVIGGRDAGTNEMVVKRHMEPRDIYLHSDIHGA 500

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYL 661
            S VIK+   E  VP  T+ +A  F    S AW     +   +WV+P QVSKT  +GE++
Sbjct: 501 PSVVIKSEGRE--VPETTIQEAAVFAASFSSAWTRGFTSLDVYWVHPEQVSKTPRSGEFV 558

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLL 687
             G+F+IRG +N++   PL +  G++
Sbjct: 559 ARGAFIIRGTRNYIRGVPLKVAVGVV 584


>gi|349581807|dbj|GAA26964.1| K7_Ypl009cp [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1027

 Score =  224 bits (570), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 203/730 (27%), Positives = 347/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+         +L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDE--------SLFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N +  KD    E      IYD F P    +N   S      E    ++  LD+F+S IE
Sbjct: 291 ENYNSEKDTADLEF-----IYDTFHPFKPYINGGDSDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PERTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 44.3 bits (103), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 44/174 (25%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG++GKLKK+++KY DQDE ER +R+  L +   ++K     Q +       K+      
Sbjct: 827  RGKRGKLKKIQKKYADQDETERLLRLEALGTLKGIEK-----QQQRKKEEIMKREVREDR 881

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
               +   K  +A   +K  K   +   H  E  P +                        
Sbjct: 882  KNKREKQKRLQALKFTKKEKARVNYDKHKSELKPSL------------------------ 917

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            +KG + D           DI+    P    + A+  YKY+VKI PG+AKK K +
Sbjct: 918  DKGDVVD-----------DIIPVFAP----WPALLKYKYKVKIQPGSAKKTKTL 956


>gi|42733496|dbj|BAD11345.1| BRI1-KD interacting protein 117 [Oryza sativa Japonica Group]
          Length = 360

 Score =  223 bits (568), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 149/296 (50%), Positives = 194/296 (65%), Gaps = 17/296 (5%)

Query: 791  VTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEE-DKHVERTATVRDKPYISKAERR 849
            V+ QLEDL+D+ LGLG   +      + +    ++++ D    +  +VRDKPYISKA+RR
Sbjct: 17   VSSQLEDLLDKNLGLGPTKVLGRSSLLSSNSASVADDIDDLDTKKTSVRDKPYISKADRR 76

Query: 850  KLKKGQ--GSSVVD-PKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEK 906
            KLKKGQ  G S  D P  E  K   K  +SQ E      K    K+SRGQKGKLKK+KEK
Sbjct: 77   KLKKGQNVGDSTSDSPNGEAAK---KPVNSQQEKGKTIEKPANPKVSRGQKGKLKKIKEK 133

Query: 907  YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
            YG+QDEEER IRMALLAS+G+  + D   ++ + +T  + KP+    D  K+CYKCKK+G
Sbjct: 134  YGEQDEEEREIRMALLASSGRASQKDKPSEDVDGATAAQSKPSTGEDDRSKICYKCKKSG 193

Query: 967  HLSKDCKEH-----PDDSSHGVEDNPCVGLDETAEM--DKVAMEEEDIHEIGEEEKGRLN 1019
            HLS+DC E      P D + G   +   G+D ++      V M+E+DIHE+G+EEK +L 
Sbjct: 194  HLSRDCPESTSEVDPADVNVGRAKD---GMDRSSAPAGSSVTMDEDDIHELGDEEKEKLI 250

Query: 1020 DVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            D+DYLTGNPLPSDILLY +PVC PY+A+Q+YKYRVKI PGTAKKGK  +   SL L
Sbjct: 251  DLDYLTGNPLPSDILLYAVPVCAPYNALQAYKYRVKITPGTAKKGKAAKTAMSLFL 306


>gi|392296002|gb|EIW07105.1| Tae2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1036

 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 202/730 (27%), Positives = 346/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+         +L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDE--------SLFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PEKTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 43.1 bits (100), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 26/36 (72%)

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            D++  +IPV  P+ A+  YKY+VKI PG+AKK K +
Sbjct: 930  DVVDDIIPVFAPWPALLKYKYKVKIQPGSAKKTKTL 965



 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 28/37 (75%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           RG++GKLKK+++KY DQDE ER +R+  L +   ++K
Sbjct: 836 RGKRGKLKKIQKKYADQDETERLLRLEALGTLKGIEK 872


>gi|151942783|gb|EDN61129.1| conserved protein [Saccharomyces cerevisiae YJM789]
          Length = 1040

 Score =  221 bits (563), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 202/730 (27%), Positives = 346/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+         +L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDE--------SLFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PEKTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 43.5 bits (101), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 26/36 (72%)

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            D++  +IPV  P+ A+  YKY+VKI PG+AKK K +
Sbjct: 934  DVVDDIIPVFAPWPALLKYKYKVKIQPGSAKKTKTL 969



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 28/37 (75%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           RG++GKLKK+++KY DQDE ER +R+  L +   ++K
Sbjct: 840 RGKRGKLKKIQKKYADQDETERLLRLEALGTLKGIEK 876


>gi|6325248|ref|NP_015316.1| Tae2p [Saccharomyces cerevisiae S288c]
 gi|74676621|sp|Q12532.1|TAE2_YEAST RecName: Full=Translation-associated element 2
 gi|683781|emb|CAA88377.1| unknown [Saccharomyces cerevisiae]
 gi|965084|gb|AAB68096.1| Ypl009cp [Saccharomyces cerevisiae]
 gi|1314067|emb|CAA95032.1| unknown [Saccharomyces cerevisiae]
 gi|285815527|tpg|DAA11419.1| TPA: Tae2p [Saccharomyces cerevisiae S288c]
          Length = 1038

 Score =  221 bits (563), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 202/730 (27%), Positives = 346/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+         +L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDE--------SLFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PEKTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 43.5 bits (101), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 26/36 (72%)

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            D++  +IPV  P+ A+  YKY+VKI PG+AKK K +
Sbjct: 932  DVVDDIIPVFAPWPALLKYKYKVKIQPGSAKKTKTL 967



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 28/37 (75%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           RG++GKLKK+++KY DQDE ER +R+  L +   ++K
Sbjct: 838 RGKRGKLKKIQKKYADQDETERLLRLEALGTLKGIEK 874


>gi|313215449|emb|CBY16187.1| unnamed protein product [Oikopleura dioica]
          Length = 404

 Score =  217 bits (553), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 157/474 (33%), Positives = 234/474 (49%), Gaps = 128/474 (27%)

Query: 597  ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
            AD+HGASS ++KN  P +PV P+TL++ G   VCHS AW++K++TSAWWV+ +QVSKTAP
Sbjct: 1    ADIHGASSCIVKNIDPSKPVSPVTLHEVGHAAVCHSAAWNAKVLTSAWWVHANQVSKTAP 60

Query: 657  TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED 716
            +GEYL+ GSFMIRGKKN+LPP  L++GFG LF+LD++ +  H  ER+++G    ++D E+
Sbjct: 61   SGEYLSTGSFMIRGKKNYLPPSQLVLGFGFLFKLDDACVARHAGERKIKGL---VNDVEE 117

Query: 717  SGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGI 776
                KE S++   K++ + +P  E        +   S  + S  D  EFP          
Sbjct: 118  ----KEQSELGEIKEENENEPQLE------GENDDDSEDSDSKSDDLEFP---------- 157

Query: 777  DSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT 836
            D+KI     N+   V  ++E++++   G G  +I                          
Sbjct: 158  DTKI-----NIKYNVDTEVEEIVNVGKGAGKKNIE------------------------- 187

Query: 837  VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQ 896
                      ERRK              E EK+     + Q E   +K + +  +  RG+
Sbjct: 188  ----------ERRK--------------EAEKKSRAKPAWQLEHEEQKAEKDKFRKKRGK 223

Query: 897  KGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAP 956
             GK KKMK+KYGDQDEE+R   M  L SAG                  +K+P        
Sbjct: 224  AGKEKKMKQKYGDQDEEDRAAMMEFLGSAG-----------------AKKQP-------- 258

Query: 957  KVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKG 1016
                         K  +      S GV +   + +D+  E     ++E++I ++ EEE G
Sbjct: 259  -------------KKFQRQAKRESKGVRE---MVIDQMKE----DVDEQEITKMLEEE-G 297

Query: 1017 RLND-----VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK 1065
             + D     +D LTG P   D++ Y +PV  P S+++ YKY +K +PGT KKGK
Sbjct: 298  FVEDDDVSILDSLTGKPTDEDLVHYAVPVVAPLSSLRDYKYHIKFVPGTGKKGK 351


>gi|367000852|ref|XP_003685161.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
 gi|357523459|emb|CCE62727.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
          Length = 1016

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 217/834 (26%), Positives = 389/834 (46%), Gaps = 124/834 (14%)

Query: 25  RCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTL 84
           R +N+Y++S     F L      TES    K  +L++ G+R+H+T + R     PSGF +
Sbjct: 25  RLTNIYNISDSNRQFLL--KFNRTES----KCSVLVDCGLRIHSTTFNRPIPPAPSGFVV 78

Query: 85  KLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR 144
           KLRKH++++RL  +RQ+  DRI++ QF  G+  +Y++LE ++ GN++L D E  +L+L R
Sbjct: 79  KLRKHLKSKRLTALRQVKNDRILVLQFADGL--YYLVLEFFSSGNVILLDEEKKILSLQR 136

Query: 145 SHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSN 204
                     ++  H       RV E  T       +  +++P A++ +   +   +  N
Sbjct: 137 ----------VVQEHE-----NRVGEVYTMFDDSLFIGGNEKPIADKREYTEDLIESWIN 181

Query: 205 ASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII----- 259
             KE +  +           +N  S  G + K+  + ++    L   P LS  +I     
Sbjct: 182 EVKEKIAAE-----------ANVISEPGHQKKKLRVPSIHKLLLSKVPHLSSDLISKNLK 230

Query: 260 ---LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
              +D  L     + +++KL     Q+LV    ++ D L++  S     +GYIL   K  
Sbjct: 231 KNEIDPSLSSLDFVDKISKLN----QLLVETEDEYTDLLKNRYS-----KGYILA--KRN 279

Query: 317 GKDHPPTESGSSTQIYDEFCPLL----LNQFRSREFVKFE-TFDAALDEFYSKIESQRAE 371
            K     +S  +  IY+ F P       N+    + ++ E  ++  LD F+S IES +  
Sbjct: 280 PKFIEEKDSKDTEYIYETFHPFAPYVDPNEIDISKVIEVEGPYNNTLDLFFSTIESSKYA 339

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            + + +E  A  KL+    +   +++ L+     + +   LI    + ++    AV+  +
Sbjct: 340 LRIQNQEFLAKKKLDDAVNENLTKINALRDIQSINEEKGVLIIEKADLIEEVKGAVQSLI 399

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL------------------ 472
             +M W  +  +++ E+K  N +A LI   L L+ N ++++L                  
Sbjct: 400 DQQMDWNAIENIIRNEQKKRNNIARLIMLPLNLKENKINIILPAEDNNSDDSDNSSSSSD 459

Query: 473 -------------------SNNLDEMDDEE-KTLPVEKVEV--DLALSAHANARRWYELK 510
                               N +   + +  K + ++  ++  DLALSA ANA  ++  K
Sbjct: 460 SDSEYSDNSDSDSSDDDIEKNRIKRKNRKNSKNVKIKGTQITIDLALSAFANASEYFNKK 519

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSEN 568
           K    KQ+K      KA K  E++ ++Q+ ++   ++  +  +R  ++FE+FNWF SSE 
Sbjct: 520 KTSAEKQKKVEKNAEKALKNIEERIKVQLNKKLKDSHDILKKIRAPYFFERFNWFFSSEG 579

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGCF 627
           +L++ G+     + I  +Y+   D+Y+       +   IKN  PE+  +PP TL QAG  
Sbjct: 580 FLILMGKSPLDTDQIYSKYIEDDDIYMSNSF--GTQVWIKN--PEKTEIPPNTLMQAGVL 635

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFMIRGKK--NFLPPHPLIMGF 684
            +  S+AW  K+ +S WW +   VSK +  G+  L  G F ++  K  NFLPP  L+MGF
Sbjct: 636 CMSASEAWSKKIASSPWWCFAKNVSKFSSDGKSVLEPGLFRMKNDKQQNFLPPAQLVMGF 695

Query: 685 GLLFRL---DESSLGSHLNERRVRGEEEGMDDFEDSGHHK---ENSDIESEKDDTDEKPV 738
           G L+++   DE     +LNE R    EE +   ED+   K   E++D+  + +   E   
Sbjct: 696 GFLWKVKIEDEGDADDNLNEVR----EEVLTGDEDNVVEKIVNESADVTDQNELLKEDEE 751

Query: 739 AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVT 792
            ES +  +S     ++ + +N D+    +  +T +N I+    D ++ VA  +T
Sbjct: 752 IESFNGMSSITQEINNLDITNADN---ISNQQTTTNNINE--MDASKTVATVLT 800



 Score = 47.0 bits (110), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 27/38 (71%)

Query: 1030 PSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            P D ++ +IPV  P+ A+  YKY++K+ PGTAKK K +
Sbjct: 899  PDDEIIDIIPVFAPWPALLKYKYKIKVQPGTAKKQKTV 936


>gi|242399100|ref|YP_002994524.1| fibronectin-binding protein [Thermococcus sibiricus MM 739]
 gi|242265493|gb|ACS90175.1| Predicted fibronectin-binding protein [Thermococcus sibiricus MM
           739]
          Length = 650

 Score =  215 bits (548), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 188/693 (27%), Positives = 323/693 (46%), Gaps = 122/693 (17%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L G R   +Y    +  I   ++++G    G ++   L++E
Sbjct: 1   MKQEMSSVDIKYIVEELKTLEGARVDKIYQDKNRVRI--KLHTTG---EGRND---LIIE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T Y ++    PS FT+ LRK++   R+E + Q  +DRI+  + G     + +I
Sbjct: 53  AGKRIHLTTYIKEAPQHPSSFTMLLRKYLSGSRVEKIEQHDFDRIVKLKIG----NYTLI 108

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            EL+ +GNI+L                 D+   I+S  RY       F+  T    H  L
Sbjct: 109 AELFQKGNIILV----------------DENNVIISAMRYEE-----FKDRTIKPQHVYL 147

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                P A E         N  +   EN       +  ++ +                  
Sbjct: 148 L----PPARE---------NPVDILWENFRELISSQDVEIVR------------------ 176

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             L   L  G   +E I+L  G+    K    N L++N ++V+      FE  +++V + 
Sbjct: 177 -ALARKLNMGGLYAEEILLRAGI---EKTKRANALDENELKVI------FEK-IKEVFNA 225

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
               +  I+ +N     D+P            +  P+ L  + S +   F TF  ALDE+
Sbjct: 226 P--KKANIIYKN-----DNPI-----------DVVPIELKWYESYKKKFFTTFSEALDEY 267

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           + KI  + A+ +   K      +L      QE  ++  K ++  + ++ +LI  N   ++
Sbjct: 268 FGKILLESAKIERTKKLQNKKRQLEATLRKQEEMINGFKNQIQENQEIGDLIYTNFAFIE 327

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
             +  +  A+  ++ W++    V+  +K+GN +A +I  +                  D 
Sbjct: 328 NLLKELSKAV-EKLGWKEFKERVENGKKSGNKIAQIIKNI------------------DA 368

Query: 482 EEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           +EK + +E    KV++ L  S   NA  +YE  KK + K E    AH +  K  ++  +L
Sbjct: 369 KEKAVTIELDGKKVKLYLNKSVGENAEIYYEKAKKAKHKLEGAQKAHKETLKKIKEIEKL 428

Query: 538 QILQEKTVANISHM--RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
              +EK   ++  +  RK  WFEKF WF+SSE +L+I+G+DA  NE++VKRYMS+ D+Y 
Sbjct: 429 IEEEEKKELSVRKLEKRKKKWFEKFRWFLSSEGFLIIAGKDATTNEIVVKRYMSENDLYC 488

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKT 654
           HAD++GA   VIK+ +        TL +A  F V  S+AW   + +  A+W  P+QV+K 
Sbjct: 489 HADIYGAPHVVIKDGK---KAGEKTLFEACQFAVSMSRAWKEGLYSGDAYWTDPNQVTKK 545

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           AP+GEYL  G+FM+ GK+N++   P+ +  G++
Sbjct: 546 APSGEYLGKGAFMVYGKRNWMHGLPVKLAVGIV 578


>gi|85000891|ref|XP_955164.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65303310|emb|CAI75688.1| hypothetical protein, conserved [Theileria annulata]
          Length = 1185

 Score =  215 bits (548), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 129/400 (32%), Positives = 209/400 (52%), Gaps = 63/400 (15%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F+ A+D F++K E     +Q K   D    K+NKI +DQ  R   L +++ +     
Sbjct: 274 FEDFNDAVDTFFTKHE---LAKQEKKSVDKRPTKINKIKIDQNKRELNLMEDIQKIDSKI 330

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           +L+E +++  +  +   +  +A+  SW D+   ++ +RK  +P+   I ++++    +  
Sbjct: 331 KLLEEHVDVAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKEIHIPTQTLIF 390

Query: 471 LLSNNLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYELK 510
             + N D+ +++ K    ++                    VE+D  L++H N ++ Y  +
Sbjct: 391 YSNQNQDQHNEQNKQNQFQQNIQQKNENKQNKKNTRDEVVVELDYRLNSHQNLKKLYNER 450

Query: 511 KKQESKQEKTIT----AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           K+ E+K E+T      A  K  K+ +K+   +  ++     IS +R+  WFEKF WFI+S
Sbjct: 451 KRLENKLERTRIGKEYALKKVTKSLKKEENKKTDKKGRDVKISSVRRRFWFEKFYWFITS 510

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK------------------ 608
           + YLV++GRDA QNE++VK+Y++ GD+Y HAD+HGASS ++K                  
Sbjct: 511 QGYLVLAGRDALQNELLVKKYLTNGDLYFHADIHGASSVILKTNSTSNNNTFNLSNSTNT 570

Query: 609 ----------------NHRPEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
                           N   E     L  ++++AG F VC S AW+ K    +WWVY HQ
Sbjct: 571 ATTSTTGTTTTSLDNENSNVEDVSKRLKESIDEAGNFAVCLSTAWNEKFSVQSWWVYWHQ 630

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           VSKT PTGEY+  GSF+IRGKKN+LPP  L MG   LF++
Sbjct: 631 VSKTPPTGEYVPQGSFVIRGKKNYLPPQKLEMGITYLFQV 670



 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/171 (38%), Positives = 97/171 (56%), Gaps = 9/171 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+N  DVA  V  L++LI  +   N+YD++ + +I K         S    K+ +L
Sbjct: 1   MAKERLNAVDVAVTVSNLKKLITNLTLVNIYDITNRVFILKF--------SKNENKIYIL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+H+T + R   + PS F  KLRKH+R RRL D+ Q+  DR+I F F     AH+
Sbjct: 53  IEIGCRIHSTQFLRSVDHLPSNFNAKLRKHLRNRRLRDISQMSQDRVIDFTFSSEEYAHH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           +I++L+  GNI LTDSE+ VLT+LR     DK   + + + Y  +    FE
Sbjct: 113 LIVQLFLPGNIYLTDSEYKVLTVLRPQNTGDKFFKVGTNYVYDMDYNSWFE 163



 Score = 47.0 bits (110), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 29/41 (70%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
            LT +    D ++ VIP+C PYSA++ YK  +K++PG +KKG
Sbjct: 1101 LTKDLKEDDDVINVIPMCAPYSAIKHYKNALKLVPGNSKKG 1141


>gi|406604691|emb|CCH43887.1| putative RNA-binding protein [Wickerhamomyces ciferrii]
          Length = 983

 Score =  214 bits (546), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 135/397 (34%), Positives = 215/397 (54%), Gaps = 40/397 (10%)

Query: 331 IYDEFCPL-LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +Y++F P   +N     E V  + ++  LD+F+S IES +   + + +E+ A  +L +  
Sbjct: 282 LYEQFHPFEPINLKEDEELVPIQGYNKTLDKFFSTIESSKYALRIQNQENQAKKRLQQAR 341

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
            D++ +V  L      +    E I +N E V+ A  AV+  L  +M W+ + +++  E+ 
Sbjct: 342 DDKQQQVQRLLDVQAVNTLKGETIIFNAEIVEEAKAAVQALLDQQMDWKTMEKLINVEKA 401

Query: 450 AGNPVAGLID-KLYLERNCMSLLLSNN-------------------LDEMDDEEKTLPVE 489
            GN VA +I+  L L+ N +SL LS                      +   DE++  PV+
Sbjct: 402 KGNRVAKVINLPLNLKENKISLSLSTEDPYANDEDEDESSSESEPESESDSDEDEPKPVK 461

Query: 490 ------------KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT-- 535
                        V +DL LS++ANA  ++ +KK    KQ+K   + +KA K  E+K   
Sbjct: 462 SQAKKDNVKNTINVTIDLTLSSYANASEYFNVKKSTVEKQKKVEQSATKALKNIEQKIEK 521

Query: 536 --RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
             +  + QE  +  +  +R  ++FEKFNWFIS+ENYL++SG+D  Q ++I  RY++  D+
Sbjct: 522 DLKKNLKQENDI--LRKLRNPYFFEKFNWFISNENYLILSGKDDSQCDLIYHRYINDDDI 579

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK 653
           YVHAD+ G+S   IKN    + V P TL QAG  ++  S+AW++KMVTS+WW+Y   V+K
Sbjct: 580 YVHADIDGSSHVFIKNPNKGE-VSPSTLMQAGILSLSTSKAWENKMVTSSWWLYASDVTK 638

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
               G  L  GSF    +KNFLPP  L+MGF  L+++
Sbjct: 639 KDIDGTILNAGSFRYLKEKNFLPPSQLVMGFAFLWKV 675



 Score = 92.8 bits (229), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/126 (36%), Positives = 78/126 (61%), Gaps = 12/126 (9%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           +   R  N+Y++  S K Y+ K     G+ +S ++    L+++SG + H T ++R    T
Sbjct: 13  ITNYRLQNIYNIATSNKQYLLKF----GLPDSKKN----LVLDSGFKTHITEFSRPTPQT 64

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PS F +KLRKH+++RRL  ++Q+G DR+I+  F  G  A++++LE ++ GNI+L D E  
Sbjct: 65  PSSFVVKLRKHLKSRRLSSIKQVGIDRVIVLTFSDG--AYHLVLEFFSAGNIVLLDHERR 122

Query: 139 VLTLLR 144
           +L L R
Sbjct: 123 ILALQR 128



 Score = 66.6 bits (161), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 74/175 (42%), Gaps = 45/175 (25%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG+KGK+KK+  KYGDQDEEER +RM  L          G  + +     + KK  +  +
Sbjct: 783  RGKKGKMKKIANKYGDQDEEERRLRMEAL----------GTLKQQTKKEEEFKKQQLIKI 832

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
            +  K   K K+   L+                N      E    +K+  E          
Sbjct: 833  NHLKKTEKKKRQEELTA---------------NKYANNKEVINFEKILNE---------- 867

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
                      LT      D  L  IPV  P++A+Q Y+Y++KI PG+ KKGK +Q
Sbjct: 868  ----------LTPILSKDDEPLEAIPVFAPWNALQKYRYKIKIQPGSTKKGKALQ 912


>gi|401842736|gb|EJT44818.1| TAE2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1032

 Score =  214 bits (546), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 190/733 (25%), Positives = 342/733 (46%), Gaps = 112/733 (15%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ +         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLRF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  +RQ+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTSLRQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           +++L R          ++       +I  +F+ T  +  +  +  S       P+ + E 
Sbjct: 131 IMSLQR---------VVLEHENQVGQIYEMFDETLFAAGNDFVNES-------PEIIKEK 174

Query: 199 GNNVSNASKENLGGQKGGKSFDLS-----KNSNKNSNDGARAKQPTLKTVLGEALGYGPA 253
               SN   E +   +     D++        NKN +   + K P++  +L   L   P 
Sbjct: 175 Y--TSNLVNEWIEATQSKYDSDIAVIKQLNIQNKNDSKEKKVKVPSIHKLL---LSKVPH 229

Query: 254 LSEHIILDTGLV----PNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYI 309
           LS  ++     V    P+M    +    +   ++L    +++ + L    + D   +GYI
Sbjct: 230 LSSDLLSKNLKVFNIDPSMSCLALLDRTNTLAEMLNRTQSEYNELL---TTSD--RKGYI 284

Query: 310 LM-QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYS 363
           L  +N++      P +      IYD F P    +N+  S  F   +    ++  LD+F+S
Sbjct: 285 LAKKNENFNSIKDPADLEF---IYDTFHPFRPYINEKNSGSFRIADVEGPYNKTLDKFFS 341

Query: 364 KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAA 423
            IES +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++  
Sbjct: 342 TIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLNVQELNERKGHLIIENASLIEEV 401

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDE 482
            LAV+  +  +M W  + +++K E+K GN +A L++  L L++N +S+ L     ++  E
Sbjct: 402 KLAVQGLVDQQMDWSTIEKLIKSEQKKGNKIAQLLNLPLNLKQNKISVKL-----DISRE 456

Query: 483 EKTLPVE--------------------------------------KVEVDLALSAHANAR 504
           E+++                                          V +DL LSA+ANA 
Sbjct: 457 EESITSSDEDDESEDSSSEGSSDSGDMSTFKEENSKKKGQSNNALNVTIDLGLSAYANAS 516

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFN 561
            ++ +KK    KQ+K      KA K  E K   Q L+ K   + S ++KV   ++FEK+N
Sbjct: 517 EYFNIKKTSAEKQKKVEKNVGKAMKNIEVKIDQQ-LKRKLKESHSVLKKVRTPYFFEKYN 575

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLT 620
           WFISSE +LV+ G+   + + I  +Y+   D+Y+    +  +   IKN  P++  VPP T
Sbjct: 576 WFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--THVWIKN--PDKTEVPPNT 631

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGKK--NFLPP 677
           L QAG   +  S+AW  K+ +S WW +   V K  +     L  G+  ++ +K  N LPP
Sbjct: 632 LMQAGILCMSSSEAWSKKIASSPWWCFAKNVCKFDSSDNSILPEGALRLKNEKDLNLLPP 691

Query: 678 HPLIMGFGLLFRL 690
             L+MGF  L+++
Sbjct: 692 AQLVMGFAFLWKV 704



 Score = 45.1 bits (105), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 26/36 (72%)

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            DI++ ++PV  P+ A+  YKY+VKI PG AKK K +
Sbjct: 918  DIVVDIVPVFAPWPALLKYKYKVKIQPGNAKKTKTL 953



 Score = 42.0 bits (97), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 25/31 (80%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAS 924
           RG++GKLKK++ KY DQDE+ER +R+  L +
Sbjct: 824 RGKRGKLKKIQRKYADQDEQERFLRLEALGT 854


>gi|255710571|ref|XP_002551569.1| KLTH0A02530p [Lachancea thermotolerans]
 gi|238932946|emb|CAR21127.1| KLTH0A02530p [Lachancea thermotolerans CBS 6340]
          Length = 1058

 Score =  214 bits (544), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 196/761 (25%), Positives = 347/761 (45%), Gaps = 128/761 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+    + L+ +L G R SN+Y++  S + ++ K         +    K+  
Sbjct: 1   MKQRISSLDLELLYRELKSQLEGYRLSNIYNIAESSRQFLLKF--------NKPDSKLNA 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R    TPSGF +KLRKH++++RL  V+++  DRI++  F  G    
Sbjct: 53  IIDCGLRVHLTDFTRPVPATPSGFVVKLRKHLKSKRLTTVKRVANDRILVLSFNDGQ--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           +++LE ++ GN++L DS+  ++ L R          I+  H +  ++  ++     S L 
Sbjct: 111 FLVLEFFSAGNVILLDSDRKIIVLQR----------IV--HEHENKVGHIYNMFDGSFLE 158

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                  +   +  D+VN              G  K  K F    +S+  +  G  AK  
Sbjct: 159 NTRIEPPKSKVHSADEVN--------------GWIKEAKDF---ADSSVKAKTGKGAKVL 201

Query: 239 TLKTVLGEALGYGPALSEHIIL----DTGLVPNMK-LSEVNKLEDNAIQVLVLAVAKFED 293
           ++  +L       P LS  +I       G+ PN   L+ ++K+ D  + +L    ++  +
Sbjct: 202 SIHKLL---FLREPQLSSDLISRNLKSRGIAPNSPCLNFLDKI-DEIVDLLDATESEVNE 257

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ--IYDEFCPLLLNQFRSRE-FVK 350
            L+D         GYI+ +       H  +E G +    +Y++F P   +     + + K
Sbjct: 258 LLRDGCKL-----GYIIAKKNP----HYDSEKGDANLEFVYEQFHPFPPHLSEDEKGYTK 308

Query: 351 F----ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
                  ++  +D+F+S IES +   + + +E  A ++L    +D E R+  L     ++
Sbjct: 309 IIEVPGQYNKTVDDFFSTIESSKYALRIQNQEFQAKNRLESAKLDNEKRIQALIDVQTQN 368

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
                 I    + V+ A  A++  +  +M W+ +  ++  E+K  N +A LI   L L+ 
Sbjct: 369 EVRGHAIIAAADLVEEAQNAIKALVEQQMDWKTIEVLISNEQKKNNRIARLIKLPLDLKN 428

Query: 466 NCMSLLLSNN----LDEMDDEEKTL----------------------------------- 486
           N  +L L  N     D  D+EE  L                                   
Sbjct: 429 NKFTLSLPRNDEIESDNSDEEEDNLTSSEDETSSSDSSDSSLSDFEADDNDEDELTSVSN 488

Query: 487 -------------PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
                        P     +DL LSA+ANA  ++ +KK    KQ+K      KA K  E+
Sbjct: 489 IKKDRNDNKKKEKPSIDATIDLTLSAYANASNYFNIKKSNVEKQKKVEKNAQKALKNIEQ 548

Query: 534 KTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           +    + ++   ++  ++  RK ++FEKF+WF+SSE +LV+ G+   +++ I  +Y+   
Sbjct: 549 RIEKDLKKKLKESHDVLNKTRKPYFFEKFHWFVSSEGFLVLMGKSGMESDQIYGKYIHDN 608

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           DV+V       +   IKN   E  VPP TL QAG   +  S AW  K+ +SAWW +  ++
Sbjct: 609 DVFVSNSFD--THVWIKNP-DETEVPPNTLMQAGIMCMSASPAWSKKIQSSAWWCFAKEL 665

Query: 652 SKTAPT-GEYLTVGSFMIRG--KKNFLPPHPLIMGFGLLFR 689
           SK     GE L  G+F ++   KK+FLPP  L+MGF LL++
Sbjct: 666 SKFDNYGGEVLPAGTFRLKDEKKKSFLPPSQLVMGFALLWK 706


>gi|312136934|ref|YP_004004271.1| fibronectin-binding a domain-containing protein [Methanothermus
           fervidus DSM 2088]
 gi|311224653|gb|ADP77509.1| Fibronectin-binding A domain protein [Methanothermus fervidus DSM
           2088]
          Length = 645

 Score =  213 bits (543), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 182/691 (26%), Positives = 323/691 (46%), Gaps = 122/691 (17%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DV A V  L +L+ G +    Y   P+  I  L     V   G   +V +++++GV
Sbjct: 1   MSNVDVYAVVYELNKLLKGSKFVKAY--QPRKDIIVL--RFHVKNKG---RVDVIIQTGV 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-LGMNAHYVILE 123
           R+H T Y+ +    P  F + LRK+++   +E V+Q  +DRI+ F    LG   + +I+E
Sbjct: 54  RIHATRYSLENPKFPPSFPMLLRKYLKGGIVESVKQHKFDRIVEFNVKVLGKKNYKLIVE 113

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTS 183
           L+ +GNI+LT+    ++  LR+ +  D+ ++   +++YP        + T SKL   L  
Sbjct: 114 LFGKGNIILTEENGKIIQPLRTEKWSDREISAGKKYKYPESRGLNPLKITKSKLKELL-- 171

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
                      +N D + V   +    GG                               
Sbjct: 172 -----------LNSDKDVVRTLALNGFGG------------------------------- 189

Query: 244 LGEALGYGPALSEHIILDTGL---VPNMKLS--EVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                      +E I+  +G+    P+  LS  E+NK+ D +I+ +  ++ ++    Q +
Sbjct: 190 ---------TYAEEIVYRSGIDKNTPSKSLSDNEINKIYD-SIEEIYGSLKEYNFKPQII 239

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           +  D+VP                                + L  +++ E   F+ F+ AL
Sbjct: 240 VDKDVVP--------------------------------IELKIYKNYEKRYFDNFNKAL 267

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEF++    +  +++ +        KL +I   Q+N + + K++  +  ++ +LI    E
Sbjct: 268 DEFFTPKLREELKKEKEKVWKNKIEKLERILNSQKNAIKSFKKKAKKYREIGDLIYLKYE 327

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            +   I  ++ A   + +W+++     E+ K       +     + ++ +  L   N+D 
Sbjct: 328 LISKVINTLKNA-KEKYTWKEII----EKVKKAKKENKIKIINSITKDGIVTL---NIDG 379

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
                     + V +D+  S   NA  +YE  KK   K +  I A  +  K      + +
Sbjct: 380 ----------KSVNIDINKSLEKNAEIYYEKAKKIRKKIKGAIKAMEETEKKLNNLKKKR 429

Query: 539 ILQEKTV-ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            ++ K +   I   RK+ WFEKF WFISS+ +LVI GRDAQ NE+IVK+YM + D+Y+HA
Sbjct: 430 DIEIKNILIPIKKRRKLKWFEKFRWFISSDGFLVIGGRDAQTNEIIVKKYMEENDIYLHA 489

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAP 656
           D+HGA S VIKN    + +P  T+N+A  F    S+AW   + ++  +WVYP QV+K+ P
Sbjct: 490 DIHGAPSVVIKNK--NKKIPENTINEAAIFAASFSKAWTYGLGSADVYWVYPQQVTKSPP 547

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +GEY++ G+F+IRGK+N++   P+ +  G++
Sbjct: 548 SGEYISKGAFVIRGKRNYIRNVPIELAVGIV 578


>gi|15679889|ref|NP_277007.1| hypothetical protein MTH1907 [Methanothermobacter
           thermautotrophicus str. Delta H]
 gi|2623041|gb|AAB86367.1| conserved protein [Methanothermobacter thermautotrophicus str.
           Delta H]
          Length = 655

 Score =  211 bits (537), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 192/690 (27%), Positives = 302/690 (43%), Gaps = 118/690 (17%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DV A    L  ++ G R    Y     T I +          GE  +V ++M++GV
Sbjct: 7   MSNVDVFAVTSELNEMLRGARVDKAYQPLRDTVIIRFHVP------GEG-RVDVVMQAGV 59

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T Y       P  F + LRKH++   + +VRQ G+DR               I+E+
Sbjct: 60  RIHRTNYPPQNPKVPPSFPMLLRKHLKGGVVREVRQHGFDR---------------IVEI 104

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKG-VAIMSRHRYPTEICRVFERTTASKLHAALTS 183
             +      D E+T++  L +     KG + ++++ R   EI    +R T S    A   
Sbjct: 105 TVE-----KDQEYTLMVELFA-----KGNIILLNQQR---EIILPLKRKTWSDRRIA--- 148

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
           S+E     P +    G N  +     L         DL +   +N               
Sbjct: 149 SREIYEYPPSR----GINPLDHDPSELEDILMNSGADLIRTLARN--------------- 189

Query: 244 LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
                G+G   +E I+L  GL  N   S  N   D+  ++       F+   +  +   I
Sbjct: 190 -----GFGGLYAEEIVLRAGLDKNTPCS--NLTPDDIRKIDAAIYETFKPLRELDLKPHI 242

Query: 304 VPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYS 363
           + +G                         ++  P+ L  +  RE   FE+F+ A DEF+S
Sbjct: 243 IGDG-------------------------EDVLPIELRVYSGRERRYFESFNDAADEFFS 277

Query: 364 KI---ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            I   E +RA ++   +E   F K  +I   Q   +   K+ ++ S +  +L+  N   V
Sbjct: 278 SIFREEIRRAHEEEWEREVDRFRKRLRI---QRETLEKFKKTIEVSTRRGDLLYANYSLV 334

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           +  +  +R A   + SW+++  ++ + RK G P A  I                 +D M 
Sbjct: 335 EEVLATIRRA-REKYSWDEIKNIIADARKRGLPEASNI---------------TEIDRMG 378

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--KTRLQ 538
           +    L  E V +D  L    NA  +YE  KK + K +  +TA  K  K  E+  K R  
Sbjct: 379 NITIFLDGEPVRIDSKLGVPENAEVYYEKAKKAKRKIKGVMTAIEKTEKEIERIEKKRDD 438

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            L+   V      RK+ WFEKF WF+SS+ +LVI GRDA  NEM+VK++M   D+Y+H+D
Sbjct: 439 ALRNIMVPRRRVKRKLRWFEKFRWFVSSDGFLVIGGRDAGTNEMVVKKHMEPRDIYLHSD 498

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPT 657
           +HGA S VIK     + VP  T+ +A  F    S AW     +   +WV+P QVSKT  +
Sbjct: 499 IHGAPSVVIKTE--GRDVPETTIQEAAVFAASFSSAWTRGFTSLDVYWVHPEQVSKTPRS 556

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           GE++  G+F+IRG +N+L   PL +  G++
Sbjct: 557 GEFVARGAFIIRGSRNYLRGVPLKIAIGVV 586


>gi|389852774|ref|YP_006355008.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
 gi|388250080|gb|AFK22933.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
          Length = 642

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 192/692 (27%), Positives = 321/692 (46%), Gaps = 132/692 (19%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M++ D+   V+ L+ +IG R   VY    +  I        + ++GE  +V LL+E+G R
Sbjct: 1   MSSVDIKYVVEELQNIIGSRVDKVYHQDNELRI-------KLHKAGEG-RVDLLIEAGKR 52

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T+Y ++    P+ F + LRK++  + L  + Q  +DRI++ +FG     + +I EL+
Sbjct: 53  IHVTSYIKENLQ-PTAFAMLLRKNLSGKFLTKIEQREFDRIVILEFG----EYKLIAELF 107

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSK 185
            +GNI+L                 DK   I+   RY        +R    K+H     SK
Sbjct: 108 GKGNIILV----------------DKDWKIIGALRYE----EFRDRAIKPKIHYQFPPSK 147

Query: 186 EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSN-KNSNDGARAKQPTLKTVL 244
           E     P K+                      SF+  K    +   +  RA        L
Sbjct: 148 E----NPLKI----------------------SFERFKELILEEDTEIVRA--------L 173

Query: 245 GEALGYGPALSEHIILDTGL-----VPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
              L  G   SE  +L   +     V ++   E+ K+ D  I+VL L             
Sbjct: 174 ARKLSIGGLYSEETLLRANIEKTRNVKDLSEEELKKIYDTMIKVLNLE------------ 221

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
                       +N ++       ++GS   +     P+ L  + + E V +++F  ALD
Sbjct: 222 ------------KNPNI-----VYKNGSMVDV----LPVDLVWYSNYEKVFYDSFSKALD 260

Query: 360 EFYSKIESQRAEQQH-KAKEDAAFHKLNKIHMDQ-ENRVHTLKQEVDRSVKMAELIEYNL 417
           E++ K+  ++A+++  KA E+    K  +I + + E ++   ++E   + +  +L+  N 
Sbjct: 261 EYFGKLTIEKAKRERTKALEEK--RKALEISLKRIEEQIRGFEKEAQENQERGDLLYANY 318

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
             V   +  +R  +   +  E++ + ++E +K G P A +I K+  +    SL++     
Sbjct: 319 TLVKEILETIRRGIKT-LGVEEVVKRIEEAKKKGYPWANIISKVSKD----SLVIE---- 369

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
                   L  +K+++D+  +   NA  +YE  KK   K E    A+ +  K  E   + 
Sbjct: 370 --------LEGKKIKLDINKTLEENAEIFYEKAKKARQKLEGARKAYEETKKKIENIEQE 421

Query: 538 QILQEKTVA-NISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            + +EK +A      R+  WFEKF WFISSE +LVI G+DA  NE++VKR+MS+ D+Y H
Sbjct: 422 IMEEEKKIAVKKLEKRRKKWFEKFRWFISSEGFLVIGGKDATTNEIVVKRHMSENDLYCH 481

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTA 655
           AD+ GA   VIK  R        T+ +A  F V  S+AW   + ++ A+WVYP QVSK A
Sbjct: 482 ADIWGAPHVVIKEGR---KASEKTIFEACQFAVSMSRAWSEGLASADAYWVYPEQVSKQA 538

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           P GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 539 PAGEYLPKGAFMVYGKRNWLHGIPLKLAVGII 570


>gi|156338807|ref|XP_001620041.1| hypothetical protein NEMVEDRAFT_v1g149359 [Nematostella vectensis]
 gi|156204309|gb|EDO27941.1| predicted protein [Nematostella vectensis]
          Length = 287

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 186/360 (51%), Gaps = 81/360 (22%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y EF P L+ Q++   +++F +FD  +D+F+S I SQ+ + +   +E +A  KL  +  D
Sbjct: 8   YQEFYPFLMTQYKDHPYLEFPSFDKTVDDFFSSIGSQKLDVKALNQEKSALKKLENVKKD 67

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
            E R+  L+   +  V+ A+LIE NL+ VD AIL V  A+AN++ W ++  +VKE +  G
Sbjct: 68  HEKRIQQLQSAQEADVRKAQLIEINLDLVDRAILVVNSAIANQIDWSEILNLVKEAQIQG 127

Query: 452 NPVAGLIDKLYLERNCMSLLLSN-NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
           +PVA  I +L L+ N +++LL   +L  ++          V VD+ L AH NARR     
Sbjct: 128 DPVASAIRELKLQTNHITMLLRYVSLASING-------RPVRVDIDLLAHLNARR----- 175

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
                                                         FEKF WFISSENY+
Sbjct: 176 ----------------------------------------------FEKFLWFISSENYV 189

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI GRD QQNE++VKR++  G+   +     A  T+I ++            Q+   T  
Sbjct: 190 VIGGRDQQQNELVVKRHLQPGNATCNTIFSQA--TLICSY------------QSQLSTTA 235

Query: 631 HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
            + ++ S++  +        VSKTAPTGEYLT GSFMIRGKKNFLPP  LIMGF  LFR+
Sbjct: 236 INHSYQSQLSIT--------VSKTAPTGEYLTTGSFMIRGKKNFLPPCHLIMGFSFLFRV 287


>gi|298675852|ref|YP_003727602.1| fibronectin-binding A domain-containing protein [Methanohalobium
           evestigatum Z-7303]
 gi|298288840|gb|ADI74806.1| Fibronectin-binding A domain protein [Methanohalobium evestigatum
           Z-7303]
          Length = 670

 Score =  208 bits (529), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 179/717 (24%), Positives = 312/717 (43%), Gaps = 121/717 (16%)

Query: 3   KVRMNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           K  M++AD++A +  L      ++  + + +Y  +P      +     +   G      L
Sbjct: 4   KQEMSSADISALISELSDGSNSIVDAKINKIYQPTPDEVRINIY----IPRVGRDN---L 56

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++E+G R+H + + R     P  F + LRKHI   R+  +RQ  +DRI+      G    
Sbjct: 57  VIEAGKRIHLSKHLRSNPKMPGPFPMLLRKHIMGGRITFIRQYDFDRIVEIGISKGDVDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I E ++QGN++L ++E  ++  ++      + +     + YP       E      L 
Sbjct: 117 ILIAEFFSQGNVILLNNERKIILPMKPRTFRGRKIQGGEMYEYPESQISPLE-AEKDDLE 175

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            A +SS            ED    + A+  NLGG                          
Sbjct: 176 QAFSSS------------EDDVVRTIATSFNLGG-------------------------- 197

Query: 239 TLKTVLGEALGYGPALSEHIILDTGL-----VPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
                          L+E +    G+     V ++ L E +KL D             +D
Sbjct: 198 --------------LLAEEVCARAGVDKNKPVDDVTLDEKSKLTDT-----------LKD 232

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFET 353
               +++G++ P    +++ K        T + S    Y +  P  L Q++  E   F++
Sbjct: 233 VFTPIVTGELNP---CIIKQK--------TNNQSE---YVDVLPFELEQYKEYEKQYFDS 278

Query: 354 FDAALDEFYSK--IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           F+ ALDEF+ K  +E++R  Q+   KE    ++  +    Q+  +   ++E ++   +AE
Sbjct: 279 FNKALDEFFGKEVVEAERKIQESAKKEKVDIYQ--RRLQQQQGAIEKFEKEANKYNSIAE 336

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
            I  +   V+  I  +  A  +  SW+D+   +KE      P A LI  +  +   + + 
Sbjct: 337 AIYSHYPFVEEVITVLTNARKSGYSWDDIKSKLKEANDI--PSAKLIQSIDPKSGTIVM- 393

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
                 ++D  + TL       D+  S   NA+ +YE  K+   K+E  + A  +  +  
Sbjct: 394 ------DLDGTKATL-------DIRYSVPQNAQTYYEKAKRVMKKREGALRAIEETKRII 440

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           E + + Q    K       +RK HW+ +F WFISS+ +LV+ GRDA  NE I K+YM K 
Sbjct: 441 ENRDKPQQQTRKRKV----IRKKHWYSRFRWFISSDGFLVVGGRDADTNEEIFKKYMEKQ 496

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS-KMVTSAWWVYPHQ 650
           D+ +H  + GA   ++K+ R    VP  T+ +A  F V +S  W S +     +WVYP+Q
Sbjct: 497 DIILHTQVPGAPLAIVKSKRYN--VPEQTMYEAAQFVVSYSSIWKSGQFGGDCYWVYPNQ 554

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           VSKT  +GE+L  GSF+IRG +N+    P+ +  GL    +   +G  L+  +  G+
Sbjct: 555 VSKTPESGEFLKKGSFIIRGDRNYFKNVPVSVAIGLELENETRVIGGPLDAVKKNGK 611


>gi|385304258|gb|EIF48283.1| tae2-like protein [Dekkera bruxellensis AWRI1499]
          Length = 979

 Score =  207 bits (527), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 191/733 (26%), Positives = 332/733 (45%), Gaps = 110/733 (15%)

Query: 23  GMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           G R SNVY LS   ++++FK              KV + +ESG +L+ T Y +     P+
Sbjct: 23  GHRLSNVYSLSSNNRSFLFKFAQPDS--------KVNVAVESGFKLYITDYQKPVLPQPT 74

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  KLRKH++++RL  V Q+G DR+++ +F  GM  +Y++LE ++ GNI+L DS   ++
Sbjct: 75  SFCTKLRKHLKSKRLTHVEQVGDDRVVVLEFSDGM--YYLVLEFFSAGNIILLDSNRQII 132

Query: 141 TLLRSHRD-----DDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKV 195
           +L R   +     D           YP+    +FE     K    +T  K       +++
Sbjct: 133 SLFRVVENKMKASDPDAFNYSIGQIYPSFDSTLFEDENM-KTREFVTYDKGLVVGWINEM 191

Query: 196 NEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALS 255
            +      N        +K G+ F ++K                          + P LS
Sbjct: 192 QQREEQNKNRETSGKKKKKKGRIFSVNK----------------------LCFMHAPYLS 229

Query: 256 EHII----LDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVIS---GDIVPEG 307
             +I    LD G+ P+   S +N LEDN+ ++ +V ++ + E+  + ++    G +  +G
Sbjct: 230 SDLIQRSLLDNGVTPSQ--SCLNMLEDNSLVEKVVTSLQESENTFKSLLQTPPGKV--QG 285

Query: 308 YILMQNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFETFDAALDEFYSKI 365
           +IL +   L  +     S +    Y+EF P   +  +    +    + ++  +D F++ I
Sbjct: 286 WILRKINPLFDNTKEESSENLKYTYEEFHPFEPVHKENEDSKVDVVDGYNKTVDTFFTMI 345

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAA 423
           E  +A    + ++ AA  +L  +  + E ++  L   QE++R  K   LI  +  +++  
Sbjct: 346 ELSKASLSRQQQKAAAAKRLQLVKEENEKKLAKLDAVQELNR--KKGYLITLHSSEIEDC 403

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD------ 477
             +++  L  +M W+++ ++++ ER+ GNP A +I  L L ++  ++LL +  +      
Sbjct: 404 RSSIQALLDQQMDWQNIDKLIEVERRRGNPTAKMIKSLNLLKHEFTVLLPDEQEVVDDEN 463

Query: 478 -----------------EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKT 520
                            + + E+K   +  V +D+  SA AN+ R+++ KK  + KQEKT
Sbjct: 464 EDESDSDSDSDSDDDDDDDETEDKKSNIISVSIDIRESAFANSTRYFDAKKNAQEKQEKT 523

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
               + A K +E K    +   K + N                  S+N + I        
Sbjct: 524 KENAAIAIKNSEMKIHRDM---KRLEN-----------------ESKNTVDIHS------ 557

Query: 581 EMIVKRYMSKG-DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
             I  RY+    D  V +D+  +   VIKN    + +PP T  QAG + +  S+AWDSKM
Sbjct: 558 --IYYRYLDNNTDYLVSSDVDKSLKVVIKNPYKNKEIPPSTFVQAGIYCLTTSKAWDSKM 615

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
             S W+V    VSK    G  L  G   I+G KNFLPP  L+MG GLL+  DE +   H+
Sbjct: 616 SPSPWFVKGDAVSKKDFDGSLLPPGLLNIKGDKNFLPPSQLVMGIGLLWLPDEKTKARHI 675

Query: 700 NERRVRGEEEGMD 712
                R ++ G +
Sbjct: 676 EYMLNRNKDIGFE 688



 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 87/204 (42%), Gaps = 31/204 (15%)

Query: 869  ERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKV 928
            E   D     E +++ +K    K+ RG+K KLKK+K KY DQD+EER +RMA L   G +
Sbjct: 730  EDANDDDEIKEDVLQNSKDSTTKLLRGKKNKLKKIKRKYKDQDDEERRLRMAAL---GTL 786

Query: 929  QKNDGDPQNENASTHKEKK-----PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
             +N+ D +   +  + E K       I      K   K ++   L    +E  +  S   
Sbjct: 787  NQNNNDGEQNGSDVNGESKVDSREQKIIEASMRKEKKKQQQINQLQHLLEEIENAISESR 846

Query: 984  EDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGP 1043
            ED        T ++ KV   E                +  L   P   D +   I V  P
Sbjct: 847  EDT-------TTDVKKVYYSE----------------LFGLLNKPGKDDNIADCIVVFMP 883

Query: 1044 YSAVQSYKYRVKIIPGTAKKGKGI 1067
            + A+  Y Y+VK+  GT KKGK +
Sbjct: 884  WGALNKYJYKVKVQSGTNKKGKTL 907


>gi|190407936|gb|EDV11201.1| hypothetical protein SCRG_02481 [Saccharomyces cerevisiae RM11-1a]
          Length = 1030

 Score =  206 bits (525), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 207/750 (27%), Positives = 342/750 (45%), Gaps = 137/750 (18%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+ +        L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDES--------LFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQ--------------------RAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           S                     RAE   K +      +LN      E + H +       
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELN------ERKGHLI------- 392

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
           ++ A LIE          LAV+  +  +M W  + +++K E+K GN +A L++  L L++
Sbjct: 393 IENAPLIE-------EVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQ 445

Query: 466 NCMSL---LLSNNLDEMDDEE------------------------------KTLPVEKVE 492
           N +S+   L S  L+   DE+                              K    EK+ 
Sbjct: 446 NKISVKLDLSSKELNTSSDEDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKIN 505

Query: 493 V--DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V  DL LSA+ANA  ++ +KK    KQ+K      KA K  E K   Q L++K   + S 
Sbjct: 506 VTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSV 564

Query: 551 MRKV---HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           ++K+   ++FEK++WFISSE +LV+ G+   + + I  +Y+   D+Y+    +  S   I
Sbjct: 565 LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWI 622

Query: 608 KNHRPEQP-VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGS 665
           KN  PE+  VPP TL QAG   +  S+AW  K+ +S WW +   VSK        L  G+
Sbjct: 623 KN--PEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGA 680

Query: 666 FMIRGK--KNFLPPHPLIMGFGLLFRLDES 693
           F ++ +  +N LPP  L+MGFG L+++  S
Sbjct: 681 FRLKNENDQNHLPPAQLVMGFGFLWKVKTS 710



 Score = 43.1 bits (100), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 26/36 (72%)

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            D++  +IPV  P+ A+  YKY+VKI PG+AKK K +
Sbjct: 924  DVVDDIIPVFAPWPALLKYKYKVKIQPGSAKKTKTL 959



 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 28/37 (75%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           RG++GKLKK+++KY DQDE ER +R+  L +   ++K
Sbjct: 830 RGKRGKLKKIQKKYADQDETERLLRLEALGTLKGIEK 866


>gi|435850617|ref|YP_007312203.1| putative RNA-binding protein, snRNP like protein
           [Methanomethylovorans hollandica DSM 15978]
 gi|433661247|gb|AGB48673.1| putative RNA-binding protein, snRNP like protein
           [Methanomethylovorans hollandica DSM 15978]
          Length = 664

 Score =  204 bits (520), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 181/690 (26%), Positives = 301/690 (43%), Gaps = 107/690 (15%)

Query: 2   VKVRMNTADVAAEVKCLRR----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVL 57
           +K  M +ADVAA V  L      LI  +   +Y        F L     V   G   +V 
Sbjct: 1   MKEEMASADVAALVAELSSGELSLIDAKVGKIYQPLEDEIRFNLF----VFGKG---RVD 53

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            ++++G R H + Y       P  F + LRKH+ + R+  ++Q  +DRII   F  G   
Sbjct: 54  FIIQAGKRAHLSQYVSPSPKLPQSFPMLLRKHVMSSRITSIKQYDFDRIIEIGFVRGGVE 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
             +I EL+A+GNI+L D+E  +  +L  +    KG  + S   Y     ++     + + 
Sbjct: 114 TVLIAELFARGNIVLIDNERRI--ILPMNPTTFKGRRVRSGEIYSYPEAQISPLDASEEQ 171

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             A+  S + D              + A++ NLGG                         
Sbjct: 172 MLAVFRSSDSDVVR-----------TIATRFNLGG------------------------- 195

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
                           LSE +    G+  N+ +SEV   E      + L +   +D    
Sbjct: 196 ---------------LLSEEVCSRAGIKKNLPVSEVGSEE------ITLLLRAMKDMFSP 234

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
           + +G++ P   I+M+ +           G + Q  D   P  L  +R     ++ +F+ A
Sbjct: 235 LQTGELDP--CIIMKGE-----------GDTAQSID-VVPFELEVYRELTKERYPSFNKA 280

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           LDE++ K E+    +Q  + +      L +    QE  V    +E ++   +AE I  N 
Sbjct: 281 LDEYFGKREAASITEQAFSVKKEKVDLLERRLRQQEEAVEKYGKESEKHTSIAETIYANY 340

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           + V+  +  + +A     SW+ +   +K  +          D +   ++ +S+  +  + 
Sbjct: 341 QAVEDVLKVLAIARDKGYSWDQIKSTIKAAK----------DSVPAAKSILSIDSATGIV 390

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
            +D     L   K  +D+  +   NA+ +YE  KK   KQE  I +  +   A +KK + 
Sbjct: 391 VLD-----LMGMKTNIDVTKTVPQNAQVYYERSKKLAKKQEGAIRSIEQTKLAMQKKEKT 445

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
              +  TV     ++K  W+++F WF+SS+ +LVI GRDA  NE I  +YM K D+ +H 
Sbjct: 446 ATRKRGTV----RIKK-QWYDRFRWFVSSDGFLVIGGRDADTNEEIFVKYMEKRDIVLHT 500

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAP 656
            + GA  TVIK    E  VP  T+ +A  F V +S  W S   ++  +WV P QVSKT  
Sbjct: 501 QMPGAPLTVIKTGGKE--VPSQTIEEAARFVVSYSSVWKSGQFSADCYWVNPTQVSKTPE 558

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +GEY+  GSF+IRG++N+L   P+ +  G+
Sbjct: 559 SGEYVKKGSFIIRGERNYLKDVPVGVAVGI 588


>gi|367011407|ref|XP_003680204.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
 gi|359747863|emb|CCE90993.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
          Length = 1016

 Score =  204 bits (520), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 200/761 (26%), Positives = 345/761 (45%), Gaps = 118/761 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+    + LR  L G R SN+Y++  S + ++ K         +    K  +
Sbjct: 1   MKQRISALDIQILAEELRAHLEGHRLSNIYNIADSSRQFLLKF--------NKPDSKFSV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T Y R     PS F +KLRKH++++RL  +RQ+  DRI++ QF  G+   
Sbjct: 53  VVDCGLRIHLTDYDRPIPPGPSSFVVKLRKHLKSKRLSALRQVKNDRILVLQFADGL--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN++L D    +L+L R   + +  V       Y      +F    +S+  
Sbjct: 111 YLVLEFFSAGNVILLDENKKILSLQRIVHEHENKVG----ETYTMFDDSLFNVNNSSQSA 166

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                SK  D        E+       SK +L       +  + ++S K      +A +P
Sbjct: 167 DQTIKSKSYDVELVRVWLEEAQ-----SKFSLQSSMQADAMKVKQSSKK------KALKP 215

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN---------KLEDNAIQVLVLAVA 289
              T+    L   P LS  +     L  N+K+ ++N           ED  + +L     
Sbjct: 216 L--TIHKLLLSKEPHLSSDL-----LSKNLKMRKINPSSPCIEFLAKEDVLVDLLNYTEI 268

Query: 290 KFEDWLQDVISGDIVPEGYILMQ---NKHLGKDHPPTESGSSTQIYDEFCPLL-----LN 341
           ++ D L +  S      G+IL +   N  LGKD    E      I++ F P        +
Sbjct: 269 EYHDVLSNKDS-----RGFILAKKNVNYTLGKDSEDLEF-----IFENFHPFKPFIEEQD 318

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
           Q RSR       ++  LD F+S IES +   + + +E  A  K+    ++ + R+  L  
Sbjct: 319 QGRSRITEVPGEYNKTLDTFFSTIESSKYALRIQQQEQLAKKKIEDARLENQKRIQALLD 378

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
               + +    I  N + V+ A +AV+  +  +M W+ + ++++ E+   N +A +ID  
Sbjct: 379 VQSSNEQKGHAIIANADLVEEAKIAVQGLIDQQMDWQTIEKLIRNEQLKKNKIAMVIDLP 438

Query: 461 LYLERNCMSLLLS-------NNLDEMD-------------------DEEKTLPVE----- 489
           L L+ N +++L+        NN  E D                   D+ +    E     
Sbjct: 439 LNLKENAVNILVPVSHDDEHNNESESDESFVESSSDESDSDEGTDSDDSEVSDFETEESR 498

Query: 490 --------------KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
                         K+ +DL LSA+ANA +++ +KK    KQ+K      KA K  E++ 
Sbjct: 499 NESRTSKRKVENKLKIRIDLGLSAYANASKYFTVKKTSADKQKKVEKNVEKAMKNIEQRI 558

Query: 536 RLQILQ--EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
             Q+ Q  +++ + +   R  ++FEK  WF SSE +LV+ GR   + + I  +Y+   D+
Sbjct: 559 DKQLKQKLKESHSVLKRARSPYFFEKHFWFYSSEGFLVLMGRSPLETDQIYSKYIEDDDI 618

Query: 594 YVHADLHGASSTVIKN-HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           Y+ +     +   IKN +R E  VPP TL QAG F +  S+AW  K+ +S  W +   ++
Sbjct: 619 YMCSSFD--TQVWIKNPNRTE--VPPNTLMQAGVFCMAASEAWSKKVSSSPQWCFAKNIT 674

Query: 653 KTAPTGE-YLTVGSFMIRGKKNF--LPPHPLIMGFGLLFRL 690
           K   T +  L  G + I+ +     LPP  L+MGFG L+++
Sbjct: 675 KFDHTNKGVLDPGLYRIKKESEMSHLPPAQLVMGFGFLWKV 715



 Score = 45.4 bits (106), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 28/44 (63%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            L  NP   D ++  IPV  P+ A+  YKY+VK+ PG+AKK K +
Sbjct: 905  LKSNPDKDDEVVDAIPVFAPWPALLKYKYKVKVQPGSAKKTKTL 948


>gi|302309325|ref|NP_986649.2| AGL017Wp [Ashbya gossypii ATCC 10895]
 gi|299788305|gb|AAS54473.2| AGL017Wp [Ashbya gossypii ATCC 10895]
          Length = 1006

 Score =  204 bits (518), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 211/826 (25%), Positives = 374/826 (45%), Gaps = 138/826 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R+++ D+    + L+ +L G R +N+Y+++  +  F L  + G        K+ +L+
Sbjct: 1   MKQRISSLDLQLLARELKAQLEGCRLANLYNVADASKQFLLKFTKG------ESKISILI 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G+++  T ++R    +P  F  KLRKH++ +RL  V+Q+G DRI++  F  G+   ++
Sbjct: 55  DCGLKIFATEFSRPIPPSPGPFVAKLRKHLKAKRLTTVKQVGADRILVLSFADGL--FFL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GN++L D++  +L L R  RD ++ V          EI  +F+      +   
Sbjct: 113 VLEFFAAGNVILLDADRRILALQRVVRDHEQKVG---------EIYNMFDDHFLEDVSLP 163

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA---KQ 237
           +    + D +    V E       A++E             SK     +  G R    K 
Sbjct: 164 VP---KLDTHTLPVVQELLIKTKTAAEE-------------SKAVMPAAPVGGRKQSLKV 207

Query: 238 PTLKTVLGEALGYGPA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVL-VLAVAKFEDWL 295
           P++  +L  +  Y  + L   I+ + G+ P+    E   L D+A Q++ +L +A+ E ++
Sbjct: 208 PSIHKLLFSSYPYLSSDLLNKILKEHGIDPSQSFLE---LFDSADQLVDILNIAEKEAYM 264

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET 353
             +++ +    GYIL +   L  +    E    T  Y++F P    L     ++F   E 
Sbjct: 265 --LLTSE-KKNGYILARENPLYDEKKDAEGIRLT--YEQFHPFRPYLPDGSQKKFEIVEV 319

Query: 354 ---FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
              ++  +D+F+S I+S +   + + +E  A  KL K   + + ++  L +    + +  
Sbjct: 320 DGDYNRTVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQALVEVQHTNEQRG 379

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             I  N+E V+ A  A++  L  +M W  + +++K E+   N +A +I   L L+ N +S
Sbjct: 380 NAIINNIELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKIS 439

Query: 470 --LLLSN-----------------------------NLDEMDDE---------------- 482
             L LSN                              L + D E                
Sbjct: 440 VELPLSNEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKV 499

Query: 483 -EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
             K++    V +DL++SA+ANA  ++E+KK    KQ        KA K  E+K    + +
Sbjct: 500 SNKSI---TVSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKK 556

Query: 542 EKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +    +  +  +R  ++FEK+ WFIS+E +LV+ G+   + + I  +Y+   DVYV    
Sbjct: 557 KLKEQHDVLQVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVS--- 613

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT-G 658
           +G  S V   +     +PP TL QAG F    S+AW  K+ TS WW     +SK     G
Sbjct: 614 NGFGSQVWIKNFERTEIPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVGG 673

Query: 659 EYLTVGSFMIRG--KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED 716
             L  G+F ++    KNFLPP  L+MGF  ++++                     DD ++
Sbjct: 674 GLLPSGAFRLKSDEAKNFLPPAQLVMGFAFMWKIK-------------------TDDDQE 714

Query: 717 SGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPA----PSHTNAS 758
           +G+ +   D+ +E D+  E        V  S  PA    PS++N S
Sbjct: 715 AGYEE---DMPAEIDEMGEVSHPSEEMVEESIGPADNLLPSNSNQS 757



 Score = 43.1 bits (100), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 1034 LLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            +L  IPV  P+ A+  YKY+VK+ PGTAKK K +
Sbjct: 906  VLAPIPVFAPWPALTKYKYKVKVQPGTAKKTKSV 939


>gi|343472755|emb|CCD15168.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 559

 Score =  203 bits (516), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 163/626 (26%), Positives = 302/626 (48%), Gaps = 86/626 (13%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKV-LL 58
           MVK RM + DV A  + +   L  +R  N+Y + P+T++F+          G++EK   +
Sbjct: 1   MVKSRMTSLDVKASSQEMHAELKNLRLLNIYSIPPRTFLFRF---------GQAEKKKTV 51

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM-NA 117
           +++ G+RLH T   R+K   PS F  K+RK +   ++  VRQL +DR++ F  G+   N+
Sbjct: 52  VLDVGIRLHLTQVVREKPQIPSAFAQKMRKLLCNWKVRSVRQLDHDRVVDFHLGMSEENS 111

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            ++++EL+++GN+++TD                        H Y  ++  +F     +K+
Sbjct: 112 LHIVVELFSKGNLVVTD------------------------HEYRVKL--LFRTEAVNKV 145

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             A+           D++      +  A  E  GGQ+      L +  N+     A+   
Sbjct: 146 TPAV-----------DEIF--LKTIPRAPLEE-GGQEQISEEMLQQEWNEKF---AQWDG 188

Query: 238 PT-LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
           P  + ++L     +G +L+ HI+   G VPN+   ++N   +   + L+  +   + W  
Sbjct: 189 PVEICSILSSMYSFGNSLAGHIMSRAG-VPNVTKDKMNCSGEEMFRKLLPGM--LDAW-- 243

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR--SREFVKFETF 354
            + S  +   GY+L  +K  G++       ++   YD+F P+LL+Q++  +  +  F  F
Sbjct: 244 RLFSSPLPEGGYLLKSSKRGGQE-------ANDSRYDDFSPVLLDQYKKDAVAYQHFPNF 296

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
            +  DEF+S  E +R E  +   +     K  +   +   R+  LK+  + S++   LI 
Sbjct: 297 SSVCDEFFSYSEKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRKGHLIF 356

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N E +D  I  +  AL  ++ W+D   ++K+ R  G+P+A +I ++  ER  + +L++ 
Sbjct: 357 QNTETIDKIIGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVVVLMNE 416

Query: 475 NLDEMDD-----------EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           + D+ DD           E++     ++E+DL  +AH NA  ++   K   +K ++TI A
Sbjct: 417 DADDDDDEQTEDEEGEKREDRDRATYEIEIDLTKTAHTNAEEYFARAKSTAAKLKRTIAA 476

Query: 524 HSKAFKAAEKKTRLQI--LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
             KA   AE+K R      QEK +      R   W+EKFNWF +S   LV+ GRD +  +
Sbjct: 477 TEKAMAGAERKGRTVTGKTQEKKIIT---ERCRFWWEKFNWFRTSCGDLVLQGRDERSTQ 533

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVI 607
           ++++R M  GD+++   + G    ++
Sbjct: 534 LLLRRVMRLGDIFLCCHVVGGLPCIL 559


>gi|190345457|gb|EDK37344.2| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 873

 Score =  202 bits (515), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 130/429 (30%), Positives = 208/429 (48%), Gaps = 49/429 (11%)

Query: 331 IYDEFCPL--LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
           +YDEF P         S +F +   ++  +D F+S ++S++ E + + ++  A  +L   
Sbjct: 159 LYDEFHPFKPYKENLESFKFTEIRGYNKTVDTFFSTLDSKKHELRMEQQKHNAKKRLLNA 218

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
             +++ ++  L+ + + + K  + I Y+ + V   I +V+  L  +M W ++  ++K E+
Sbjct: 219 REERDKQIDNLRIQQEMNSKKGDAIIYHADLVSECIASVQTLLDQQMDWANIESLIKLEQ 278

Query: 449 KAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDD-------------------------- 481
             GN VA  I   L L  N + L L +  D M D                          
Sbjct: 279 SRGNSVAKTIKLPLNLTENKIGLKLPDT-DSMYDPADIDSESDSETSSESETESESESES 337

Query: 482 -----------------EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                            + K +P   V +DL+LS  ANAR ++E KK+ ESKQEK     
Sbjct: 338 GSESEDETPPKRMSKKAKSKEIPALSVWIDLSLSPFANARTYFESKKQAESKQEKVEKNT 397

Query: 525 SKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
             A + A+KK    + +     N  +  +R  +WFEKF WF+SSE YL I+GRD  Q +M
Sbjct: 398 DMALRNAQKKIEQDLAKNLKNENETLRQVRPKYWFEKFFWFVSSEGYLCIAGRDDAQVDM 457

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           I  R+ S  D +V +D+ G+   V+KN    + +PP TL QAG F +  S AW+ K+ TS
Sbjct: 458 IYYRHFSDNDFFVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAMSASAAWNGKITTS 517

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
            W++  + V+K    G  +  G+F  +GKK FLPP  L+MG G  F  D+ +   +   R
Sbjct: 518 PWFLAGNDVTKLDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYFLGDDDTTKKYGETR 577

Query: 703 RVRGEEEGM 711
             R  E G+
Sbjct: 578 ITRQNESGL 586



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 84/183 (45%), Gaps = 38/183 (20%)

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKK 947
            E  K++RG++ K+K+  +KY DQDE+ER +RM +L +  +++        E     +E  
Sbjct: 656  EPHKLTRGKRSKMKRAAKKYADQDEDERKLRMEMLGTLKQLE--------EIKKKRQENA 707

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
                     +   K K+     ++ +E+                          M EE  
Sbjct: 708  DQDKQQAQQQQNDKLKQTRKAKQEQREYLK-----------------------YMREE-- 742

Query: 1008 HEIGEEEKGRLNDVDYL---TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
              + E+E   +N +D L      P PSD L+ ++PV  P+ ++  +KY+VKI PG AKKG
Sbjct: 743  --VNEDESSMVNYLDILDSFIAKPQPSDKLVAIVPVFAPWYSLNKFKYKVKIQPGMAKKG 800

Query: 1065 KGI 1067
            K I
Sbjct: 801  KSI 803


>gi|410077749|ref|XP_003956456.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
 gi|372463040|emb|CCF57321.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
          Length = 1038

 Score =  202 bits (515), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 201/766 (26%), Positives = 350/766 (45%), Gaps = 136/766 (17%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+    + L++ I G R SN+Y++  S + ++ K         +    K+ +
Sbjct: 28  MKQRISSLDLKLLAQELQKAIEGYRLSNIYNVADSKRQFLLKF--------NKPDSKINV 79

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+++H T Y R     PSGF  KLRKH++++RL  +RQ+  DRI++ +F  G+  +
Sbjct: 80  IVDCGLKVHVTEYTRPTPQLPSGFVAKLRKHLKSKRLTALRQVDNDRILVLEFSDGL--Y 137

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN+LL D+   ++ L R   + +  V          E+ ++F+ T   +  
Sbjct: 138 YLVLEFFSAGNVLLLDNNRCIMALQRIVEEHENKVG---------ELYKIFDSTLFKE-- 186

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSN-KNSNDGARAKQ 237
                        PD   E         +E +   K     D + NSN K   D  + K 
Sbjct: 187 ------------NPDNPLERQFYTEELVREWISSAK-----DTTSNSNTKGPTDKKKIKV 229

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN---------KLEDNAIQVLVLAV 288
            ++  +L   L   P LS  +     L  N+K + +N           E   + +L    
Sbjct: 230 FSIHKLL---LSKQPHLSSDL-----LQKNLKEAGINCASSCLDFVNREQTIVSLLNTTA 281

Query: 289 AKFEDWLQDVISGDIVPEGYILMQ---NKHLGKDHPPTESGSSTQIYDEFCP----LLLN 341
            +++  LQ         +G+IL +   N    KD P  E      +Y+ F P    +   
Sbjct: 282 KEYKQLLQTEFK-----KGFILAKKNVNYDSLKDKPELE-----YLYENFHPFKPYISGA 331

Query: 342 QFRSREFVKFE-TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
           + +S   ++ E +++  LD F+S IES +   + + +E  A  KL     D + R+ +L 
Sbjct: 332 EEKSVRILEIEGSYNRTLDVFFSTIESLKYSLRIQNQELQAKKKLEDARSDNQKRIQSLS 391

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID- 459
                +   A  I  N + VD+A  AV+  L  +  W  + +++  E+K  N +A +I+ 
Sbjct: 392 DVQILNETKANAILNNTDLVDSAKQAVQDLLEQQTDWNMIEKLIMNEKKRRNKIAEIIEL 451

Query: 460 KLYLERNCMSL-------------LLSNN----------------LDEMDD--------- 481
            L L+ N +++               S+N                  E+ D         
Sbjct: 452 PLNLKNNKINIKIPLQSPSQFEEETFSDNESVKSSLSDSDFSDESDSELSDFSMEEVVGR 511

Query: 482 EEKTLPV------EKVEVDLAL--SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
            E T  +      + V V + L  S++ANA +++  KK    KQ+K     +KA    E 
Sbjct: 512 HENTRKIRAKDDKQHVTVTIDLSLSSYANASQYFNSKKDSAEKQKKMEKHMAKAMTNIEN 571

Query: 534 KTRLQI---LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           +   Q+   L+E     +  +RK ++FEK+NWFISSE YLV++G+ A +N+ I  +Y+  
Sbjct: 572 RIDQQLKKKLRESHTV-LKKIRKPYFFEKYNWFISSEGYLVMTGKSALENDQIYMKYIED 630

Query: 591 GDVYVHADLHGASSTVIKNHRPEQ-PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            D+++       S   IKN  P++  +PP TL QAG F    S+AW +K+V S  W Y  
Sbjct: 631 DDIFMSTSF--GSKAWIKN--PDRGEIPPNTLMQAGIFCASSSKAWSNKVVCSPKWCYAR 686

Query: 650 QVSKTAPTGEYLT-VGSFMI--RGKKNFLPPHPLIMGFGLLFRLDE 692
            ++K    G  +   G F++    K++ LPP  LIMG G L++L +
Sbjct: 687 NITKFTQDGSIVAETGEFVLIDEQKQSTLPPAQLIMGIGFLWKLKQ 732



 Score = 43.9 bits (102), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 27/38 (71%)

Query: 1030 PSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            P D ++ ++PV  P+ A+  YKY+VKI PG++KK K +
Sbjct: 916  PGDEVVDIVPVFAPWPALLKYKYKVKIQPGSSKKTKSM 953


>gi|303290793|ref|XP_003064683.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453709|gb|EEH51017.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 807

 Score =  202 bits (514), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 156/497 (31%), Positives = 228/497 (45%), Gaps = 105/497 (21%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-------------PPTES-- 325
           ++ L+  ++  +DW + V  G  VP G +  + K                   PP ++  
Sbjct: 239 VERLLRQLSVLDDWFEGVGDGSAVPTGVVTRRRKPGATGDDDDAFVVDDFSPLPPIDAID 298

Query: 326 --GSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
              +ST   D+                +E+FD ALD +++  E+Q A +Q +  E A   
Sbjct: 299 SNANSTATDDD----------DARVQAYESFDDALDAYFASFETQAATRQRERAEKAVVD 348

Query: 384 KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
           +L K+  DQ  R   L++E +     A LIEYNLE VD A+ AV  ALA  M W DL  M
Sbjct: 349 RLEKVRKDQSQRAAALEREREADELRATLIEYNLERVDVALAAVNNALAGGMGWGDLEIM 408

Query: 444 VKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK------------- 490
           ++EE +AGNPVAG I  L L  N +++ L+N+LD+ +D+E     +              
Sbjct: 409 IREETRAGNPVAGTIKSLDLANNKITVTLANHLDDDEDDEDEEEEDGEDEDKDGDEDDAG 468

Query: 491 --------------------------VEVDL--ALSAHANARRWYELKKKQESKQEKTIT 522
                                     V V+L  +LSA+ANAR  +E KKK  +K +KT+ 
Sbjct: 469 EGDDEKSSERKRKQQQKKLRRKRRKAVAVELDLSLSAYANARTHFEKKKKHATKHDKTLA 528

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
              +A                               KF WF+++EN LV+S RDA Q + 
Sbjct: 529 QTERA-------------------------------KFWWFVTTENCLVVSARDAAQTDA 557

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           ++K+Y   G   V     G       N      VPP +L QAG   +C S AWDS+ V S
Sbjct: 558 MLKKYAPPGSSVVVGGGGGGGGAGWCNG-----VPPASLAQAGAACLCRSNAWDSRQVIS 612

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNE 701
           AW+V P Q+ K  P GE L  G     GKK FLPP PL+MGF  +F L D++S+ +H  +
Sbjct: 613 AWYVKPEQIRKETPEGEPLLNGVVWTVGKKTFLPPAPLVMGFAYMFVLGDDASVEAHAGD 672

Query: 702 RRVRGEEEGMDDFEDSG 718
           R V+ +   + + +  G
Sbjct: 673 RVVKQQMAALGNADGEG 689



 Score =  177 bits (448), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 92/199 (46%), Positives = 120/199 (60%), Gaps = 12/199 (6%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K + N  D+ AEV CLR RL+G   +NVYD    +++FK   S G TESGE EK+ ++
Sbjct: 1   MPKQKFNNYDIRAEVACLRARLVGTWLTNVYDRDKTSFVFKFTRSGGATESGEGEKINVV 60

Query: 60  MESGVRLHTTAYARDKK-----------NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +ESG R H T++AR              + PS F  KLR H+R +RL  + Q+G DR + 
Sbjct: 61  IESGTRFHCTSHARASASGGGGGKASSTDQPSKFNAKLRMHLRGKRLNAIDQIGSDRAVD 120

Query: 109 FQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRV 168
           F F  G   H++I+ELYAQGN+LL D +  VLTLLR+HRDDDKGV I+  HRYP E  R 
Sbjct: 121 FTFSSGDTEHHLIVELYAQGNVLLLDKDDVVLTLLRTHRDDDKGVKILGNHRYPRERFRT 180

Query: 169 FERTTASKLHAALTSSKEP 187
            +R T   L  AL   + P
Sbjct: 181 HKRVTLHDLEGALGLGQNP 199


>gi|374109900|gb|AEY98805.1| FAGL017Wp [Ashbya gossypii FDAG1]
          Length = 1006

 Score =  202 bits (514), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 210/826 (25%), Positives = 374/826 (45%), Gaps = 138/826 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R+++ D+    + L+ +L G R +N+Y+++  +  F L  + G        K+ +L+
Sbjct: 1   MKQRISSLDLQLLARELKAQLEGCRLANLYNVADASKQFLLKFTKG------ESKISILI 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G+++  T ++R    +P  F  KLRKH++ +RL  V+Q+G DRI++  F  G+   ++
Sbjct: 55  DCGLKIFATEFSRPIPPSPGPFVAKLRKHLKAKRLTTVKQVGADRILVLSFADGL--FFL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GN++L D++  +L L R  RD ++ V          EI  +F+      +   
Sbjct: 113 VLEFFAAGNVILLDADRRILALQRVVRDHEQKVG---------EIYNMFDDHFLEDVSLP 163

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA---KQ 237
           +    + D +    V E       A++E             SK     +  G R    K 
Sbjct: 164 VP---KLDTHTLPVVQELLIKTKTAAEE-------------SKAVMPAAPVGGRKQSLKV 207

Query: 238 PTLKTVLGEALGYGPA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVL-VLAVAKFEDWL 295
           P++  +L  +  Y  + L   I+ + G+ P+    E   L D+A Q++ +L +A+ E ++
Sbjct: 208 PSIHKLLFSSYPYLSSDLLNKILKEHGIDPSQSFLE---LFDSADQLVDILNIAEKEAYM 264

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET 353
             +++ +    GYI+ +   L  +    E    T  Y++F P    L     ++F   E 
Sbjct: 265 --LLTSE-KKNGYIVARENPLYDEKKDAEGIRLT--YEQFHPFRPYLPDGSQKKFEIVEV 319

Query: 354 ---FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
              ++  +D+F+S I+S +   + + +E  A  KL K   + + ++  L +    + +  
Sbjct: 320 DGDYNRTVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQELVEVQHTNEQRG 379

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             I  N+E V+ A  A++  L  +M W  + +++K E+   N +A +I   L L+ N +S
Sbjct: 380 NAIINNIELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKIS 439

Query: 470 --LLLSN-----------------------------NLDEMDDE---------------- 482
             L LSN                              L + D E                
Sbjct: 440 VELPLSNEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKV 499

Query: 483 -EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
             K++    V +DL++SA+ANA  ++E+KK    KQ        KA K  E+K    + +
Sbjct: 500 SNKSI---TVSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKK 556

Query: 542 EKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +    +  +  +R  ++FEK+ WFIS+E +LV+ G+   + + I  +Y+   DVYV    
Sbjct: 557 KLKEQHDVLQVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVS--- 613

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT-G 658
           +G  S V   +     +PP TL QAG F    S+AW  K+ TS WW     +SK     G
Sbjct: 614 NGFGSQVWIKNFERTEIPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVGG 673

Query: 659 EYLTVGSFMIRG--KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED 716
             L  G+F ++    KNFLPP  L+MGF  ++++                     DD ++
Sbjct: 674 GLLPSGAFRLKSDEAKNFLPPAQLVMGFAFMWKIK-------------------TDDDQE 714

Query: 717 SGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPA----PSHTNAS 758
           +G+ +   D+ +E D+  E        V  S  PA    PS++N S
Sbjct: 715 AGYEE---DMPAEIDEMGEVSHPSEEMVEESIGPADNLLPSNSNQS 757



 Score = 43.1 bits (100), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 1034 LLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            +L  IPV  P+ A+  YKY+VK+ PGTAKK K +
Sbjct: 906  VLAPIPVFAPWPALTKYKYKVKVQPGTAKKTKSV 939


>gi|408381973|ref|ZP_11179520.1| fibronectin-binding A domain-containing protein [Methanobacterium
           formicicum DSM 3637]
 gi|407815421|gb|EKF86006.1| fibronectin-binding A domain-containing protein [Methanobacterium
           formicicum DSM 3637]
          Length = 711

 Score =  202 bits (514), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 173/642 (26%), Positives = 297/642 (46%), Gaps = 57/642 (8%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V +  ++G+R+HTT Y  +    P  F + LRKH++   ++ VRQ  +DRI+  +  + 
Sbjct: 47  RVDVAFQAGLRVHTTQYPPENPKVPPSFPMLLRKHLKNATVKGVRQHNFDRIL--EIDIQ 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTT 173
               + +++EL++QGNI+L D E  ++  L+      + +     ++YP       E   
Sbjct: 105 KEHRFTLVVELFSQGNIILLDEENQIILPLKHRHAQGRKITSKEEYQYP-------EERG 157

Query: 174 ASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGA 233
              L+  L   KE  AN       D + +   ++  LGG    + F  S         G 
Sbjct: 158 IHILNVELEDLKELFANS------DSDLIRTLARSGLGGMYSEEIFLRS---------GV 202

Query: 234 RAKQPTLKTVLGEALGYGPALSEHI--ILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
             KQP  +T   E      +++E    +      P +    V   E    +       K 
Sbjct: 203 DKKQPANETSESEIESIYQSMTELFKPLKTFKFQPQIVKEVVEGEEKENEEKTGKEEGK- 261

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
              ++D+       E     + K   K     E   + +  ++  PL +  +++    +F
Sbjct: 262 ---VKDISKTKKGKEDSKTKKGKEDSKTKKGKEDSKTKKGKEDVLPLDILTYQNFHKERF 318

Query: 352 ETFDAALDEFYSK---IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ETF+ A DEFYS     + ++ ++   AKE   + K  +I   QE  +   ++ +  + +
Sbjct: 319 ETFNQAADEFYSGKVGADIKKVQEDIWAKEVGKYEKRLRI---QEETLEKFQKTIVETKR 375

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
              LI  +  ++   +  +  A   + SW ++A  +K+ RK G   A +I  +    + M
Sbjct: 376 KGNLIYSHYSEIQNLLDIIHQA-REKFSWMEIASKLKKARKEGMVQAQIIQSM----DKM 430

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            +L  N           L  E V VD  L    NA ++Y   KK + K +    A  +  
Sbjct: 431 GVLTLN-----------LEGETVTVDANLEIPENAEKYYNKGKKAKRKIKGVNMAIERTK 479

Query: 529 KAAEKK-TRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           K  E+K  + ++  E+       +RK + WFEK  WF+SS+ +LVI GRDA  NEM+VKR
Sbjct: 480 KDVERKRNKRELALERVRVPQKRVRKELKWFEKLRWFLSSDGFLVIGGRDAGTNEMVVKR 539

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWW 645
           ++   D+Y+H+D+HGA S VIK    E+ +P  T+++AG      S AW     +   +W
Sbjct: 540 HLDNPDIYLHSDIHGAPSVVIKKGEAEE-IPESTIHEAGNLAASFSSAWSKGYGSQDVYW 598

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V+P QVSKT  +GE++  G+F+IRG +N+L   PL +  G++
Sbjct: 599 VHPDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIAVGIV 640


>gi|255722283|ref|XP_002546076.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136565|gb|EER36118.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 857

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 148/503 (29%), Positives = 245/503 (48%), Gaps = 58/503 (11%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPL-- 338
           +Q +V A+   ED   D+ISG    +GYI+ +     K+   +E      I DEF P   
Sbjct: 89  LQKVVDALHVCEDKYMDLISGKTETQGYIVSR-----KNKNASEDSEFDYICDEFHPFKP 143

Query: 339 LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHT 398
             +     +F +   ++  +D+F+S +ES +   + + +++ A  +L K   +++ ++ +
Sbjct: 144 YKSNVTDLKFTEVSGYNKTVDQFFSTLESSKFSLKIEQQKENASKRLEKAKSERDKQIES 203

Query: 399 LKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI 458
           L  +   + K  ELI+Y+ E V+     V+  L  +M W ++  ++  E+K  NP +  I
Sbjct: 204 LVAQQQLNSKKGELIQYHSELVEECRRYVQQYLDQQMDWTNIETVIALEQKKNNPTSKSI 263

Query: 459 D-KLYLERNCMSLLLSNNLDEMDDEEKT-------------------------LPVEKVE 492
              L L+ N + +LL +  D  D E  +                         +PV++V+
Sbjct: 264 QLPLNLKDNKIKVLLPDFEDYSDSESASATETESESETESESESDSDSDSDDDIPVKRVQ 323

Query: 493 ------------------VDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK 534
                             +DL+LS+ ANAR +++ KK  E+KQ K   + + A K AE+K
Sbjct: 324 KPAKTKAPKKKQNIIPTWIDLSLSSFANARTYFDSKKTAETKQVKVENSTNLALKNAERK 383

Query: 535 TRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
               + +     N  +  +R  +WFEKF WF+SSE YL ++G+D  Q +MI  R+ S  D
Sbjct: 384 INQDLAKALKQENETLKEIRPKYWFEKFYWFVSSEGYLCLAGKDNSQIDMIYYRHFSDND 443

Query: 593 VYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
             V AD+ G+    IKN    + +PP TL QAG F++  S AW+ K+ TSAW ++  ++S
Sbjct: 444 SIVSADMEGSLKVFIKNPFQGEAIPPSTLMQAGIFSMSASTAWNGKVTTSAWVLHGTEIS 503

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM- 711
           K    G  +  G F    KK +LPP  L+MG G    +DE S   +   R  R +E G+ 
Sbjct: 504 KRDFDGSIVPDGEFKYLAKKEYLPPAQLVMGLGFYCLVDEESTKKYAEIRSNREKEHGLT 563

Query: 712 ----DDFEDSGHHKENSDIESEK 730
               +  +D  + K N  +ESEK
Sbjct: 564 IVVDNKKKDLENIKLNMPVESEK 586



 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 95/194 (48%), Gaps = 36/194 (18%)

Query: 874  ASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDG 933
            A+++P+SI   T +      RG+K KLKK   KY DQDEEER +RM  L +  ++++ + 
Sbjct: 614  AATEPDSIKSNTPV-----PRGKKSKLKKTAAKYRDQDEEERRLRMDALGTLKQLEEQEE 668

Query: 934  DPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDE 993
              + + ++    K+ A+  V   ++  + +K     ++ K++  D               
Sbjct: 669  KTKAQVSA----KEEALKKVQERELAIERRKK-QKERELKKYLAD--------------- 708

Query: 994  TAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYR 1053
                D+   +E  I          L  +D     P  +D ++ ++PV  P+S++Q +KY+
Sbjct: 709  ----DQETNDESHI-------TNYLEILDSFRSKPSVNDKIIGIVPVFAPWSSLQKFKYK 757

Query: 1054 VKIIPGTAKKGKGI 1067
            VKI PG+ KKGK I
Sbjct: 758  VKIQPGSGKKGKCI 771


>gi|261335340|emb|CBH18334.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1100

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 157/548 (28%), Positives = 254/548 (46%), Gaps = 99/548 (18%)

Query: 235 AKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
           A+  T ++ L     +GP+L++HI+  TG V ++K + +    D   + L+  +   E W
Sbjct: 192 AEYETTRSTLSATHHFGPSLADHILTVTG-VKSVKKANMTCSGDEMFEKLLPGM--LEAW 248

Query: 295 LQDVISGDIVPEGYILMQ---------NKHLGKDHPPTESGSSTQI-------------- 331
                +   +P G  L+           +  GK  P  ++G  T                
Sbjct: 249 R---FAFSPLPTGGYLISKTAATKGRGTQERGKAPPHVDAGVGTTADGGEAGSGVEKQPR 305

Query: 332 -------YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
                  Y++F P+LL Q+R          +F +  D F+   E ++ EQ +        
Sbjct: 306 PHLQGVQYEDFSPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVL 365

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K  K   D   R+  L++  + + +  ELI  N E +D AI  +  ALA  + WE L R
Sbjct: 366 SKKEKFERDHNRRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRR 425

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM---DDEEKTLPV----------- 488
           ++K+    G+PVA ++ +L+L+RN +S+L+  N +++   +DEE  + V           
Sbjct: 426 LLKQRHAEGHPVAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGG 485

Query: 489 ---EK-------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE 532
              EK             +EVDL+ +A+ANA  ++  KK   +K EKTI A +KA   AE
Sbjct: 486 NSGEKKDRVEGCSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAE 545

Query: 533 KKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
           KK      +++T   I+  R   W+EKFNWF +S   LV+ G D Q  E++V+R M  GD
Sbjct: 546 KKGERLAAKKQTKKAIATERHRCWWEKFNWFRTSCGDLVLQGHDTQSTELLVRRIMRLGD 605

Query: 593 VYVHADLHGA-------------SSTVIKNHRPEQP------------VPPLTLNQAGCF 627
           V+VH+D+ G              +ST       E+             +  ++L++A  +
Sbjct: 606 VFVHSDVEGGLPCILRAAGSAWDASTAFGEGESEENSIQVGESTKGWLIHMISLDEAAAW 665

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW+SK    AWWV+  Q+      G YL      + G+KN+L P PL++G GLL
Sbjct: 666 CVCRSSAWESKFSVGAWWVHASQIVGGTAAGCYL------LSGEKNYLRPRPLMLGCGLL 719

Query: 688 FRLDESSL 695
           FR+   ++
Sbjct: 720 FRISSRAI 727



 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/164 (46%), Positives = 109/164 (66%), Gaps = 12/164 (7%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L G+R +NVYD+ P+T++FK  NS         +K  LL
Sbjct: 1   MVKQRMTALDVRASVEEMRTELQGLRLTNVYDIPPRTFLFKFGNSE--------KKRTLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+GVRLH T   R+K   P+ FTL+LRKH+R  RL+ V QL +DR + F+FG+   A Y
Sbjct: 53  LENGVRLHLTQLVREKPKVPTQFTLRLRKHVRAWRLDSVTQLQHDRTVDFRFGVAEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+++GNI+LTD E+ ++ LLR+H+DD  GV +  R  YP
Sbjct: 113 HIIVELFSKGNIVLTDHEYRIMLLLRAHKDD--GVNMFVRELYP 154



 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 54/216 (25%), Positives = 92/216 (42%), Gaps = 38/216 (17%)

Query: 891  KISRGQKGKLKKMKEKYGDQDEEER------------NIRMALLASAGKVQKNDGDPQNE 938
            ++++ Q+ KLKK+++KY DQD+E+R             +++ LLAS    Q N+   +  
Sbjct: 847  QLTKHQRKKLKKIQQKYKDQDDEDRLTGALLNGNQLSKVQLELLASERAKQTNE-IVRTS 905

Query: 939  NASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMD 998
            +A +      A           +C   G +         D+ H +  +P  G D  A+ +
Sbjct: 906  SAGSSSAAGEAGERCGGEAWGEEC--VGEVRGRAPAKGGDAGHLLAASPSCGSDGPADNE 963

Query: 999  KVAMEEEDIHEIGEEEKGRL--------------NDVDY------LTGNPLPSDILLYVI 1038
            +   E+ +      + + R               ND ++       T  P P D + Y +
Sbjct: 964  RTPREDNEPSTGEPQPRSRAIDSTAASLEATRAANDAEFNREWIHFTAKPQPGDCVEYAV 1023

Query: 1039 PVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLL 1074
             VC P  +V SYKYR ++  G AKKG   Q+  SL+
Sbjct: 1024 AVCAPMGSVISYKYRAELSCGNAKKG---QVALSLI 1056


>gi|294496348|ref|YP_003542841.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
           5219]
 gi|292667347|gb|ADE37196.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
           5219]
          Length = 662

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 185/720 (25%), Positives = 310/720 (43%), Gaps = 127/720 (17%)

Query: 2   VKVRMNTADVAAEVKCL----RRLIGMRCSNVYD-----LSPKTYIFKLMNSSGVTESGE 52
           +K  M +ADVAA    L      L+  +   +Y      L    YIFK            
Sbjct: 1   MKEEMTSADVAALATELGTGENSLVDSKIGKIYQPGESLLRIHLYIFK------------ 48

Query: 53  SEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             K  LL+E+G RLH + Y       P  F + LRKHI   R+   RQ  +DRII     
Sbjct: 49  KGKANLLIEAGSRLHLSEYIPPSPKNPQSFPMLLRKHIMGGRITYFRQYDFDRIIEIGIK 108

Query: 113 LGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT 172
            G +   +++E++ QGNI+L DS+  ++  +       + +     ++YP          
Sbjct: 109 RGDDETVLVVEIFGQGNIILLDSDRKIILPMNPVTFKGRRIRSGEIYQYP---------- 158

Query: 173 TASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
                 A LT         P  VNED                      L +  + + +D 
Sbjct: 159 -----EAQLT---------PLDVNED---------------------QLCEVFSNSDSDV 183

Query: 233 ARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFE 292
            R         L      G  LSE + L +G+  N+  SEV+                  
Sbjct: 184 VRT--------LATRFNLGGILSEEVCLRSGVDKNLPASEVDPQ---------------- 219

Query: 293 DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE 352
                 I+  ++    +L      G+  P T S   ++   +  P  L ++   E   ++
Sbjct: 220 ------IASKLIEAIGVLFSPLEKGQLKPCTVSKPGSKETFDVVPFDLEKYADFEKNYYD 273

Query: 353 TFDAALDEFYSKIESQRAEQQHKA----KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           +F+ ALD+F+ K  +   EQ+ +A    K +  F +  K    QE  +   +++++++  
Sbjct: 274 SFNKALDDFFGKRAAISLEQKKEASVKEKTEDVFQRRLK---QQEGAIKKFEKDIEKNTS 330

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
           +AE I  + +D++  +  +  A     SW+++  ++ + +          D+L   +  +
Sbjct: 331 IAEKIYEHYQDIELLLQTLLDAREKDYSWKEIQSIISDAK----------DELPAAKKII 380

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
           ++  S  L  +D + K     K  +D+ L+   NA R+YE  KK E K++  + A     
Sbjct: 381 NIDGSQGLVLLDLDGK-----KANIDVRLTVPQNAMRYYEKAKKLEKKRKGALAA----- 430

Query: 529 KAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
              + K  ++  +     +   + K HW+E+F WF SS+ +LV+ GRDA  NE IVK+YM
Sbjct: 431 -IEDTKNAMKKKKAAPKKHFKVVHKKHWYERFRWFFSSDGFLVVGGRDATTNEEIVKKYM 489

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVY 647
            K D+  H    GA  TV+K    E  +P  TL +A  F V  S  W     +   +W+Y
Sbjct: 490 EKRDLVFHTQAPGAPITVVKTGGKE--IPDTTLQEAAEFVVSFSSIWKGGQFSGDCYWIY 547

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           P QV+KT  +GEYL  GSF+IRG++N+    P+    GL  + +  ++G  ++  + RGE
Sbjct: 548 PEQVTKTPESGEYLKKGSFIIRGERNYYRDVPVRAAVGLELKPETRAIGGPVSAVKARGE 607


>gi|157865120|ref|XP_001681268.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68124563|emb|CAJ02783.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1224

 Score =  199 bits (506), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 139/484 (28%), Positives = 236/484 (48%), Gaps = 46/484 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ-DV 298
           ++T++     +GP L++H++  TG VPN       +  ++    L   + +  D  + D+
Sbjct: 255 VQTLVAGIQHFGPDLAQHVLTVTG-VPNAPRKSWTQSTESIFATLCPGLLEAFDLAKVDL 313

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESG------------SSTQIYDEFCPLLLNQFRSR 346
            S      GY++      G     +               +  + Y+ F P+LL Q+ + 
Sbjct: 314 TSAG----GYLIKPKARPGSAAHASAPPAPGASAGAADLVAVAERYESFTPILLAQYAND 369

Query: 347 --EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
             E +   +F    DEF+   E++R +  +  +++ A  K +K   D   R++ L+ ++ 
Sbjct: 370 GVEALYRTSFGRVCDEFFLLTETERIDASNAKRKNTAKSKEDKFAADHARRINALETDIA 429

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            +    E +  N + VD AI  +  ALA  +SW+ L  ++K     G+PVA +I  L+LE
Sbjct: 430 ANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHPVAYMIHDLFLE 489

Query: 465 RNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           RN +S+LL   LDE + EE   +P   VEV L+ +AHANA  ++  +K+  SK E+T+ A
Sbjct: 490 RNSISVLLETALDEENGEEDCDVPPLVVEVALSKTAHANAADYFSKQKQYRSKLERTVAA 549

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             KA   A +K   +   +K    I   R+ +W+EKF WF ++   LV+ G+D Q  E++
Sbjct: 550 TEKAAAGAARKGARKAAGQKEKKVIVKERQRNWWEKFFWFRTTAGDLVLRGKDVQSTELL 609

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHR-------------------PEQPVPPLTLNQA 624
           V+R M  GD+++H ++ GA   +++                        QPV   ++ +A
Sbjct: 610 VRRVMHLGDLFIHCEVDGALPCLLRPMNDVWQELGGNNAGGDLTASPATQPVALRSVCEA 669

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G + V  S AW+ K  T +WWVY  QV+    TG YL        G+++ LPP  + +G 
Sbjct: 670 GAWCVAFSGAWERKQTTGSWWVYASQVTGGTATGAYLYA------GERHHLPPQSMSLGC 723

Query: 685 GLLF 688
            LLF
Sbjct: 724 ALLF 727


>gi|146419620|ref|XP_001485771.1| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 873

 Score =  199 bits (506), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 131/432 (30%), Positives = 208/432 (48%), Gaps = 55/432 (12%)

Query: 331 IYDEFCPLL-----LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKL 385
           +YDEF P       L  F+   F +   ++  +D F+S ++S++ E + + ++  A  +L
Sbjct: 159 LYDEFHPFKPYKENLELFK---FTEIRGYNKTVDTFFSTLDSKKHELRMEQQKHNAKKRL 215

Query: 386 NKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVK 445
                +++ ++  L+ + + + K  + I Y+ + V   I +V+  L  +M W ++  ++K
Sbjct: 216 LNAREERDKQIDNLRIQQEMNSKKGDAIIYHADLVSECIASVQTLLDQQMDWANIESLIK 275

Query: 446 EERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDD----------------------- 481
            E+  GN VA  I   L L  N + L L +  D M D                       
Sbjct: 276 LEQSRGNSVAKTIKLPLNLTENKIGLKLPDT-DSMYDPADIDSELDSETSSESETESESE 334

Query: 482 --------------------EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTI 521
                               + K +P   V +DL LS  ANAR ++E KK+ ESKQEK  
Sbjct: 335 SESGSESEDETPPKRMSKKAKSKEIPALSVWIDLLLSPFANARTYFESKKQAESKQEKVE 394

Query: 522 TAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
                A + A+KK    + +     N  +  +R  +WFEKF WF+SSE YL I+GRD  Q
Sbjct: 395 KNTDMALRNAQKKIEQDLAKNLKNENETLRQVRPKYWFEKFFWFVSSEGYLCIAGRDDAQ 454

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
            +MI  R+ S  D +V +D+ G+   V+KN    + +PP TL QAG F +  S AW+ K+
Sbjct: 455 VDMIYYRHFSDNDFFVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAMSASAAWNGKI 514

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
            TS W++  + V+K    G  +  G+F  +GKK FLPP  L+MG G  F  D+ +   + 
Sbjct: 515 TTSPWFLAGNDVTKLDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYFLGDDDTTKKYG 574

Query: 700 NERRVRGEEEGM 711
             R  R  E G+
Sbjct: 575 ETRITRQNESGL 586



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 84/183 (45%), Gaps = 38/183 (20%)

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKK 947
            E  K++RG++ K+K+  +KY DQDE+ER +RM +L +  +++        E     +E  
Sbjct: 656  EPHKLTRGKRSKMKRAAKKYADQDEDERKLRMEMLGTLKQLE--------EIKKKRQENA 707

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
                     +   K K+     ++ +E+                          M EE  
Sbjct: 708  DQDKQQAQQQQNDKLKQTRKAKQEQREYLK-----------------------YMREE-- 742

Query: 1008 HEIGEEEKGRLNDVDYL---TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
              + E+E   +N +D L      P PSD L+ ++PV  P+ ++  +KY+VKI PG AKKG
Sbjct: 743  --VNEDESSMVNYLDILDSFIAKPQPSDKLVAIVPVFAPWYSLNKFKYKVKIQPGMAKKG 800

Query: 1065 KGI 1067
            K I
Sbjct: 801  KSI 803


>gi|396081612|gb|AFN83228.1| putative RNA-binding protein [Encephalitozoon romaleae SJ-2008]
          Length = 648

 Score =  199 bits (505), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 114/341 (33%), Positives = 187/341 (54%), Gaps = 40/341 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F TF+ A + F+        + + K   +    K++K+   QEN +  ++QE     K A
Sbjct: 245 FSTFNDAAEFFF--------QNRKKFGRNDRESKVDKVRKRQENYMKEMEQERQSYRKKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   ++   N++ W D  +  ++E K GN ++  I K  ++   C  
Sbjct: 297 ELLEENADFVNKILDIFKIVKKNKVRWTDFEKFREQENKKGNEISKAIVKTDFISHTCTI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
                           L  E++++D   S  +N  R+Y+  KK E K +KT  +  +  K
Sbjct: 357 ---------------ALEGEEIQIDFETSLFSNISRFYQKNKKLEEKIKKTRDSLEEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K     ++ K V      R ++WFEKF++F SS+  LVI G++AQQNE++VK+++ 
Sbjct: 402 KVAPK-----VETKKVT-----RALYWFEKFHFFFSSDGILVIGGKNAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D+Y H D+HG+SS ++K  +P Q     T+ +A    +C S+ W++ +V+  W+VY  
Sbjct: 452 PTDLYFHGDMHGSSSIIVK--KPTQK----TIEEAASMALCMSKCWEANVVSPVWYVYGE 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           QVSKTAP+GEYLT GSFMI+GKKN++  H +  G GLLF++
Sbjct: 506 QVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 64/134 (47%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  LR RL+G    N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELRPRLVGKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEHDMDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116



 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 29/35 (82%)

Query: 1034 LLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +++ +PVCGP+S + +YKY+V+++PG  KKGK IQ
Sbjct: 576  IVHSMPVCGPWSVISTYKYKVRLVPGREKKGKLIQ 610


>gi|18977764|ref|NP_579121.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
 gi|397651884|ref|YP_006492465.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
 gi|18893505|gb|AAL81516.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
 gi|393189475|gb|AFN04173.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
          Length = 649

 Score =  199 bits (505), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 192/695 (27%), Positives = 321/695 (46%), Gaps = 127/695 (18%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+    + L+ +I G R   +Y    +   FKL + +GV       +V LL+
Sbjct: 1   MKESMSSVDIKYITEELKDMIVGSRVEKIYHEGNEIR-FKL-HKTGVG------RVDLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T Y ++    P+ F + LRK++  + LED+RQ  +DR+++  FG     +++
Sbjct: 53  EAGKRIHITTYVKENLQ-PTSFAMLLRKYLSGKFLEDIRQYEFDRVVILSFG----EYFL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I EL+ +GNI+    ++ ++  LR     D+  AI  + +Y      VF  + A+ L  +
Sbjct: 108 IAELFGRGNIIFVTKDWEIIGALRYEEFKDR--AIKPKIKY------VFPPSRANPLKVS 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
               KE        +N  G  +  A               L+KN                
Sbjct: 160 FEEFKEII------LNSQGTEIVRA---------------LAKN---------------- 182

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED-----WL 295
                     G   SE  +L   +  + K+ E+++ E   +   +L V   E      + 
Sbjct: 183 -------FSIGGLYSEETLLRAKIDKDRKVDELSEEELRLVYDTLLTVLNDEKKPNIVYN 235

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDE-FCPLLLNQFRSREFVKFETF 354
           ++ +  D+VP   I +Q     +++      S ++  DE F  L + + R  +  + E  
Sbjct: 236 KEGVMVDVVP---IDLQ---WYREYTKRYYESFSEALDEYFGKLTIEKARLEKTKQLEER 289

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
             AL+     I  +R E+Q K  E  A    N+   D     +++  E+ R +  A L +
Sbjct: 290 RKALE-----ISLRRIEEQIKGFEKEAM--TNQEKGDALYAHYSIVNEILRVISSA-LKQ 341

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           Y +E+V                     + ++E +KAG P A +I  + +  N ++L    
Sbjct: 342 YGVEEVK--------------------KRIEEGKKAGYPWAKMI--IDVTDNKVTL---- 375

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA-FKAAEK 533
           NLD +          KV +D+  S   NA  +YE  KK + K E    A+ +   K  E 
Sbjct: 376 NLDGI----------KVSLDVEKSLEENAELYYERAKKAKKKLEGAKIAYEETKRKLIEL 425

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
           +  ++   ++        +K  WFEKF WFISSE +LVI G+DA  NE++VK++M + D+
Sbjct: 426 EKEIERESKEINIKKITRKKKKWFEKFRWFISSEGFLVIGGKDATTNEIVVKKHMDENDI 485

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVS 652
           Y HAD+ GA   +IKN R        T+ +A  F V  S+AW   + ++ A+WVYP QVS
Sbjct: 486 YCHADIWGAPHVIIKNGR---NASEKTIREACQFAVAMSRAWSEGLASADAYWVYPEQVS 542

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           K AP GEYL  G+FM+ GK+N++   PL +  G++
Sbjct: 543 KQAPAGEYLPKGAFMVYGKRNWIHGIPLKLAVGIV 577


>gi|257215816|emb|CAX83060.1| Serologically defined colon cancer antigen 1 [Schistosoma
           japonicum]
          Length = 521

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 96/181 (53%), Positives = 122/181 (67%), Gaps = 19/181 (10%)

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           KT+A I+ +RK  WFEKF WFISSENYLV++G D+QQNE++VKRY+  GD++VHAD+HGA
Sbjct: 5   KTIAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQQNEVLVKRYLKSGDIFVHADIHGA 64

Query: 603 SSTVIKN-------------------HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
           S+ +IK                    HR     PP TL +A    V  S AW S ++T A
Sbjct: 65  STVIIKARHLTSEESDFSKHESLLHLHRSLPLPPPKTLLEAANMAVVLSSAWQSHVLTRA 124

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERR 703
           WWV+  QVSKTAP+GEYLT GSF+IRGKKN+LPP P   GFG++F+L E S+  H  ERR
Sbjct: 125 WWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGIMFKLHEDSVFKHKGERR 184

Query: 704 V 704
           +
Sbjct: 185 I 185



 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 98/187 (52%), Gaps = 20/187 (10%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQN-----ENASTHKEK 946
            + RGQK KLKK+K+KY +QDEEER++RM +L      Q +D  P       E   +  + 
Sbjct: 292  LKRGQKSKLKKIKQKYKEQDEEERSLRMRIL------QGDDAKPSQYHQILERDHSLNQV 345

Query: 947  KPAISPVDAPKVC-----YKCKKAGHLSKDCKEHPDDSSHGVEDN-PCVGLDETAEMDKV 1000
            K + S +D   VC        +   + + D  +H  +S  G E++  C  +D      K 
Sbjct: 346  KTSNSILDTQTVCDSDVIRNDQPDNNANLDIDDHFTESDDGSEESLRCSDVDNLKS--KD 403

Query: 1001 AMEEEDIHEIGEEEKGRL-NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPG 1059
              + +D  ++  E K  L + ++ LTG P   D+LLY IPVC PYS +  +K+RVK+ PG
Sbjct: 404  NDDGDDDEDLSSESKDDLISLLNSLTGQPNDDDLLLYAIPVCAPYSVLLKFKFRVKLNPG 463

Query: 1060 TAKKGKG 1066
              K+GK 
Sbjct: 464  NTKRGKA 470


>gi|84489327|ref|YP_447559.1| RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
 gi|84372646|gb|ABC56916.1| predicted RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
          Length = 666

 Score =  197 bits (501), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 187/692 (27%), Positives = 301/692 (43%), Gaps = 116/692 (16%)

Query: 6   MNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  D+   V  L + LI  R    Y     T   KL       ++GE  K L++ ++GV
Sbjct: 1   MSNVDIHRMVNELNKELINTRIDKAYQPDVDTIRIKL------RKAGEGRKDLVI-QAGV 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-E 123
           R+H T Y +     P  F + LRKH+    +  + Q  +DRII  +  +     Y IL E
Sbjct: 54  RIHLTNYPQPNPTIPPNFPMLLRKHLSGGSITSIEQHNFDRII--KIKVQKKEEYTILVE 111

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTS 183
           L+++GNI+L                 DK   I+S  ++ T              H    +
Sbjct: 112 LFSKGNIILL----------------DKDNNIISPLKHKT-------------WHDRKIT 142

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
           + E     P+K    G N++N   E+L         D+++    N   G  A+       
Sbjct: 143 AHEEYKYPPEK----GININNCRFEDLKTVINTSDRDITRTLATNGLGGLYAE------- 191

Query: 244 LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
             E + Y     E       L   +   E+ +L +NAI  L   +             + 
Sbjct: 192 --EVISYTSINKEK------LAKELTDDEITQL-NNAINELFNKI-------------ET 229

Query: 304 VPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYS 363
            P+  I++      KD                 P+ LN++   +   FETF+ A DEFYS
Sbjct: 230 NPQPQIILDENDKNKD---------------LVPITLNKYAQFKSKSFETFNMAADEFYS 274

Query: 364 K---IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           K    + +  E++  AK    F K  K+   QE  +    + ++      + I  +  ++
Sbjct: 275 KKIVSDIKNKEEKLWAKRIGKFEKRLKM---QEETLEGFYKTIEDKQHKGDTIYAHYNEI 331

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGN-PVAGLIDKLYLERNCMSLLLSNNLDEM 479
              I  +  A  N  SW+++  ++K+ +K G  P   +I+ +    + M ++   NL ++
Sbjct: 332 QQIINVIHQAREN-YSWKEIGSIIKKSKKEGKIPELEMIESI----DKMGVI---NL-KL 382

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFKAAEKKTR 536
           DD         V++D  +    +  ++Y   KK + K +   K I       K  E K  
Sbjct: 383 DDTH-------VQIDSNIGIPESTEKYYNKGKKAKRKIDGVNKAIENTKSEIKKLEDKKE 435

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
           + I   +        R++ W+EK  WFIS + YLVI GRDA  NE +VK+Y    D+Y+H
Sbjct: 436 VAIELLRQKQEKREKRELKWYEKLRWFISRDGYLVIGGRDANSNEQVVKKYSKNNDIYLH 495

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTA 655
            D+HGA ST+I+N + E  +P  TL  A CF    S AW     +  A+WV   QVSKT 
Sbjct: 496 CDIHGAPSTIIQN-KNEDEIPESTLYDAACFASSFSSAWTEGFSSYDAYWVTLDQVSKTP 554

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +GE+L  G+F+IRGKKNF+   P+++  G++
Sbjct: 555 QSGEFLKKGAFVIRGKKNFIRNVPVLIAIGVV 586


>gi|398011164|ref|XP_003858778.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322496988|emb|CBZ32058.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 1228

 Score =  197 bits (501), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 139/484 (28%), Positives = 234/484 (48%), Gaps = 46/484 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           ++T++     +GP L++H++  TG++ N       +  DN  + L   + +      D+ 
Sbjct: 255 VQTLVAGIQHFGPDLAQHVLTVTGVL-NTPRKSWTQSADNVFEALRPGLLE----AFDLA 309

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSST-------------QIYDEFCPLLLNQFRSR 346
             D+   G  L++ K        T +  +              + Y+ F P+LL Q+ + 
Sbjct: 310 KVDLTSAGGYLIKPKAKPASTAHTPAPPAPGASAAAADLVAVAEQYESFTPILLAQYTND 369

Query: 347 --EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
             E +   +F    DEF+   E++R +  +  +   A  K +K   D   R++ L+ ++ 
Sbjct: 370 GVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHARRINALETDIA 429

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            +    E +  N + VD AI  +  ALA  +SW+ L  ++K     G+PVA +I  L+LE
Sbjct: 430 ANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHPVAYMIHDLFLE 489

Query: 465 RNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           RN +S+LL   LDE   EE   +P   VEV L+ +AHANA  ++  +K+  SK E+T+ A
Sbjct: 490 RNSISVLLETVLDEEKGEEDCDVPPLVVEVTLSKTAHANAADYFSKQKQHRSKLERTVAA 549

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             KA   A +K   +   +K    I   R+ +W+EKF WF ++   LV+ G+D Q  E++
Sbjct: 550 TEKAAAGAARKGARKAAAQKEKKVIVKERQRNWWEKFFWFRTTAGDLVLRGKDVQSTELL 609

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHR-------------------PEQPVPPLTLNQA 624
           V+R M  GD+++H D+ G+   +++                        QPV   ++ +A
Sbjct: 610 VRRVMRLGDLFIHCDVDGSLPCLLRPMNDVWQELGGNNAGGDLTASPATQPVALHSVCEA 669

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G + V  S AW+ K  T +WWVY  QV+    TG YL        G+++ LPP  + +G 
Sbjct: 670 GAWCVAFSGAWERKQTTGSWWVYASQVTGGTATGAYLYA------GERHHLPPQSMSLGC 723

Query: 685 GLLF 688
            LLF
Sbjct: 724 ALLF 727



 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/165 (44%), Positives = 106/165 (64%), Gaps = 12/165 (7%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+Y++  K ++FK  +       GE++K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRATLIGLRLLNIYNIGNKMFLFKFGH-------GENKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN--A 117
            ESG R H T  AR+K   PS FTLKLRKHIR  RL+ + QL +DR I   FG+      
Sbjct: 54  -ESGTRFHLTELAREKPKVPSQFTLKLRKHIRAWRLDSIAQLQHDRTIDLCFGVPSTEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            ++I+EL+++GN++LTD  +T++ LLR+HRDD+ G+ +M    YP
Sbjct: 113 FHIIVELFSKGNVILTDYAYTIMMLLRTHRDDE-GLKLMVNQVYP 156


>gi|146078492|ref|XP_001463556.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134067642|emb|CAM65921.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 1228

 Score =  197 bits (501), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 139/484 (28%), Positives = 234/484 (48%), Gaps = 46/484 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           ++T++     +GP L++H++  TG++ N       +  DN  + L   + +      D+ 
Sbjct: 255 VQTLVAGIQHFGPDLAQHVLTVTGVL-NTPRKSWTQSADNVFEALRPGLLE----AFDLA 309

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSST-------------QIYDEFCPLLLNQFRSR 346
             D+   G  L++ K        T +  +              + Y+ F P+LL Q+ + 
Sbjct: 310 KVDLTSAGGYLIKPKAKPASTAHTPAPPAPGASAAAADLVAVAEQYESFTPILLAQYTND 369

Query: 347 --EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
             E +   +F    DEF+   E++R +  +  +   A  K +K   D   R++ L+ ++ 
Sbjct: 370 GVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHARRINALETDIA 429

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            +    E +  N + VD AI  +  ALA  +SW+ L  ++K     G+PVA +I  L+LE
Sbjct: 430 ANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHPVAYMIHDLFLE 489

Query: 465 RNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           RN +S+LL   LDE   EE   +P   VEV L+ +AHANA  ++  +K+  SK E+T+ A
Sbjct: 490 RNSISVLLETVLDEEKGEEDCDVPPLVVEVTLSKTAHANAADYFSKQKQHRSKLERTVAA 549

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             KA   A +K   +   +K    I   R+ +W+EKF WF ++   LV+ G+D Q  E++
Sbjct: 550 TEKAAAGAARKGARKAAAQKEKKVIVKERQRNWWEKFFWFRTTAGDLVLRGKDVQSTELL 609

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHR-------------------PEQPVPPLTLNQA 624
           V+R M  GD+++H D+ G+   +++                        QPV   ++ +A
Sbjct: 610 VRRVMRLGDLFIHCDVDGSLPCLLRPMNDVWQELGGNNAGGDLTASPATQPVALHSVCEA 669

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G + V  S AW+ K  T +WWVY  QV+    TG YL        G+++ LPP  + +G 
Sbjct: 670 GAWCVAFSGAWERKQTTGSWWVYASQVTGGTATGAYLYA------GERHHLPPQSMSLGC 723

Query: 685 GLLF 688
            LLF
Sbjct: 724 ALLF 727



 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/165 (44%), Positives = 106/165 (64%), Gaps = 12/165 (7%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+Y++  K ++FK  +       GE++K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRATLIGLRLLNIYNIGNKMFLFKFGH-------GENKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN--A 117
            ESG R H T  AR+K   PS FTLKLRKHIR  RL+ + QL +DR I   FG+      
Sbjct: 54  -ESGTRFHLTELAREKPKVPSQFTLKLRKHIRAWRLDSIAQLQHDRTIDLCFGVPSTEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            ++I+EL+++GN++LTD  +T++ LLR+HRDD+ G+ +M    YP
Sbjct: 113 FHIIVELFSKGNVILTDYAYTIMMLLRTHRDDE-GLKLMVNQVYP 156


>gi|336476370|ref|YP_004615511.1| fibronectin-binding A domain-containing protein [Methanosalsum
           zhilinae DSM 4017]
 gi|335929751|gb|AEH60292.1| Fibronectin-binding A domain protein [Methanosalsum zhilinae DSM
           4017]
          Length = 660

 Score =  196 bits (498), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 183/697 (26%), Positives = 307/697 (44%), Gaps = 123/697 (17%)

Query: 2   VKVRMNTADVAAEVKCL----RRLIGMRCSNVYD-LSPKTYIFKLMNSSGVTESGESEKV 56
           +K  M++ADV+A V  L      LI  +   +Y   S +  I   ++  G          
Sbjct: 1   MKDEMSSADVSALVYELVHGPYNLIDAKIGKIYQPFSDEIRINLFIHGKGRDN------- 53

Query: 57  LLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
            L++E+G R H +         P  F + LRKH+   R+ D+ Q  +DRII  +   G  
Sbjct: 54  -LILEAGKRAHISKNLPPNPKLPPSFPMLLRKHLSGGRILDISQYDFDRIIEIRIVRGGV 112

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASK 176
              ++ EL+A+GNI+L DSE  ++  ++      + +     + YP       E  T  K
Sbjct: 113 ETVLVAELFARGNIVLLDSERKIILPMKPVTFRGRKIRSGETYEYPESKVNPLE-ITEEK 171

Query: 177 LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
           +   L +S        D V       + A+K NLGG                        
Sbjct: 172 MKDLLYTSTS------DLVR------TIATKMNLGGN----------------------- 196

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
                            LSE I L +G+  N    E++   D  I +L  +V    D L 
Sbjct: 197 -----------------LSEEICLVSGIDKNRSAKEID---DQEISILCESV---NDVLS 233

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE---- 352
            ++SGD+ P    +++ K+                 D+  P+ +N F  + F K+E    
Sbjct: 234 PLVSGDLKPN---IVKKKN-----------------DDLEPINVNPFDLKIFEKYEKEYY 273

Query: 353 -TFDAALDEFYSKIESQRA-EQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
            +F+ ALDE++ K   ++  E+    K+D A     +    Q+  +   +++ ++ V+ A
Sbjct: 274 ESFNEALDEYFGKASLEKVDEKVETVKKDKA-GVFERRLQQQKTAISKFEKQAEKYVQAA 332

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E I    +D++    A+  A +   SW ++  ++K  + +      +I+   ++     +
Sbjct: 333 EKIYSYYQDIEHITDALNNARSKGYSWSEIKSIIKSSKDSTQAAKSIIN---IDPGKGII 389

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           +L  +LD  +          VE+++  S   NA  +YE  KK   K++  + A  +   +
Sbjct: 390 VL--DLDGTN----------VEININKSIPQNAEMYYEKAKKVTRKRDGALKALEETKAS 437

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
            +KK + +  + K +      RK  W+E+F WFISS+ +LV+ GRDA  NE IVK+YM K
Sbjct: 438 MQKKEKKEPSKRKII------RKPSWYERFRWFISSDGFLVVGGRDADTNEEIVKKYMEK 491

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD-SKMVTSAWWVYPH 649
            D++ H    GA  T+IK    E  VP  T+ +A  F V +S  W         + V P 
Sbjct: 492 RDLFFHTQAPGAPVTIIKTEGKE--VPSTTIEEASRFVVSYSSLWKLGHFAGDCYMVKPE 549

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           QVSKT  +GEYL  GSF+IRG++N+    P+ +  G+
Sbjct: 550 QVSKTPESGEYLKKGSFVIRGERNYFKNVPMRVAVGI 586


>gi|332157694|ref|YP_004422973.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
 gi|331033157|gb|AEC50969.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
          Length = 650

 Score =  194 bits (493), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 118/358 (32%), Positives = 198/358 (55%), Gaps = 28/358 (7%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E V F+TF  ALDE++ K+  ++A ++   K +    +L      QEN 
Sbjct: 243 VPIDLRWYDGYEKVYFDTFSKALDEYFGKLTIEKAREEKTKKLEEKKKQLIATLKRQENM 302

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   K+E+ R+ ++A+LI  N + VD  +  +  A+  R+ WE+L R V+E +K GN +A
Sbjct: 303 IKGFKEEMRRNQEIADLIYANYQLVDNLLKELSKAV-ERLGWEELIRRVEEGKKKGNRIA 361

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            +I  +  + N +++       E++D++  L +++         + NA  +YE  KK + 
Sbjct: 362 MMIKSINPQENSVTI-------EIEDKKVRLYIDR-------DINENAEIYYEKAKKAKH 407

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH-----WFEKFNWFISSENYL 570
           K E       KA++  +KK      + +       ++K+      WFEKF WFISSE +L
Sbjct: 408 KLE----GAKKAYEELKKKLEQVEKEIEEEEKKVQVKKIERRKKKWFEKFRWFISSEGFL 463

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI G+DA  NE++V++YM + D+Y HAD+ GA   +IK+ R        T+ +A  F V 
Sbjct: 464 VIGGKDATTNEIVVRKYMGENDIYCHADIWGAPHVIIKDGR---RASEKTIFEACQFAVS 520

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            S+AW   + ++ A+WVYP QV K AP+GE+L  G+FM+ GK+N++   PL +  G++
Sbjct: 521 MSRAWSEGLYSADAYWVYPEQVKKQAPSGEFLPKGAFMVYGKRNWMHGIPLKLAVGII 578



 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 83/162 (51%), Gaps = 13/162 (8%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+   V+ L+  L G R   VY    +  I        + ++GE  + L++ 
Sbjct: 1   MKEEMSSVDIRYIVQELKEELKGARIDKVYHEGDEVRI-------KLHKTGEGRRDLII- 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G RLH T Y ++  ++PS F + LRK++    ++++ Q  +DRI+  + G       +
Sbjct: 53  EAGKRLHLTTYIKESSSSPSSFAMLLRKYLSGAFVDEIEQHDFDRIVKIRVG----KFTI 108

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I EL+ +GN++L D    +L  +R     D+ +     +++P
Sbjct: 109 IAELFRRGNVILVDENNVILGAIRYEEFKDRSIKPKHEYKFP 150


>gi|401826788|ref|XP_003887487.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
           50504]
 gi|395460005|gb|AFM98506.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
           50504]
          Length = 648

 Score =  192 bits (489), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 112/341 (32%), Positives = 187/341 (54%), Gaps = 40/341 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F TF+ A + F+        + + K  ++    K++K+   QEN +  ++QE +   K A
Sbjct: 245 FPTFNDAAEFFF--------QSRKKFGKNDRESKVDKVRKRQENYMKEMEQEGESYRKKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   +V   N++ W D  +  ++E + GN ++  I K  ++   C  
Sbjct: 297 ELLEANADFVNKILDIFKVVKKNKVKWTDFEKFREQENRKGNEISKAIVKTDFISHTCTI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           +L                 E++++D  ++   N  R+Y+  KK E K  KT  +  +  K
Sbjct: 357 VLEG---------------EEIQIDFEVTLFNNVSRFYQKSKKLEEKIMKTRDSLEEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K   + +           R ++WFEKF++F SS+  LVI GR+AQQNE++VK+++ 
Sbjct: 402 KIAPKVETKKIT----------RALYWFEKFHFFFSSDGVLVIGGRNAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D+Y H D+HG+SS ++K     +P P  T+ +A    +C S+ W++ +V+  W+VY  
Sbjct: 452 PNDLYFHGDMHGSSSIIVK-----KPTPK-TIEEAASMALCMSKCWEANVVSPVWYVYGE 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           QVSKTAP+GEYLT GSFMI+GKKN++  H +  G GLLF++
Sbjct: 506 QVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 64/134 (47%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  LR RL+G    N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELRPRLVGKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEHDMDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116



 Score = 49.7 bits (117), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 29/35 (82%)

Query: 1034 LLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +++ +PVCGP+S + +YKY+V+++PG  KKG+ +Q
Sbjct: 576  IVHSMPVCGPWSVISAYKYKVRLVPGREKKGRLVQ 610


>gi|117938818|gb|AAH06001.1| SDCCAG1 protein [Homo sapiens]
          Length = 398

 Score =  192 bits (488), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 148/477 (31%), Positives = 222/477 (46%), Gaps = 111/477 (23%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K                                   
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMK----------------------------------- 77

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
                  GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E          
Sbjct: 78  -------GNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE--------PL 122

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  +             V++A K  L                             L
Sbjct: 123 LTLERLTEI------------VASAPKGEL-----------------------------L 141

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 142 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 197

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD A
Sbjct: 198 SNFSGKGYIIQKREIKPCLEADKPVEDILT----YEEFHPFLFSQHSQCPYIEFESFDKA 253

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 254 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 313

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 314 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRN 370


>gi|347524253|ref|YP_004781823.1| hypothetical protein Pyrfu_1716 [Pyrolobus fumarii 1A]
 gi|343461135|gb|AEM39571.1| protein of unknown function DUF814 [Pyrolobus fumarii 1A]
          Length = 668

 Score =  191 bits (485), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 186/687 (27%), Positives = 298/687 (43%), Gaps = 125/687 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  M   DVA+ V+ L  L G R  N+Y++    Y+ +L  +             ++ E 
Sbjct: 5   KTSMTAFDVASVVRELEELKGARLVNIYEVFENVYLLRLRGTRDAR---------VIAEP 55

Query: 63  GVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
           G R+H T+Y    K  P    + LRKHIR  RL  V+QLG+DRIILF+F    N + +++
Sbjct: 56  GRRVHETSYDVTGKEQPPPLIMALRKHIRGERLSTVKQLGFDRIILFEFA---NGYKLVV 112

Query: 123 ELYAQGNILLTDSEFTVL--TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           EL  +G + L D + ++L  +  R  RD                  RV +R    K    
Sbjct: 113 ELLPRGVLALLDEKGSILHASEWREMRD------------------RVIKRGVEYK---- 150

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                 P A  P+ + ED        +E L G  G                        +
Sbjct: 151 ---QPPPAAVHPENLTED------VVRERLAGASG-----------------------EV 178

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
             VL   LGY   + E  +   G+    K + V KL  + I  +V A+       + +  
Sbjct: 179 VRVLVRKLGYPGEVVEEALFRAGI---EKTTPVEKLGASDIGAIVEAI-------RGIYR 228

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
             +   GYI+   K L     P            F P   + +  R +   E+   ALDE
Sbjct: 229 ESLEARGYIVYDEKGLVLTVVP------------FKP---SMYEGR-YRAVESISKALDE 272

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           ++ ++E  RA ++   K +    KL       E  +   +++  +  K+A L+  N   V
Sbjct: 273 YFVELEKARAVEEAVEKLEEEKGKLRAAISKTEELIREYEEKKVKLEKLALLVAENAALV 332

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           D A+   R  +     W+ +          GN   G++D +   R  + L +  ++ E+D
Sbjct: 333 DQALECAR-RMREGSGWDYIP---------GN-CPGVVD-VEPSRGVVKLNIGGSIVEVD 380

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
                     +  D A   +   R+  EL+KK+ S+  +T+    K  ++ E    L+I 
Sbjct: 381 ----------IRSDSARLINELYRKIGELEKKR-SRALRTLEELKKKLESLE----LEIR 425

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           +E   A     RK  W+EK++W  +S   LVI GRDA QNE +VKRY+ + ++++HAD+ 
Sbjct: 426 EEARRARARIRRK-EWYEKYHWMFTSHWLLVIGGRDASQNESVVKRYLGENNIFMHADIR 484

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGE 659
           GA + V+     E P     + +A     C+S+AW   +     +WV+  QVSK AP GE
Sbjct: 485 GAPAVVVFAGGKEPPEE--DIREAAVIAACYSRAWKEGLGAIDVYWVWGRQVSKAAPPGE 542

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           YLT G+FM+ G++N++    L +  GL
Sbjct: 543 YLTKGAFMVYGERNYIRGVELKLAIGL 569


>gi|19074389|ref|NP_585895.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi GB-M1]
 gi|19069031|emb|CAD25499.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329389|gb|AGE95661.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi]
          Length = 648

 Score =  190 bits (483), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 116/343 (33%), Positives = 183/343 (53%), Gaps = 40/343 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FETF+ A  EFY +   +  +   ++K D       K+   QE  V  ++Q+ +   + A
Sbjct: 245 FETFNEAA-EFYFQSRKKFGKNDRESKVD-------KVRKRQEEYVKEMEQQGELLRRKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   +V   NR+ W D  +   +E K GN V+  I K  ++   C  
Sbjct: 297 ELLERNSKLVNRILDIFKVVKKNRIKWTDFEKFWGQENKKGNEVSKAIVKTDFMAHKCWI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           +L                 E++E+D   S  +N    Y+  KK E K  +T  +  +  K
Sbjct: 357 VLEG---------------EEIEIDFDSSLFSNISGLYQKSKKLEEKIRRTRDSLEEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K     ++ K +      R  +WFEKF++F SS+  LVI G++AQQNE++VK+++ 
Sbjct: 402 RIAPK-----IESKKIT-----RAPYWFEKFHFFFSSDGVLVIGGKNAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            GD+Y H+D+HG+SS ++K    +      T+ +A    +C S+ W++ +V+  W+VY  
Sbjct: 452 PGDLYFHSDMHGSSSIIVKKATQK------TIEEAASMALCMSKCWEANVVSPVWYVYGD 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
           QVSKTAP+GEYL  GSFMI GKKN++  H +  G GLLFR+ E
Sbjct: 506 QVSKTAPSGEYLKKGSFMITGKKNYVECHRIEYGLGLLFRVSE 548



 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 62/134 (46%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  LR RL      N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELRPRLKEKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEYDTDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 29/35 (82%)

Query: 1034 LLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +++ +PVCGP+S + +YKY+V+++PG  KKGK +Q
Sbjct: 576  IVHSMPVCGPWSVISAYKYKVRLVPGREKKGKLVQ 610


>gi|303389736|ref|XP_003073100.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
           50506]
 gi|303302244|gb|ADM11740.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
           50506]
          Length = 648

 Score =  189 bits (479), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 112/343 (32%), Positives = 184/343 (53%), Gaps = 40/343 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F TF+ A + F+        + + K  ++    K++K+   QEN +  ++Q+ +   K A
Sbjct: 245 FNTFNDAAEYFF--------QGRKKFGKNDRETKVDKVRKRQENYMKEMEQQGECYRKKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   +V   N++ W D  +  ++E K G+ V+  I K  ++   C  
Sbjct: 297 ELLEKNADLVNRILEIFKVVRKNKVKWTDFEKFREQENKKGSEVSKAIVKTDFVSHTCWI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
                          TL  E++ +D  +S   N   +Y+  KK E K  KT  +  +  K
Sbjct: 357 ---------------TLEGEEIPIDFNISLFNNVSEFYQKSKKLEEKIRKTRDSLGEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K   + +           R ++WFEKF++F SS+  LVI G+ AQQNE++VK+++ 
Sbjct: 402 KIAPKVETKKIT----------RTLYWFEKFHFFFSSDGVLVIGGKTAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D+Y H+D+HGASS ++K  +P +     T+ +     +C S+ W++ +V+  W+VY  
Sbjct: 452 PTDLYFHSDVHGASSIIVK--KPTEK----TIVETASMALCMSRCWETNVVSPVWYVYGE 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
           QVSKTAP+GEYL  GSFMI+GKKN++  H +  G GLLFR+ E
Sbjct: 506 QVSKTAPSGEYLGKGSFMIKGKKNYVDCHKIEYGLGLLFRVFE 548



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L+ RL+G    N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELKPRLVGKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEHDMDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116



 Score = 49.7 bits (117), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 29/35 (82%)

Query: 1034 LLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +++ +PVCGP+S + +YKY+V+++PG  +KGK +Q
Sbjct: 576  IVHSMPVCGPWSVISTYKYKVRLVPGRERKGKLVQ 610


>gi|159111661|ref|XP_001706061.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
           50803]
 gi|157434154|gb|EDO78387.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
           50803]
          Length = 1063

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 137/215 (63%), Gaps = 17/215 (7%)

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI--LQEKTVANI---SHMR 552
           +AH  A+  +E  K  E K ++T+   S  F   EKK    I  + ++T A +    H R
Sbjct: 537 TAHIIAKTLFEAAKAAEEKCKRTLGHSSAYFDKVEKKATADIDSVMKETDAELIALQHQR 596

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR- 611
              WFEKF+WF S++ YLV+SGRDAQ NE++VK++MS  D++VH++ HGA+ T++K  R 
Sbjct: 597 SPLWFEKFHWFFSTDGYLVLSGRDAQSNELLVKKFMSSNDIFVHSEAHGAACTIVKAPRL 656

Query: 612 -----PEQP-----VPPL-TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
                P+Q      VPP+ T+ +AG FTV HS+ W  K+ T ++WVY  QVSKTAP G Y
Sbjct: 657 TTTDIPQQNTVLRWVPPVQTMLEAGAFTVIHSKMWAQKVGTQSYWVYADQVSKTAPAGMY 716

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           +  GSF+IRGK+NF+P  PL +G  LL+R D +++
Sbjct: 717 IGTGSFVIRGKRNFIPQQPLELGVALLWRYDTANV 751



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 77/150 (51%), Gaps = 12/150 (8%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE------- 54
           K+  ++ DVA   K L   L+  R ++V +LS  TY+ +   S+ V +  +++       
Sbjct: 6   KLTPSSFDVAVLAKELSAILVNTRLNSVTNLSKTTYLLRFHASTTVIDQCQTKNQTLIDT 65

Query: 55  --KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             K  +++E G  +H T +   K   P+ F+ +LR  I       V Q  +DR+I+ +F 
Sbjct: 66  YSKPSIIIEPGFYMHATRFDWSKAIPPTAFSNRLRTEICNMICTGVSQFYFDRVIILEFS 125

Query: 113 LGMN--AHYVILELYAQGNILLTDSEFTVL 140
              +    Y+I+ELY +GN++LTD  + VL
Sbjct: 126 RYNSDLKRYLIVELYGRGNLILTDEAYKVL 155


>gi|337284225|ref|YP_004623699.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
 gi|334900159|gb|AEH24427.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
          Length = 648

 Score =  187 bits (474), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 120/357 (33%), Positives = 195/357 (54%), Gaps = 26/357 (7%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSK--IESQRAEQQHKAKEDAAFHKLNKIHMDQ- 392
            P+ L  +   E   FETF  ALDE++ K  +E  +AE+  K +E     K  +I +++ 
Sbjct: 241 VPIELKWYDGYERKYFETFSEALDEYFGKLTVEKAKAEKTRKLEEK---RKALEISLERI 297

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
             ++   ++E  ++ ++ +LI  N   V+  +  +R A+  ++ WE+L R V+E +K GN
Sbjct: 298 REQMMAFEEEAKKNQELGDLIYANYSLVERLLEELRAAV-KKLGWEELERRVEEGKKTGN 356

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
             A +I  ++   N +++       E+D +        +++ L  S   NA  +YE  K+
Sbjct: 357 KAAEVIKGIHPSENAVTV-------EIDGK-------AIKLYLNRSLGENAELYYERAKR 402

Query: 513 QESKQEKTITAHSKA-FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
            ++K E    A+ +   K  E +  ++   +K        RK  WFEKF WFISSE +LV
Sbjct: 403 AKAKLEGARKAYEETKIKIEELERLIEEEGKKVGVKKLERRKKKWFEKFRWFISSEGFLV 462

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           I G+DA  NEM+VKR+M + D+Y HAD++GA   VIK+ R        T+ +A  F V  
Sbjct: 463 IGGKDATTNEMVVKRHMEENDIYCHADVYGAPHVVIKDGR---KAGERTIFEACQFAVSM 519

Query: 632 SQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           S+AW   + ++ A+WVYP QVSK +P GEYL  G+FM+ GK+N+    PL +  G++
Sbjct: 520 SRAWGQGLYSADAYWVYPEQVSKKSPAGEYLPKGAFMVYGKRNWFHGIPLKLAVGVV 576



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 78/157 (49%), Gaps = 13/157 (8%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M + D+   VK LR L+G R   VY    +  I          ++GE  K L++ E+G R
Sbjct: 5   MTSVDIRYIVKELRELVGARVDKVYHEGNEIRI-------KFHKAGEGRKDLII-EAGKR 56

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T Y ++   TP+ F + LRKH+    L  + Q  +DRI+   F      + +++EL+
Sbjct: 57  IHLTTYIKEI-PTPTSFAMLLRKHLGGAFLSGIEQHDFDRIVKLSF----RDYTLVVELF 111

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +GN++L   +  ++  LR     D+ +     +++P
Sbjct: 112 GKGNLVLVGPDGLIIAALRYEEFRDRAIKPKVEYKFP 148


>gi|50312521|ref|XP_456296.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49645432|emb|CAG99004.1| KLLA0F27335p [Kluyveromyces lactis]
          Length = 1027

 Score =  186 bits (473), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 200/800 (25%), Positives = 365/800 (45%), Gaps = 131/800 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+    K L  +++G R  N+Y++  S + ++ K     G  +S    K+ +
Sbjct: 1   MKQRLSSLDLQLISKELENQIVGFRLRNIYNIADSNRQFLLKF----GKPDS----KLNV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+HTT + R    TPS F  KLR +++ +RL  V+Q+  DRII+F F  G   +
Sbjct: 53  VIDCGLRVHTTDFTRPIPPTPSWFVSKLRSYLKEKRLTAVKQIPNDRIIVFTFADG--KY 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN+LL D++  +L L R   D            Y  ++   ++    ++++
Sbjct: 111 YLVLEFFSAGNVLLLDADQKILLLQRVVDD------------YSMKVGEFYDMANFAEIN 158

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
              TS+  PD  E  +     N +++  KE     K      +     K      +A  P
Sbjct: 159 Q--TSTTVPDPKEYFE-----NEIADWLKEADVKAKST----IVPGEAKKGKLKGKASVP 207

Query: 239 TLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
           +++ +L     + P LS  +I ++    G+ P+    E      + + VLV  ++  E  
Sbjct: 208 SIQKLL---FVHAPHLSSDLIQNSLKAIGIDPSSSCLEFK----HNVSVLVDLMSSLEVQ 260

Query: 295 LQDVISGDIVPEGYILMQNKHLG---KDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
              +IS      GYI+     L    +D P  E   S   +  F P + +    +     
Sbjct: 261 ANKLISTTSTRIGYIVAHKNKLYDPLRDKPELEYTFSN--FHPFKPFVGDSTDVKIIEIG 318

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
             ++  +D F+S IES +   + + ++  A  KL++   + E  + +L      + +   
Sbjct: 319 GMYNNTVDTFFSTIESNKYASRIQNQDFQAQKKLDEAKNNNETIIKSLLHAQQTNEEKGN 378

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
           ++  N   V+ A  AV+  L  +M W+ +  ++  E++ GN +A +I   + L  N +++
Sbjct: 379 ILIANANLVEEAKNAVKSLLDQQMDWQSMETLIANEQRKGNKIARIIKLPMDLPNNKITI 438

Query: 471 LLSNNLDEMDD------EEKTLPVEKVEVD---LALSAHANARRWYEL---KKKQESKQE 518
            L  +    DD       E      + +V+    ++S+  +   + EL   K KQ+S+++
Sbjct: 439 ELPKDGYSEDDSTEHHQSEADYSSNESDVNQSDSSVSSDYSDSDFEELTSSKSKQQSRRK 498

Query: 519 KTITAHSK------------AFKAA-----------EKKTRLQILQEKTVANI------- 548
             IT+  +            AF  A           EK+ +++   EK + +I       
Sbjct: 499 SKITSEKRETVLLTVDLSLSAFANASSYFNAKKATSEKQKKVEKNAEKALKSIQQKIEKD 558

Query: 549 -------SH-----MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
                  SH     +R  ++FEK+ WFISSE++LV+ G+   + + +  +Y++  D+ V 
Sbjct: 559 LQKKSKESHDILKAIRTPYFFEKYYWFISSESFLVLMGKSPVETDQLYAKYVNDDDIMV- 617

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
            +     + ++   + E  VPP TL QAG F    S AW  K+ +S WW +   V+K   
Sbjct: 618 TNAFDVKAWILNPQKTE--VPPNTLMQAGTFANSASDAWSKKIASSPWWCFAKNVTKFDD 675

Query: 657 T-GEYLTVGSFMIR--GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
             G  L VGSF ++    KN LPP  L+MG GL++              +V+ E    D 
Sbjct: 676 IDGSVLPVGSFRMKQPKAKNMLPPAQLVMGLGLVW--------------KVKTE----DS 717

Query: 714 FEDSGHHKENSDIESEKDDT 733
            E  G +++NSD+E+  DDT
Sbjct: 718 EEKEGEYEQNSDLEASDDDT 737



 Score = 43.9 bits (102), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 23/31 (74%)

Query: 1037 VIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            VIPV  P++A+   KY+VKI PGTAKK K I
Sbjct: 919  VIPVYAPWAALTKNKYKVKIQPGTAKKSKSI 949



 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 29/37 (78%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           RG++GKLKK+++KY DQDEEER +R+  L +   +++
Sbjct: 820 RGKRGKLKKIQKKYFDQDEEERLLRLEALGTLKGIER 856


>gi|308160802|gb|EFO63274.1| Serologically defined colon cancer antigen 1 [Giardia lamblia P15]
          Length = 1063

 Score =  186 bits (471), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 135/422 (31%), Positives = 203/422 (48%), Gaps = 74/422 (17%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
           +R+ +  ++E+++  LDE+ S + + RA Q        A   L       ENRV +L   
Sbjct: 327 YRAEDIREYESYNKTLDEYNSLLVTARAYQNRAQLVQKAKLTLAHAQDTTENRVASLLNS 386

Query: 403 VDRSVKMAELI-------EYNLEDVDAAILAVRVALANRMSWEDLARM----------VK 445
             R   +AE I       +Y  + ++      RV   + + W +   M          V 
Sbjct: 387 ATRKRLLAECILWKAAEIDYLTKQMEFLFKTERVTWNDVIVWMNYGSMDVPLLEAISSVD 446

Query: 446 EERKAGN----PVAGLIDKLYLERNCMSLLLSNNL-------------DEMDDEEK---- 484
             RK  +      A  I  ++ E     L LS +              DE +D ++    
Sbjct: 447 VVRKVVSFNISIFASDIHDMHYEDCTPFLALSKSRATAKQEIPDLEASDETEDNDEQQGY 506

Query: 485 ------------TLPVEKVEVDLAL------SAHANARRWYELKKKQESKQEKTITAHSK 526
                       T P+  + VD+        +AH  A+  +E  K  E K ++T+   S 
Sbjct: 507 GSCENTRIMPDPTEPI-IISVDVPFKGTAGTNAHTIAKTLFEAAKAAEEKCKRTLGHSSA 565

Query: 527 AFKAAEKKTRLQI--LQEKTVANI---SHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
            F   EKK    I  + ++T A +    H R   WFEKF+WF S+  YLV+SGRDAQ NE
Sbjct: 566 YFDKVEKKATADIDSVMKETDAELIALQHQRSPLWFEKFHWFFSTNGYLVLSGRDAQSNE 625

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNHR------PEQP-----VPP-LTLNQAGCFTV 629
           ++VK++MS  D++VH++ HGA+ T++K  R      P++      VPP  T+ +AG FTV
Sbjct: 626 LLVKKFMSPNDIFVHSEAHGAACTIVKAPRLTTTDAPQENTVLRWVPPEQTMLEAGAFTV 685

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
            HS+ W  K+ T ++WVY  QVSKTAP G Y+  GSF+IRGK+NF+P  PL +G  LL+R
Sbjct: 686 IHSKMWTQKVGTQSYWVYADQVSKTAPAGMYIGTGSFVIRGKRNFIPQQPLELGVALLWR 745

Query: 690 LD 691
            D
Sbjct: 746 YD 747



 Score = 66.6 bits (161), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 76/150 (50%), Gaps = 12/150 (8%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL-- 59
           K+  ++ DVA   K L   L+  R ++V +LS  TY+ +   S+ V +  +++   L+  
Sbjct: 6   KLTPSSFDVAVLAKELSAILVNTRLNSVTNLSKTTYLLRFHASTTVIDQCQTKNQTLIDT 65

Query: 60  -------MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
                  +E G  +H T +   K   P+ F+ +LR  I       V Q  +DR+I+ +F 
Sbjct: 66  YSKPSVIIEPGFYMHATRFDWSKAIPPTVFSNRLRTEICNMICTGVSQFYFDRVIILEFS 125

Query: 113 LGMN--AHYVILELYAQGNILLTDSEFTVL 140
              +    Y+I+ELY +GN++LTD  + VL
Sbjct: 126 RYNSELKRYLIVELYGRGNLILTDETYKVL 155


>gi|448583074|ref|ZP_21646543.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
 gi|445730031|gb|ELZ81623.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
          Length = 702

 Score =  186 bits (471), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 177/707 (25%), Positives = 291/707 (41%), Gaps = 130/707 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP           AS+L 
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                       +P  V+ D                      L +N  ++  D  R    
Sbjct: 165 ------------DPLTVSRDA---------------------LGRNMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+   + +++    + +A+   ++      D  Q V
Sbjct: 188 ----TLATQLNLGGLYAEELCTRAGVEKTLDIADATAEDYDAVYDAIV------DLRQQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEF-CPLLLNQFRSREFVKFETFDAA 357
            SG+  P  Y+                G   ++ D    PL  +Q    +   ++TF+ A
Sbjct: 238 RSGEFDPRLYL----------------GDDGEVVDVTPFPLREHQNAGLDEEAYDTFNDA 281

Query: 358 LDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           LDE++ +++    EQ+  +     +    K  +I   QE  +   +Q+ +   + AEL+ 
Sbjct: 282 LDEYFFRLDLTADEQEATSNRPDFEEEIAKQQRIIDQQEGAIEGFEQQAEDERERAELLY 341

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N + VD  +  VR A    + W+D+A  ++E  + G P A  +  +      +++    
Sbjct: 342 ANYDLVDDVLSTVRGAREEGVPWDDIAARLEEGAEQGIPEAEAVTNVDGANGTVTI---- 397

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAE 532
              E+DD   TL V       ++    NA R Y   K+ E K+E  + A   ++   AA 
Sbjct: 398 ---ELDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAV 447

Query: 533 KKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSENYLVISG 574
           KK R +   +                    + ++      HWFE+F WF +S  YLV+ G
Sbjct: 448 KKRRDEWEADDDEDDEEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYLVVGG 507

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTV 629
           R+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V
Sbjct: 508 RNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSDETLREAAQFAV 567

Query: 630 CHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 568 SYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYF 614


>gi|409095360|ref|ZP_11215384.1| Fibronectin-binding protein A (FbpA) [Thermococcus zilligii AN1]
          Length = 650

 Score =  184 bits (468), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 119/361 (32%), Positives = 191/361 (52%), Gaps = 33/361 (9%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F TF  ALDE++ +I  ++A  +   K +A   +L    M QE  
Sbjct: 242 VPIELKVYGGLEKKYFSTFSEALDEYFGRITVEKARIEQTQKLEAKKKQLLTTLMMQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++ +  + ++ +LI  N   V+  +   + A   ++ WE+  + ++E +KAGN VA
Sbjct: 302 LRGFEKAMKENQELGDLIYANYPVVERLLEEFKRA-TEKLGWEEFKKRIEEGKKAGNRVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE------KVEVDLALSAHANARRWYEL 509
            ++                   E+D +EK + VE      K+ VD +L    NA  +YE 
Sbjct: 361 LMVK------------------EIDPKEKAVTVELEGKEVKLHVDRSLGE--NAELYYEN 400

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM--RKVHWFEKFNWFISSE 567
            KK   K E  + A+    +  E+  +L   + K   N+  +  RK  WFEKF WF+SSE
Sbjct: 401 AKKFRHKYEGALKAYEDTRRKIEEIEKLIEEEMKKELNVRRIEGRKKRWFEKFRWFVSSE 460

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LV++G+DA  NE +VK++M K D+Y HAD++GA   VIK+    Q     T+ +A  F
Sbjct: 461 GFLVLAGKDANTNETLVKKHMDKNDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQF 517

Query: 628 TVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            V  S+AW   + ++ A+W YP QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G+
Sbjct: 518 AVSMSRAWSQGLYSADAYWAYPEQVTKQAPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGV 577

Query: 687 L 687
           +
Sbjct: 578 V 578



 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYIVRELQWLVGSRVDKVYHEGDEIRI-KLHTKEGRAD--------LVLQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PSGFT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYVKEAPKEPSGFTMLLRKHLSGGFIDAIEQHQFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+  GNI+L DSE  +++ LR     D+ +   + + +P
Sbjct: 108 GELFRSGNIVLVDSENRIISALRYEEYRDRAIKPNAEYIFP 148


>gi|282165250|ref|YP_003357635.1| hypothetical protein MCP_2580 [Methanocella paludicola SANAE]
 gi|282157564|dbj|BAI62652.1| conserved hypothetical protein [Methanocella paludicola SANAE]
          Length = 666

 Score =  184 bits (468), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 115/379 (30%), Positives = 197/379 (51%), Gaps = 23/379 (6%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           +  P+ L+++   + V FE+F+ ALDE+YSK     A+ +   K+      L +    QE
Sbjct: 249 DVLPIELSRYAGYQKVYFESFNKALDEYYSKHIVAEAKAEVVEKKAEKLGVLERRLKQQE 308

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
           + +   ++E    V+  ELI      VD  I  ++ A +  +SW+D+ +++K+ +KAGNP
Sbjct: 309 DAIAKFEKEEKEYVRKGELIYAEYGAVDDIIKVIKGARSRGISWDDIRKILKDAKKAGNP 368

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
            A +I  +    N +++                P   + +++ L+   N++ +Y+  KK 
Sbjct: 369 AASMIQSVDPAANTVAV--------------KFPEATININVDLTVPQNSQTYYDKAKKV 414

Query: 514 ESKQEKTITAHSKAFKA-AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
           +SK++  + A     +A A++  R +  + K  A     RK  W+EK+ WF +S+ +LVI
Sbjct: 415 QSKKDGALKAIEDTKRAMAKEMPREKPAEPKKPAVKMKPRKPKWYEKYRWFFTSDGFLVI 474

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +GRDA QNE IVK+Y+ K D++ HA   GA  TV+K    E  + P  + +   F V +S
Sbjct: 475 AGRDADQNEEIVKKYLDKKDIFFHAQAFGAPITVVKTEGRE--ITPEAIAEVAQFAVAYS 532

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
             W S   +   +WV P QVSKT  +GEY+  G+F+IRG +N++         G+  R D
Sbjct: 533 SVWKSGQSSGDCFWVRPEQVSKTPESGEYVAKGAFIIRGDRNYVKNVEARAAVGI--RFD 590

Query: 692 ESS---LGSHLNERRVRGE 707
           E+    +G  +   + RG+
Sbjct: 591 ETGCYVVGGPVAAVKARGK 609



 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 44/161 (27%), Positives = 80/161 (49%), Gaps = 7/161 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ DV A V  L+ LI  +    Y  +      KL       +  ++ K  L++E
Sbjct: 1   MKEEMSSVDVYAVVMELQFLIDSKLEKAYQHTADEIRLKL-------QEFKTGKYDLILE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T + R+    P  F + LRK++   R+  + Q  +DRI+          + ++
Sbjct: 54  AGKRLHLTEHPRESPKLPPSFPMMLRKYMMGGRITRIAQHNFDRIVEIDVVRAGVMNTLV 113

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL++QGN++L D +  ++  LRS +  D+ V    ++ +P
Sbjct: 114 AELFSQGNVILLDQDRRIMMPLRSLKMKDRDVLRGEQYEFP 154


>gi|358339725|dbj|GAA47729.1| nuclear export mediator factor NEMF [Clonorchis sinensis]
          Length = 449

 Score =  183 bits (464), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 152/486 (31%), Positives = 232/486 (47%), Gaps = 76/486 (15%)

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF++KLRKHI+ ++L +V+QLG DRI+ FQFG   +  ++I+ELY +GN+ LTD  +T
Sbjct: 2   PSGFSMKLRKHIKNKKLSNVKQLGMDRIVDFQFGFDEHLFHLIIELYDRGNMCLTDHSYT 61

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           +L LLR   D ++ V   +  +YP ++     RT    L              PD +N D
Sbjct: 62  ILHLLRPRTDANQDVRYAAHEKYPLDLV----RTVPECLQGL-----------PDDINID 106

Query: 199 GNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHI 258
           G       K  LG     K     + SN+       A +P  K +L     YG    EH 
Sbjct: 107 G-----VCKRVLGLLDEAKGPWCPRGSNE-------ALKPVQK-LLSSEFSYGQPCVEHC 153

Query: 259 I----------LDTGLVPNMKLSEVNKL----EDNAIQVLV----LAVAKFEDWLQDVIS 300
                      L T    N+ + E ++L    ED A   ++    L +A +     +V  
Sbjct: 154 CRLANMAVQSTLKTSATENVPVDEEDRLRQIKEDYAKHFVMALRNLLLAAYLVGTDNVEM 213

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G  +  GYI       GK   P +   S Q  ++F P L +QFR+R  V F TF  A+D 
Sbjct: 214 G--MSRGYI------FGKKLQPEDEELSRQ--EDFQPFLFDQFRNRPHVAFPTFSKAVDT 263

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           ++SKIE  +  +     E+ A  K   I  D E R+  LK + ++ V  A+L+E N + V
Sbjct: 264 YFSKIERDKTTELLVQNENKANKKFENIKKDHELRLAALKADQEQDVHKAQLLEKNRQLV 323

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL---------L 471
           D  IL +  AL+N++ W  L  M++E R  G+ +A  I +L L++N +++         L
Sbjct: 324 DNIILMINHALSNQLDWGTLDTMIQEARARGDLLASHIVQLNLQQNQITVSLKYGFSLYL 383

Query: 472 LSNNLDEMD----------DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTI 521
           L    D  +          D+  + P E V + L L+A  NAR++Y+ K+    K+EKT+
Sbjct: 384 LIMPRDPFESESEGENCERDQTISAPTEVV-ISLDLNALNNARKYYDRKRAALKKEEKTL 442

Query: 522 TAHSKA 527
            A  K 
Sbjct: 443 IASRKV 448


>gi|212223298|ref|YP_002306534.1| fibronectin-binding protein [Thermococcus onnurineus NA1]
 gi|212008255|gb|ACJ15637.1| predicted fibronectin-binding protein [Thermococcus onnurineus NA1]
          Length = 649

 Score =  182 bits (463), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 189/360 (52%), Gaps = 31/360 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  + + E   F TF  ALDE++ K+  ++A+ +   K +A   +L      QE  
Sbjct: 241 VPVELKVYENFEKRYFSTFSEALDEYFGKVTLEKAKIEQTKKLEAKKRQLLMTLKKQEEL 300

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   +++   + ++ +LI  N   V+  +   R A   R+ WE+  + + E +KAGN  A
Sbjct: 301 LKGFEEQAKANQEIGDLIYANFTMVERLLDEFRKA-TERLGWEEFKKRIDEGKKAGNKAA 359

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EK + +E    KV + L  S   NA  +YE  K
Sbjct: 360 LMVKSI------------------DPKEKAVTIELEGKKVRLYLNKSIGENAELYYEKAK 401

Query: 512 KQESKQEKTITAH---SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K + K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WF+SSE 
Sbjct: 402 KAKHKLEGALKAYEDTKRKLDEIEKLIEEEMKKELAVKRIER-RKKKWFEKFRWFVSSEG 460

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LV++G+DA  NE ++K++M + D+Y HAD++GA   VIK+    Q     T+ +A  F 
Sbjct: 461 FLVLAGKDASTNENLIKKHMDENDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQFA 517

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V  S+AW   + ++ A+W YP+QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 518 VSMSKAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGVI 577



 Score = 77.0 bits (188), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 82/161 (50%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   +Y    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQSLVGSRVDKIYHDGDEIRI-KLRTKEGRQD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHVTTYVKEAPKMPSSFTMLLRKHLSGGFIDAIEQHDFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L D E  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIILVDGENRIVAALRYEEFKDRAIKPKAEYKFP 148


>gi|240103770|ref|YP_002960079.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
           EJ3]
 gi|239911324|gb|ACS34215.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
           EJ3]
          Length = 650

 Score =  182 bits (463), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 117/360 (32%), Positives = 190/360 (52%), Gaps = 31/360 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F+TF  ALDE++ K+  ++A+ +   K ++   +L      QE  
Sbjct: 242 VPIELKIYEGLEKRYFKTFSEALDEYFGKLTIEKAKIEKTRKLESKKKQLLATLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++ ++ + ++ +LI  N   V+  +   R A   ++ WE+  R ++  +K GN VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYAMVERLLDEFRKA-TEKLGWEEFKRRIEAGKKEGNKVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EKT+ +E    KV++ L  S   NA  +YE  K
Sbjct: 361 LMVKAI------------------DPKEKTVTIELEGRKVKLYLNKSIGENAELYYEKAK 402

Query: 512 KQESKQEKTITAHS---KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K   K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WFISSE 
Sbjct: 403 KFRHKYEGALKAYEDTRRKLDEVEKLIEEEMKKELNVKRIER-RKKKWFEKFRWFISSEG 461

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LV++G+DA  NE ++K++MS  D+Y HAD++GA   VIK+    Q     T+ +A  F 
Sbjct: 462 FLVLAGKDASTNETLIKKHMSDNDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQFA 518

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V  S+AW   +  + A+W YP+QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 519 VSMSRAWSQGLYGADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGVI 578



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQWLVGSRVDKVYHDGDEIRI-KLRTKEGRAD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYVKEAPKQPSSFTMLLRKHLSGGFIDAIEQHQFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L DSE  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIVLVDSENRIVAALRYEEYKDRAIKPKAEYKFP 148


>gi|349602918|gb|AEP98908.1| Serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 517

 Score =  182 bits (463), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 80/122 (65%), Positives = 98/122 (80%), Gaps = 1/122 (0%)

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S AWD++++TSAWWVY HQ
Sbjct: 1   GDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQ 59

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           VSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E 
Sbjct: 60  VSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDED 119

Query: 711 MD 712
           M+
Sbjct: 120 ME 121


>gi|315231919|ref|YP_004072355.1| RNA-binding protein [Thermococcus barophilus MP]
 gi|315184947|gb|ADT85132.1| RNA-binding protein [Thermococcus barophilus MP]
          Length = 650

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 192/359 (53%), Gaps = 29/359 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  + + E   FETF  ALDE++ KI  ++A+ +   + +    ++      QE +
Sbjct: 242 VPIELKWYENYEKKYFETFSEALDEYFGKITVEKAKIERTKRLEEKKRQILATLRRQEEQ 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   + E+ ++ ++ +LI  N   +D  +     A+  ++ WE+  + ++E +KAGN +A
Sbjct: 302 MKGFEAEMKKNQELGDLIYANFTFIDNLLREFSKAV-EKLGWEEFKKRIEEGKKAGNKIA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EK + +E    K+++ L  S   NA  +YE  K
Sbjct: 361 LMVKSI------------------DPKEKAVTIEIEGRKIKLYLNKSIGENAEIYYEKAK 402

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRL--QILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K + K E    A+    K  ++  +L  + ++++        RK  WFEKF WFISSE +
Sbjct: 403 KAKHKLEGAKRAYEDTKKKLQEIEKLIEEEMKKELKVKKLEKRKKKWFEKFRWFISSEGF 462

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LVI G+DA  NEM+VKR+M   D+Y HAD+HGA   VIK+    Q     T+ +A  F V
Sbjct: 463 LVIGGKDATTNEMVVKRHMGDNDLYCHADVHGAPHVVIKDG---QKAGEKTIFEACQFAV 519

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
             S+AW   + ++ A+W YP+QV+K AP+GEYL  G+FM+ GK+N+    PL +  G++
Sbjct: 520 SMSKAWSEGVYSADAYWAYPNQVTKKAPSGEYLGKGAFMVYGKRNWYHGIPLKLAVGII 578



 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 83/160 (51%), Gaps = 14/160 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L G R   +Y    +  I        + ++GE  K L++ E
Sbjct: 1   MKEEMSSVDIKYIVEELKSLKGARIDKIYHDGSEIRI-------KLHKAGEGRKDLII-E 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T+Y R+    PS FT+ LRKH+     +++ Q  +DRI+  + G     + +I
Sbjct: 53  AGKRIHLTSYIREAPKMPSSFTMLLRKHLSGGFFDNIEQHDFDRIVKIRIG----NYTLI 108

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
            EL+ +GNI+L D    ++  LR     D+  AI  +H Y
Sbjct: 109 AELFRKGNIILVDENNIIIGALRYEEFKDR--AIKPKHEY 146


>gi|253745574|gb|EET01418.1| Serologically defined colon cancer antigen 1 [Giardia intestinalis
           ATCC 50581]
          Length = 1065

 Score =  181 bits (459), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 92/216 (42%), Positives = 131/216 (60%), Gaps = 19/216 (8%)

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI------LQEKTVANISHM 551
           +AH  A   +E  K+ E K E+T+   S  F   EKK   +I         K +A + H 
Sbjct: 539 NAHTIANTLFEAAKEAEQKCERTLGHSSAYFNKVEKKATAEIDSAIKETDAKLIA-LQHQ 597

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R   WFEKF+WF S++ YLV+SGRDAQ NE++VK++MS  D++VH++ HGA+ T++K  R
Sbjct: 598 RPPLWFEKFHWFFSTDGYLVLSGRDAQSNELLVKKFMSPHDIFVHSEAHGAACTIVKAPR 657

Query: 612 PEQP-----------VPP-LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
                          +PP  T+ +AG FTV HS+ W  K+   ++WVY  QVSKTAP G 
Sbjct: 658 LTTADTIQQNKILRWIPPEQTMLEAGAFTVIHSKMWAQKIGAQSYWVYADQVSKTAPPGM 717

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           Y+  GSF+IRGK+NF+P  PL +G  LL+R D +++
Sbjct: 718 YIGTGSFVIRGKRNFIPQQPLELGVALLWRYDAANV 753



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 77/150 (51%), Gaps = 12/150 (8%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL-- 59
           K+  ++ DVA   K L   L+  R +++ +LS  TY+ +   S+   +  +++  +L+  
Sbjct: 6   KLTPSSFDVAVLAKELSAILVNTRLNSITNLSKTTYLLRFHASTTAIDQCQTKDQMLIDT 65

Query: 60  -------MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
                  +E G  +HTT +   K   P+ F+ +LR  I       V Q  +DR+I+ +F 
Sbjct: 66  YSKPSVIIEPGFYMHTTRFDWSKAIPPTAFSNRLRTEICNLICTGVSQFYFDRVIIMEFS 125

Query: 113 LGMN--AHYVILELYAQGNILLTDSEFTVL 140
              +    Y+I+ELY +GN+LLTD  + VL
Sbjct: 126 RYNSEFKRYLIVELYGRGNLLLTDENYKVL 155


>gi|313126151|ref|YP_004036421.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
 gi|448285991|ref|ZP_21477228.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
 gi|312292516|gb|ADQ66976.1| predicted RNA-binding protein, snRNP like protein [Halogeometricum
           borinquense DSM 11551]
 gi|445575584|gb|ELY30057.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
          Length = 702

 Score =  180 bits (456), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 173/712 (24%), Positives = 292/712 (41%), Gaps = 140/712 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D++A V  L R  G +    Y         ++ +        +  +V L++E 
Sbjct: 4   KRELTSVDLSALVTELNRYEGAKVDKAYLYGDNLLRLRMRDF-------DRGRVELILEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT    +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDVKRAHTAKPEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFDFERGDEDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGN+ + D    V+  L + R   + VA  +++ +P+           S+LH
Sbjct: 117 EIVVELFGQGNVAVLDETGEVVRSLETVRLKSRTVAPGAQYEFPS-----------SRLH 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                        P  V+ +G                       +    +  D  R    
Sbjct: 166 -------------PFTVSYEG---------------------FKRRMEDSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   +  G   +E      G+   M++S+     D   + +  A+  F D L+  
Sbjct: 188 ----TLATQVNLGGLYAEEFCTRAGVEKTMEISDAG---DEEYRAIYDAIQTFHDRLK-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y              TE G+         PL  ++        ++TF+ AL
Sbjct: 239 -SGDFDPRVY--------------TEDGNVVDATP--FPLKEHEAEGLNSESYDTFNEAL 281

Query: 359 DEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           DE++      ++ E +     ++   +A   K  +I   QE  +   +Q+ +R  + AEL
Sbjct: 282 DEYFFAFDRSAEDEPEEEPGSNRPDFEAEIEKKKRIIEQQEGAIEGFEQQAERERERAEL 341

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNCMSLL 471
           +  N E VD  +  VR A    + W+++ + +++  + G P A  ++D            
Sbjct: 342 LYANYELVDEVLSTVRSARDESVPWDEIRQTLEDGAERGIPAAEAVVD------------ 389

Query: 472 LSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQEKTITA-HSK 526
                  +D  E T+ +E    ++EV++ +    NA R Y+  K+ E K+E  + A    
Sbjct: 390 -------VDGAEGTVTIEIDGTRIEVEVDMGVEKNADRLYKEAKRVEGKKEGAMAAIEDT 442

Query: 527 AFKAAEKKTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSENY 569
             + AE K R    +E                  + ++I    +  W+E+F WF +S+ Y
Sbjct: 443 REELAEVKARRDAWEEDDEDDDEEPEEPEDIDWLSRSSIPLKTEEQWYEQFRWFHTSDGY 502

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-----PPLTLNQA 624
           LVI GR+A QNE IVK+Y++K D++ H   HG   TV+K   P +P      P  T  +A
Sbjct: 503 LVIGGRNADQNEEIVKKYLNKHDLFFHTQAHGGPVTVVKATGPSEPAQEVEFPDSTKREA 562

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             F V +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 563 AQFAVSYSSIWKEGRYADDAYMVTPDQVSKTPESGEYIEKGSFVIRGDRTYF 614


>gi|390960715|ref|YP_006424549.1| hypothetical protein containing fibronectin-binding protein
           [Thermococcus sp. CL1]
 gi|390519023|gb|AFL94755.1| hypothetical protein containing fibronectin-binding protein
           [Thermococcus sp. CL1]
          Length = 649

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 188/356 (52%), Gaps = 23/356 (6%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F TF  ALDE++ +I  ++A+ +   K +    +L      QE  
Sbjct: 241 VPIELKIYEGLEKKYFNTFSEALDEYFGRITIEKAKIERTRKLENKKRQLLMTLRKQEEM 300

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   +  +  + ++ +LI  N   ++  +   R A   ++ WE+  + ++E +KAGN VA
Sbjct: 301 LKGFEGAMRENQEIGDLIYANYALIERLLDEFRKA-TEKLGWEEFRKRIEEGKKAGNRVA 359

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            ++  +  +   +++       E+D +       KV++ L  S   NA  +YE  KK   
Sbjct: 360 MMVKGINPKEKAVTI-------ELDGK-------KVKLYLNRSIGENAELYYEKAKKFRH 405

Query: 516 KQEKTITAH---SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
           K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WFISSE +LV+
Sbjct: 406 KHEGALKAYEDTKRKLNEVEKLIEEEMKKELNVKRIER-RKKKWFEKFRWFISSEGFLVL 464

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +G+DA  NE+++KR+M + D+Y HAD++GA   VIK+    Q     T+ +A  F V  S
Sbjct: 465 AGKDASTNEILIKRHMGENDLYCHADVYGAPHVVIKDG---QKAGERTIFEACQFAVSMS 521

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +AW   + +  A+W YP+QV+K  P+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 522 KAWSRGVYSEDAYWAYPNQVTKQTPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGVI 577



 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 85/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQWLVGSRVDKVYHDGDEIRI-KLRTKEGRAD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q G+DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYIKEAPKQPSSFTMLLRKHLSGGFIDAIEQHGFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GN++L DSE  ++  LR     D+ +   + +RYP
Sbjct: 108 GELFRRGNVILVDSENRIVAALRYEEYKDRAIKPKAEYRYP 148


>gi|223478404|ref|YP_002582764.1| fibronectin-binding protein A domain-containing protein
           [Thermococcus sp. AM4]
 gi|214033630|gb|EEB74457.1| Fibronectin-binding protein A domain protein [Thermococcus sp. AM4]
          Length = 650

 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 190/360 (52%), Gaps = 31/360 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F+TF  ALDE++ K+  ++A+ +   K +    +L      QE  
Sbjct: 242 VPIELKIYEGLEKHYFKTFSEALDEYFGKLTIEKAKIERTRKLENKKRQLLATLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++ ++ + ++ +LI  N   ++  +   R A   ++ WE+  + ++  +K GN VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYALIERLLEEFRKA-TEKLGWEEFKKRIEAGKKEGNRVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EK + +E    KV++ L  S   NA  +YE  K
Sbjct: 361 LMVKSI------------------DPKEKAVTIELEGKKVKLYLNKSIGENAELYYEKAK 402

Query: 512 KQESKQEKTITAH---SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K   K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WF+SSE 
Sbjct: 403 KFRHKYEGALKAYEDTKRKLDEVEKLIEEEMRKELNVKRIER-RKKKWFEKFRWFVSSEG 461

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LV++G+DA  NE+++K++M++ D+Y HAD++GA   VIK+    Q     T+ +A  F 
Sbjct: 462 FLVLAGKDASTNEVLIKKHMTENDLYCHADVYGAPHVVIKDG---QKAGERTIFEACQFA 518

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V  S+AW   +  + A+W YP+QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 519 VSMSRAWSQGLYGADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGVI 578



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQWLVGSRVDKVYHDGDEIRI-KLRTKEGRAD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYVKEAPKQPSSFTMLLRKHLSGGFIDAIEQHQFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L DSE  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIVLVDSENRIVAALRYEEYKDRAIKPKAEYKFP 148


>gi|333987711|ref|YP_004520318.1| fibronectin-binding A domain-containing protein [Methanobacterium
           sp. SWAN-1]
 gi|333825855|gb|AEG18517.1| Fibronectin-binding A domain protein [Methanobacterium sp. SWAN-1]
          Length = 663

 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 132/402 (32%), Positives = 193/402 (48%), Gaps = 31/402 (7%)

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIY----DEFCPLLLNQFRSREFVKF 351
           +D  S DI PE    + N       P   +    QI     D+  PL L ++   E   F
Sbjct: 204 KDKPSSDITPEELDFIHNAMSDVFSPLKTAQFHPQIISSEKDDVLPLNLTKYEKYEKKTF 263

Query: 352 ETFDAALDEFYSKIESQRAEQQHK---AKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ETF+ A DEFYS I     +Q H+   A E   F K  KI M+    +   K  + ++  
Sbjct: 264 ETFNQAADEFYSSIVGDDIKQVHEDVWAAEVGKFEKRLKIQMET---LEKFKDTIVKTKI 320

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
             E I  N +++   +  +  A   R ++  L  +   ++     V+GL           
Sbjct: 321 KGEAIYSNYQNIQNILDIIHNA---RETYSWLDIIDIIKKGKKEKVSGLD---------- 367

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
              +  +LD+M      L    V VD  +S   NA  +Y   KK + K      A  K  
Sbjct: 368 ---IIESLDKMGVLTLNLDGTIVNVDSNMSIPENAEIYYNKGKKAKRKISGVNIAIEKTM 424

Query: 529 KAAEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           K  E+ K + +I  EK +     +RK + WFEK  WF+SS+  LVI GRDA  NEMIVK+
Sbjct: 425 KEVERAKNKREIAMEKVLVPQKRVRKELKWFEKLRWFLSSDGLLVIGGRDATTNEMIVKK 484

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWW 645
           +M   D+Y H+D+HGA+S V+K    E  VP  TLN+   F    S AW +    T  +W
Sbjct: 485 HMENRDIYFHSDIHGAASVVVKAGEGE--VPESTLNETASFAGSFSSAWSAGFGSTDVYW 542

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V+P QVSKT  +GE++  G+F+IRG +NF+   PL++  G++
Sbjct: 543 VHPDQVSKTPQSGEFVGKGAFIIRGSRNFIRNAPLLVAVGIV 584



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 65/110 (59%), Gaps = 1/110 (0%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V ++ ++G+R+HTT Y  +    P  F + LRKH++   +  V+Q  +DRI+       
Sbjct: 47  RVDVVFQAGLRVHTTQYPPENPQIPPSFPMILRKHLKGGNVTCVKQHNFDRILKINIQ-K 105

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + + +++EL+A+GNI+L D E T++  L+    +D+ ++    ++YP E
Sbjct: 106 EHKYSLVIELFAKGNIILLDEEGTIIMPLKRKLWEDRNISSKEEYKYPPE 155


>gi|14520906|ref|NP_126381.1| hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
 gi|5458123|emb|CAB49612.1| Hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
 gi|380741455|tpe|CCE70089.1| TPA: hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
          Length = 649

 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 110/354 (31%), Positives = 187/354 (52%), Gaps = 20/354 (5%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E V FETF  ALDE++ K+  ++A+++   K +    +L      QE  
Sbjct: 242 VPIELKWYEGYERVYFETFSQALDEYFGKLTIEKAKEERTRKLEEKKKQLMATLERQERM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++E  ++ ++ +LI  N   +D  +     A+  +  W +  + ++E +K GN +A
Sbjct: 302 IKGFEEEARKNQEIGDLIYANYTIIDGILREFSKAV-EKFGWNEFKKRLEEGKKQGNKIA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            L+  +  E + +++ L                 K+++ L  S + NA  +YE  KK + 
Sbjct: 361 LLVKNVNPEEDSITIELEGR--------------KIKLYLNRSINDNAELYYEKAKKAKH 406

Query: 516 KQEKTITAHSK-AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           K E    A+ +   K  + +  ++  ++K        RK  WFEKF WFISSE +LVI G
Sbjct: 407 KLEGAKKAYEELKRKLEQIEKEIEEEEKKIQVKKIEKRKKKWFEKFRWFISSEGFLVIGG 466

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQA 634
           +DA  NE++V++YM + D+Y HAD+ GA   +IK+    Q     T+ +A  F V  S+A
Sbjct: 467 KDATTNEIVVRKYMQENDIYCHADIWGAPHVIIKDG---QKASERTIFEACQFAVSMSRA 523

Query: 635 WDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           W   + +  A+WVYP QV K AP+GE+L  G+FM+ GK+N++   PL +  G++
Sbjct: 524 WSEGLYSGDAYWVYPEQVKKQAPSGEFLPKGAFMVYGKRNWMHGIPLKLAVGVV 577



 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/162 (27%), Positives = 83/162 (51%), Gaps = 14/162 (8%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+   V+ L+  ++G R   VY    +  I        + ++GE  K L++ 
Sbjct: 1   MKEEMSSVDIRYIVQELKEEIVGARVDKVYHEGNEVRI-------KLHKAGEGRKDLII- 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T+Y ++    PS F + LRKH+    ++ + Q  +DRI+  + G       +
Sbjct: 53  EAGKRIHLTSYIKESPQ-PSSFAMLLRKHLSGSFVDGIEQHDFDRIVKIRIG----KFTI 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I EL+ +GN++L D   T++  +R     D+ +     ++YP
Sbjct: 108 IAELFRRGNVILVDENNTIIGAIRYEEFKDRAIKPKLEYKYP 149


>gi|448565126|ref|ZP_21636097.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
 gi|445715785|gb|ELZ67538.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
          Length = 702

 Score =  177 bits (448), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 175/706 (24%), Positives = 287/706 (40%), Gaps = 128/706 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP           AS+L 
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                       +P  V+ D                      L +N  ++  D  R    
Sbjct: 165 ------------DPLTVSRDA---------------------LGRNMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+   + +++    + +A+   ++      D  Q V
Sbjct: 188 ----TLATQLNLGGLYAEELCTRAGVEKTLDIADATADDYDAVYDAIV------DLRQQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG+  P  Y+              E G    +     PL  +Q    +   ++TF+ AL
Sbjct: 238 RSGEFDPRLYL-------------DEDGEVVDVTP--FPLREHQNAGLDEEAYDTFNDAL 282

Query: 359 DEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           DE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     + AEL+  
Sbjct: 283 DEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEQQAQDERERAELLYA 342

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N + VD  +  VR A    + W+D+   + E  + G P A  +  +      +++     
Sbjct: 343 NYDLVDDVLSTVRGAREEGVPWDDIGETLAEGAEQGIPEAEAVTNVDGANGTVTV----- 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
             ++DD   TL V       ++    NA R Y   K+ E K+E  + A   ++   AA K
Sbjct: 398 --DLDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVK 448

Query: 534 KTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSENYLVISGR 575
           K R +   +                    + ++      HWFE+F WF +S  YLV+ GR
Sbjct: 449 KRRDEWEADDDEDDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYLVVGGR 508

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTVC 630
           +A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V 
Sbjct: 509 NADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLREAAQFAVS 568

Query: 631 HSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 569 YSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYF 614


>gi|300176454|emb|CBK23765.2| unnamed protein product [Blastocystis hominis]
          Length = 767

 Score =  176 bits (447), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 92/233 (39%), Positives = 150/233 (64%), Gaps = 6/233 (2%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK--AAEKKTRLQILQEKTVANI 548
           V+V+L+L+ + N    +  KK  + K +KT+ A   A    + +++T L++ +    A I
Sbjct: 151 VDVELSLNCNQNISLLFSQKKDLQDKLDKTVQAAQAAVAEASKQRQTELRVAEAAHPAEI 210

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
           +  R+  WFEKF+W ++++ ++V++G+  +QNE++V+RY+  GD+++HAD+HGA++ V++
Sbjct: 211 ARQREKRWFEKFDWCVTTDGFIVLAGKSGEQNEILVRRYLRPGDLFLHADVHGAATVVLR 270

Query: 609 NHR-PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           N+R PE P     L QA  F +CHS AWD++++   +WV   QVSKTAP+GEYL  GSFM
Sbjct: 271 NYRAPELP-GEAALLQAAAFALCHSSAWDAQLLCKVYWVPARQVSKTAPSGEYLPTGSFM 329

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           IRGKKNFL P+ + MG  +LF +    +  H  +R+ R  E+   D+E    H
Sbjct: 330 IRGKKNFLAPYRMEMGLTVLFEVRPEDVQRHFYDRKPREMEDA--DWETLVKH 380


>gi|57641373|ref|YP_183851.1| fibronectin-binding protein [Thermococcus kodakarensis KOD1]
 gi|57159697|dbj|BAD85627.1| predicted fibronectin-binding protein [Thermococcus kodakarensis
           KOD1]
          Length = 650

 Score =  176 bits (447), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 115/376 (30%), Positives = 191/376 (50%), Gaps = 29/376 (7%)

Query: 319 DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
           D P         +  +  P+ L  +   E   F TF  ALDE++ KI  ++A+ +   K 
Sbjct: 225 DEPKPNIVFKDGVMHDVVPIELKIYEGFEKRYFPTFSEALDEYFGKITLEKAKIEQTKKL 284

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
           +     L      QE  +   ++ +  + ++ +LI  N   ++  +   R A    + W+
Sbjct: 285 EEKKRGLMATLRKQEEMLKGFEKAMRENQEIGDLIYANYTLIERLLEEFRKA-TETLGWD 343

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVD 494
           +  R + E +K GN VA ++  +                  D +EK + +E    KV++ 
Sbjct: 344 EFRRRIDEGKKTGNKVALMVKGI------------------DPKEKAVTIELDGKKVKLY 385

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM--R 552
           L  S   NA  +YE  KK   K E  + A+    +  E+  +L   ++K   N+  +  R
Sbjct: 386 LEKSLGENAEIYYEKAKKFRHKYEGALKAYEDTKRKLEEIEKLIEEEQKKELNVKKLERR 445

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP 612
           K  WFEKF WF+SSE +LV++G+DA  NE++VK++M   D+Y HAD++GA   VIK+   
Sbjct: 446 KRKWFEKFRWFVSSEGFLVLAGKDASTNEVLVKKHMEDNDLYCHADVYGAPHVVIKDG-- 503

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
            Q     T+ +A  F V  S+AW   + ++ A+W YP+QV+K AP+GEYL  G+FM+ GK
Sbjct: 504 -QKAGEKTIFEACQFAVSMSRAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGK 562

Query: 672 KNFLPPHPLIMGFGLL 687
           +N++   PL +  G++
Sbjct: 563 RNWMHGLPLKLAVGVI 578



 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +   FKL    G  +        L++E
Sbjct: 1   MKEEMSSVDIRYIVRELQWLVGSRVDKVYHDGDEVR-FKLRTKEGRAD--------LILE 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYIKEAPKQPSSFTMLLRKHLGGGFIDAIEQHQFDRIVKIRIG----NYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L DSE  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIILVDSENKIVAALRYEEYKDRAIKPKAEYKFP 148


>gi|448491980|ref|ZP_21608648.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
           19288]
 gi|445692198|gb|ELZ44379.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
           19288]
          Length = 729

 Score =  176 bits (446), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 183/732 (25%), Positives = 303/732 (41%), Gaps = 130/732 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAADPDNVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + + +  D+ +  L  A+++ ++ L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIEEATDDQLGALHDALSRLDERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD----EFCPLLLNQFRSREFVKFETF 354
            SGD+ P  Y         ++    + G  T   D    +  P  L +      V F+TF
Sbjct: 239 -SGDVDPRVY---------EESVEGDGGDETDERDPRVVDVTPFPLAEHEGLPSVGFDTF 288

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRS 406
           +AA+DE++ ++ ++  ++     +  A          K  +I   Q   +   +++    
Sbjct: 289 NAAVDEYFYRLGNEETDEGEAPADAGASRPDFEEEIAKQERIIEQQLGAIEGFEEQAQAE 348

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
            + AEL+  + + VD  +  VR A  N + W+++A  +    + G P A  +    ++ +
Sbjct: 349 RERAELLYAHYDLVDEVLSTVREARENEVPWDEIAATLDAGAERGIPAAAAV----VDVD 404

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ---EKTITA 523
                ++  LDE  D E T+    VE+D +     NA R Y   K+ E K+   ++ I +
Sbjct: 405 GGEGTVTVELDEEGDGEGTV---TVELDASEGVEVNADRLYREAKRVEEKKAGAKEAIES 461

Query: 524 HSKAFKAA-EKKTRLQILQEK----------------------TVANISHMRKVHWFEKF 560
             +  +A  E+K   +  Q                        + ++I       WFE+F
Sbjct: 462 TREELEAVKERKAEWEEQQAADDGSGGDDGGEDDEEEYETDWLSRSSIPIRSPDDWFERF 521

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL- 619
            WF +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+ 
Sbjct: 522 RWFRTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTILKASGPSESADPVD 581

Query: 620 ----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
               TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + +
Sbjct: 582 FSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTY 641

Query: 675 LPPHPLIMGFGL 686
               P  +  G+
Sbjct: 642 FEDVPCRIAVGV 653


>gi|76156824|gb|AAX27946.2| SJCHGC07203 protein [Schistosoma japonicum]
          Length = 184

 Score =  176 bits (445), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 90/180 (50%), Positives = 116/180 (64%), Gaps = 19/180 (10%)

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           ++  K+A  K    +   KT+A I+ +RK  WFEKF WFISSENYLV++G D+QQNE++V
Sbjct: 5   AQILKSAIHKAEATMKTAKTIAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQQNEVLV 64

Query: 585 KRYMSKGDVYVHADLHGASSTVIKN-------------------HRPEQPVPPLTLNQAG 625
           KRY+  GD++VHAD+HGAS+ +IK                    HR     PP TL +A 
Sbjct: 65  KRYLKSGDIFVHADIHGASTVIIKARHLTSEESDFSKHESLLHLHRSLPLPPPKTLLEAA 124

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              V  S AW + ++T AWWV+  QVSKTAP+GEYLT GSF+IRGKKN+LPP P   GFG
Sbjct: 125 NMAVVLSSAWQNHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFG 184


>gi|448528898|ref|ZP_21620278.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
           700873]
 gi|445710346|gb|ELZ62165.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
           700873]
          Length = 740

 Score =  176 bits (445), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 184/736 (25%), Positives = 297/736 (40%), Gaps = 127/736 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAADPDHVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP            S+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLSTVRLKSRTVAPGSQYEYP-----------GSRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                              D  +VS          +GG      ++  ++ +D  R    
Sbjct: 165 -------------------DPLDVS----------RGG----FERHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+     + E     D+ ++ L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKETPIEEAT---DDQLRALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGY---ILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
            SGDI P  Y   I           P        ++ D   P  L +      V F++F+
Sbjct: 239 -SGDIDPRVYEESIDGDGNADDDADP--------RVVD-VTPFPLAEHEDLPSVGFDSFN 288

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSV 407
           AA+DE++ ++ S+ AE      + +A          K  +I   QE  +   +++     
Sbjct: 289 AAVDEYFYRLGSEDAEAGDAPADASASRPDFEGEIAKQQRIIEQQEGAIEGFEEQAQAER 348

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERN 466
           + AEL+  N + VD  +  VR A  + + W+++   +    + G P A  ++D    E  
Sbjct: 349 ERAELLYANYDLVDEVLSTVREARESEVPWDEIEETLDAGAERGIPAAEAVVDVDGGEGT 408

Query: 467 CMSLLLSNNLDEMDDEE-KTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
               L   + D+ DDE        ++E+D +     NA R Y+  K+ E K+E  + A  
Sbjct: 409 VTVELADESGDDADDEGGANGGTTRIELDASEGVEVNADRLYQEAKRVEEKKEGAMAAIE 468

Query: 526 KAFKAAEK-KTRLQILQEKTVAN----------------------------ISHMRKVHW 556
              +  E  K R    +E+  AN                            I      +W
Sbjct: 469 STREELEAVKERKAEWEEQQAANDGSGQGDDGDDGADDEEEYETDWLSRASIPIRSPDNW 528

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV 616
           +++F WF +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +  
Sbjct: 529 YDRFRWFHTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTILKASGPSESA 588

Query: 617 PPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG
Sbjct: 589 DPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVDPDQVSKTPESGEYIEKGSFVIRG 648

Query: 671 KKNFLPPHPLIMGFGL 686
            + +    P  +  G+
Sbjct: 649 DRTYFEDVPCRIAVGV 664


>gi|13542268|ref|NP_111956.1| RNA-binding protein snRNP [Thermoplasma volcanium GSS1]
 gi|14325702|dbj|BAB60605.1| hypothetical protein [Thermoplasma volcanium GSS1]
          Length = 604

 Score =  174 bits (442), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 138/200 (69%), Gaps = 12/200 (6%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           E +++D   SA  NA R+++L K    K    I    KA + AE++ R++ LQEK V ++
Sbjct: 343 EDIDIDYTKSAGENANRYFDLSKDYRKK----IEGAKKAIEEAEQE-RIK-LQEKKVKSV 396

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
           +  R++ WFE ++WFISSE YLVI+GRDA+ NE IVK+++ +GD+YVHAD++GA ST+IK
Sbjct: 397 N--RRIFWFETYHWFISSEGYLVIAGRDAKSNEKIVKKHLKEGDLYVHADMYGAPSTIIK 454

Query: 609 NHRPEQPVPPL-TLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           +    +P+P   T+ QA  F +C S+AW + + + +A+WVYP QVSKT  +GEY++ GS+
Sbjct: 455 SE--GKPMPGEDTIRQAAAFAICFSRAWPAGIASGTAYWVYPSQVSKTPESGEYVSTGSW 512

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
           +IRGK+N++    L +  GL
Sbjct: 513 IIRGKRNYVTNLKLELCIGL 532



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 58/123 (47%), Gaps = 12/123 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           RL+G     VY   P  ++ +L  S       +   +L+ ++ G+   +     +  +T 
Sbjct: 20  RLVGSFVKKVYQTGPDDFLIQLYRSDL-----KRFDMLVSLKKGIFFKS----EETPDTA 70

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           S   + LRK I  RR+  V Q+ +DR++ F F  G     +ILEL+  GN++ TD +   
Sbjct: 71  SQTAMVLRKTISDRRIVSVEQVNFDRVVKFVFHTG---QALILELFRDGNLIATDGDKIT 127

Query: 140 LTL 142
             L
Sbjct: 128 FVL 130


>gi|410722235|ref|ZP_11361543.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
           sp. Maddingley MBC34]
 gi|410597380|gb|EKQ52002.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
           sp. Maddingley MBC34]
          Length = 742

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 113/364 (31%), Positives = 186/364 (51%), Gaps = 25/364 (6%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK---IESQRAEQQHKAKEDAAFHKLN 386
           ++ ++  PL +  +++    +F+TF+ A DEFYS     + ++ ++   AKE   + K  
Sbjct: 328 KVKEDVLPLDILTYQNFHKERFDTFNQAADEFYSGKVGADIKKVQEDIWAKEVGKYEKRL 387

Query: 387 KIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
           +I   QE  +   ++ +  + K   L+  +  ++   +  +  A   + SW ++A   K+
Sbjct: 388 RI---QEETLEKFQKTIVETKKKGNLLYSHYSEIQDLLDIIHQA-REKFSWMEIASKFKK 443

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRW 506
            RK G   A +I+ +    + M +L  N           L  E+V VD  L    NA ++
Sbjct: 444 ARKEGMKEAQIIESM----DKMGVLTLN-----------LEGERVTVDANLEIPENAEKY 488

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKK--TRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           Y   KK + K      A  +  K  E+K   R   L+   V      +++ WFEK  WF+
Sbjct: 489 YNKGKKAKRKIRGVNIAIERTKKDVERKRNKREMALERVRVPQKRVRKELKWFEKLRWFL 548

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SS+ YLVI GRDA  NEM+VKR++   D+Y+H+D+HGA S VIK    E  +P  T+ +A
Sbjct: 549 SSDGYLVIGGRDAGTNEMVVKRHLDNQDIYLHSDIHGAPSVVIKKGEVEGEIPESTVQEA 608

Query: 625 GCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           G      S AW     +   +WV+P QVSKT  +GE++  G+F+IRG +N+L   PL + 
Sbjct: 609 GTLAASFSSAWSKGYGSQDVYWVHPDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIA 668

Query: 684 FGLL 687
            G++
Sbjct: 669 VGIV 672



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 60/110 (54%), Gaps = 1/110 (0%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V +  ++G+R+HTT Y  +    P  F + LRKH++   ++ VRQ  +DRI+       
Sbjct: 47  RVDVAFQAGLRVHTTQYPPENPKVPPSFPMLLRKHLKNATVKGVRQHNFDRILEIDIQ-K 105

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +   +++EL++QGNI+L D +  ++  L+      + +     ++YP E
Sbjct: 106 EHRFTLVVELFSQGNIILLDEDNQIILPLKHRHAQGRKITSKEEYQYPEE 155


>gi|383318475|ref|YP_005379316.1| RNA-binding protein, eukaryotic snRNP-like protein [Methanocella
           conradii HZ254]
 gi|379319845|gb|AFC98797.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Methanocella conradii HZ254]
          Length = 662

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 185/359 (51%), Gaps = 22/359 (6%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L+++ S + V FE+F+ ALDE++S+  +  A+ +   ++        +    QE  
Sbjct: 251 LPIELSRYSSHQKVYFESFNQALDEYFSRHVAAEAKAEVVERKAEKLGVYERRLRQQEEA 310

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++E   +V+  E I      +   I  +R A A   SW+D+ +++++ RKAGN  A
Sbjct: 311 IAKFEREEAENVRKGEAIYAEYNTISEVIGVIRGARAKGYSWDDIRKILRDARKAGNKAA 370

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            LI  +    N +++ LS+                V V++ L+   NA+ +Y+  KK   
Sbjct: 371 SLIQSVDPAANTVNVKLSSV--------------SVNVNIDLTVPQNAQAYYDKAKKARL 416

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGR 575
           K+E  + A  +  KA  K+T     +    A   H RK  W+EK+ WF +S+ +LVI GR
Sbjct: 417 KKEGALKAIEETKKAMAKETPAPPREPSAKA---HPRKPRWYEKYRWFYTSDGFLVIGGR 473

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW 635
           DA QNE +VK+YM K DV+ HA   GA  T++K     + V P  L +A  F V +S  W
Sbjct: 474 DADQNEELVKKYMEKSDVFFHAQAFGAPITIVKAG--GRDVTPAALAEAAQFAVSYSSVW 531

Query: 636 DSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
            S   +   +WV P QVSKT   GEY+  G+F+IRG +N++    +    G+  R DE+
Sbjct: 532 KSGQYSGDCFWVRPEQVSKTPEHGEYVAKGAFIIRGDRNYVKNVEVRAAVGI--RFDET 588



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 81/161 (50%), Gaps = 7/161 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ DV A V+ L+ L+  +    Y  +      +L       +  ++ K  L++E
Sbjct: 1   MKEEMSSVDVYAAVRELQFLVDAKVEKAYQHTADEIRIRL-------QEFKTGKYDLVIE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T + R+    P  F + LRKH+   R+  + Q  +DRI+  +         ++
Sbjct: 54  AGKRLHLTRHPRESPKLPPSFPMMLRKHMMGGRITRIAQHNFDRIVEIEVARAGVKSTLV 113

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+AQGN++L D E  ++  LRS +  D+ V    ++ YP
Sbjct: 114 AELFAQGNVILLDGERRIMMPLRSMKMKDRDVVRGEQYEYP 154


>gi|386003039|ref|YP_005921338.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
 gi|357211095|gb|AET65715.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
          Length = 668

 Score =  173 bits (439), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 180/706 (25%), Positives = 296/706 (41%), Gaps = 132/706 (18%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M+  DVAA V+ L+ +L+G      Y LSP   +          +S  S K+ LL+
Sbjct: 1   MKKAMSNVDVAAVVEELQEKLVGGFVGKSYQLSPDRVVISF-------QSPASGKLDLLL 53

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T   R+    P  F   LR  +   R+  VRQ G+DR+   +   G + + +
Sbjct: 54  EAGRRIHLTEKPREAPKMPPQFPTMLRSRLSGGRVAAVRQHGFDRVAEIEIERGDDRYTL 113

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I E++ +GN+LL DS   ++  LR     D+                        KL A 
Sbjct: 114 IAEIFPKGNVLLLDSGGRIVLPLRPLAFRDR------------------------KLLAG 149

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            T     D  +P  V          S+ +L       +F L+ + ++            L
Sbjct: 150 ETYQYREDQVDPRTV----------SRNDL-------AFILASSDSE------------L 180

Query: 241 KTVLGEALGYGPALSEHIILDTGL---VPNMKLS--EVNKLEDNAIQVLVLAVAKFEDWL 295
              L   L  G   +E I L  G+   VP   L+  E+++L     +V  LA    E + 
Sbjct: 181 VRTLVRGLNMGGTYAEEICLRAGINKTVPAFALAGEEIDRLHWALGEVFGLA----EAYP 236

Query: 296 QDVISG----DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
             V  G    D+VP                     +   +YD             E  +F
Sbjct: 237 HLVAEGERIVDVVP---------------------APLAVYDGL-----------ERREF 264

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F  ALDEF+S  E++  E    AK   A  +  ++   QE  +   ++      ++ E
Sbjct: 265 GSFSEALDEFFSSKEAEAEE----AKPKTALERRREM---QERSIQEFRERERELARLGE 317

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
            +     +V+A + A+        ++ ++   +K    +G P+A  I  L  +      L
Sbjct: 318 KVYERYGEVEAVLAAISKGFERGFTYSEILAKIK---TSGLPIAEKILALDYQGELRLRL 374

Query: 472 LSNNLDEMDDEEKTLPVEK----------VEVDLALSAHANARRWYELKKKQESKQEKTI 521
                 +  + +     +           +E++  L+   NA+R+Y+L K+Q  K+E   
Sbjct: 375 DDPGDGDGGEGKGGTVGDTGGKGEARGAVLELNSNLTVPQNAQRYYDLAKEQAKKREGAE 434

Query: 522 TAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
            A  +  +   +K   +  + KT A +   RK  W+E+F WF SS+ +LVI GRDA  NE
Sbjct: 435 KALEETIRLIARKAGPE--KAKTRA-VYRRRKPKWYERFRWFTSSDGFLVIGGRDATSNE 491

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT 641
            I  +Y+ K D+ +H D  GA  TVIK     + VP  TL +A  F V +S  W + +  
Sbjct: 492 EIYAKYLEKRDLALHTDAPGAPLTVIKTL--GEAVPESTLEEAASFAVSYSSLWKAGLFE 549

Query: 642 S-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
              + V   QV+KT   GE+L  G+F++RG++ +    PL +  G+
Sbjct: 550 GDCYLVAADQVTKTPEPGEFLKKGAFVVRGERRYYRDVPLGLALGI 595


>gi|375084281|ref|ZP_09731287.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
 gi|374741041|gb|EHR77473.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
          Length = 650

 Score =  172 bits (437), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 188/359 (52%), Gaps = 29/359 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   FETF  ALDE++ KI  + A+ +   K       L      QE  
Sbjct: 242 LPIELKWYEGYEKKFFETFSEALDEYFGKILIESAKIERTKKLQDKKRGLEVTLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++++  + ++ +LI  N   V+  +  +  A+  ++ WE+  + ++E RK+GN VA
Sbjct: 302 IKGFERQMQENQEIGDLIYANFTFVENLLKELSKAV-EKLGWEEFKKRIEEGRKSGNKVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            +I  +                  D +EK + VE    KV++ L  S   NA  +YE  K
Sbjct: 361 QIIKGI------------------DPKEKAVTVELEGKKVKLYLNKSIGENAEIYYEKAK 402

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH--WFEKFNWFISSENY 569
           K + K E    A+    K  ++  +L   +EK   ++  + K    WFEKF WF+SSE +
Sbjct: 403 KAKHKLEGARKAYEDTLKKIQEIEKLIEEEEKKELSVKKLEKRKKKWFEKFRWFVSSEGF 462

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LVI G+DA  NE++VKR+MS+ D+Y HAD++GA   VIK+ +        T+ +A  F V
Sbjct: 463 LVIGGKDATTNEIVVKRHMSENDLYCHADIYGAPHVVIKDGK---KAGEKTIFEACQFAV 519

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
             S+AW   + +  A+W  P QV+K AP+GEYL  G+FM+ GK+N++   P+ +  G++
Sbjct: 520 SMSRAWKDGIYSGDAYWADPSQVTKKAPSGEYLGKGAFMVYGKRNWMHGLPVKLAIGIV 578



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 84/160 (52%), Gaps = 14/160 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L G R   +Y    +  I        +  +GE  K L++ E
Sbjct: 1   MKQEMSSVDIKYIVEELKSLEGARVDKIYHDGDQIRI-------KLHIAGEGRKDLII-E 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T Y ++    PS FT+ LRK++   RLE + Q  +DRI+  + G     + +I
Sbjct: 53  AGRRIHLTTYIKEAPQQPSSFTMLLRKYLSGLRLEKIEQHDFDRIVKLKIG----EYTLI 108

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
            EL+ +GN++L D +  +++ +R     D+  AI  +H Y
Sbjct: 109 AELFKRGNVILVDKDNVIISAMRHEEFKDR--AIKPKHEY 146


>gi|448725341|ref|ZP_21707802.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
 gi|445798677|gb|EMA49073.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
          Length = 695

 Score =  172 bits (437), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 165/714 (23%), Positives = 288/714 (40%), Gaps = 128/714 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSVDLAALVTELGTYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  GV--RLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  +  +  D    P GF   LR  +       V Q G+DR++ F+F  G    
Sbjct: 57  GETKRAHVVSPEHVPDAPGRPPGFAKMLRNRLSGADFAGVSQFGFDRVLTFEFERGDRNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            V+ EL+ +GN+ + D+   V+  L + R   + VA  +++ +P           +++  
Sbjct: 117 KVVAELFGEGNVAVLDATGEVVDCLNTVRLQSRTVAPGAQYEFP-----------STRF- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                       +P  V+ DG                      +    +++ D       
Sbjct: 165 ------------DPLAVDYDG---------------------FAARMEESNTD------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            L   L   L +G    E +    G+   + + E ++ E    +VL  A+    + L   
Sbjct: 185 -LVRTLATQLNFGGLYGEELCTRAGVEKELAIEEADETE---FEVLYDALTGLSEQLS-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y          D  P +            P  L++    +  +F++F AAL
Sbjct: 239 -SGDFDPRIYR--------DDGEPVD----------VTPFPLDERAEFDSEEFDSFTAAL 279

Query: 359 DEFYSKI---ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ ++   E + + ++ +   +    +  +I   QE  +   + + DR  + AE +  
Sbjct: 280 DAYFVELDTTEDEESGERERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYA 339

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E VD  +  VR A    + WE +     E  + G   AG +  +      +++     
Sbjct: 340 NYELVDEILTTVRNAREEGIGWEAIEERFAEGEERGIAAAGAVTGIEPSEGTVTI----- 394

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWY-ELKKKQESKQ--EKTITAHSKAFKAAE 532
             E+DD +       VE+D       NA R Y E K+  E K+  E+ +    +  +A E
Sbjct: 395 --EIDDRD-------VELDPQEGVEQNADRLYREAKRVVEKKEGAEEAVVETREELEAIE 445

Query: 533 KK--------------TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           ++                 + +   +  +I   +   W+E+F WF +S+ YLVI GR+A 
Sbjct: 446 RQRDEWEAGDVDDDPDEESEDVDWLSRRSIPTRKNEQWYERFRWFHTSDGYLVIGGRNAD 505

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGCFTVCHSQ 633
           QNE +VK+Y+ +GD + H  + G   T++K   P +P     +P  +L +A  F V +S 
Sbjct: 506 QNEDLVKKYLDRGDRFFHTQVQGGPVTILKATGPSEPTREIDLPDRSLEEAAQFAVSYST 565

Query: 634 AW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            W + +    A+   P QVSKT  +GEYL  G F IRG + +     + +  G+
Sbjct: 566 VWKNGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGI 619


>gi|448612034|ref|ZP_21662464.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
 gi|445742795|gb|ELZ94289.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
          Length = 701

 Score =  172 bits (436), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 167/716 (23%), Positives = 284/716 (39%), Gaps = 127/716 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  + R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTEMNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L                    E  R+  RT A    
Sbjct: 117 KIVVELFGQGNIAILDETGEVVRSL--------------------ETVRLKSRTVAPGSQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+     +P  ++ D                      L ++  ++  D  R    
Sbjct: 157 YEYPSSR----LDPLTISRDA---------------------LGRHMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                +   L  G   +E +    G+   + + +  + + +AI   ++ +       Q V
Sbjct: 188 ----TIATQLNLGGLYAEELCTRAGVEKTLDIEDATEDDYDAIYDAIVNLR------QQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG+  P  Y+    + +  D  P              PL  +Q    +   +ETF+ AL
Sbjct: 238 RSGEFDPRLYLADDGEVV--DVTP-------------FPLQEHQNAGLDEEAYETFNEAL 282

Query: 359 DEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           DE++ +++    EQ+  +     +    K  +I   QE  +    Q+ D   + AEL+  
Sbjct: 283 DEYFFRLDLTADEQEATSNRPDFEEQIAKQERIIEQQEQAIEGFDQQADEERERAELLYA 342

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N +  D  +  VR A    + W+++A  + E    G P A  +  +      +++ L   
Sbjct: 343 NYDLADDVLSTVRDAREQGVPWDEIAVTLDEGADQGIPAAEAVTNVDSANGTVTVELDGT 402

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
                          V +D+++    NA R Y   K+ + K+E  + A   ++    A K
Sbjct: 403 --------------SVTLDVSMGVEKNADRLYTEAKRIQEKKEGALAAIEDTREELEAAK 448

Query: 534 KTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           + R +   +                  ++ ++      HWFE+F WF +S  YLV+ GR+
Sbjct: 449 RRRDEWEADDGGGDADEDDEPEETDWLSLESVPVKSTEHWFERFRWFYTSSGYLVVGGRN 508

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTVCH 631
           A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V +
Sbjct: 509 ADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQKVDFSEETLREAAQFAVAY 568

Query: 632 SQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + +    P  +  G+
Sbjct: 569 SSIWKEGRFADDAYMVEPSQVSKTPESGEYIDKGSFVIRGDRRYFEDVPAKVAVGI 624


>gi|448474105|ref|ZP_21602073.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
           13560]
 gi|445818385|gb|EMA68244.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
           13560]
          Length = 731

 Score =  172 bits (436), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 180/734 (24%), Positives = 303/734 (41%), Gaps = 132/734 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYAGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHVADPEHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  +++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLSTVRLKSRTVAPGAQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+   + + E     D+ ++ L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKTLPVDEAT---DDQLRALHEALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSST----QIYDEFCPLLLNQFRSREFVKFETF 354
            SGDI P  Y    +   G++    ++GS T    ++ D   P  L++      V F++F
Sbjct: 239 -SGDIDPRVYEEALDGD-GEEDGNGDAGSDTDRDPRVVD-VTPFPLSEHEGLPSVGFDSF 295

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRS 406
           +AA+DE++ ++E +  +      + +A          K  +I   Q   +    ++  + 
Sbjct: 296 NAAVDEYFYRLEHEDTDAGEAPADASASRPDFEEEIAKQERIIEQQRGAIEGFDEQAAQE 355

Query: 407 VKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            + AEL+  EY+L  VD  +  VR A AN + W+++A  +    + G P A  +  +   
Sbjct: 356 RERAELLYAEYDL--VDEVLSTVRDARANDVPWDEIADTLAAGAERGIPAAEAVVDVDGS 413

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ---EKTI 521
              +++ L ++              +VE+D       NA R Y+  K+ E K+   E+ I
Sbjct: 414 DGTVTVELGDD------------GTRVEIDTGAGVEVNADRLYQEAKRIEDKKAGAEQAI 461

Query: 522 TAHSKAFKAA-EKKTRLQILQEK----------------------TVANISHMRKVHWFE 558
            +     +A  E+K      Q                        + ++I   R   W+E
Sbjct: 462 ESTRAELEAVKERKAEWAAQQAAADDDQSDSEEDDDEEEHEIDWLSRSSIPIRRPEDWYE 521

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           +F WF ++  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   P +   P
Sbjct: 522 RFRWFHTASGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAGPSESADP 581

Query: 619 L-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
           +     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG +
Sbjct: 582 VDFSEQTLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDR 641

Query: 673 NFLPPHPLIMGFGL 686
            +    P  +  G+
Sbjct: 642 TYFEDVPCRVAVGV 655


>gi|16082623|ref|NP_394872.1| RNA-binding protein snRNP [Thermoplasma acidophilum DSM 1728]
          Length = 601

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 151/260 (58%), Gaps = 16/260 (6%)

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE-- 489
           + + S E+  ++  E+++ G  +   + ++  +    S    N    +D   K + V+  
Sbjct: 283 SQKKSIEEFEKIANEKQEIGRAIMERLQEI--DGAIRSARSGNYAGNIDRARKVITVDMD 340

Query: 490 --KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN 547
              VE+D  +SA  NA R++   K    K E  +    KA + AEK    Q L E   A 
Sbjct: 341 GKPVEIDYTVSAGENANRYFSQAKDYRRKIEGAM----KAIEEAEK----QRLTEMQKAE 392

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
               RKV WFE ++WFISSE YLVI+GRDA+ NE IVK+++ +GD+YVHAD++GA ST+I
Sbjct: 393 KKKRRKVFWFETYHWFISSEGYLVIAGRDAKSNEKIVKKHLQEGDIYVHADMYGAPSTII 452

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           K+   +QP    TL +A  F V  S+AW + + + +A+WVYP QVSKT  +GEY+  GS+
Sbjct: 453 KSS-GKQPPGEATLREAASFAVSFSRAWPAGIASGTAYWVYPSQVSKTPESGEYVATGSW 511

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
           +IRGK+N++    L +  G+
Sbjct: 512 IIRGKRNYITDLKLELCIGM 531



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 86/175 (49%), Gaps = 18/175 (10%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + ++ D  A V   R R +G     VY + P  ++ ++  S       +   VL+ +
Sbjct: 1   MKDKESSIDFYAFVNIYRDRFVGSFVKKVYQVGPDDFMVQIYRSDI-----KRMDVLISL 55

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G+   T     +   T +   + LRK I  RR+  +RQ+ +DR++ F F  G     +
Sbjct: 56  KHGIFFKTV----ETPETATQTAMVLRKTISDRRIVGIRQINFDRVVEFTFHTGQK---L 108

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTAS 175
           ILEL+ +GN++ TD +  +  +LR  +  ++ + +   ++ P+     F+ ++AS
Sbjct: 109 ILELFREGNLIATDGD-RITFVLRPRKWKNRDLEVGGTYQPPSS----FDPSSAS 158


>gi|10640760|emb|CAC12538.1| conserved hypothetical protein [Thermoplasma acidophilum]
          Length = 588

 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 151/260 (58%), Gaps = 16/260 (6%)

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE-- 489
           + + S E+  ++  E+++ G  +   + ++  +    S    N    +D   K + V+  
Sbjct: 270 SQKKSIEEFEKIANEKQEIGRAIMERLQEI--DGAIRSARSGNYAGNIDRARKVITVDMD 327

Query: 490 --KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN 547
              VE+D  +SA  NA R++   K    K E  +    KA + AEK    Q L E   A 
Sbjct: 328 GKPVEIDYTVSAGENANRYFSQAKDYRRKIEGAM----KAIEEAEK----QRLTEMQKAE 379

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
               RKV WFE ++WFISSE YLVI+GRDA+ NE IVK+++ +GD+YVHAD++GA ST+I
Sbjct: 380 KKKRRKVFWFETYHWFISSEGYLVIAGRDAKSNEKIVKKHLQEGDIYVHADMYGAPSTII 439

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           K+   +QP    TL +A  F V  S+AW + + + +A+WVYP QVSKT  +GEY+  GS+
Sbjct: 440 KSS-GKQPPGEATLREAASFAVSFSRAWPAGIASGTAYWVYPSQVSKTPESGEYVATGSW 498

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
           +IRGK+N++    L +  G+
Sbjct: 499 IIRGKRNYITDLKLELCIGM 518



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 77/156 (49%), Gaps = 17/156 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           R +G     VY + P  ++ ++  S       +   VL+ ++ G+   T     +   T 
Sbjct: 7   RFVGSFVKKVYQVGPDDFMVQIYRSDI-----KRMDVLISLKHGIFFKTV----ETPETA 57

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           +   + LRK I  RR+  +RQ+ +DR++ F F  G     +ILEL+ +GN++ TD +  +
Sbjct: 58  TQTAMVLRKTISDRRIVGIRQINFDRVVEFTFHTGQK---LILELFREGNLIATDGD-RI 113

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTAS 175
             +LR  +  ++ + +   ++ P+     F+ ++AS
Sbjct: 114 TFVLRPRKWKNRDLEVGGTYQPPSS----FDPSSAS 145


>gi|294658357|ref|XP_002770767.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
 gi|202953070|emb|CAR66294.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
          Length = 1064

 Score =  171 bits (432), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 95/257 (36%), Positives = 144/257 (56%), Gaps = 8/257 (3%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--I 548
           V +D++LS  ANAR ++E KK  ESKQ K   +   A K A+KK    +  +    N  +
Sbjct: 514 VWIDISLSPFANARVYFESKKSAESKQIKVEKSTEFALKNAKKKIEQDLNNKLKNENDSL 573

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  +WFEKF WF+SSE YL ++GRD  Q +MI  R+ +  D ++ +D+ G+    IK
Sbjct: 574 KQIRPKYWFEKFLWFVSSEGYLCLAGRDNSQIDMIYYRHFNDNDYFISSDIEGSLKVFIK 633

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
           N    + +PP TL QAG F +  S AW+ K+ TSAW ++   +SK    G  ++ G+F  
Sbjct: 634 NPFKGESIPPSTLMQAGIFAISASSAWNGKVTTSAWLLHGADISKKDFDGTLISSGNFNY 693

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG----MDDFEDSGHH--KE 722
           + KK +LPP  LIMGFG  +  DE +   +   R  R EE G    MD+ +    H  K 
Sbjct: 694 KAKKTYLPPCQLIMGFGFYWLGDEETTKKYTETRLSREEEHGLKIVMDNKKQDLEHSSKS 753

Query: 723 NSDIESEKDDTDEKPVA 739
           ++ I+S  ++ D++ V+
Sbjct: 754 SNKIQSSLNEVDDEKVS 770



 Score =  119 bits (298), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 121/487 (24%), Positives = 234/487 (48%), Gaps = 58/487 (11%)

Query: 11  VAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTA 70
           + AE+   + ++  R  N+Y++S  +  F L  S       +S+KV++L + G +LH T 
Sbjct: 13  ITAELS--KEILNYRLQNIYNVSSSSRQFLLKFSIP-----DSKKVVVL-DCGNKLHLTE 64

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNI 130
           + R    TPS F  KLRKH++TRRL  ++Q+G DR+++ +F  G+   Y+ LE ++ GNI
Sbjct: 65  FDRPTTQTPSNFVTKLRKHLKTRRLSQIKQIGNDRVLVLEFSDGL--FYLALEFFSAGNI 122

Query: 131 LLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASKLHAALTSSKEPDA 189
           LL D +  +L+L R     DKG       RY   EI ++F+ +             + D 
Sbjct: 123 LLLDQDRKILSLQRMV--SDKG----GNDRYAVNEIYKMFDESLF-----------KSDF 165

Query: 190 NEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL---KTVLGE 246
           N   K           SKE + G    +   L    ++ S DG + K       K +   
Sbjct: 166 NYERKT---------YSKEQVQGWIKSQRDKL----DQRSQDGNKKKNKVFSIHKLLFVN 212

Query: 247 ALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFE-DWLQDVISGDIVP 305
           +      L +  ++  G+  +    +    +D  + +++ A+ + E D++  +   +   
Sbjct: 213 SSHLSSDLVQLNLIKNGISSSASCFDFEN-DDAKMDLIIKALEEAESDYINLLEKSEDAI 271

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLL-----LNQFRSREFVKFETFDAALD 359
            GYI+ + K+L  +    +S +  + I DEF P       ++ +R   F + + ++  +D
Sbjct: 272 NGYIVSK-KNLSYNPDNDDSTNDLEYIMDEFYPYKPYKSDMDNYR---FTEIQGYNRTMD 327

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            F+S IES +   +   ++  A  +L+    +++ ++ +L  + + ++K  + I Y  + 
Sbjct: 328 SFFSTIESTKYALRIDQQKQQATKRLDYAREERDKQIQSLLAQQESNIKKGDAIMYYADL 387

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDE 478
           VD    +V   +  +M W ++  +++ E+  GN +A  I+  L L+ N ++L L  ++DE
Sbjct: 388 VDQCKDSVVKLIDQQMDWTNIESLIELEQSRGNKIARFINLPLNLKENKINLHLP-DMDE 446

Query: 479 MDDEEKT 485
            ++E KT
Sbjct: 447 ENEENKT 453



 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 104/243 (42%), Gaps = 56/243 (23%)

Query: 840  KPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKIS------ 893
            K  +S  ERR L+KG+     D KV   ++  +D     E  ++  K+E  K        
Sbjct: 794  KKRLSAKERRMLRKGK-----DIKVSENEDTDEDVFDPIEQEMKNLKLEETKKKTAEPSS 848

Query: 894  ------RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKK 947
                  RG+K K+KK+  KY DQDEEER IRM  L +  +V+               EK+
Sbjct: 849  QKPPNVRGKKSKMKKIAAKYADQDEEERKIRMEALGTLKQVEA--------------EKQ 894

Query: 948  PAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
              I                   ++ KE  D   +   +       E  E  K  MEE + 
Sbjct: 895  KQID-----------------EEENKESKDKYVNEALNAERRKNQEEREYRKYIMEEAN- 936

Query: 1008 HEIGEEEKGRLNDV---DYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
                E+E   +N +   D     P P D L+ ++PV  P+SA+  +KY+VKI PG  KKG
Sbjct: 937  ----EDESSVVNYLEILDSFISKPQPDDCLVNLVPVFAPWSALTKFKYKVKIQPGGGKKG 992

Query: 1065 KGI 1067
            K I
Sbjct: 993  KCI 995


>gi|429961918|gb|ELA41462.1| hypothetical protein VICG_01446 [Vittaforma corneae ATCC 50505]
          Length = 351

 Score =  171 bits (432), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 98/285 (34%), Positives = 159/285 (55%), Gaps = 31/285 (10%)

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL-ERNCM 468
            E++  N   ++  +   +     +M W       +EE++ GNP A  I    L ER C+
Sbjct: 5   TEILNENRVFINEILGIFKKVFETKMEWSAFEAFWEEEKRNGNPYAKAIVSYDLSERKCI 64

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            L+                   +E+D+++    N  +++  +KK   K +KT        
Sbjct: 65  VLIDHRY---------------IELDVSMPLSKNIEKYFSKRKKALDKSDKT-------- 101

Query: 529 KAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
           KAA +    +++ +K +   +  R+++WFEKF++FIS+EN LVI G++AQQNE+IVK+++
Sbjct: 102 KAALENIVDKLIPKKAIVP-AQKRELYWFEKFHFFISTENELVIGGKNAQQNEIIVKKHL 160

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
              D+Y H D+HGASS   K  R E     +T+ +A    +C S+ WD  ++   ++V P
Sbjct: 161 EPTDLYFHCDIHGASSIACKG-RSE-----VTIEEASYMALCMSKCWDEGVIKPVFYVEP 214

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
            QVSK+AP+GEY+T GSFMI+GK+N + P+ L  G GLLF+L+ S
Sbjct: 215 DQVSKSAPSGEYITKGSFMIKGKRNIMNPYRLEYGIGLLFKLEGS 259



 Score = 40.8 bits (94), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 16/45 (35%), Positives = 27/45 (60%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
             + NP     +LY +PV  P+  V++YKY++++ P + KK K  Q
Sbjct: 265  FSSNPADDAKILYGLPVSAPWICVKNYKYKIRLCPASEKKSKLCQ 309


>gi|341581973|ref|YP_004762465.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
 gi|340809631|gb|AEK72788.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
          Length = 650

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 184/359 (51%), Gaps = 29/359 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F TF  ALDE++ KI  ++A  +   + +A   +L      QE  
Sbjct: 242 VPIELRIYEGFEKRYFTTFSEALDEYFGKITMEKARVEQTKRLEAKKRQLLMTLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++    + ++ +LI  N   ++  +   R A    + W++  + ++E ++AGN VA
Sbjct: 302 LKGFEEGAKANQEIGDLIYANYALIERLLEEFRKA-TETLGWDEFKKRIEEGKRAGNRVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++                     D +EK + +E    KV + L  S   NA  +YE  K
Sbjct: 361 LMVKG------------------TDPKEKAVTIELEGKKVRLYLNRSIGENAELYYEKAK 402

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM--RKVHWFEKFNWFISSENY 569
           K   K E  + A+    +  ++  RL   + K   ++  +  RK  WFEKF WF+SSE +
Sbjct: 403 KFRHKHEGALKAYEDTKRKLDEIERLIEEELKKELSVKRIERRKKKWFEKFRWFVSSEGF 462

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LV++G+DA  NE+++KR+M   D+Y HAD++GA   VIK+    Q     T+ +A  F V
Sbjct: 463 LVLAGKDAGTNEILIKRHMDDNDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQFAV 519

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
             S+AW   + +  A+W +P+QV+K  P+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 520 SMSKAWSRGVYSEDAYWAHPNQVTKQTPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGVI 578



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 83/161 (51%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   +Y    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYIVRELQSLVGSRVDKIYHDGDEIRI-KLRTKEGRQD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T Y ++    PS FT+ LRKH+    ++ + Q G+DRI+  + G     + ++
Sbjct: 52  AGKRFHVTTYVKEAPKQPSSFTMLLRKHLSGGFIDAIEQHGFDRIVKIRVG----DYTLV 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GN++L D E  ++  LR     D+ +   + ++YP
Sbjct: 108 GELFRRGNVILVDGENRIVAALRYEEYKDRRIMPKAEYQYP 148


>gi|448578556|ref|ZP_21643976.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
 gi|445725734|gb|ELZ77354.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
          Length = 702

 Score =  169 bits (429), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 169/705 (23%), Positives = 290/705 (41%), Gaps = 128/705 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLVEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYDFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP           AS+L 
Sbjct: 117 KIVVELFGQGNISVLDETGEVVRSLETVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                       +P  V+ D                      L +N +++  D  R    
Sbjct: 165 ------------DPLSVSRDA---------------------LGRNMDESDTDIVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+   + +S+  + + +A+   ++      D  + V
Sbjct: 188 ----TLATQLNLGGLYAEELCTRAGVDKTLDISDATEEDYDAVFDAIV------DLREQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +G+  P  Y L  ++++    P               PL  +Q    +   +++F+ AL
Sbjct: 238 RAGEFDPRLY-LDDDENVVDVTP--------------FPLREHQNDGLDEEAYDSFNEAL 282

Query: 359 DEFYSKIESQRAEQQ----HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           DE++ +++    EQQ    ++   +A   K  +I   QE  +    +      + AEL+ 
Sbjct: 283 DEYFFRLDLTADEQQDVGSNRPDFEAQIAKQERIIEQQEGAIEGFDERAAAERERAELLY 342

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N + VD  +  VR A    + W+D+A  +    + G P A  +  +      +++    
Sbjct: 343 ANYDLVDDVLSTVRDAREEGVPWDDIAEKLDAGAEQGIPAAEAVTNVDGAEGTVTI---- 398

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK 534
              E+DD   TL       D+++    NA R Y   K+ + K+E  + A     +  E+ 
Sbjct: 399 ---ELDDSTITL-------DVSMGVEKNADRLYTEAKRIQEKKEGALAAIEDTREELEEV 448

Query: 535 TRLQILQEK-------------------TVANISHMRKVHWFEKFNWFISSENYLVISGR 575
            R +   E                    ++ ++      +W+E+F WF +S+ YLV+ GR
Sbjct: 449 KRRRDEWEADDDEDDAEDEEEQEETDWLSLQSVPVKSTDYWYEQFRWFHTSDGYLVVGGR 508

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-----PPLTLNQAGCFTVC 630
           +A QNE +VK+YM K D + H    G   T++K   P +P      P  +L++A  F V 
Sbjct: 509 NADQNEALVKKYMDKHDRFFHTQARGGPVTLLKATGPSEPAKEVDFPESSLHEAAQFAVS 568

Query: 631 HSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + +
Sbjct: 569 YSSIWKDGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRTY 613


>gi|322368861|ref|ZP_08043428.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
           DX253]
 gi|320551592|gb|EFW93239.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
           DX253]
          Length = 711

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 164/713 (23%), Positives = 290/713 (40%), Gaps = 128/713 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA  + L    G +    Y         K+ +        +  ++ LL+E 
Sbjct: 22  KRELSSIDLAAITRELNSFEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLVEV 74

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  +       P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 75  GEVKRAHTVAPEHVPPAPGRPPNFAMMLRNRLSGADFAGVEQFEFDRILQFHFKREDGDT 134

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+ + D    V+  L                    +  R+  RT A    
Sbjct: 135 TIVAELFGQGNVAVLDENNEVIDCL--------------------DTVRLKSRTVAPGSQ 174

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+      P +++ +                     +     N +  D  R    
Sbjct: 175 YEFPSSR----VNPLEIDYE---------------------EFEYRMNDSDTDVVR---- 205

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L +G   +E +    G+    K++++   +++  + L  A+ +  + L+  
Sbjct: 206 ----TLATQLNFGGLYAEEVCTRAGV---EKVTDIADADEDEYERLYAAIERLREPLE-- 256

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +GD  P  Y                      +  +  P  L ++   +   F++F+AA+
Sbjct: 257 -TGDFDPRVYY------------------EDDVRVDVTPFPLEEYEGLDSEAFDSFNAAV 297

Query: 359 DEFYSKI---ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D++++ +   E++ A +  K    A   K  +I   QE  +   +++ D   + AEL+  
Sbjct: 298 DDYFTNLDVSENEDAGEPQKPDFQAQIEKQQRIIEQQEGAIEGFERKADAEREKAELLYA 357

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N   VD  +  VR A A    W D+    +E  + G   A  +  +      +++     
Sbjct: 358 NYGFVDEILATVRNARAEDTPWADIEARFEEGAERGIEAAEAVQGIDPSEGTVTV----- 412

Query: 476 LDEMDDEEKTL-PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK- 533
             E+DD + TL P + VE         NA R Y+  K+ E K+E  + A     +  E+ 
Sbjct: 413 --EIDDTKITLFPDDGVE--------KNANRLYQEAKRIEEKKEGALAAIEDTREELEEV 462

Query: 534 KTRLQILQEK--------------TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
           K R +  +E+              + A+I   ++  W+E+F WF +S+ +LV+ GR+A +
Sbjct: 463 KKRAEQWEEEPEEERTEPENIDWLSRASIPVRKQEQWYERFRWFRTSDGFLVLGGRNADE 522

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGCFTVCHSQA 634
           NE +VK+YM + D++ H+  HG   T++K   P +P     VP  +  +A  F V +S  
Sbjct: 523 NEELVKKYMDRNDLFFHSQAHGGPITILKTSDPSEPSKDVDVPEQSKREAAQFAVSYSSV 582

Query: 635 W-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           W D +    A+ V P QVSKT  +GEYL  G F IRG + +    P+ +  G+
Sbjct: 583 WKDGRFAGDAYMVTPDQVSKTPESGEYLEKGGFAIRGDRTYFEDTPVGVAVGI 635


>gi|14591254|ref|NP_143331.1| hypothetical protein PH1465 [Pyrococcus horikoshii OT3]
 gi|3257889|dbj|BAA30572.1| 650aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 650

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 108/358 (30%), Positives = 190/358 (53%), Gaps = 28/358 (7%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E V FETF  ALDE++ K+  ++A+ +   K +    +L      QE  
Sbjct: 243 VPIDLKWYEGYEKVYFETFSQALDEYFGKLTIEKAKAEKTKKLEEKRKQLLATLKRQEEM 302

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++E+ ++ ++  LI  N   +D  +     A+ N + W++  + ++E +K GN +A
Sbjct: 303 IKGFEKELKKNQEIGNLIYANYTLIDGLLREFSKAVKN-LGWDEFKKRIEEGKKKGNKIA 361

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            ++  +  E N +++ +                ++V++ L    + NA  +YE  KK + 
Sbjct: 362 LMVKGIEPESNSITVEIEG--------------KRVKLYLDKDLNENAEIYYEKAKKAKH 407

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH-----WFEKFNWFISSENYL 570
           K E       KA++  ++K      + +       ++K+      WFEKF WFISSE +L
Sbjct: 408 KLE----GARKAYEDLKRKLESIEREIEEEEKKIQVKKIEKRKKKWFEKFRWFISSEGFL 463

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI G+DA  NE++V++Y+ + D+Y HAD+ GA   VIK+    Q     T+ +A  F V 
Sbjct: 464 VIGGKDATTNEIVVRKYLEENDLYCHADIWGAPHVVIKDG---QKAGEKTIFEACQFAVS 520

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            S+AW   + ++ A+WVYP+QV K AP+GE+L  G+FM+ GK+N++   PL +  G++
Sbjct: 521 MSRAWSEGLYSADAYWVYPNQVKKQAPSGEFLPKGAFMVYGKRNWMYGIPLKLAVGII 578



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/162 (27%), Positives = 83/162 (51%), Gaps = 13/162 (8%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+   V+ L+  +IG R   VY    +  I        + ++GE  K L++ 
Sbjct: 1   MKEEMSSVDIRYIVEELKSEIIGARVDKVYHEGDEVRI-------KLHKTGEGRKDLII- 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T+Y ++  + PS F + LRKHI    +ED+ Q  +DRI+  +    +    +
Sbjct: 53  EAGKRIHLTSYIKESSSQPSSFAMLLRKHISGNFVEDIEQHDFDRIVKIK----IGKFKI 108

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I EL+ +GN++    +  +L  +R     D+ +     ++YP
Sbjct: 109 IAELFKKGNVVFVTEDNIILGAIRYEEFKDRVIKPKHEYKYP 150


>gi|395504204|ref|XP_003756446.1| PREDICTED: nuclear export mediator factor NEMF [Sarcophilus
           harrisii]
          Length = 996

 Score =  169 bits (428), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 117/349 (33%), Positives = 189/349 (54%), Gaps = 41/349 (11%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           L+ +L   L YG  L EH + + G     ++ E  K E   I+ +++ + K ED ++ + 
Sbjct: 183 LRRILNPYLPYGATLIEHCLRENGFSSYFRVDE--KFETGDIEKVLVCLQKAEDHMKTM- 239

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
             +   +GYI+ Q K       P +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 240 -SNFSGKGYII-QKKEKKPSLEPDKQSEDILTYEEFHPFLFSQHSKCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNL 417
           EFYSK+E Q+ + +   +E  A  KL+ +  D E+R+  L   QE+D+ +K  ELIE NL
Sbjct: 298 EFYSKLEGQKIDLKALQQEKQALKKLDNVRKDHEHRLEALHQAQEIDK-IK-GELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+ VA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDIVANAIRELKLQTNHVTMLLKNPYL 415

Query: 475 -----------NLDEMDDEE-----------------KTLPVEKVEVDLALSAHANARRW 506
                      N+++ + EE                 K  P+  V+VDL+LSA+ANA+++
Sbjct: 416 ISDEEEEDDEINIEKEETEEPKGKKKKQKNKQLQKLQKNKPL-LVDVDLSLSAYANAKKY 474

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH 555
           Y+ K+    K +KT+ A  KAF++AEKKT+  + + + V  I   RKV+
Sbjct: 475 YDHKRHAARKTQKTVEAAEKAFRSAEKKTKQTLKEVQMVTTIQKARKVY 523



 Score =  147 bits (370), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 152/450 (33%), Positives = 222/450 (49%), Gaps = 75/450 (16%)

Query: 650  QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
            QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DE  +  H  ER+VRG++E
Sbjct: 536  QVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDEPCVWRHRGERKVRGQDE 595

Query: 710  GMDDFEDSGHHKENSDIESEKDDTDEKPVAESLS--VPNSAHPAPSHTNASNVDSHEFP- 766
                           D+E+         VA S S  V     P  +  ++S  D  E   
Sbjct: 596  ---------------DLET---------VASSTSKLVSEEMEPLDNGDSSSGEDQEETSE 631

Query: 767  -AEDKTISNGIDSKIFDIARNVAAPV--TPQLEDLID--RALGLGSASISSTKHGIET-- 819
              E++ + N ID ++  I  +   P   + Q E   D  ++  + S    ++K   E+  
Sbjct: 632  TVEEREVVNQIDEEVISIQNDKNRPKEGSAQEESSDDDGKSQRMKSDQEIASKRKDESEM 691

Query: 820  ------TQFDLS--EEDKHVERTATVRDKPYISKAE---RRKL---------KKGQGSSV 859
                  T  DLS  +  +  ++T T  D   ++ ++   RR L         KK Q +  
Sbjct: 692  SLNYPDTTIDLSHLQSQRSFQKTVTREDASDVNDSKLHGRRHLSAKERREMKKKKQPNDS 751

Query: 860  VDPKVEREKERGKDASSQPESIVRKTKIEGG--KISRGQKGKLKKMKEKYGDQDEEERNI 917
             D  +  +K  GK+ + + E     +K   G   + RGQK K+KKMKEKY DQDEE+R +
Sbjct: 752  TDLDILEDK--GKENTLKTEVFPNTSKTVSGPQPMKRGQKSKIKKMKEKYKDQDEEDREL 809

Query: 918  RMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPD 977
             M LL SAG    +  +   +       K    +    P+      + G   K  KE P 
Sbjct: 810  IMKLLGSAG---SSKEEKGKKGKKGKTGKTKEEATKKQPQKFRSELRIGDRIK--KETPL 864

Query: 978  DSS-HGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLY 1036
            ++  H +++   + +D+  + DK   EE+D+ + G EE    N +D LTG P   D+LL+
Sbjct: 865  EAVIHELQE---ITMDDQPD-DK---EEQDVDQQGNEE----NLLDSLTGQPHSEDVLLF 913

Query: 1037 VIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
             IPVC PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 914  AIPVCAPYTTMTNYKYKVKLTPGVQKKGKA 943



 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 104/170 (61%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+ A +     RL+GMR  N+YD+  KTY+ +L             KV LL+
Sbjct: 1   MKTRFSSVDICAILSEFNARLLGMRVYNIYDVDNKTYLIRLQKPDF--------KVTLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LT+ E+ +L +LR   D+   V    R +YP +  RV E
Sbjct: 113 IIELYDKGNIVLTNYEYLILNILRFRSDEADDVKFAVREKYPIDHARVME 162


>gi|448435995|ref|ZP_21587011.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
           14210]
 gi|445683155|gb|ELZ35558.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
           14210]
          Length = 743

 Score =  168 bits (426), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 175/733 (23%), Positives = 301/733 (41%), Gaps = 118/733 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     R  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDRVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP            S+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLSTVRLKSRTVAPGSQYEYP-----------GSRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                          L   +GG      ++  ++ +D  R    
Sbjct: 166 P------------------------------LDVSRGG----FERHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+     + E     D+ ++ L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKETPIEEAT---DDQLRALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y   ++     D       +  ++ D   P  L +  +   V F++F+ A+
Sbjct: 239 -SGDIDPRVY--EESIDGSGDGDGNADDADPRVVD-VTPFPLAEHENLPSVGFDSFNDAV 294

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++ S+  E      + +A          K  +I   QE  +   +++     + A
Sbjct: 295 DEYFYRLGSEDTEAGDAPADASASRPDFEGEIAKQERIIEQQEGAIEGFEEQAQAERERA 354

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  N + VD  I  VR A  + + W+++   +    + G P A  +  +      +++
Sbjct: 355 ELLYANYDLVDEVISTVREARESEVPWDEIEETLDAGAERGIPAAEAVVDVDGGEGTVTV 414

Query: 471 LLSNNLDE--MDDEE-KTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
            L+   D+  +D E+  +    ++E+D +     NA R Y+  K+ E K+E  + A   +
Sbjct: 415 ELAEEPDDDAVDGEDGASGGTTRIELDASEGVEVNADRLYQEAKRVEEKKEGAVAAIEST 474

Query: 526 KAFKAAEKKTRLQILQEKTV--------------------------ANISHMRKVHWFEK 559
           +A   A K+ + +  +++                            A+I       W+++
Sbjct: 475 RAELEAVKERKAEWEEQQAADDGSAQGGDGDDEDDDEEYETDWLSRASIPIRSPDDWYDR 534

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   P +   P+
Sbjct: 535 FRWFHTSTGYLVIGGRNADQNEELVKKYMDKHDRFFHTQAHGGPVTILKAAGPSESAEPV 594

Query: 620 -----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
                TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + 
Sbjct: 595 DFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVDPDQVSKTPESGEYIEKGSFVIRGDRT 654

Query: 674 FLPPHPLIMGFGL 686
           +    P  +  G+
Sbjct: 655 YFEDVPCRIAVGV 667


>gi|448454957|ref|ZP_21594359.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
           21995]
 gi|445814337|gb|EMA64302.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
           21995]
          Length = 736

 Score =  166 bits (421), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 177/731 (24%), Positives = 303/731 (41%), Gaps = 121/731 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHVADAEHVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L++ R   + VA  +++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGALQTVRLKSRTVAPGAQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+   + + E     D+ ++ L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKTLPVDEAT---DDQLRALHEALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y   +    G      +     ++ D   P  L++      V F++F+AA+
Sbjct: 239 -SGDIDPRVY---EEDLDGAGSEDADGDGDPRVVD-VTPFPLSEHEGLPSVGFDSFNAAV 293

Query: 359 DEFYSKIESQRAEQQHKAKEDAA---------FHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           DE++ ++E +  +   +A  DA+           K  +I   Q   +   +++ +   + 
Sbjct: 294 DEYFYRLEREDGDA-GEAPADASPSRPEFEEEIAKQERIIEQQRGAIEGFEEQAEAERER 352

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
           AEL+    + VD  +  VR A  N + W+++A  ++   + G P A  +  +      ++
Sbjct: 353 AELLYARYDLVDEVLSTVREARENEVPWDEIAETLEAGAERGIPAAEAVADVDGGEGTVT 412

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSK 526
           + L     E  D E    V +VE+D +     NA R Y+  K+ E K+E   + I +  +
Sbjct: 413 VELDREGGE--DGESGDSV-RVELDASTGVEVNADRLYQEAKRIEGKKEGAMEAIESTRR 469

Query: 527 AFKAAE-KKTRLQILQEK------------------------TVANISHMRKVHWFEKFN 561
             +A E +K   + ++                          + ++I       W+++F 
Sbjct: 470 ELEAVEERKAEWEAMEAADDGDGDGGDSEDEDDEEEYETDWLSRSSIPIRSPDDWYDRFR 529

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-- 619
           WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   P +   P+  
Sbjct: 530 WFHTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAGPSESADPVDF 589

Query: 620 ---TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
              TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 590 SEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 649

Query: 676 PPHPLIMGFGL 686
              P  +  G+
Sbjct: 650 EDVPCRIAVGV 660


>gi|354544800|emb|CCE41525.1| hypothetical protein CPAR2_800770 [Candida parapsilosis]
          Length = 661

 Score =  166 bits (421), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 86/224 (38%), Positives = 128/224 (57%), Gaps = 3/224 (1%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVAN 547
           V +D  LS++ANA  ++E KK  ESKQ K       A+K AEKK    +   L+ +   +
Sbjct: 200 VSIDYTLSSYANASIYFESKKAAESKQAKIEKGAEIAYKNAEKKINQDLVKNLRRENGTS 259

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
            +  R+  WFE F WF+SSE YL ++GR   Q +++  +Y S  D  V +++ G+    +
Sbjct: 260 SNAEREKFWFESFYWFVSSEGYLCLAGRSKSQTDLLYFKYFSDDDFLVSSEIEGSLKVFV 319

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           KN    + VPP T+ QAG F +  SQAW+ K+ T+AW ++  ++SK   +G  L  G F 
Sbjct: 320 KNPLKGESVPPTTILQAGIFAMAASQAWNGKINTAAWVLHGSEISKYNSSGALLPAGEFE 379

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
              KK+FLPP  L+MGFGL F +DE S   H  +R  + +E G+
Sbjct: 380 YLAKKHFLPPAQLVMGFGLYFLVDEGSAEGHKIQRVQKEKEHGL 423



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 34/50 (68%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
             + +D+L+  P   D +L ++PV  P+SA+Q +KY+ KI PG AKKGK I
Sbjct: 557  FDTLDFLSSKPAVGDTVLDIVPVFAPWSALQRFKYKAKIQPGLAKKGKSI 606


>gi|448089209|ref|XP_004196743.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|448093427|ref|XP_004197774.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|359378165|emb|CCE84424.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|359379196|emb|CCE83393.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
          Length = 1056

 Score =  166 bits (420), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 132/229 (57%), Gaps = 2/229 (0%)

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EK 543
           +P+ +V +DL+LS+ AN+R +++ KK  E+KQ K       A + AEKK    +    +K
Sbjct: 508 VPLLEVSIDLSLSSFANSRIYFDNKKNAETKQAKVEKNTEIALRNAEKKINRDLSSNLKK 567

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
               +  +R   WFEKF WF+S+E YL ++G D  Q +MI  R+ +  D +V +D+ G+ 
Sbjct: 568 ESETLKQIRPKFWFEKFYWFVSNEGYLCLAGNDDTQTDMIYYRHFNDNDYFVTSDIEGSL 627

Query: 604 STVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTV 663
              +KN    + V P TL QAG F++  S+AWD+K+ TSAW++   +VSK    G  ++ 
Sbjct: 628 KVFVKNPYQGKEVSPSTLTQAGIFSMSASKAWDNKITTSAWYLKGSEVSKKDFDGSLVSF 687

Query: 664 GSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           G+F  +G+K FLPP  L+MG    F  DE +   + + R  R  E G++
Sbjct: 688 GNFNYKGEKQFLPPSQLVMGLAFYFLGDEETTQRYRSTRLERQAEFGLE 736



 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 123/490 (25%), Positives = 240/490 (48%), Gaps = 67/490 (13%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+   D+    K L+  ++  R  NVY    S K YI K      V +S    K L+
Sbjct: 1   MKQRVTGLDLQILCKELQEEIVSYRLQNVYGTAKSNKQYILKF----SVADS----KKLV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +E+G R+H T Y R  +  PS F  K+RKH+++RRL  V+Q+  DR+++ +F  G  A 
Sbjct: 53  ALETGNRIHLTEYERATEAFPSSFVTKMRKHLKSRRLTGVKQVANDRVLVLEFSDG--AF 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASKL 177
           Y+ LE ++ GNI+L D    +L+L R+ +  +KG       +Y   E   +F+++   K 
Sbjct: 111 YLALEFFSAGNIILLDENLKILSLQRTVQ--EKG----GNDKYAVNETYSMFDKSLFQKE 164

Query: 178 HAALTSSKEPD------ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSK----NSNK 227
                 S  PD      A++  ++     +V++ASK      K  K + + K    N++ 
Sbjct: 165 IQIPKISFTPDLISEWIASQKTRL----EDVTDASK------KKKKVYSIHKLLFVNASH 214

Query: 228 NSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
            S D        L++++ + +    +  +++    GL             ++ ++ L   
Sbjct: 215 LSGD------LILRSLVKQGINPSSSCFDYVEDTQGL-------------EDIVRALQET 255

Query: 288 VAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR--S 345
            A++   L+ V S   V    ++++NK    + P  +S     I DEF P   ++    S
Sbjct: 256 QAEY---LEIVESPSRVKGCIVMVKNKLYNPEDP--DSKDLKYIMDEFHPYKPHKENEDS 310

Query: 346 REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
            +F++ E ++  LD ++S IES R   + + +++ A  +L K   +++ ++ +L  + + 
Sbjct: 311 YQFMEVEGYNKTLDTYFSTIESSRYALRIEQQKEQARKRLEKARNERDKQIQSLLDQKNL 370

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
           ++K  E I Y+ + ++    +V   +  +M WE++ ++++ E+  GN +A +I   L L 
Sbjct: 371 NIKKGEAIIYHADVIEECKESVLQLIRQQMDWENIEKLIQLEQTRGNKLAQMIKLPLNLV 430

Query: 465 RNCMSLLLSN 474
           +N +++LL++
Sbjct: 431 QNKINVLLTD 440



 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 94/199 (47%), Gaps = 39/199 (19%)

Query: 882  VRKTKIEGGKISRGQKGKLKKMK----EKYGDQDEEERNIRMALLASAGKVQKNDGDPQN 937
            +R  ++E     +G K + KKMK     KY DQDEE+R +RM  L +  +VQ+N    + 
Sbjct: 843  LRNLRVEEKSTQKGPKVRGKKMKLQKAAKYADQDEEDRRLRMEALGTWKQVQENKK-KRA 901

Query: 938  ENASTHKEKKPAISPVDAPKVCYKCKKAGHLSK-DCKEHPDDSSHGVEDNPCVGLDETAE 996
            E A    +++   +P   P        A   S+ +  E+       + DN      E++ 
Sbjct: 902  EGAQNTGQRRNGTAPQQKP--------ASRRSRQELAEYRKYVMSEINDN------ESSV 947

Query: 997  MDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKI 1056
            +D +A+                  +D   G P  +D L YV+PV  P+SA+   KY+VKI
Sbjct: 948  VDPLAI------------------LDSFIGTPTSTDKLCYVVPVFAPWSALSKLKYKVKI 989

Query: 1057 IPGTAKKGKGI-QIFYSLL 1074
             PG  KKGK + ++ ++LL
Sbjct: 990  QPGNMKKGKCVSEVIHALL 1008


>gi|448424081|ref|ZP_21582207.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
           10247]
 gi|448478971|ref|ZP_21603977.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
 gi|445682746|gb|ELZ35159.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
           10247]
 gi|445822801|gb|EMA72563.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
          Length = 735

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 179/718 (24%), Positives = 296/718 (41%), Gaps = 118/718 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + +++  D+ +  L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIDEATDDQLGALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y    +   G       S        +  P  L +      V F++F+AA+
Sbjct: 239 -SGDIDPRVYEESVDGEGGDGGDGDGSDGRDPRVVDVTPFPLAEHEDLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++  +  E+     + +A          K  +I   Q+  +   +++     + A
Sbjct: 298 DEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAQAERERA 357

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  + + VD  I  VR A  N + W+++   +    + G P A  +  +      +++
Sbjct: 358 ELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAGAERGIPAAEAVAGVDGGEGTVTV 417

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKA 527
                L+E  D+  T+    VE+D +     NA R Y   K+ E K+E   + I +  + 
Sbjct: 418 ----ELEEEGDDGGTV---TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREE 470

Query: 528 FKAAEKKTRLQILQEKTV------------------------ANISHMRKVHWFEKFNWF 563
            +A +++ R    Q+                           ++I       WFE+F WF
Sbjct: 471 LEAVKERKREWEEQQAADDGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWF 530

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL---- 619
            +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+    
Sbjct: 531 HTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKAAGPSESADPVDFSE 590

Query: 620 -TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 591 ETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 648


>gi|448688255|ref|ZP_21694088.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
 gi|445779316|gb|EMA30246.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
          Length = 717

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 161/663 (24%), Positives = 269/663 (40%), Gaps = 103/663 (15%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQ--PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+    L   L   L +G    E +    G+  N+    V+ LE++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLEESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   G++ P  Y    +   G D    + G   +  D   P+ L+++     
Sbjct: 232 DEMGTRLRE---GNVDPRVYYETLDDDDGADSGEADDGPDRRRVD-VTPIPLSEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F++ALD+++     QR E+       +   +A   K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNSALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQKRIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFDEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARR-WYEL 509
               ++L +               N DE+  E K +  +K   + AL+A  N R    E+
Sbjct: 406 SEGTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEEV 462

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K++++  +       +   ++ ++ T    +Q     ++      HW+E+F WF +S+ +
Sbjct: 463 KERRDEWEADDGDDETDEDQSEDEPTDWLSMQ-----SVPTRSTEHWYEQFRWFHTSDGF 517

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQA 624
           LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA
Sbjct: 518 LVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQASLDQA 577

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
             F V +S  W D K     + V P QVSKT  +GEYL  G F IRG + +    P+ + 
Sbjct: 578 AQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVA 637

Query: 684 FGL 686
            G+
Sbjct: 638 VGI 640


>gi|389848295|ref|YP_006350534.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
 gi|448618500|ref|ZP_21666737.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
 gi|388245601|gb|AFK20547.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
 gi|445746871|gb|ELZ98329.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
          Length = 701

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 166/716 (23%), Positives = 287/716 (40%), Gaps = 127/716 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  + R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTEMNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDIKRAHIAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI +                D+ G  + S      E  R+  RT A    
Sbjct: 117 KIVVELFGQGNIAVL---------------DETGEVVRS-----LETVRLKSRTVAPGSQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+     +P  ++ D                      L ++  ++  D  R    
Sbjct: 157 YEYPSSR----LDPLTISRDA---------------------LGRHMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                +   L  G   +E +    G+   + +++      +AI   ++ +       Q V
Sbjct: 188 ----TIATQLNLGGLYAEELCTRAGVEKTLDIADATDDHYDAIYDAIVNLR------QQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG+  P  Y    +  +     P +   +  + +E                ++TF+ AL
Sbjct: 238 RSGEFDPRLYTDDDDAVVDVTPFPLQEHQNAGLDEE---------------AYDTFNEAL 282

Query: 359 DEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           DE++ +++    EQ+  +     +    K  +I   Q+  +    ++ +   + AEL+  
Sbjct: 283 DEYFFRLDLTADEQEATSNRPDFEEQIAKQERIIEQQKQAIEGFDEQANEERERAELLYA 342

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N + VD  +  VR A    + W+D+A  + E  + G P A  +  +      +++     
Sbjct: 343 NYDLVDDVLSTVREAREQGVPWDDIAVTLDEGAEQGIPAAEAVTNVDGANGTVTI----- 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWY-ELKKKQESKQEK--TITAHSKAFKAAE 532
             ++DD   TL       D+++    NA R Y E K+ QE KQ     I    +  +AA+
Sbjct: 398 --KLDDATVTL-------DVSMGVEKNADRLYTEAKRIQEKKQGALAAIEDTREELEAAK 448

Query: 533 KK----------------TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           ++                   +     ++ ++      HWFE+F WF +S  YLV+ GR+
Sbjct: 449 RRRDEWEADDQEDESDEDEEPEETDWLSLDSVPVKSTEHWFERFRWFHTSSGYLVVGGRN 508

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTVCH 631
           A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V +
Sbjct: 509 ADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQKVDFSEETLQEAAQFAVSY 568

Query: 632 SQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + +    P  +  G+
Sbjct: 569 SSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRRYFEDVPAKVAVGI 624


>gi|147920849|ref|YP_685344.1| hypothetical protein RCIX612 [Methanocella arvoryzae MRE50]
 gi|110620740|emb|CAJ36018.1| conserved hypothetical protein [Methanocella arvoryzae MRE50]
          Length = 670

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 189/366 (51%), Gaps = 33/366 (9%)

Query: 334 EFCPLLLNQFRSREF--VKFETFDAALDEFYS---KIESQRAEQQHKAKEDAAFHKLNKI 388
           +  P+ L ++    +  V FETF+ A+D ++    K E++ A  + KA++   F +  + 
Sbjct: 249 DVLPIELKRYEGEGYEKVYFETFNKAVDAYFGARIKTEAKAAIVEKKAEKLGVFERRLR- 307

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
              Q++ +   ++E   + +  E+I    + V+  I  ++ A     SW+D+ +++K+ +
Sbjct: 308 --QQQDAIAKFEREEQENARKGEVIYAEYQKVEEIIKVIKGARDRGYSWDDIRKILKDAK 365

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           KAGN  A  I  +      ++++L              P   V +D+ L+   NA+ +Y+
Sbjct: 366 KAGNQAAAAIQAIDSATGLITVVL--------------PEATVNIDVKLTVPQNAQAYYD 411

Query: 509 LKKKQESKQE---KTITAHSKAFKAAEKKTRL--QILQEKTVANISHMRKVHWFEKFNWF 563
             KK ++K+E   K I    KA   A+ K     + +Q+K  A     RK  W+++F WF
Sbjct: 412 KVKKVQAKKEGALKAIEETRKAMAKAQPKVAEPGKPVQKKVSAK---PRKPKWYDRFRWF 468

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
            +S+ +LV++GRDA  NE IVK+YM K DV+ HA  HGA  TV+K     +PV    L +
Sbjct: 469 FTSDGFLVVAGRDADTNEEIVKKYMEKNDVFFHAQAHGAPITVLKTA--GKPVTEQALAE 526

Query: 624 AGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
              F V +S  W +   +   +WV P QVSKT   GEY+  G+F++RG++N++    +  
Sbjct: 527 VAQFAVSYSSVWKAGQFSGDCYWVKPEQVSKTPEPGEYVAKGAFIVRGERNYVKDVQVRA 586

Query: 683 GFGLLF 688
             G+ F
Sbjct: 587 AIGIRF 592



 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 84/162 (51%), Gaps = 7/162 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M + DV A VK L+ L+  +    Y  S      +L       +  ++ K  L+ E
Sbjct: 1   MKEEMTSVDVYAVVKELQFLVDAKLEKAYQTSADEIRLRL-------QEFKTGKYDLIAE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH TA A +    P  F + LRK+    R+  +RQ G+DRI+  +       + +I
Sbjct: 54  AGKRLHITANAPESPKLPPAFAMILRKYTMGGRITAIRQHGFDRIVEIETVRAGEGNILI 113

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           +E++A+GNI+L D+E  ++  L+S +  D+ V    ++ YP+
Sbjct: 114 VEMFARGNIILADAERKIIMPLKSLKMRDRDVVRGEKYEYPS 155


>gi|448448413|ref|ZP_21591226.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
           13561]
 gi|445814829|gb|EMA64787.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
           13561]
          Length = 736

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 179/719 (24%), Positives = 296/719 (41%), Gaps = 119/719 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + +++  D+ +  L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIDEATDDQLGALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y    +   G       S        +  P  L +      V F++F+AA+
Sbjct: 239 -SGDIDPRVYEESVDGEGGDGGDADGSDGRDPRVVDVTPFPLAEHEDLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++  +  E+     + +A          K  +I   Q+  +   +++     + A
Sbjct: 298 DEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAQAERERA 357

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  + + VD  I  VR A  N + W+++   +    + G P A  +  +      +++
Sbjct: 358 ELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAGAERGIPAAEAVVGVDGGEGTVTV 417

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKA 527
                L+E  D+  T+    VE+D +     NA R Y   K+ E K+E   + I +  + 
Sbjct: 418 ----ELEEEGDDGGTM---AVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREE 470

Query: 528 FKAAEKKTRLQILQEKTV-------------------------ANISHMRKVHWFEKFNW 562
            +A +++ R    Q+                            ++I       WFE+F W
Sbjct: 471 LEAVKERKREWEEQQAADDGSGGDGGDNEGEEDEEYETDWLARSSIPIRSPDDWFERFRW 530

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL--- 619
           F +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+   
Sbjct: 531 FHTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKAAGPSESADPVDFS 590

Query: 620 --TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 591 EETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 649


>gi|448512226|ref|ZP_21616340.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           9100]
 gi|448520849|ref|ZP_21618182.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           10118]
 gi|445694546|gb|ELZ46671.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           9100]
 gi|445702985|gb|ELZ54924.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           10118]
          Length = 735

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 179/718 (24%), Positives = 296/718 (41%), Gaps = 118/718 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + +++  D+ +  L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIDEATDDQLGALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y    +   G       S        +  P  L +      V F++F+AA+
Sbjct: 239 -SGDIDPRVYEESVDGEGGDGGDGDGSDGRDPRVVDVTPFPLAEHEDLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++  +  E+     + +A          K  +I   Q+  +   +++     + A
Sbjct: 298 DEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAQAERERA 357

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  + + VD  I  VR A  N + W+++   +    + G P A  +  +      +++
Sbjct: 358 ELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAGAERGIPAAEAVVGVDGGEGTVTV 417

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKA 527
                L+E  D+  T+    VE+D +     NA R Y   K+ E K+E   + I +  + 
Sbjct: 418 ----ELEEEGDDGGTV---TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREE 470

Query: 528 FKAAEKKTRLQILQEKTV------------------------ANISHMRKVHWFEKFNWF 563
            +A +++ R    Q+                           ++I       WFE+F WF
Sbjct: 471 LEAVKERKREWEEQQAADDGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWF 530

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL---- 619
            +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+    
Sbjct: 531 HTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKAAGPSESADPVDFSE 590

Query: 620 -TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 591 ETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 648


>gi|448502987|ref|ZP_21612851.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
           10284]
 gi|445693389|gb|ELZ45541.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
           10284]
          Length = 730

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 181/744 (24%), Positives = 296/744 (39%), Gaps = 153/744 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAADPDNVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+     + E     D+ +  L  A+++ ++ L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKETPIEEAT---DDQLGALHDALSRLDERLR-- 238

Query: 299 ISGDIVPEGY---ILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
            SGDI P  Y   +           P        ++ D   P  L +      V F++F+
Sbjct: 239 -SGDIDPRVYEESVDGDGSEDDGGDP--------RVVD-VTPFPLAEHEGLPSVGFDSFN 288

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAA---------FHKLNKIHMDQENRVHTLKQEVDRS 406
           AA+DE++ ++ ++ A    +A  DA            K  +I   Q   +   +++    
Sbjct: 289 AAVDEYFYRLGNE-ATDDGEAPADATASRPDFEAEIAKQERIVEQQRGAIEGFEEQAQAE 347

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
            + AEL+  N + VD  +  VR A  N + W+++A  +    + G P A  +  +     
Sbjct: 348 RERAELLYANYDLVDEVLSTVREARENEVPWDEIAATLDAGAERGIPAAAAVVDVDGGEG 407

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
            +++ L       DDE+      ++++D +     NA R Y+  K+ E K+         
Sbjct: 408 TVTVAL-------DDEDGG--SVRIDLDASEGVEVNADRLYQEAKRVEEKKAGA------ 452

Query: 527 AFKAAEKKTR--LQILQEK------------------------------------TVANI 548
             KAA + TR  L+ + E+                                    + ++I
Sbjct: 453 --KAAIESTREELEAVNERKAEWEEQEAAADESAGADGDGEDGEDGDEAYETDWLSRSSI 510

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  WFE+F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K
Sbjct: 511 PIRSPDDWFERFRWFRTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTILK 570

Query: 609 NHRPEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              P +   P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+ 
Sbjct: 571 ASGPSESADPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIE 630

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            GSF+IRG + +    P  +  G+
Sbjct: 631 KGSFVIRGDRTYFEDVPCRIAVGV 654


>gi|448414286|ref|ZP_21577425.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
           14848]
 gi|445682579|gb|ELZ34996.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
           14848]
          Length = 701

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 170/711 (23%), Positives = 287/711 (40%), Gaps = 138/711 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D++A V  L R  G +    Y         ++ +        +  +V L++E 
Sbjct: 4   KRELTSVDLSALVTELNRYEGAKVDKAYLYGDDLLRLRMRDF-------DRGRVELILEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F + LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAAKPEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFEFERDDEDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ +GNI + D    V+  L + R   + VA  +++ +P+           S+LH
Sbjct: 117 QIVVELFGEGNIAVLDETGEVVRSLETVRLKSRTVAPGAQYEFPS-----------SRLH 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                        P  V+ +G                       +    +  D  R    
Sbjct: 166 -------------PFTVSYEG---------------------FKRRMEDSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   +  G   +E      G+   M ++E     D   + +  A+  F D L+  
Sbjct: 188 ----TLATQVNLGGLYAEEFCTRAGVDKTMDITEAG---DEEFRAVYDAIQSFRDRLK-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y   +      D  P              PL  ++         +TF+ AL
Sbjct: 239 -SGDFDPRVY---EEDESVVDATP-------------FPLEEHEAEGLNSESHDTFNDAL 281

Query: 359 DEFYSKIESQRAEQ------QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           DE++ +++    ++       ++   +A   K  +I   QE  +   +++     + AEL
Sbjct: 282 DEYFFRLDRTAEDEPDEEPGSNRPDFEAEIEKKKRIIQQQEGAIEGFEEQAQEERERAEL 341

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           +  + + VD  +  VR A    + W+D+ + ++E  + G P A                 
Sbjct: 342 LYAHYDLVDEVLTTVRDAREENVPWDDIRQRLEEGAERGIPAA----------------- 384

Query: 473 SNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQEKTITA-HSKA 527
             ++ ++D  E T+ VE    ++EV +      NA R Y   K+ E K+E  + A     
Sbjct: 385 -ESVVDVDGAEGTVTVELEDTRIEVVVDTGVEKNADRLYTEAKRVEGKKEGALAAVEDTR 443

Query: 528 FKAAEKKTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSENYL 570
            + AE K R +  +E+                 + ++I    + HWFE+F WF +S+ YL
Sbjct: 444 EELAEAKRRREEWEEEDEDDEEEDEEPEDIDWLSRSSIPLRTEEHWFERFRWFHTSDGYL 503

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAG 625
           VI GR+A QNE IVK+Y++K D++ H   HG   TV+K   P +P      P  T  +A 
Sbjct: 504 VIGGRNADQNEEIVKKYLNKHDLFFHTQAHGGPVTVVKATGPSEPSEAVEFPDATKREAA 563

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            F V +S  W + +    A+ V P QVSKT  +GEYL  GSF+IRG + + 
Sbjct: 564 QFAVSYSSIWKEGRYAGEAYMVTPDQVSKTPESGEYLEKGSFVIRGDRTYF 614


>gi|345005767|ref|YP_004808620.1| fibronectin-binding A domain-containing protein [halophilic
           archaeon DL31]
 gi|344321393|gb|AEN06247.1| Fibronectin-binding A domain protein [halophilic archaeon DL31]
          Length = 717

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 171/709 (24%), Positives = 288/709 (40%), Gaps = 120/709 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA    L R  G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELSSVDLAALATELSRYEGAKLDKAYLYGEDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  I +  L  V Q  +DRI++F+F       
Sbjct: 57  GDTKRAHVAAQEHVPDAPGRPPEFAMMLRGRIESADLVSVEQYEFDRILVFEFERPDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+  GN+ + D    V+  L + R   + VA  + + +P            S+L+
Sbjct: 117 TLVVELFGDGNVAVLDGNGEVVRSLETVRLKSRTVAPGTPYGFPQ-----------SRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S +  +A   D   +    V  A++ NLGG  G                       
Sbjct: 166 PLEMSYEALEARMEDSDTDVVRTV--ATQLNLGGFWG----------------------- 200

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                            E +    G+   M + +  + E  A+   ++++A        +
Sbjct: 201 -----------------EELCRRAGVEKAMDIEDAGEAEYRAVHRELMSLA------DTL 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPP-TESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            SG   P  Y+   +     D    TE G    +     P+ L +      V F++F+AA
Sbjct: 238 TSGQFDPRVYVEETDGESDDDDKSLTERGKVVDV----SPVALKERSELLSVAFDSFNAA 293

Query: 358 LDEFYSKIESQRAEQQHKAKE-----DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           LDE++ ++  Q   ++          +A   K  +I   QE  +   ++E ++  + AEL
Sbjct: 294 LDEYFYRLTHQERREEEGGGRKRPDFEADIEKEKRIIQQQEGAIEGFEEEAEQRRREAEL 353

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
                E VD  +  ++ A      W+++   + +  + G P A  +  +   +  +++  
Sbjct: 354 CYERYELVDEVLSTIQQARQQEHGWDEIQETLAQGAEQGIPAAEAVVDVNSAKGMVTI-- 411

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFK 529
                E+DD   TL       D ++    NA R Y   K+ E K+E   + I    K  +
Sbjct: 412 -----ELDDHRITL-------DASMGVEKNADRLYREAKRVEGKKEGAREAIEDTRKRLE 459

Query: 530 AAEKK-----------------TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
           AA+++                    + +   T  +I   +  HW+E+F WF +S+ YLVI
Sbjct: 460 AAKQRREEWEAEDDPEPEPDPDEEQEEVDWLTREDIPIRQPEHWYEEFRWFRTSDGYLVI 519

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCF 627
            GR+A QNE +VK+Y+ K D + H   HG   T++K   P +   P+     TL +A  F
Sbjct: 520 GGRNADQNEALVKKYLDKHDRFFHTQAHGGPVTLLKASGPSEAASPVDFPDATLQEAAQF 579

Query: 628 TVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            V +S  W D +    A+ V P QVSKT  +GEYL  G F IRG + + 
Sbjct: 580 AVSYSSVWKDGRGAGDAYMVDPDQVSKTPESGEYLEKGGFAIRGDREYF 628


>gi|448508289|ref|XP_003865916.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
 gi|380350254|emb|CCG20475.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
          Length = 654

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 128/224 (57%), Gaps = 3/224 (1%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVAN 547
           V +D  LS++ANA  ++E KK  E+KQ K       A+K AEKK    +   L+ +   +
Sbjct: 197 VSIDYTLSSYANASVYFENKKAAEAKQTKVEKGAEIAYKNAEKKINQDLVKNLRRENGTS 256

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
               R+  WFE F WF+SSE YL ++GR   Q +++  +Y S  D +V +++ G+    +
Sbjct: 257 SKSEREKFWFESFYWFVSSEGYLCLAGRTKSQIDLLYFKYFSDDDFFVSSEIEGSLKVFV 316

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           KN    + VPP T+ QAG F +  SQAW+ K+ T+AW ++  +VSK   +G  L  G F 
Sbjct: 317 KNPLKGESVPPSTILQAGIFAMSASQAWNGKINTAAWVLHGSEVSKYNQSGALLPPGEFE 376

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
              +K+FLPP  L+MGFGL F +DE S   H  +R  + +E G+
Sbjct: 377 YLARKHFLPPAQLVMGFGLYFLVDEGSAEGHKQQRVQKEKEHGL 420



 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 33/50 (66%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
             + +D+LT  P   D +L ++PV  P+SA+Q  KY+ KI PG AKKGK I
Sbjct: 550  FDTLDHLTPKPAVGDTVLDIVPVFAPWSALQKLKYKAKIQPGLAKKGKSI 599


>gi|448470211|ref|ZP_21600408.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
 gi|445808289|gb|EMA58361.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
          Length = 735

 Score =  163 bits (413), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 181/743 (24%), Positives = 302/743 (40%), Gaps = 146/743 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHVADAEHVADAPGRPPNFAKMLRNRMAGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  +++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGALSTVRLKSRTVAPGAQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                        P  V+  G                       ++  ++ +D  R    
Sbjct: 166 -------------PLDVSPGG---------------------FERHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+   + + EV    D+ ++ L  A+++  D L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKTLPVDEVT---DDQLRALHEALSRIGDRLR-- 238

Query: 299 ISGDIVPEGYI-LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            SGDI P  Y   +     G+D    ES        +  P  L++      V F++F+AA
Sbjct: 239 -SGDIDPRVYEEALDGGDGGED---AESDDRDPRVVDVTPFPLSEHEGLPSVGFDSFNAA 294

Query: 358 LDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +DE++ ++E++  +      + +A          K  +I   Q   +   +++ +   + 
Sbjct: 295 VDEYFYRLEAEDTDAGEAPADASASRPDFEEEIAKQERIIEQQRGAIEGFEEQAEAERER 354

Query: 410 AELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           AEL+  EY+L  VD  +  V+ A    + W+++A  +      G P A  +  +      
Sbjct: 355 AELLYAEYDL--VDEVLSTVQEAREAEVPWDEIAETLDAGADRGIPAAEAVVDVDGGEGT 412

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +++       E+DDE+      +VE+D +     NA R Y+  K+ E K+E        A
Sbjct: 413 VTV-------ELDDEDGD--SVRVELDASAGVEVNADRLYQEAKRIEGKKEG-------A 456

Query: 528 FKAAEKKTR-LQILQEK-------------------------------------TVANIS 549
            +A E   R L+ ++E+                                     + ++I 
Sbjct: 457 MEAIESTRRELEAVKERKAEWEAKEAAADETPGGGGDGDGDDDADDEEYETDWLSRSSIP 516

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
                 WFE+F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K 
Sbjct: 517 IRSPDDWFERFRWFRTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKA 576

Query: 610 HRPEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTV 663
             P +   P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  
Sbjct: 577 AGPSESADPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEK 636

Query: 664 GSFMIRGKKNFLPPHPLIMGFGL 686
           GSF+IRG + +    P  +  G+
Sbjct: 637 GSFVIRGDRTYFEDVPCRVAVGV 659


>gi|378754807|gb|EHY64836.1| hypothetical protein NERG_02239 [Nematocida sp. 1 ERTm2]
          Length = 697

 Score =  162 bits (411), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 114/360 (31%), Positives = 172/360 (47%), Gaps = 43/360 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F AA+D  ++  E      Q K +         KI   QE  +H   +E+      A
Sbjct: 269 FEGFGAAMDAVFNVQEITETASQKKQR---------KIREAQERDLHKKIEEMTILKDKA 319

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  N  +V   I  +  A A  +S ++  R  +E  K  NP A +I K+      + L
Sbjct: 320 ELLSENQAEVKNVISVIEAANAASLSEKEFERF-RETEKDTNPTAQIIKKVNFGNKTVDL 378

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA----HSK 526
                         TL  + V +D   S        Y+  KK E K +KT  A      K
Sbjct: 379 --------------TLDKKAVSIDYTKSIFEQINMLYQKAKKIEEKLKKTRKALDESKHK 424

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
             + A K  +++ ++          R   WFEKF WFI+ ++ L+I+GRD++QNE++VK+
Sbjct: 425 EVEIASKVEKIEKIE----------RNPFWFEKFRWFITKDSDLIIAGRDSKQNEILVKK 474

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           Y+   D Y HAD+ G SS ++  H  +      T   A    +  S+AW++ ++T  + V
Sbjct: 475 YLLDTDYYFHADIRGGSSVIVGEHATDH-----TKEIAASMAMHLSKAWENNLITEVYCV 529

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
              QVSKTAP GEYLT GSFMI GKK F  P  L  GF L++++++  +    + R+V G
Sbjct: 530 RGDQVSKTAPAGEYLTHGSFMITGKKEFYHPTRLEYGFSLIYKIEDEEITISDDNRKVTG 589



 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/144 (33%), Positives = 73/144 (50%), Gaps = 14/144 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R++  D+ A V  L ++ G     VY  S K  + K  N           K  LL++
Sbjct: 1   MKGRLSWLDIRAGVNELEKINGCHIKTVYSTSKKAILIKFSN-----------KDQLLID 49

Query: 62  SGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
              + H T  + +K N TP    L LR+ I   R+E + QLG+DR+ + +   G     +
Sbjct: 50  PPSKFHLTHKSYEKVNLTP--LALYLRREISNYRVEKITQLGFDRVAVIKIRSGKGCRLL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+E+YA GNI+LTD E  ++ LLR
Sbjct: 108 IVEMYANGNIILTDEELNIINLLR 131


>gi|260942807|ref|XP_002615702.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
 gi|238850992|gb|EEQ40456.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
          Length = 605

 Score =  162 bits (410), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 154/299 (51%), Gaps = 7/299 (2%)

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQE 542
           +P   V++DLALSA ANA  ++E KK   +KQ +       A K AE+K +  +   L+ 
Sbjct: 70  MPTLTVDIDLALSAFANASVYFESKKVAVTKQTRVEKNTKIALKNAERKIQSDLNKNLKN 129

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           +T  ++   R   WFEK+ WF +S+ YL ++GRD  Q +MI  R+ S GD +V +DL GA
Sbjct: 130 ET-ESLRAFRHKFWFEKYFWFTTSDGYLCLAGRDDLQTDMIYYRHFSDGDYFVSSDLDGA 188

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
           +   I N    Q V P  L QAG F +  S AW +K+ +SAWW+    V+K    G  L 
Sbjct: 189 AKVFILNPYKAQNVSPSALFQAGIFALSTSTAWSAKISSSAWWMSGADVTKREFDGSLLG 248

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD-DFEDSGHHK 721
            G    + KKN++PP  ++MGFG  +  DE +   +   R  R EE G+   F +     
Sbjct: 249 PGILKYKAKKNYMPPAQMVMGFGFYWLCDEETTQKYKIAREKRQEEHGLKVSFSNKKSDL 308

Query: 722 ENSDIESEKDDT-DEKPVAESLSVP-NSAHPAPSHTNASNVDSHEFPAEDKTISNGIDS 778
           ++  I+S  + T +E  + E+   P NS  P+     +   DS     E+K     ++S
Sbjct: 309 DDMSIKSSMNSTKEEASLEETQKEPENSDEPSKKDAYSPIEDSEASHPEEKETETMVES 367



 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 40/178 (22%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG+KGKLKK+  KY DQDEEER +RM +L +  +++        E     +E + A S  
Sbjct: 394  RGKKGKLKKINAKYADQDEEERRLRMEMLGTLKQME--------ELERKRREAEKAKS-- 443

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAM----EEEDIHE 1009
                                +   +  HG   N      + AE  ++      E ++ + 
Sbjct: 444  --------------------DQQQNEKHG---NKAASKQQKAEERELQRYLKGEMDEDNA 480

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            IG      L  +D L   P   DI+  ++PV  P++++  +KY+VKI PG  KKGK +
Sbjct: 481  IG---TNYLELLDSLVAKPARDDIVADLVPVFAPWASMAKFKYKVKIQPGLGKKGKSL 535


>gi|297619525|ref|YP_003707630.1| Fibronectin-binding A domain-containing protein [Methanococcus
           voltae A3]
 gi|297378502|gb|ADI36657.1| Fibronectin-binding A domain protein [Methanococcus voltae A3]
          Length = 722

 Score =  162 bits (410), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 114/344 (33%), Positives = 185/344 (53%), Gaps = 16/344 (4%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           +E +  ALDE++S+   Q+  ++ + K D    K  +I   Q       +++  ++ +  
Sbjct: 311 YENYLNALDEYFSQFILQKDIKKEETKLDKLIRKQERIVNSQIETKAKYEKQSAKNHQKG 370

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           +LI  N  ++D  I  +R A   +M W+ + ++V E +   NP+   I+ +  +   ++L
Sbjct: 371 DLIYANFTEIDEIINTIRSA-REKMEWKQIKKIVSENK--DNPILSKIESINEKNAELNL 427

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
            L   + E   E  T+    V +D+  SA  NA  +Y   KK ++K    I A   + K 
Sbjct: 428 KL---IAEYGGELGTIK-GNVAIDIRESAFENANSYYTKAKKFKNKVSGVIVALEISQKK 483

Query: 531 AEK---KTRL--QILQEKTVANISHMRKV-HWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            EK   +T L  ++L++K        R+V  W+EK  W I  +NYL+I+G+DA  NE+IV
Sbjct: 484 LEKIRQQTELDAELLKQKQQNIKKKERRVLKWYEKLKWTII-DNYLIIAGKDATTNEIIV 542

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-A 643
           K+Y+ K DV  H  + GA  TVIKN   E P    TL +   F V HS+AW   + ++  
Sbjct: 543 KKYLEKNDVVFHTLMEGAPFTVIKNTSEETPSEE-TLLEVAKFAVSHSKAWKLGLGSADV 601

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +WV P Q+SKTA +GE+L  G+F+IRGK+NF+   PL +G G++
Sbjct: 602 YWVLPEQISKTAESGEFLKKGAFVIRGKRNFIRSAPLDLGVGIV 645



 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 79/169 (46%), Gaps = 16/169 (9%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLLL 59
           K  M   D+   VK L+ LI  +    + ++    +  I K+ N    TE G  E V+  
Sbjct: 15  KKEMTNIDICVAVKELQNLINAKFDKAFLVNNQDGRELILKVHN----TEMGTQEIVI-- 68

Query: 60  MESGV----RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
              GV     +  T Y R K   P  F + LRK++R  ++  + Q  +DRI+   F    
Sbjct: 69  ---GVGKYKYITKTEYDRQKPKNPHSFVMLLRKNLRNIKITKIEQHNFDRIVKITFEWNE 125

Query: 116 NAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
             + +I+EL+  GN++L D E  ++  LR+ R  D+ +     +++P +
Sbjct: 126 LKYTLIIELFKDGNVILLDKENKIVMPLRNERFSDRKLIPKEEYKFPAQ 174


>gi|110667755|ref|YP_657566.1| hypothetical protein HQ1801A [Haloquadratum walsbyi DSM 16790]
 gi|109625502|emb|CAJ51929.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
          Length = 719

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 163/724 (22%), Positives = 288/724 (39%), Gaps = 147/724 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  LRR  G +    Y        F++ +        +  ++ LL+E 
Sbjct: 4   KQELTSVDIAALVTELRRYTGAKVDKTYRYGDDLLRFRMRDF-------DRGRLELLIEV 56

Query: 63  GV--RLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R+HT    +  D    P  F + LR  +    L +V Q  +DRI++  F  G    
Sbjct: 57  GTQKRIHTADPDHVPDAPERPPNFAMMLRNRLSGADLVNVEQFEFDRIMILSFERGEEMT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+  GN+ + DS   V+  L                    E  R+  RT A    
Sbjct: 117 RIIVELFGDGNVAVVDSAGEVIQSL--------------------ETVRLKSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                S+      P +V  D                  +   L   S+ +          
Sbjct: 157 YEFPDSR----VNPLQVTYD------------------RFISLMNESDTD---------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +   L   L  G   +E +    G+    K +++    D   + +  A+      LQ  
Sbjct: 185 -IVRTLATQLNLGGLYAEEVCARAGI---DKTTQITNTSDKIYRAIYTALESLGTQLQ-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P    L  +     D  P              PL   + ++ +   +++F+ AL
Sbjct: 239 -SGDFEPR---LYTDDDAVIDATP-------------FPLEERKQQNLDVTTYDSFNGAL 281

Query: 359 DEFYSKIE-SQRAEQQHKAKED--AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ +++ +  AE+  + + D  A   K  +I   QE  +   +Q  +     AEL+  
Sbjct: 282 DVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQEGAIDDFEQRAEAERSRAELLYA 341

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E V+  I  ++ A A   SW+++        + G   A  +    +  +    +++  
Sbjct: 342 NYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGIDAAAAV----VSVDGAEAMVTIE 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
           +D+M          +V V++ +    NA + Y   K+ E K+E  +TA  +++    A K
Sbjct: 398 IDDM----------RVPVNVDVGVEKNADQRYTEAKRIEEKKEGALTAIENTREELNAVK 447

Query: 534 KTR------------------LQILQEK------------------TVANISHMRKVHWF 557
           + R                   + + +K                  ++ +I   +   W+
Sbjct: 448 QRRDAWDREDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTDDEWLSMTSIPLQKNDDWY 507

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVP 617
           E+F WF +S  YLV+ GR+A QNE +VK+Y++K D + H + HG   T++K   P +P  
Sbjct: 508 EQFRWFHTSTGYLVVGGRNADQNETLVKKYLNKHDRFFHTEAHGGPITILKASGPSEPAE 567

Query: 618 PLTLN-----QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+ L      +   F + +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG 
Sbjct: 568 PIELTAETRREVAQFAISYSSIWKEGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGD 627

Query: 672 KNFL 675
           + ++
Sbjct: 628 RTYI 631


>gi|422295934|gb|EKU23233.1| zinc knuckle (cchc-type) family protein, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 397

 Score =  160 bits (405), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 130/413 (31%), Positives = 192/413 (46%), Gaps = 90/413 (21%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK +  T DV A V+ LR +++G++  N+YD+  +TY FKL    G       EKV LL
Sbjct: 1   MVKTKFTTPDVRAMVRDLRTKVLGLKVVNIYDIDNRTYTFKLAVPGG-------EKVTLL 53

Query: 60  MESGVRLHTTAYARDKK---NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
           +ESG R HTTAYAR++      P+ F +KLRK++R + LEDVRQLG DR+++F+FG G  
Sbjct: 54  LESGARFHTTAYARERSVPGELPNVFAMKLRKYLRGKGLEDVRQLGMDRVVVFRFGQGEG 113

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSH----------------------------RD 148
           A ++ILELYA GN++LTD+ + +L LLR+H                            R 
Sbjct: 114 ALHLILELYASGNLVLTDANYLILALLRTHQYDQGPEKAVDGEVVGKDAEAGAGTVEGRV 173

Query: 149 DDKGVAIMSRHRYP----------TEICRVFERTTASK----------LHAALTSSKEPD 188
           ++ G  +   H YP          T      E+   +K              L + +E  
Sbjct: 174 EESGRVVRVGHVYPLAFASNALAATRSSAGVEKDAGAKQDPPPWLAVTAETVLAALREVV 233

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
             E  K  ++GN  S+ ++   GG++G         ++  S    +    T KT+   AL
Sbjct: 234 VREKGKAGKEGNGTSSMAQ---GGKRGRTKRGGQAGASARSKVNLKMALMTSKTLDLSAL 290

Query: 249 GYGPALSEHIILDTGLVPNMKL-----------------SEVNKLEDNAIQVLVLAVAKF 291
             GPA+ EH +L+ GL P ++L                  +   L +     L  AV   
Sbjct: 291 --GPAIVEHAVLEAGLRPLLRLMPPASAVALGEDEEEGEGQREGLTEEEAARLAEAVQGL 348

Query: 292 EDWLQDV-ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI-YDEFCPLLLNQ 342
           +  L+ + + G    EGYIL +      D   T  G   +I Y+EF PL L Q
Sbjct: 349 DGRLRRLDLPGQ---EGYILCRK----ADGAGTRGGEEDEIMYEEFHPLRLRQ 394


>gi|448391228|ref|ZP_21566471.1| fibronectin-binding A domain-containing protein [Haloterrigena
           salina JCM 13891]
 gi|445666097|gb|ELZ18766.1| fibronectin-binding A domain-containing protein [Haloterrigena
           salina JCM 13891]
          Length = 723

 Score =  160 bits (404), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 179/691 (25%), Positives = 280/691 (40%), Gaps = 152/691 (21%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGETKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDS------ 162

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT        LT S+E                               +FD  +    +  
Sbjct: 163 RTNP------LTVSRE-------------------------------AFD--REMEDSDT 183

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLED------NAIQVL 284
           D  R    TL T     L +G   +E I    G+   M ++E +  ED       AI+ L
Sbjct: 184 DVVR----TLAT----QLNFGGLYAEEICTRAGVEKAMDIAEAD--EDVYDRIYGAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESG-------SSTQIYDEFCP 337
            L          D+ +G+  P  Y+  +    G D   +ES        SS     +  P
Sbjct: 234 AL----------DLRNGNFDPRLYLADE----GDDDNESESDENGGDGDSSPDRVVDATP 279

Query: 338 LLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQ-----HKAKEDAAFHKLNKIHMDQ 392
             L +        +++F AALD+++ ++E    E++      +   +    K  +I   Q
Sbjct: 280 FPLEEHVELASEPYDSFLAALDDYFYRLELAEDEEETDPTTQRPDFEEEIAKYERIIEQQ 339

Query: 393 ENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
           +  +   +QE D   + AEL+  EY L  VD  +  V+ A A    WE++     EER  
Sbjct: 340 QGAIEGFEQEADALREQAELLYAEYGL--VDDILSTVQEARAQDRPWEEI-----EER-- 390

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRW 506
                      + E     +  +  + ++D  E T+ VE    ++++        NA R 
Sbjct: 391 -----------FAEGADRGIAAAEAVVDVDGSEGTVTVELDGERIDLVAKQGVEQNADRL 439

Query: 507 YELKKKQESKQEKTITAHSKAFK-AAEKKTR----------------------LQILQEK 543
           Y   K+ E K+E  + A     +  AE K R                         L E 
Sbjct: 440 YTEAKRVEEKKEGALAAIEDTREDLAEAKARRDRWEEEDAAAEGDDDEDEDDDRDWLSEP 499

Query: 544 TVANISHMRKVH-WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           +V     +R+   WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG 
Sbjct: 500 SVP----IRENEPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGG 555

Query: 603 SSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTA 655
             TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QV+KT 
Sbjct: 556 PVTVLKATDPSEASSSDIELPESSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTP 615

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            +GEYL  G F IRG + +    P+ +  G+
Sbjct: 616 ESGEYLEKGGFAIRGDRTYYRDTPVDVAVGI 646


>gi|448737510|ref|ZP_21719550.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
           13552]
 gi|445803654|gb|EMA53937.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
           13552]
          Length = 695

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 158/720 (21%), Positives = 278/720 (38%), Gaps = 140/720 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSVDLAALVTELGTYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  +  +  D    P GF   LR  +       V Q G+DR++ F+F  G    
Sbjct: 57  GETKRAHVVSPEHVPDAPGRPPGFAKMLRNRLSGADFAGVSQFGFDRVLTFEFERGDRNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            V+ EL+ +GN+ + D+   V+  L +                     R+  RT A    
Sbjct: 117 KVVAELFGEGNVAVLDATGEVIDCLNT--------------------VRLQSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S++     +P  V+ DG                      +    +++ D       
Sbjct: 157 YEFPSAR----FDPLAVDYDG---------------------FAARMEESNTD------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            L   L   L +G    E +    G+   + + E ++ +  A+   +  ++      + +
Sbjct: 185 -LVRTLATQLNFGGLYGEELCTRAGVEKELAIEEADETDFEALYDALTGLS------EQL 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y          D  P +            P  L++    +   F++F AAL
Sbjct: 238 SSGDFNPRIYR--------DDGDPVD----------VTPFPLDERAELDSEGFDSFTAAL 279

Query: 359 DEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ ++++   E+   + +   +    +  +I   QE  +   + + DR  + AE +  
Sbjct: 280 DAYFVELDTTEDEESGGRERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYA 339

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E VD  +  VR A    + WE +     E  + G   A  +  +      +++    +
Sbjct: 340 NYELVDEILTTVRNAREEGIGWEAIEERFAEGEERGIAAAEAVSGIEPSEGTVTV----D 395

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
           +D+ D          VE+D       NA R Y   K+   K+E    A       AE + 
Sbjct: 396 IDDRD----------VELDPQEGVEQNADRLYREAKRVVEKKEGAEEA------VAETRE 439

Query: 536 RLQILQEK-----------------------TVANISHMRKVHWFEKFNWFISSENYLVI 572
            L+ ++ +                       +  +I       W+E+F WF +S+ +LV+
Sbjct: 440 ELEAIERQRDEWEAGDVDDDPDEESEDVDWLSQRSIPVRTDEQWYERFRWFHTSDGFLVL 499

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGCF 627
            GR+A QNE +VK+Y+ +GD + H  + G   T++K   P +P     +P  +L +A  F
Sbjct: 500 GGRNADQNEDLVKKYLDRGDRFFHTQVQGGPVTILKATGPSEPTREIDLPDRSLEEAAKF 559

Query: 628 TVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            V +S  W + +    A+   P QVSKT  +GEYL  G F IRG + +     + +  G+
Sbjct: 560 AVSYSTVWKNGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGI 619


>gi|91773364|ref|YP_566056.1| hypothetical protein Mbur_1391 [Methanococcoides burtonii DSM 6242]
 gi|91712379|gb|ABE52306.1| FbpA, DUF814 containing protein [Methanococcoides burtonii DSM
           6242]
          Length = 663

 Score =  159 bits (402), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 111/379 (29%), Positives = 182/379 (48%), Gaps = 32/379 (8%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQ----HKAKEDAAFHKLNKIH 389
           +  PL L Q+   E   + +F+ ALDEF+ K  S+   +Q     K KED    +L K  
Sbjct: 257 DVLPLELTQYSDAEKEFYPSFNKALDEFFGKKASEEVIEQVVAKKKEKEDVFERRLRK-- 314

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
             Q+  +   + +  R   +AE I  N + V+  +  +  A     SW+D+   +K+ + 
Sbjct: 315 --QQEAILKFETDSTRYTLIAESIYGNYQTVEEVLSVLEAARDKGYSWKDIWDTLKKAK- 371

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
                    D L   +  +S+  +     +D     L V    +++  +   NA+ +Y  
Sbjct: 372 ---------DTLPAAKAIVSIDPAEGSVVVD-----LDVVNANINVRKTIPQNAQMYYNK 417

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
            KK   K++  + A     +A +K+      ++K       + K HW+++F WF SS+ +
Sbjct: 418 AKKISKKRDGALIAIEDTKRAMQKR------EQKVSKRRKAVFKKHWYDRFRWFFSSDGF 471

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LVI GRD+  NE IVK+YM K D+  H  + GA  TVIK    +  +P  TL +A  F V
Sbjct: 472 LVIGGRDSDTNEEIVKKYMEKRDIVFHTQVPGAPITVIKTEGKD--IPETTLEEAARFVV 529

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            +S  W S   +   +W+ P QVSKT  +GEYL  GSF+IRG++N+    P+ +  GL  
Sbjct: 530 SYSSVWKSGQFSGDCYWIKPEQVSKTPESGEYLKKGSFIIRGERNYYKDVPVGVAIGLDL 589

Query: 689 RLDESSLGSHLNERRVRGE 707
             +   +G  L+  +  G+
Sbjct: 590 GAETRVIGGPLSAVQSNGK 608



 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 72/147 (48%), Gaps = 11/147 (7%)

Query: 2   VKVRMNTADVAAEVKCLR----RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVL 57
           +K  M +ADVAA V  L      LI  +   +Y  +P      L     +   G      
Sbjct: 1   MKQEMTSADVAALVSELGDGEGSLIDSKIGKIYQPAPDEIRINLF----IFGKGRYN--- 53

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++E+G R H + Y R+   TP  F + LRKHI   R+  ++Q  +DRII      G   
Sbjct: 54  LVIEAGKRAHMSNYVRESPKTPQAFPMLLRKHILGGRITSIKQYDFDRIIEMGVIRGGIE 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLR 144
             ++ EL+++GNI+L +S+  ++  ++
Sbjct: 114 TILVCELFSRGNIVLLNSDRKIILPMK 140


>gi|268323401|emb|CBH36989.1| conserved hypothetical protein containing fibronectin-binding
           protein A N-terminal domain, DUF814 family [uncultured
           archaeon]
 gi|268324037|emb|CBH37625.1| conserved hypothetical protein containing fibronectin-binding
           protein A N-terminal (FbpA) domain and DUF814 domain
           [uncultured archaeon]
          Length = 631

 Score =  159 bits (402), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 161/681 (23%), Positives = 271/681 (39%), Gaps = 142/681 (20%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+AA V  L+ L+G R    Y    +    KL +   +          L++E
Sbjct: 1   MKESMSSVDIAAIVIELQELLGARLVKAYQPGREEIRLKLHHKGSLD---------LIIE 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T Y R     PS F + LRKH+   R+  +RQL +DRI+            +I
Sbjct: 52  AGKRIHLTKYKRASPRMPSNFAMYLRKHLSGARIAQIRQLDFDRIVEITIERWDKKLRLI 111

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            EL  +GNI++                D+ G  ++   R         +   + K+    
Sbjct: 112 AELLPRGNIVVV---------------DEDGTILLPLRR---------KSFASRKIKVGE 147

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
              + P    P  ++E                      DL  N  K   D A        
Sbjct: 148 KYERPPSRANPLTMSES---------------------DLM-NLCKRDKDIA-------- 177

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
           +V    L +G   +E +    G+   M+  E+   E NAI   +  +  FE         
Sbjct: 178 SVFASELSFGGLYAEEVCAKAGIDKRMRADELTATEINAIHETIHTL--FEP-------- 227

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
                  I+  +K   K H   E         +F P  L+ + ++E   F + + A DE+
Sbjct: 228 -------IITNDKSTLKAHIVIEGEDKI----DFVPFELSSYENKEKQFFPSLNDAADEY 276

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           ++   ++  E+Q K++ D    K  +I  +Q   +H  + +   S K  E+I  +     
Sbjct: 277 FTTQIAEVVEEQAKSEHDTVIGKYERILNEQLEALHKFELKEAESTKKGEMIYAH----- 331

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
                          + +L  M++E  K              +R  ++L L        D
Sbjct: 332 ---------------YLELEEMLQEPDK--------------KRKVVTLTLP-------D 355

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYE----LKKKQESKQEKTITAHSKAFKAAEKKTRL 537
            + +L     E+D ++S H NA  +Y+     +KK+E  +        K     EK+ R+
Sbjct: 356 TDISL-----EIDTSVSLHKNAGAYYDKAKVFRKKREGVEPAIEMTKEKIRTEKEKEVRI 410

Query: 538 QILQEKTVANISHMR--KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +   E+ +     +R  K  W+EKF WF +S+ +LV+ G+DA  NE++ K++M   D++ 
Sbjct: 411 E---EELIPTKKEVRTEKEEWYEKFRWFETSDGFLVVGGKDATTNEILAKKHMEPNDLFF 467

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKT 654
           H    GA   + K    E  +    L +   F   +S  W         + V   QVSKT
Sbjct: 468 HTQAEGAPVVIAKAGGKE--ISESGLKEIAQFAASYSNLWKYGFYEGECYCVVGEQVSKT 525

Query: 655 APTGEYLTVGSFMIRGKKNFL 675
            P+GEY+  GSFM+RGK+ + 
Sbjct: 526 PPSGEYIKKGSFMVRGKRKYF 546


>gi|296109018|ref|YP_003615967.1| Fibronectin-binding A domain protein [methanocaldococcus infernus
           ME]
 gi|295433832|gb|ADG13003.1| Fibronectin-binding A domain protein [Methanocaldococcus infernus
           ME]
          Length = 666

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 194/355 (54%), Gaps = 16/355 (4%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L +++  E   F +F  ALDE+++K  +    ++ K+K +    K   I   Q   
Sbjct: 257 VPIELRKYKDYEKRYFNSFYEALDEYFAKFLTSVEIKKEKSKLEKEIEKQESILRRQLET 316

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++EV ++    +LI  N + V+  + A+RVA  ++  WE++ R+++E ++  +P+ 
Sbjct: 317 LKAYEEEVRKNQIKGDLIYSNYQLVEEILNAIRVA-KDKKGWEEVKRVIRENKE--HPII 373

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            LI+ +  ++  + + LS++LD   +E       +V +D+  S   NA  +Y   KK +S
Sbjct: 374 KLIEGVNEKKGEIIVRLSSDLDGKIEE-------RVVLDIRKSTFENAESYYNKAKKFKS 426

Query: 516 KQE--KTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVIS 573
           K E  K     SK      KK R   ++EK        ++  W+EKF W + + N+LVI+
Sbjct: 427 KIEGIKKAIEMSKKKLEELKKKRDVEIEEKKALKKKVKKERKWYEKFKWTVIN-NFLVIA 485

Query: 574 GRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQ 633
           G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK +  E  V   TL +   F+V HS+
Sbjct: 486 GKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTNGRE--VDEETLMEVAKFSVSHSK 543

Query: 634 AWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           AW         +WV P Q+SK A +GEYL  G+F+IRGK+N++   PL +G G+L
Sbjct: 544 AWKLGYGALDTYWVKPDQISKRAESGEYLKRGAFVIRGKRNYIRNVPLELGIGVL 598



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 55/94 (58%)

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T+Y R+K   P  F + LRK+++  +L  + Q+ +DRI+L +F +G   + +I EL+  G
Sbjct: 64  TSYEREKPKLPPSFAMLLRKYLKNAKLLRIDQVEFDRILLLEFSIGEKKYKIIAELFKDG 123

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           NI+  D E  ++  LR     ++ +A   ++++P
Sbjct: 124 NIIFLDEEDNIIAPLRVEVFSNRKIAPKEKYQFP 157


>gi|448666601|ref|ZP_21685246.1| fibronectin-binding A domain-containing protein [Haloarcula
           amylolytica JCM 13557]
 gi|445771732|gb|EMA22788.1| fibronectin-binding A domain-containing protein [Haloarcula
           amylolytica JCM 13557]
          Length = 717

 Score =  158 bits (399), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 160/669 (23%), Positives = 263/669 (39%), Gaps = 115/669 (17%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    A+  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHAADPAHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEHGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                     +++    +++
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGF--------------------VARIKESDAD 185

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
                    L   L   L +G    E +    G+  N+    V++L+++  + L   + +
Sbjct: 186 ---------LVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDELDESDFERLYELIDQ 233

Query: 291 FEDWLQDVISGDIVPEGYI--LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
               L++   GD+ P  Y   L      G   P  E         +  P+ L ++     
Sbjct: 234 MGTRLRE---GDVDPRVYYEALDDGDGAGSADPDDEPDRRRV---DVTPIPLEEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F+ ALD+++     QR E+       +   +A   K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNPALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQKRIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  V+ A A+ +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVQAARADDVSWDDIEAKFNEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
               ++L +                 +V VD       NA   Y+  K+ E K+E  + A
Sbjct: 406 SEGTVTLDIDGT--------------RVTVDAFTGVEKNADELYKEAKRIEEKKEGALAA 451

Query: 524 --HSKAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWF 563
             +++    A K+ R +   +                   ++ +I      HW+E+F WF
Sbjct: 452 IENTREDLEAVKERRDEWEADDGDDEADEDEGEDEPTDWLSMQSIPTRSTEHWYEQFRWF 511

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPP 618
            +S+ +LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P 
Sbjct: 512 HTSDGFLVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSTEVDFPQ 571

Query: 619 LTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
            +L+QA  F V +S  W D K     + V P QVSKT  +GEYL  G F IRG + +   
Sbjct: 572 SSLDQAAQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFES 631

Query: 678 HPLIMGFGL 686
            P  +  G+
Sbjct: 632 TPAGIAVGI 640


>gi|448677723|ref|ZP_21688913.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
           12282]
 gi|445773398|gb|EMA24431.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
           12282]
          Length = 717

 Score =  158 bits (399), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 160/662 (24%), Positives = 264/662 (39%), Gaps = 101/662 (15%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQ--PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+    L   L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   G++ P  Y    +   G  +  +      +  D   P+ L+++     
Sbjct: 232 DEMGTRLRE---GNVDPRVYYETLDDGDGAGNGESGDDPDRRRVD-VTPIPLSEYEGLYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F++ALD+++     QR E+       +   +    K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNSALDDYFFNF--QREEEVEGGETQRPDFEVEIEKQKRIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  VR A  + +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVRAAREDDVSWDDIEAKFDEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
               ++L +               N DE+  E K +  +K   + AL+A  N R   E  
Sbjct: 406 SEGTVTLDIGGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAV 462

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           K++  + E    A     + AE +   +     ++ +I       W+E+F WF +S+ +L
Sbjct: 463 KERRDEWE----ADDGEDEVAEDEGEDEPTDWLSMQSIPTRSTERWYEQFRWFHTSDGFL 518

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAG 625
           VI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA 
Sbjct: 519 VIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKKVDFPQSSLDQAA 578

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            F V +S  W D K     + V P QVSKT  +GEYL  G F IRG + +    P+ +  
Sbjct: 579 QFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVAV 638

Query: 685 GL 686
           G+
Sbjct: 639 GI 640


>gi|34364937|emb|CAE45889.1| hypothetical protein [Homo sapiens]
          Length = 505

 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 86/111 (77%), Gaps = 1/111 (0%)

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           A+S VIKN   E P+PP TL + G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYL
Sbjct: 1   ATSCVIKNPTGE-PIPPRTLTEVGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYL 59

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           T GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 60  TTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDEDME 110


>gi|452210388|ref|YP_007490502.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
 gi|452100290|gb|AGF97230.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
          Length = 775

 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 113/390 (28%), Positives = 191/390 (48%), Gaps = 22/390 (5%)

Query: 320 HPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIE-SQRAEQQHKAKE 378
           H   E     + +D   P  LN++   E   F++F+ ALDEF+ K    Q AE +   K+
Sbjct: 269 HIKQEINGKMETFD-VVPFDLNRYSEYEKEYFDSFNTALDEFFGKKALEQVAEVKEAEKK 327

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
           +       +  M QE  +   ++E++++  +AE +  N + ++     +  A A   SW+
Sbjct: 328 EKTLGVFERRLMQQEESLAKFEKEIEKNNALAETVYANYQIIEELFSVLNGARAKGYSWD 387

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALS 498
           ++  ++K+ +K   P A  I  +  +   +++    NLD           + + +D+  +
Sbjct: 388 EIRSILKQAKKT-VPAAQTITNIDQKTGTVTV----NLDG----------KSINLDIRKT 432

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
              NA+ +YE  KK   K++  I A     KA EKK   +  +      +   RK HW++
Sbjct: 433 VPQNAQEYYEKVKKFTKKKDGAIRAIEDTKKAMEKKAATKSAKAGR--KLQASRKKHWYD 490

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           +F WF+SS+ +LV+ GRDA  NE I K+YM K D+  H    GA  TV+K    E  VP 
Sbjct: 491 RFRWFVSSDGFLVVGGRDADTNEEIFKKYMEKRDIVFHTQTPGAPLTVVKTGGKE--VPD 548

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
            TL +   F V +S  W +   +   +W+   QV+KT  +GEYL  G+F+IRG++N+   
Sbjct: 549 STLQEVSQFAVSYSSLWKAGQFSGDCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKD 608

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
            PL +  GL  + +   +G   +  R  G+
Sbjct: 609 VPLGIAVGLELKGETRIIGGPASAVRKHGD 638



 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 68/139 (48%), Gaps = 11/139 (7%)

Query: 6   MNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           M++ADVAA V  L    R +I  +   +Y  + +     L     V   G      L++E
Sbjct: 1   MSSADVAAVVAELSAGPRSIIDAKIGKIYQPASEEIRINLY----VFHQGRDN---LVIE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T + R     P  F + LRK++   R+  V Q  +DRI+            +I
Sbjct: 54  AGKRLHMTKHIRPSPTLPQAFPMLLRKYLMGGRIVSVEQHDFDRIVKIGIERAGVRSTLI 113

Query: 122 LELYAQGNILLTDSEFTVL 140
           +EL+A+GN+L+ DSE  ++
Sbjct: 114 VELFARGNVLIVDSENKII 132


>gi|410670434|ref|YP_006922805.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
 gi|409169562|gb|AFV23437.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
          Length = 664

 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/362 (30%), Positives = 175/362 (48%), Gaps = 31/362 (8%)

Query: 351 FETFDAALDEFYSK----IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           F +F+ ALD F+ K      ++  E   K K D    +L K    QE  +    +E +R 
Sbjct: 274 FPSFNKALDGFFGKRSAEEVTEVVEAVKKEKVDVFERRLRK----QEEAIENFGREAERH 329

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
           V +AE I  + + ++  I  +  A  N  SW+++  ++K  ++   P A  I  +     
Sbjct: 330 VDVAEKIYAHYQVIEDVIGVLEKARQNGYSWDEIKSILKGAKET-VPAAKSISSIDSATG 388

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
            + L L                 K  +D+ L+   NA+ +YE  KK   K+E  I A   
Sbjct: 389 RIVLDLEGT--------------KATIDIKLTIPQNAQSYYEKAKKLTRKKEGAIRAIED 434

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
              A +KK +     ++ V    HM+K HW+++F WF SSE +LV+ GRDA+ NE +VK+
Sbjct: 435 TRVAMQKKEKKVSGNKRKV----HMKK-HWYDRFRWFYSSEGFLVVGGRDAETNEELVKK 489

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWW 645
           YM K DV  H    GA  T++K     +PV   TL +A  F V +S  W S   +   +W
Sbjct: 490 YMDKSDVVFHTQDPGAPMTIVKAQ--GKPVTEQTLMEAAQFVVSYSSVWKSGQFSGDCYW 547

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           V P QVSKT  +GEY+  G+F+IRG++N+     + M   L    +   +G  ++  R  
Sbjct: 548 VLPEQVSKTPESGEYVKKGAFIIRGERNYFRDVQVGMAVALELGAETRVIGGPVSAVRQH 607

Query: 706 GE 707
           G+
Sbjct: 608 GQ 609



 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 55/105 (52%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++E+G R H + + R     P  F + LRKHI   R+  VRQ  +DRII F    G   
Sbjct: 54  LVIEAGKRAHLSEHIRQSPKIPHSFPMLLRKHIFAGRITYVRQYDFDRIIEFGMVRGGVE 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
             ++ EL++ GNI+L DSE  ++  ++      + +     ++YP
Sbjct: 114 TVLVAELFSPGNIVLLDSERKIILPMKPVTFKGRKIRSGEVYQYP 158


>gi|395506524|ref|XP_003757582.1| PREDICTED: uncharacterized protein LOC100920250 [Sarcophilus
           harrisii]
          Length = 231

 Score =  156 bits (395), Expect = 6e-35,   Method: Composition-based stats.
 Identities = 70/113 (61%), Positives = 86/113 (76%), Gaps = 2/113 (1%)

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
           H RK   FEKF WFISSENYL+I GRD QQNEMIVKRY++ GD+YVHADLHGA+S VIKN
Sbjct: 46  HQRKCG-FEKFLWFISSENYLIIGGRDQQQNEMIVKRYLTPGDIYVHADLHGATSCVIKN 104

Query: 610 HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              E P+PP TL +AG   +C+S AWD++++TSAWWVY HQ+      G+ L+
Sbjct: 105 PTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQLRSAFRVGDSLS 156


>gi|448329966|ref|ZP_21519260.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
           10478]
 gi|445613154|gb|ELY66864.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
           10478]
          Length = 720

 Score =  156 bits (395), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 166/714 (23%), Positives = 292/714 (40%), Gaps = 104/714 (14%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT        
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP------ 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S+E   +E D  + D                                         +
Sbjct: 167 LTVSREAFDHEMDDSDTD-----------------------------------------V 185

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
              L   L +G   +E + +  G+   M   +++  +++  + L   + +      D+ +
Sbjct: 186 VRTLATQLNFGGLYAEELCVRAGVEKGM---DIDDADEDVYERLYETIERL---ALDIRN 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G+  P  Y+   ++        +E   +  +  +  P  L +    +   +++F +ALD+
Sbjct: 240 GNFDPRLYLERDDEEADDGEGESEDADANVV--DVTPFPLEEHDDLDGEAYDSFLSALDD 297

Query: 361 FYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI--E 414
           ++ ++E    E+     +   F     K  +I   Q+  +   +QE +   + AEL+  E
Sbjct: 298 YFFRLELAEEEESDPTDQRPDFESEIAKQERIIEQQQGAIEGFEQEAEELREQAELLYAE 357

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLID--------KLYLER 465
           Y L  VD  +  ++ A     SW+++    +E  + G   A  ++D         + ++ 
Sbjct: 358 YGL--VDDILSTIQGAREQDRSWDEIRERFEEGAEQGIDAAEAVVDVDGSDGTVTVDIDG 415

Query: 466 NCMSLL----LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTI 521
             + L+    +  N D +  E K +  +K   + AL+A  N R   E  K++  + E   
Sbjct: 416 ERIGLVAGRGVEQNADRLYTEAKRVEEKK---EGALAAIENTREDLEEAKRRRDEWEADE 472

Query: 522 TAHSKAFKAAEKKTRLQ--ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
           +  +   ++ E +   Q   L E +   I       WF++F WF +S++YLVI GR+A Q
Sbjct: 473 SGPAAETESDEDEEETQRDWLSEPS---IPIRENEPWFDRFRWFQTSDDYLVIGGRNADQ 529

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQ 633
           NE +VK+Y+  GD   H   HG   TV+K   P +       +P  ++ +A  F V ++ 
Sbjct: 530 NEELVKKYLEPGDKVFHTQAHGGPVTVLKATDPSEASSSDIELPESSIEEAAQFAVSYAS 589

Query: 634 AW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            W D +     + V   QVSKT  +GEYL  G F IRG + +    P+    G+
Sbjct: 590 VWKDGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 643


>gi|448439536|ref|ZP_21588100.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
           1137]
 gi|445691070|gb|ELZ43265.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
           1137]
          Length = 733

 Score =  156 bits (394), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 181/741 (24%), Positives = 296/741 (39%), Gaps = 144/741 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHVADPDNVSDAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L++ R   + VA  S++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGALQTVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +     +     + E     D+ ++ L  A+ +  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRASVEKETPIEEAT---DDQLRALHEALERIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD+ P  Y    ++  G      E+        +  P  L++      V F++F+AA+
Sbjct: 239 -SGDVDPRVYEEELDEGDGDGGEDDEADDRDPRVVDVTPFPLSEHEGLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++E + ++      + +A          K  +I   Q+  +   +++ +   + A
Sbjct: 298 DEYFYRLEHEESDAGEAPTDASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAEAERERA 357

Query: 411 ELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNC 467
           EL+  EY+L  VD  +  V+ A  N + W+++A  +    + G P A  ++D        
Sbjct: 358 ELLYAEYDL--VDEVLSTVQEARENDVPWDEIAETLDAGAERGIPAAEAVVD-------- 407

Query: 468 MSLLLSNNLDEMDDEEKTLPVE------KVEVDLALSAHANARRWYELKKKQESKQEKTI 521
                      +D  E T+ VE      +VE+D +     NA R Y+  K+ E K+E  +
Sbjct: 408 -----------VDGGEGTVTVELGEDDTRVELDASAGVEVNADRLYQEAKRIEGKKEGAM 456

Query: 522 TAHSKAFKAAEK-KTRLQILQEKTVAN-----------------------------ISHM 551
            A     +  E  K R    + K  A+                             I   
Sbjct: 457 EAIESTRQDLEAVKERKAEWKAKEAADDEEGGSDAGGGEGDEGEEEYETDWLSRSSIPIR 516

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
               WFE+F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   
Sbjct: 517 SPDDWFERFRWFRTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAG 576

Query: 612 PEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           P +   P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GS
Sbjct: 577 PSESADPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGS 636

Query: 666 FMIRGKKNFLPPHPLIMGFGL 686
           F+IRG + +    P  +  G+
Sbjct: 637 FVIRGDRTYFEDVPCRVAVGV 657


>gi|448339346|ref|ZP_21528374.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
 gi|445620575|gb|ELY74071.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
          Length = 721

 Score =  156 bits (394), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 166/713 (23%), Positives = 280/713 (39%), Gaps = 125/713 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRDF-------DRGRMELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFIFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT      
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP---- 166

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
             LT S+E   +E D  + D                                        
Sbjct: 167 --LTVSREAFDHEMDDSDTD---------------------------------------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +   L   L +G   +E +    G+   M + + ++   + +   +  +A       D+
Sbjct: 185 -VVRTLATQLNFGGLYAEEVCTRAGVEKGMDIDDADEAVYDRLYETIERLA------LDI 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +G+  P  Y+   ++    D   T  G    + D   P  L +    +   +++F +AL
Sbjct: 238 RNGNFDPRLYLETDDEDDDADGDGTPEGGDAHVVD-VTPFPLEEHEDLDGEPYDSFLSAL 296

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE +   + AEL+ 
Sbjct: 297 DDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAESLREQAELLY 356

Query: 414 -EYNL-EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
            EY L +D+ + IL  R       SW+D+    +E  + G   A  +    ++ +     
Sbjct: 357 AEYGLVDDILSTILGAR---KRDRSWDDIRDRFEEGAEQGIDAAEAV----VDVDGSDGT 409

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           ++ ++D+          E++ +D       NA R Y   K+ E K+E  + A        
Sbjct: 410 VTVDIDD----------ERISLDAQQGVEQNADRLYTEAKRVEEKKEGALAAIENTRDDL 459

Query: 532 EKKTRLQILQEK-----------------------TVANISHMRKVHWFEKFNWFISSEN 568
           E   R +   E                        +  +I       WF++F WF +S+ 
Sbjct: 460 EDAKRRRDEWEDDESGGADEAEADEDEEDSQRDWLSEPSIPIRENEPWFDRFRWFHTSDG 519

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLN 622
           YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ 
Sbjct: 520 YLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIDLPESSVA 579

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           +A  F V +S  W D +     + V   QVSKT  +GEYL  G F IRG + +
Sbjct: 580 EAAQFAVSYSSVWKDGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTY 632


>gi|385803199|ref|YP_005839599.1| hypothetical protein Hqrw_1937 [Haloquadratum walsbyi C23]
 gi|339728691|emb|CCC39852.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
          Length = 719

 Score =  155 bits (391), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 163/724 (22%), Positives = 286/724 (39%), Gaps = 147/724 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  LRR  G +    Y        F++ +        +  ++ LL+E 
Sbjct: 4   KQELTSVDIAALVTELRRYTGAKVDKTYRYGDDLLRFRMRDF-------DRGRLELLIEV 56

Query: 63  GV--RLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R+HT    +  D    P  F + LR  +    L +V Q  +DRI++  F  G    
Sbjct: 57  GTQKRIHTADPDHVPDAPERPPNFAMMLRNRLSGADLVNVEQFEFDRIMILSFERGEEMT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+  GN+ +                D  G  I S      E  R+  RT A    
Sbjct: 117 RIIVELFGDGNVAVV---------------DSAGEVIQS-----LETVRLKSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                S+      P +V  D                           N++  D  R    
Sbjct: 157 YEFPDSR----VNPLQVTYDR---------------------FVSLMNESDTDIVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+    K +++    D   + +  A+      LQ  
Sbjct: 188 ----TLATQLNLGGLYAEEVCARAGI---DKTTQITNTSDKIYRAIYTALESLGTQLQ-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P    L  +     D  P              PL   + ++ +   +++F+ AL
Sbjct: 239 -SGDFEPR---LYADDDAVIDATP-------------FPLEERKQQNLDVTAYDSFNGAL 281

Query: 359 DEFYSKIE-SQRAEQQHKAKED--AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ +++ +  AE+  + + D  A   K  +I   QE  +   +Q  +     AEL+  
Sbjct: 282 DVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQEGAIDDFEQRAEAERSRAELLYA 341

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E V+  I  ++ A A   SW+++        + G   A  +    +  +    +++  
Sbjct: 342 NYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGIDAAAAV----VSVDGAEAMVTIE 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
           +D++          +V V++ +    NA + Y   K+ E K+E  +TA  +++    A K
Sbjct: 398 IDDV----------RVPVNVDVGVEKNADQRYTEAKRIEEKKEGALTAIENTREELNAVK 447

Query: 534 KTR------------------LQILQEK------------------TVANISHMRKVHWF 557
           + R                   + + +K                  ++ +I   +   W+
Sbjct: 448 QRRDAWDREDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTNDEWLSMTSIPLQKNDDWY 507

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVP 617
           E+F WF +S  YLV+ GR+A QNE +VK+Y++K D + H + HG   T++K   P +P  
Sbjct: 508 EQFRWFHTSTGYLVVGGRNADQNETLVKKYLNKHDRFFHTEAHGGPITILKASGPSEPAE 567

Query: 618 PLTLN-----QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+ L      +   F + +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG 
Sbjct: 568 PIELTAETRREVAQFAISYSSIWKEGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGD 627

Query: 672 KNFL 675
           + ++
Sbjct: 628 RTYI 631


>gi|222479900|ref|YP_002566137.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
           49239]
 gi|222452802|gb|ACM57067.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
           49239]
          Length = 733

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 178/741 (24%), Positives = 308/741 (41%), Gaps = 144/741 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHVADPENVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L++ R   + VA  +++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLQTVRLKSRTVAPGAQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+    K + ++ + D+ ++ L  A+ +  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGV---EKETPIDDVTDDQLRALHEALERIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD+ P  Y    +    +D  P       ++ D   P  L++      V F++F+AA+
Sbjct: 239 -SGDVDPRVYEEELSDDEAEDRDP-------RVVD-VTPFPLSEHEGLPSVGFDSFNAAV 289

Query: 359 DEFYSKIESQRAEQQHKAKEDAA---------FHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           DE++ +++   +E+  +A  DA+           K  +I   Q+  +   +++ +   + 
Sbjct: 290 DEYFYRLDRDGSEE-GEAPADASPSRPDFEEEIGKQERIVEQQQGAIEGFEEQAEAERER 348

Query: 410 AELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           AEL+  EY+L  VD  +  V+ A    + W+++A  +    + G P A  +  +      
Sbjct: 349 AELLYAEYDL--VDEVLSTVQEAREAEVPWDEIAETLDAGAEQGIPAAETVVDVDGGEGT 406

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +++ L     E DD E T    ++E+D +     NA R Y+  K+ E K+E  +    +A
Sbjct: 407 VTVELRGGDGEDDDGETT----RIELDASAGVEVNADRLYQEAKRIEGKKEGAM----EA 458

Query: 528 FKAAEKKTRLQILQEK------------------------------------TVANISHM 551
            K+   +  L+ ++E+                                    + ++I   
Sbjct: 459 IKST--RAELEAVKERKAEWEAKEAAADETAGDGADDGEEEEDGEEYQTDWLSRSSIPIR 516

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
               W+++F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   
Sbjct: 517 SPDDWYDRFRWFYTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAG 576

Query: 612 PEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           P +   P+     TL +   F V +S  W D +    A+ V P QVSKT  +GEY+  GS
Sbjct: 577 PSESADPVDFSEETLREVAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGS 636

Query: 666 FMIRGKKNFLPPHPLIMGFGL 686
           F+IRG + +    P  +  G+
Sbjct: 637 FVIRGDRTYFEDVPCRIAVGV 657


>gi|300706574|ref|XP_002995542.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
 gi|239604689|gb|EEQ81871.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
          Length = 644

 Score =  154 bits (389), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 101/340 (29%), Positives = 176/340 (51%), Gaps = 39/340 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F++F+ A++ F+     ++ E+  K         L KI   Q   +  L+  V      A
Sbjct: 239 FQSFNEAVEFFFMDRRKKKIEKVDK---------LQKIRNKQYEHIKELENMVKDMTMKA 289

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           +LI  N + V+  +      + N+++W D  +  ++E+  GN +A +I K  +  ++C+ 
Sbjct: 290 DLILKNADIVENVLDIHNYVIKNKLNWNDFLKFKEDEKSKGNEIADIIVKSDFKNKSCI- 348

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
                 +D  D+E+       +E+    S H+NA+ ++E +KK E K  KT     KA  
Sbjct: 349 ------IDLKDNEDSHF----IEISFDKSLHSNAQNYFEKRKKFEEKILKT----EKAID 394

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
             + KT  +  +EK    I   R V WFEKFN+  +++  LVI G++AQQNE+IVK++++
Sbjct: 395 TIKIKTYTK--EEK----IKIQRSVFWFEKFNFCFTTDKKLVIGGKNAQQNEIIVKKHLT 448

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
              +Y H +  G SS + +          + +++     +C+S  W+  +V+  ++V   
Sbjct: 449 PNHLYFHTESSGGSSVISE--------ADVNIDEVALVALCNSACWEVNVVSPVFYVKSD 500

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           QVSKT PTG++L  GSF+IRG K ++  + L  G GLLF+
Sbjct: 501 QVSKTPPTGQFLPKGSFLIRGTKTYVNVYKLEYGVGLLFK 540



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 14/148 (9%)

Query: 53  SEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
           S K +LL+E G+R+H T+ A D     S F   LRK  R  ++ D+ Q+G+DR+I+F+  
Sbjct: 42  SSKDILLIEPGIRIHLTSEADD---GISHFCNILRKKARRDKVVDIYQVGFDRVIVFE-- 96

Query: 113 LGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY---PTEICRVF 169
             ++   +++E ++ GN+ + D    ++ + R  ++ D    I+   +Y   P E    +
Sbjct: 97  --LSRQKIVIEFFSGGNVFILDEFDKIVEVFRVVKELD----IIKNTQYVFNPAEFDFSW 150

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNE 197
           E     +    L   KE   N   K+N+
Sbjct: 151 ENFCNMEFKEFLPFEKELVDNLIKKINK 178


>gi|448313587|ref|ZP_21503301.1| fibronectin-binding A domain-containing protein [Natronolimnobius
           innermongolicus JCM 12255]
 gi|445597955|gb|ELY52026.1| fibronectin-binding A domain-containing protein [Natronolimnobius
           innermongolicus JCM 12255]
          Length = 723

 Score =  154 bits (389), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 164/680 (24%), Positives = 274/680 (40%), Gaps = 130/680 (19%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDSLETVRLKSRTVVPGSRYEFPE------- 161

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               S+++  LT S+E                               +FD  +    +  
Sbjct: 162 ----SRINP-LTVSRE-------------------------------AFD--REMEDSDT 183

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLED------NAIQVL 284
           D  R    TL T     L +G   +E +    G+   M + + +  ED       AI+ L
Sbjct: 184 DVVR----TLAT----QLNFGGLYAEEVCTRAGVEKAMDIEDAD--EDVYDRLYGAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSS---TQIYDEFCPLLLN 341
            L          D+ +G+  P  Y+   +   G D    ESG+      + D   P  L 
Sbjct: 234 AL----------DLRNGNFEPRLYVDDGDDENGDDSEDDESGADEGPAPVVDA-TPFPLE 282

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQ----QHKAKEDAAFHKLNKIHMDQENRVH 397
           +        +++F AALD+++ ++E    E+      +   D    K  +I   QE  + 
Sbjct: 283 EHVELASEPYDSFLAALDDYFHRLELAEEEEPDPTDQRPDFDEQIAKHERIIEQQEGAIE 342

Query: 398 TLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
             ++E D     AEL+  EY L  VD  +  VR A      W+++    +E  + G   A
Sbjct: 343 GFEREADELRDQAELLYAEYGL--VDEILSTVRQAREQDRPWDEIEERFEEGAERGIEAA 400

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
             +  +      +++ +                E++E+        NA R Y   K+ E 
Sbjct: 401 EAVVGVDGSEGIVTVSVDG--------------ERIELVAQQGVEQNADRLYTEAKRVEE 446

Query: 516 KQEKTITA----HSKAFKAAEKKTRLQILQEK------------------TVANISHMRK 553
           K+E  + A      +  +  +++ R +    +                  + +++     
Sbjct: 447 KKEGALAAIEDTREELEEIVDRRDRWEAEDAETDEADEADEEEGEDRDWLSESSVPIREN 506

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPE 613
             WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P 
Sbjct: 507 EPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPS 566

Query: 614 QP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSF 666
           +       +P  ++ +A  F V ++  W D +     + V   QV+KT  +GEYL  G F
Sbjct: 567 EASSSDIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGF 626

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
            IRG + +    P+ +  G+
Sbjct: 627 AIRGDRTYYDDTPVGVAVGI 646


>gi|387592702|gb|EIJ87726.1| hypothetical protein NEQG_02273 [Nematocida parisii ERTm3]
          Length = 700

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 112/350 (32%), Positives = 169/350 (48%), Gaps = 30/350 (8%)

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
            D F S +++  A Q+    E A+  K  KI   QE  +H    E+      AEL+  N 
Sbjct: 269 FDGFGSAMDAAFAVQE--ITETASQKKHRKIREAQERDLHKKIDEMTILKTKAELLSENQ 326

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
            +V   I  +  A A  +S ++  R  KE  K  NP A +I K    +  + L++   L 
Sbjct: 327 AEVKNVISVIEAAHAASLSEKEFERF-KESEKDKNPTAKIIKKANFGKKTVDLIIDKQL- 384

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
                        V +D   S        Y+  KK E K +KT  A        E +T+ 
Sbjct: 385 -------------VTIDYTASIFEQINALYQKAKKIEEKLKKTRVA------LEESRTK- 424

Query: 538 QILQEKTVANISHM-RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
           +I   K +  I  + R V WFEKF W I+ ++ L+++GRD++QNE++VK+++   D Y H
Sbjct: 425 EIEVTKRIEKIEKIDRNVFWFEKFRWLITKDSDLILAGRDSKQNEILVKKHLLDTDYYFH 484

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           AD+ G SS ++  +         T   A    +  S+AW++  +T  + V   QVSKTAP
Sbjct: 485 ADVRGGSSVIVGENATVH-----TKEVAAAMALHLSKAWENSTITEVYCVRGEQVSKTAP 539

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
            GEYLT GSFMI GKK F  P  L  GF ++++L +  +    + R+V G
Sbjct: 540 AGEYLTHGSFMITGKKEFYHPTKLEYGFSIMYKLKDKEIEISDDNRQVSG 589



 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/144 (34%), Positives = 72/144 (50%), Gaps = 14/144 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R++  D+ A V  L ++ G     VY  S K  + K  N           K  LL++
Sbjct: 1   MKGRLSWLDIRAGVNELEKINGCHIKTVYSTSKKAILIKFSN-----------KEQLLID 49

Query: 62  SGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
              + H T    +K N TP    L LR+ I   R+E V QLG+DRI + +   G     +
Sbjct: 50  PPSKFHLTHKNYEKVNLTP--LALYLRREISNYRVEKVTQLGFDRIAVIKIRSGKGCRLL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+E+YA GNI+LTD E  ++ LLR
Sbjct: 108 IIEMYANGNIILTDEELNIINLLR 131


>gi|448385151|ref|ZP_21563730.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
           DSM 11522]
 gi|445657436|gb|ELZ10264.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
           DSM 11522]
          Length = 719

 Score =  154 bits (388), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 171/727 (23%), Positives = 279/727 (38%), Gaps = 131/727 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +        +  ++ L++E 
Sbjct: 4   KRELTSVDLAALVGELGTYEGAKVDKAYLYGDDLVRLKMRDF-------DRGRLELILEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT      
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP---- 166

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
             LT S+E   +E D  + D                                        
Sbjct: 167 --LTVSREAFDHEMDDSDTD---------------------------------------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +   L   L +G   +E +    G+   + + + ++       V     A  E    D+
Sbjct: 185 -VVRTLATQLNFGGLYAEEVCTRAGVEKGLDIDDADE------DVYDRIYAAIERLALDI 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +G+  P  Y    ++  G D           + D   P  L +        +++F +AL
Sbjct: 238 RNGNFDPRLYFAGDDEADGDDESEETDAGDGPVVD-VTPFPLEEHADLPAEGYDSFLSAL 296

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE ++  + AEL+ 
Sbjct: 297 DDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAEQLRERAELLY 356

Query: 414 -EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNCMSLL 471
            EY L  VD  +  V+ A     +W+++    +E    G   A  +ID            
Sbjct: 357 AEYGL--VDEILSTVQQAREQDRAWDEIRERFEEGADRGIAAAEAVID------------ 402

Query: 472 LSNNLDEMDDEEKTLPV----EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  +D  E T+ V    E++E+        NA R Y   K+ E K+E  + A    
Sbjct: 403 -------VDGSEGTVTVDLDGERIELVADRGVEQNADRLYTEAKRVEDKKEGALAAIENT 455

Query: 528 FKAAEKKTRLQILQEKTVA---------------------NISHMRKVHWFEKFNWFISS 566
            +  E   R +   E   A                     +I       WF++F WF +S
Sbjct: 456 REDLEDAKRRRDEWEAQDAASDDEDEADDEGPKRDWLADPSIPIRENEPWFDRFRWFHTS 515

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLT 620
           ++YLVI GR+A QNE IVK+Y+  GD  +H   HG   TV+K   P +       +P  +
Sbjct: 516 DDYLVIGGRNADQNEEIVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPESS 575

Query: 621 LNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           + +A  F V ++  W D +     + V   QVSKT  +GEYL  G F IRG + +    P
Sbjct: 576 IEEAAQFAVSYASVWKDGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGDRTYYRDTP 635

Query: 680 LIMGFGL 686
           +    G+
Sbjct: 636 VGAAVGI 642


>gi|126466189|ref|YP_001041298.1| hypothetical protein Smar_1299 [Staphylothermus marinus F1]
 gi|126015012|gb|ABN70390.1| protein of unknown function DUF814 [Staphylothermus marinus F1]
          Length = 663

 Score =  154 bits (388), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 123/441 (27%), Positives = 209/441 (47%), Gaps = 64/441 (14%)

Query: 243 VLGEALGYG-PA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           V G   G+G P  ++E +I   GL    K  ++N +E   +  L+     FE  + +V+ 
Sbjct: 179 VRGIVKGWGLPGYIAEELIYRAGLYEK-KNYKINMIEKTDLYSLIYI---FEKIINEVLE 234

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G    +GY++  N             +   IY  + P L  +       K++  +  LD 
Sbjct: 235 G----KGYLVKLN-------------NEPHIYTSYEPKLYKELYELNVEKYDELNHVLDI 277

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           +Y + E +   +Q   K+     K+ K   +Q+  +    +E ++  K +E +  N  +V
Sbjct: 278 YYGEYEKRIYYEQKTTKQQMLIEKIKKNIEEQQKIIKKYIEESEKYRKFSETLVTNY-NV 336

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
              IL           WE +                 I + Y ++  +       + ++D
Sbjct: 337 LEKILKCVHETRRTSGWEKIVENCPN-----------IVEFYKDKGIV-------IVKLD 378

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKK---KQESKQEKTITAHSKAFKAAEKKTRL 537
           D E       + +D+ L    N  R+ +L     K+  + E+ +    K+ + A  K   
Sbjct: 379 DYE-------IPIDIRLDTWNNILRYKKLSGELLKKAKRAEEALRELEKSLEEAVNKK-- 429

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
           Q++++KT   I   +   W+E+F+W I+SE +LVI+GRDA QNE+IVK+YM   D+++HA
Sbjct: 430 QLIEKKTEIGI---KPRLWYERFHWMITSEGFLVIAGRDADQNELIVKKYMEPHDIFLHA 486

Query: 598 DLHGASSTVIKNHR--PEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKT 654
           D+HGA +TVIK H   P Q     ++ +A     C+S+AW+        +WV+  QVSKT
Sbjct: 487 DIHGAPATVIKTHNRMPSQK----SIEEAAVIAACYSKAWNEGFGAIDVFWVHASQVSKT 542

Query: 655 APTGEYLTVGSFMIRGKKNFL 675
            P+GEYL+ G+FMI GKKN++
Sbjct: 543 PPSGEYLSKGAFMIYGKKNYV 563



 Score = 47.0 bits (110), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 78/163 (47%), Gaps = 10/163 (6%)

Query: 1   MVKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M+K  M+  D+ +      +++IG    N+Y  +   ++ K+         G+S    L 
Sbjct: 1   MIKKAMDILDIYSWTNNFGKQVIGCFIENIY-FTGFYWLLKIRCPG----KGKS---YLK 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E  +RLH +     +K     F+  +RK+IR  R+ DV+QLG++RII          + 
Sbjct: 53  IEPSIRLHVSNIDPLEKKIDK-FSSFMRKYIRGARIVDVKQLGWERIIELHVKSRNKKYI 111

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           +I E+  +G ++LT+  + +L   R     D+ +   S++  P
Sbjct: 112 LINEIMPRGFLVLTNETYNILYANRFQELRDRIIKRGSKYTPP 154


>gi|269862824|ref|XP_002650989.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065304|gb|EED43067.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 506

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 168/347 (48%), Gaps = 46/347 (13%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 176 MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 226

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 227 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 286

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 287 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 324

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 325 -----KIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 379

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 380 NKYMEDRDLYFHCDVKGASSVVCKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 433

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 434 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 480


>gi|20089538|ref|NP_615613.1| hypothetical protein MA0651 [Methanosarcina acetivorans C2A]
 gi|19914450|gb|AAM04093.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 788

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/394 (27%), Positives = 190/394 (48%), Gaps = 30/394 (7%)

Query: 320 HPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIE-SQRAEQQHKAKE 378
           H   E     + +D   P  L ++   E   F++F+ ALDEF+ K    Q AE +   K+
Sbjct: 271 HVKKEINGKIETFD-VVPFDLIRYSEFEKEYFDSFNTALDEFFGKKALEQVAEVKAAEKK 329

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
           +       +  + QE  +    +E++++  +AE++  N + ++     +  A A   SW+
Sbjct: 330 EKTLGVYERRLLQQEESLAKFGKEIEKNNTLAEIVYANYQLIEELFSVLNGARAKGYSWD 389

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVD 494
           ++  ++K+ +K                   ++  +  +  +D +  T+ V+     V +D
Sbjct: 390 EIRSILKQAKK-------------------TVPAAQKITNIDQKTGTVTVDLDGRNVNLD 430

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV 554
           +  +   NA+ +YE  KK   K++  + A  +  KA EKK   +  +      +   RK 
Sbjct: 431 IRKTVPQNAQEYYEKVKKFSKKRDGALKAIEETKKAMEKKAASKAAKAGR--KLQAFRKK 488

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ 614
           HW+++F WF+SS+ +LV+ GRDA  NE I K+Y+ K D+  H    GA  TV+K    E 
Sbjct: 489 HWYDRFRWFVSSDGFLVVGGRDADTNEEIFKKYLEKRDIVFHTQTPGAPLTVVKTGGEE- 547

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
            +P  TL +   F V +S  W S   +   +W+   QV+KT  +GEYL  G+F+IRG++N
Sbjct: 548 -IPESTLLEVARFAVSYSSLWKSGQFSGDCYWIKAEQVTKTPESGEYLKKGAFVIRGERN 606

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           +    PL +  GL  + +   +G   +  R  G+
Sbjct: 607 YFKDIPLGVAVGLELKGETRVIGGPASAVRKHGD 640



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 68/139 (48%), Gaps = 11/139 (7%)

Query: 6   MNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           M++ADVAA V  L    R +I  +   +Y  + +     L     V   G      L++E
Sbjct: 5   MSSADVAAVVAELSAGPRSIIDAKIGKIYQPASEEIRINLY----VFHQGRDN---LVIE 57

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T Y R     P  F + LRK++   R+  V Q  +DRII            +I
Sbjct: 58  AGKRLHMTKYVRASPTLPQAFPMLLRKYLMGGRIISVEQHDFDRIIKIGIERAGVRSTLI 117

Query: 122 LELYAQGNILLTDSEFTVL 140
           +EL+A+GN+L+ DSE  ++
Sbjct: 118 VELFARGNVLIVDSENKII 136


>gi|448340269|ref|ZP_21529242.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
 gi|445630575|gb|ELY83836.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
          Length = 722

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 166/712 (23%), Positives = 282/712 (39%), Gaps = 122/712 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT        
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP------ 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S+E   +E D  + D                                         +
Sbjct: 167 LTVSREAFDHEMDDSDTD-----------------------------------------V 185

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
              L   L +G   +E +    G+   M + + ++      +V        E    D+ +
Sbjct: 186 VRTLATQLNFGGLYAEEVCTRAGVEKGMDIDDADE------EVYGRLYETIERLALDIRN 239

Query: 301 GDIVPEGYI--LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           G   P  Y+                ESG++  +  +  P  L +    E   +++F +AL
Sbjct: 240 GTFDPRLYLEPDDAAGDDADGDGTAESGAARVV--DVTPFPLEEHDDLEGEPYDSFLSAL 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE     + AEL+ 
Sbjct: 298 DDYFFRLELAAEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAASLREQAELLY 357

Query: 414 -EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
            EY L  VD  +  ++ A     SW+++    +E  + G   A  I  +      +++  
Sbjct: 358 AEYGL--VDEILSTIQGARERERSWDEIRERFEEGAEQGIDAAEAIVDIDGSDGTVTV-- 413

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKA 530
                E+DDE       ++++D       NA R Y   K+ E K++  + A  +++   A
Sbjct: 414 -----EIDDE-------RIDLDAQQGVEQNADRLYTEAKRVEEKKDGALAAIENTRQDLA 461

Query: 531 AEKKTRLQILQEKT---------------------VANISHMRKVHWFEKFNWFISSENY 569
             K+ R +   +++                      ++I       WF++F WF +S+ +
Sbjct: 462 DAKRRRDEWEADESGGEDDDETDADGDDLPRDWLSESSIPIRENEPWFDRFRWFHTSDGF 521

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 522 LVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIDLPDSSVAE 581

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           A  F+V +S  W D +     + V   QVSKT  +GEYL  G F IRG + +
Sbjct: 582 AAQFSVSYSSVWKDGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTY 633


>gi|387595331|gb|EIJ92956.1| hypothetical protein NEPG_02355 [Nematocida parisii ERTm1]
          Length = 700

 Score =  153 bits (387), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 169/357 (47%), Gaps = 37/357 (10%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F +A+D  ++  E      Q K +         KI   QE  +H    E+      A
Sbjct: 269 FDGFGSAMDAAFAVQEITETVSQKKHR---------KIREAQERDLHKKIDEMTILKTKA 319

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  N  +V   I  +  A A  +S ++  R  KE  K  NP A +I K    +  + L
Sbjct: 320 ELLSENQAEVKNVISVIEAAHAASLSEKEFERF-KESEKDKNPTAKIIKKANFGKKTVDL 378

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           ++   L              V +D   S        Y+  KK E K +KT  A       
Sbjct: 379 IIDKQL--------------VTIDYTASIFEQINALYQKAKKIEEKLKKTRVA------L 418

Query: 531 AEKKTRLQILQEKTVANISHM-RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
            E +T+ +I   K +  I  + R V WFEKF W I+ ++ L+++GRD++QNE++VK+++ 
Sbjct: 419 EESRTK-EIEVTKRIEKIEKIDRNVFWFEKFRWLITKDSDLILAGRDSKQNEILVKKHLL 477

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D Y HAD+ G SS ++  +         T   A    +  S+AW++  +T  + V   
Sbjct: 478 DTDYYFHADVRGGSSVIVGENATVH-----TKEVAAAMALHLSKAWENSTITEVYCVRGE 532

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
           QVSKTAP GEYLT GSFMI GKK F  P  L  GF ++++L +  +    + R+V G
Sbjct: 533 QVSKTAPAGEYLTHGSFMITGKKEFYHPTKLEYGFSIMYKLKDKEIEISDDNRQVSG 589



 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/144 (34%), Positives = 72/144 (50%), Gaps = 14/144 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R++  D+ A V  L ++ G     VY  S K  + K  N           K  LL++
Sbjct: 1   MKGRLSWLDIRAGVNELEKINGCHIKTVYSTSKKAILIKFSN-----------KEQLLID 49

Query: 62  SGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
              + H T    +K N TP    L LR+ I   R+E V QLG+DRI + +   G     +
Sbjct: 50  PPSKFHLTHKNYEKVNLTP--LALYLRREISNYRVEKVTQLGFDRIAVIKIRSGKGCRLL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+E+YA GNI+LTD E  ++ LLR
Sbjct: 108 IIEMYANGNIILTDEELNIINLLR 131


>gi|433431126|ref|ZP_20407596.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
 gi|448568141|ref|ZP_21637718.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
 gi|448601017|ref|ZP_21656300.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
 gi|432194170|gb|ELK50822.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
 gi|445727091|gb|ELZ78705.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
 gi|445734620|gb|ELZ86178.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
          Length = 702

 Score =  153 bits (387), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 173/365 (47%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEQQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+A  ++E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLEEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL       D+++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTL-------DVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +   AA KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELAAVKKRRDEWEADDGDDDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL 
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLR 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|292656996|ref|YP_003536893.1| hypothetical protein HVO_2883 [Haloferax volcanii DS2]
 gi|448293595|ref|ZP_21483700.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
 gi|291371020|gb|ADE03247.1| conserved protein [Haloferax volcanii DS2]
 gi|445570456|gb|ELY25019.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
          Length = 702

 Score =  153 bits (387), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 173/365 (47%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEQQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+A  ++E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLEEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL       D+++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTL-------DVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +   AA KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELAAVKKRRDEWEADDGDEDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL 
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLR 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|297527127|ref|YP_003669151.1| hypothetical protein Shell_1151 [Staphylothermus hellenicus DSM
           12710]
 gi|297256043|gb|ADI32252.1| protein of unknown function DUF814 [Staphylothermus hellenicus DSM
           12710]
          Length = 663

 Score =  153 bits (387), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 180/356 (50%), Gaps = 49/356 (13%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
            IY  + P L  +       K++  +  LD +YS+ E +   +Q   K+     K+ K +
Sbjct: 247 HIYTSYEPKLYKELYDVSVEKYDKLNHVLDIYYSEYEKRIYYEQRTIKQRILIEKIKK-N 305

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
           +D++ ++  +K+ ++ S K  E                R  + N    E +   V + RK
Sbjct: 306 IDKQQKI--IKKYIEESEKYKEF--------------SRTLVTNYNLLEKILECVNKTRK 349

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARR 505
                    DK+    NC       N+ +   ++ T+ V+    ++ +D+ L+A  N  R
Sbjct: 350 TSG-----WDKIV--ENC------PNIVKYYKDKGTVIVKFNEYEIPIDIRLNAWNNILR 396

Query: 506 WYELKK---KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNW 562
           + +L     K+  K E+ +    ++ + A  K   Q++Q +T   I   +   W+E+F+W
Sbjct: 397 YKKLSGELLKKAKKAEEALRELERSLEEAVNKK--QLIQRRTEIGI---KPRLWYERFHW 451

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVPPLT 620
            I+SE +LVI+GRD  QNE+IVK+YM   D+++HAD+HGA +TVIK H   P Q     +
Sbjct: 452 MITSEGFLVIAGRDIDQNELIVKKYMEPHDIFLHADIHGAPATVIKTHNRMPSQK----S 507

Query: 621 LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           + +A     C+S+AW         +WVY +QVSKT P+GEYL  G+FMI GKKN++
Sbjct: 508 IKEAAVIAACYSKAWKEGFGAIDVFWVYANQVSKTPPSGEYLPKGAFMIYGKKNYV 563



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 77/152 (50%), Gaps = 10/152 (6%)

Query: 1   MVKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M+K  M+  DV +      +++IG    N+Y  +   ++ K+  S      G+S    L 
Sbjct: 1   MIKKSMDILDVYSWTNNFGKQIIGCFIENIY-FTGFYWLIKIRCSG----KGKS---YLK 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E  +RLH +     +K     F+  +RKHIR  R+ DV+QLG++RII        N + 
Sbjct: 53  IEPSIRLHISNIEPLEKKIDK-FSSFMRKHIRGARIIDVKQLGWERIIELHVKSRKNEYI 111

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDK 151
           +I E+  +G ++LT+ ++++L   R     D+
Sbjct: 112 LINEILPRGFLVLTNEKYSILYANRFQELRDR 143


>gi|284166116|ref|YP_003404395.1| fibronectin-binding A domain-containing protein [Haloterrigena
           turkmenica DSM 5511]
 gi|284015771|gb|ADB61722.1| Fibronectin-binding A domain protein [Haloterrigena turkmenica DSM
           5511]
          Length = 723

 Score =  153 bits (386), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 170/684 (24%), Positives = 274/684 (40%), Gaps = 138/684 (20%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGETKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDS------ 162

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT        LT S+E                               +FD  +    +  
Sbjct: 163 RTNP------LTVSRE-------------------------------AFD--REMEDSDT 183

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLED------NAIQVL 284
           D  R    TL T     L +G   +E I    G+   M ++E +  ED       AI+ L
Sbjct: 184 DVVR----TLAT----QLNFGGLYAEEICTRAGVEKAMDIAEAD--EDVYDRIYGAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESG---SSTQIYDEFCPLLLN 341
            L          D+ +G+  P  Y+   +    +     E+G   SS ++ D   P  L 
Sbjct: 234 AL----------DLRNGNFDPRLYVADDDGDEDESESGDENGDDSSSDRVVDA-TPFPLE 282

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQ-----HKAKEDAAFHKLNKIHMDQENRV 396
           +        +++F AALD+++ ++E    E++      +   +    K  +I   Q   +
Sbjct: 283 EHVELASEPYDSFLAALDDYFYRLELADDEEETDPTTQRPDFEEEIAKYERIIEQQRGAI 342

Query: 397 HTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
              +QE D   + AEL+  EY L  VD  +  V+ A A    W+++     EER      
Sbjct: 343 EGFEQEADALREQAELLYAEYGL--VDDILSTVQEARAQDRPWDEI-----EER------ 389

Query: 455 AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELK 510
                  + E     +  +  +  +D  E T+ VE    ++++        NA R Y   
Sbjct: 390 -------FAEGADRGIAAAEAVVNVDGSEGTVTVELDGERIDLVAKQGVEQNADRLYTEA 442

Query: 511 KKQESKQEKTITA----HSKAFKAAEKKTRLQILQEKTVA-----------------NIS 549
           K+   K+E  + A         +A  ++ R +                         ++ 
Sbjct: 443 KRVGEKKEGALAAIEDTREDLGEAKARRDRWEEADAADEGEDDEDDEGEERDWLSEPSVP 502

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
                 WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K 
Sbjct: 503 IRENEPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKA 562

Query: 610 HRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
             P +       +P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL 
Sbjct: 563 TDPSEASSSDIELPDSSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLE 622

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            G F IRG + +    P+ +  G+
Sbjct: 623 KGGFAIRGDRTYYRDTPVDVAVGI 646


>gi|448303302|ref|ZP_21493251.1| fibronectin-binding A domain-containing protein [Natronorubrum
           sulfidifaciens JCM 14089]
 gi|445593087|gb|ELY47265.1| fibronectin-binding A domain-containing protein [Natronorubrum
           sulfidifaciens JCM 14089]
          Length = 716

 Score =  153 bits (386), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 162/668 (24%), Positives = 279/668 (41%), Gaps = 113/668 (16%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RIELILEVGEIKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRLIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPD------- 161

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               S+L+  LT S+E                               +FDL    +    
Sbjct: 162 ----SRLNP-LTVSRE-------------------------------AFDLEMEDSDTD- 184

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDN------AIQVL 284
                    +   L   L +G   +E I    G+   M +++ +  ED+      AI+ L
Sbjct: 185 ---------IVRTLATQLNFGGLYAEEICTRAGIEKGMDIADAD--EDDYDRLYEAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR 344
            L          D+ + +  P  Y+         D     + S+  +  +  P  L +  
Sbjct: 234 AL----------DLRNANFEPRLYLEDGEDGDDDDESDDSTESARVV--DATPFPLEEHA 281

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF----HKLNKIHMDQENRVHTLK 400
                 +++F AALD+++ ++E    E+     +   F     K  +I   Q+  +   +
Sbjct: 282 ELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIIDQQQGAIEGFE 341

Query: 401 QEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKE--ER--KAGNPV 454
           Q+ D   + AEL+  EY L  VD  +  ++ A A    W+++    +E  ER  +A   V
Sbjct: 342 QQADDLREQAELLYAEYGL--VDDILSTIQDARAQDRPWDEIEARFEEGAERGIEAAEAV 399

Query: 455 AGL-----IDKLYLERNCMSLL----LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
            G+     I  + ++ + + L+    +  N D +  E K +  +K   + AL+A  + R 
Sbjct: 400 VGIDSSEGIVTVDIDGDRIDLVAHDGVEQNADRLYTEAKRVAEKK---EGALAAIEDTRE 456

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
             E  K++  + +       +A     ++T         + +I       WF++F WF +
Sbjct: 457 DLEDAKRRRDEWDADDEGDEQADDEDTEETNWL-----EMPSIPIRENEPWFDRFRWFHT 511

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPL 619
           S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  
Sbjct: 512 SDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPDS 571

Query: 620 TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
           ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG++ +    
Sbjct: 572 SIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGERTYHRDT 631

Query: 679 PLIMGFGL 686
           P+ +  G+
Sbjct: 632 PVGVAVGI 639


>gi|269862592|ref|XP_002650899.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065446|gb|EED43157.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 480

 Score =  153 bits (386), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 168/347 (48%), Gaps = 46/347 (13%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 97  MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 147

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 148 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 207

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 208 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 245

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 246 -----KIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 300

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 301 NKYMEDRDLYFHCDVKGASSVVCKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 354

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 355 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 401


>gi|167044451|gb|ABZ09127.1| putative domain of unknown function (DUF814) [uncultured marine
           crenarchaeote HF4000_APKG6D9]
          Length = 648

 Score =  152 bits (385), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 164/685 (23%), Positives = 293/685 (42%), Gaps = 151/685 (22%)

Query: 23  GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGF 82
           G   SN+Y ++  + +FKL ++       +S+  +++  SGV L  TA   D+   P+  
Sbjct: 21  GYYISNIYGITKDSILFKLHHTE------KSDLFMMVSTSGVWL--TAVKIDQME-PNRL 71

Query: 83  TLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLT 141
             +LR  +   +L+ + Q+G +RI  F F  G    +V++ E +  GNILL   E  +L 
Sbjct: 72  LKRLRSDLLRLKLKKIEQIGAERIAYFTFE-GFGKEFVLVGEFFGDGNILLCSKEMKILA 130

Query: 142 LLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNN 201
           L  S         I  RHR               KL   L   + P+         +G +
Sbjct: 131 LQHS---------IEVRHR---------------KLSVGLEYVQPPN---------NGLD 157

Query: 202 VSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG----YGPALSEH 257
           + N  + +         FD+ K S     D   AK        G  LG    Y   + E 
Sbjct: 158 IFNILESD---------FDVLKTS-----DLVSAKW------FGRTLGLPKKYVEGIFEI 197

Query: 258 IILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
             +D   + N+  + E+ K+ +   +V++           DVISG+  P   I+++N+  
Sbjct: 198 ANIDPKKIGNLLTNDEITKIFETTKKVVL-----------DVISGNHKP---IIIRNEK- 242

Query: 317 GKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
                            E  P+ L +    E V   +F   LD  Y++    + +    +
Sbjct: 243 ----------------TEILPIKLGKMDG-EIVDVNSFIEGLDTVYTENIVTKGKSIQSS 285

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
             D    +      +QE  + T+K   DRS  +  +     E V + IL++  A A ++ 
Sbjct: 286 GSDKKIKEFQTQISEQEKAIQTVK---DRSKNITNVANSLFEMVSSGILSIEDASAQKIL 342

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLA 496
               A++  E+                    +SL++  +             EK++++  
Sbjct: 343 VNHNAKLTSEK-------------------GISLIIVQD-------------EKIKIN-- 368

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT-----VANISHM 551
             A +  +    L   +  KQ + I++  +     EKK  L+  Q KT     +  ++ +
Sbjct: 369 --AKSPLQSIASLLFNEAKKQSRAISSIEEIKSKTEKK--LEKFQNKTESEQDIMLVTEI 424

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           RK  W+E++ WF +++ YL + GRDA  N  +V++++ K D   HAD+ G+   +IK+  
Sbjct: 425 RKKSWYERYRWFYTTDGYLAVGGRDAASNSAVVRKHLVKNDKIFHADIFGSPFFIIKD-- 482

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             +  P  ++++    TVC S+AW   +    A+W++P QV K+AP+GE+L  GSF I G
Sbjct: 483 -AEHAPATSMDEVAHATVCFSRAWREGLYGVKAYWIHPEQVKKSAPSGEFLPKGSFTIEG 541

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSL 695
           ++NF+    L +  G++ + D  +L
Sbjct: 542 QRNFINSKNLKLAVGIIQQEDGHAL 566


>gi|261350362|ref|ZP_05975779.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
 gi|288861145|gb|EFC93443.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
          Length = 668

 Score =  152 bits (385), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 168/340 (49%), Gaps = 20/340 (5%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F+ A DEFYSK  +   +   +A  +   +K  K    QE  +    + ++ S    
Sbjct: 268 FDNFNEACDEFYSKKVNTDIKNIKEAAWNKKVNKFEKRLKLQEETLDNFHKTIETSQHKG 327

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E+I  N   ++  +  V  A++   S++++ + +KE +K G   A + +           
Sbjct: 328 EVIYSNYTTIENLVKVVNNAISKDYSYKEIGKTLKEAKKNGLKEAEIFE----------- 376

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
               ++D+M      L    + ++  L+   NA  +YE  KK + K +    A     K 
Sbjct: 377 ----SIDKMGVLTLKLNETSININPKLTIPENAEIYYEKAKKAKKKTKGATIAIENTKKQ 432

Query: 531 AEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
            EK K + ++  E        ++K + W+EK  WF++S+N LVI GRDA  NE +VK+YM
Sbjct: 433 LEKIKAKKEVAMEHISVPKKRVKKNLKWYEKLRWFVTSDNVLVIGGRDAGTNETVVKKYM 492

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVY 647
              D+Y+HAD+HGA+STVIK       V    L ++G F    S AW     T   +WV 
Sbjct: 493 DNNDIYLHADIHGATSTVIK--LEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVN 550

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           P QV+KT   GE+L  GSF+IRG +N++    + +  G++
Sbjct: 551 PEQVTKTPEAGEFLPKGSFVIRGNRNYIRSAKVRIAIGIV 590



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 63/111 (56%), Gaps = 3/111 (2%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           ++ L+ME G R+HT+ Y  +    P  F + LRK I+   +  + Q  +DRII  +  + 
Sbjct: 47  RIDLVMECGKRIHTSKYPLENPINPPVFPMLLRKRIKGANVVSITQHNFDRII--EIKVK 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + +Y +++EL+ +GNI+L D +  ++  L+  R  D+ ++    +++P E
Sbjct: 105 KDKYYTIVVELFDKGNIILLDEDNNIILPLKRKRFSDRDISSKKEYQFPEE 155


>gi|269863550|ref|XP_002651263.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064852|gb|EED42793.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 335

 Score =  152 bits (385), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 102/347 (29%), Positives = 168/347 (48%), Gaps = 46/347 (13%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 1   MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 51

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 52  TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 111

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 112 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 149

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 150 -----KIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 204

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 205 NKYMEDRDLYFHCDVKGASSVICKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 258

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 259 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 305


>gi|397772651|ref|YP_006540197.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
 gi|397681744|gb|AFO56121.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
          Length = 722

 Score =  152 bits (385), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 166/712 (23%), Positives = 281/712 (39%), Gaps = 122/712 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT        
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP------ 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S+E   +E D  + D                                         +
Sbjct: 167 LTVSREAFDHEMDDSDTD-----------------------------------------V 185

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
              L   L +G   +E +    G+   M + + ++      +V        E    D+ +
Sbjct: 186 VRTLATQLNFGGLYAEEVCTRAGVEKGMDIDDADE------EVYGRLYETIERLALDIRN 239

Query: 301 GDIVPEGYI--LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           G   P  Y+                ESG++  +  +  P  L +    E   +++F +AL
Sbjct: 240 GTFDPRLYLEPDDAAGDDADGDGTAESGAARVV--DVTPFPLEEHDDLEGEPYDSFLSAL 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE     + AEL+ 
Sbjct: 298 DDYFFRLELAAEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAASLREQAELLY 357

Query: 414 -EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
            EY L  VD  +  ++ A     SW+++    +E  + G   A  I  +      +++  
Sbjct: 358 AEYGL--VDEILSTIQGARERERSWDEIRERFEEGAEQGIDAAEAIVDIDGSDGTVTV-- 413

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKA 530
                E+DDE       ++++D       NA R Y   K+ E K++  + A  +++   A
Sbjct: 414 -----EIDDE-------RIDLDAQQGVEQNADRLYTEAKRVEEKKDGALAAIENTRQDLA 461

Query: 531 AEKKTRLQILQEKT---------------------VANISHMRKVHWFEKFNWFISSENY 569
             K+ R +   +++                      ++I       WF++F WF +S+ +
Sbjct: 462 DAKRRRDEWEADESGGEDDDETDADGDDLPRDWLSESSIPIRENEPWFDRFRWFHTSDGF 521

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 522 LVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIDLPESSVAE 581

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           A  F V +S  W D +     + V   QVSKT  +GEYL  G F IRG + +
Sbjct: 582 AAQFAVSYSSVWKDGRYAGDIYAVDSDQVSKTPESGEYLEKGGFAIRGDRTY 633


>gi|222445070|ref|ZP_03607585.1| hypothetical protein METSMIALI_00687 [Methanobrevibacter smithii
           DSM 2375]
 gi|222434635|gb|EEE41800.1| fibronectin-binding protein A domain protein [Methanobrevibacter
           smithii DSM 2375]
          Length = 668

 Score =  152 bits (385), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 168/340 (49%), Gaps = 20/340 (5%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F+ A DEFYSK  +   +   +A  +   +K  K    QE  +    + ++ S    
Sbjct: 268 FDNFNEACDEFYSKKVNTDIKNIKEAAWNKKVNKFEKRLKLQEETLDNFHKTIETSQHKG 327

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E+I  N   ++  +  V  A++   S++++ + +KE +K G   A + +           
Sbjct: 328 EVIYSNYTTIENLVKVVNNAISKDYSYKEIGKTLKEAKKNGLKEAEIFE----------- 376

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
               ++D+M      L    + ++  L+   NA  +YE  KK + K +    A     K 
Sbjct: 377 ----SIDKMGVLTLKLNETSININPKLTIPENAEIYYEKAKKAKKKTKGATIAIENTKKQ 432

Query: 531 AEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
            EK K + ++  E        ++K + W+EK  WF++S+N LVI GRDA  NE +VK+YM
Sbjct: 433 LEKIKAKKEVAMEHISVPKKRVKKNLKWYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKYM 492

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVY 647
              D+Y+HAD+HGA+STVIK       V    L ++G F    S AW     T   +WV 
Sbjct: 493 DNNDIYLHADIHGATSTVIK--LEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVN 550

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           P QV+KT   GE+L  GSF+IRG +N++    + +  G++
Sbjct: 551 PEQVTKTPEAGEFLPKGSFVIRGNRNYIRSAKVRIAIGIV 590



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 63/111 (56%), Gaps = 3/111 (2%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           ++ L+ME G R+HT+ Y  +    P  F + LRK I+   +  + Q  +DRII  +  + 
Sbjct: 47  RIDLVMECGKRIHTSKYPLENPINPPVFPMLLRKRIKGANVVSITQHNFDRII--EIKVK 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + +Y +++EL+ +GNI+L D +  ++  L+  R  D+ ++    +++P E
Sbjct: 105 KDKYYTIVVELFDKGNIILLDEDNNIILPLKRKRFSDRDISSKKEYQFPEE 155


>gi|149246271|ref|XP_001527605.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146447559|gb|EDK41947.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 701

 Score =  152 bits (384), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 129/243 (53%), Gaps = 8/243 (3%)

Query: 474 NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
           +NL ++    K +PV+   +DL  S+ ANAR +++ KK  E  Q K       A++ AEK
Sbjct: 196 DNLGKLGSGRKGVPVK---IDLTQSSFANARIYFDSKKAAEQLQLKVEKGAEIAYRNAEK 252

Query: 534 KTRLQIL----QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
           K     +    +E    + S +R   WFEKF WF+SSE YL ++GRD  Q +MI  +Y+ 
Sbjct: 253 KISQDFVRNVKKELGSTDSSALRSKLWFEKFYWFVSSEGYLCLAGRDKTQVDMIYFKYVG 312

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D  V +++ G+    IKN   ++ +PP T+ QAG F +  S AW  K+ T+AW +   
Sbjct: 313 DDDYLVSSEIEGSLKVFIKNPIKDEAIPPSTILQAGIFAMSASHAWSGKVNTAAWVMQAS 372

Query: 650 QVSK-TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
            VSK  +  G  L  G F    KK+ LPP  L+MGFG    +DE S   H   R  R +E
Sbjct: 373 DVSKYDSAAGNLLPPGEFEYFAKKDLLPPAQLVMGFGFYCDVDEESAKKHAAIRVEREQE 432

Query: 709 EGM 711
            G+
Sbjct: 433 HGL 435



 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 93/202 (46%), Gaps = 49/202 (24%)

Query: 867  EKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAG 926
            +K  GKD+ S P             +SRG++ KLKK   KY DQDEEE+ +RM +L    
Sbjct: 492  DKSFGKDSKSSP------------MVSRGKQNKLKKAAAKYADQDEEEKALRMKVLGLNK 539

Query: 927  KVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDD-SSHGVED 985
             +++ D           KEK  ++    +P+   +      L +  K H  D  ++ V+ 
Sbjct: 540  SLKRKDS----------KEKSLSLP---SPQPVSRLSDQDELERKRKLHQQDVETYLVDP 586

Query: 986  NPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYS 1045
             P + L +                         N +D LT  PL SD +  ++PV  P+S
Sbjct: 587  QPKIDLADY-----------------------FNAMDQLTPKPLSSDTIFDMVPVFAPWS 623

Query: 1046 AVQSYKYRVKIIPGTAKKGKGI 1067
            A+Q +KY+VKI PG AKKGK I
Sbjct: 624  ALQKFKYKVKIQPGLAKKGKCI 645


>gi|444317477|ref|XP_004179396.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
 gi|387512437|emb|CCH59877.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
          Length = 1053

 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 125/456 (27%), Positives = 215/456 (47%), Gaps = 78/456 (17%)

Query: 307 GYILMQ---NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR--SREFVKFETFDAALDEF 361
           GYI+ +   N  +G+D    E    T  ++ F P +    R  S+  +    ++  LD+F
Sbjct: 274 GYIVAKKNPNYVIGRDADDLEYVYET--FNPFEPFIDETHRTNSKIIIVDGPYNLTLDKF 331

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           ++ IES +   + + +E+ A  K+   H++ + R+  L      + +    I  N E ++
Sbjct: 332 FTTIESSKYALKIQTQEEQAKKKIEDAHLENKKRIDALINVQTSNEQKGYAIIANTELIE 391

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNCMSLLL-------- 472
               AV+  +  +M W  + +++K E+  GN VA  +I  L L+ N ++++L        
Sbjct: 392 TTKYAVQGLVDQQMDWNTIEKLIKNEQVRGNEVAENIILPLNLKENTINMILPLKSETSS 451

Query: 473 ---------------SNNL---DEMDDEEKTLPVEK------------------------ 490
                          S+N    +   DEE  + VE+                        
Sbjct: 452 IENSSSEEQDEYCSESDNEPANENTSDEESDISVEQDVSDFVEVTTIGNSPLISKKSKHK 511

Query: 491 ----------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--KTRLQ 538
                     V +DL+LSA+ANA R+++ KKK   KQ++      KA K  E+  +T LQ
Sbjct: 512 RLQNNENSIIVSIDLSLSAYANASRYFDTKKKTAEKQKRVEENAEKAMKNIEQGIETSLQ 571

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
              +++   +  +RK ++FEK++WFISSE  LV+ G+ + + + I  +Y+   D+Y+   
Sbjct: 572 RKLKESHEVLKKIRKPYFFEKYHWFISSEKILVLMGKSSTETDQIYSKYIEDDDIYMSNS 631

Query: 599 LHGASSTVIKNHRPEQ-PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
               +   IKN  PE+  + P TL QAG F +  S+AW  K+ +S WW     VSK    
Sbjct: 632 FD--TQVWIKN--PEKIEISPNTLMQAGVFCMSSSEAWSKKIASSPWWCKAKNVSKFDKE 687

Query: 658 GEY-LTVGSFMIR--GKKNFLPPHPLIMGFGLLFRL 690
           G   L  G F+++   +K+ LPP  L+MG GLL+++
Sbjct: 688 GNTCLEPGKFILKNENEKHSLPPAQLVMGIGLLWKV 723



 Score = 86.7 bits (213), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 81/137 (59%), Gaps = 12/137 (8%)

Query: 20  RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN 77
           +L G R +N+Y++  + + ++ K         +    K+ ++++ G+R+H T + R    
Sbjct: 20  KLEGYRLTNIYNIADTKRQFLLKF--------NKPDSKLNVVVDCGLRIHLTDFTRHIPQ 71

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEF 137
            PS F +KLRKH++++RL  +RQ+  DRII+ QF  G+   Y++LE ++ GN++L D   
Sbjct: 72  FPSDFVIKLRKHLKSKRLTKLRQVPGDRIIVLQFAEGL--FYLVLEFFSAGNVILLDENK 129

Query: 138 TVLTLLRSHRDDDKGVA 154
           T+L+L R  ++ +  V 
Sbjct: 130 TILSLQRVVKEHENKVG 146


>gi|333910763|ref|YP_004484496.1| fibronectin-binding A domain-containing protein [Methanotorris
           igneus Kol 5]
 gi|333751352|gb|AEF96431.1| Fibronectin-binding A domain protein [Methanotorris igneus Kol 5]
          Length = 675

 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 189/368 (51%), Gaps = 30/368 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++ + E  ++  F  ALD+++++  ++   ++ ++K      K  +I   
Sbjct: 257 YVDVVPINLKKYENFEKKEYGEFLEALDDYFAQFMAKVETKKEESKLQKLIKKQERILKT 316

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   ++++  + +  +LI  N   VD  +  +R A   +M W  + ++V E +   
Sbjct: 317 QLETLEKYEKQMQENQEKGDLIYANYTLVDEILNTLRNA-REKMEWYKIKKIVNEHK--D 373

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI  +  +   + + LS +  +   E+       V +D+  +A  NA  +Y   K
Sbjct: 374 HPILGLIQNINEKNGEIVIKLSADYGDKKIEKN------VSLDIRKNAFENAETYYTKSK 427

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEK-----------TVANISHMRKVHWFEKF 560
           K +SK    I    +A K +EKK  L  L+EK                   ++  W+EKF
Sbjct: 428 KLKSK----IEGIKEAIKLSEKK--LAELKEKGEIELKELKEKEKIKKKERKERKWYEKF 481

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            W + +  +LVI+G+DA  NE+++K+Y    D+  HA + GA  TVIK ++  + V   T
Sbjct: 482 KWTVIN-GFLVIAGKDAVTNELLIKKYTEDDDIVFHAQIEGAPFTVIKTNK--RIVDEET 538

Query: 621 LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           LN+   F+V HS+AW         +WV P QVSKTA +GEYL  G+F+IRGK+NF+   P
Sbjct: 539 LNEVAKFSVAHSRAWKLGWGALDTYWVKPEQVSKTAESGEYLKKGAFVIRGKRNFIRNVP 598

Query: 680 LIMGFGLL 687
           L +G G++
Sbjct: 599 LELGIGII 606



 Score = 63.2 bits (152), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 53/97 (54%)

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +  T Y R+K   P  F + LRKH++  ++  + Q  +DRI++  F      + +++EL+
Sbjct: 63  ITMTNYEREKPKIPPTFAMLLRKHLKNIKITKIEQHDFDRIVIITFEWNETVYKLVIELF 122

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +GN++L D E  ++  L+  R   + +A    +++P
Sbjct: 123 GEGNVILLDKEDRIIMPLKIERWSTRTIAPKEIYKFP 159


>gi|148642838|ref|YP_001273351.1| RNA-binding protein snRNP-like protein [Methanobrevibacter smithii
           ATCC 35061]
 gi|148551855|gb|ABQ86983.1| predicted RNA-binding protein, eukaryotic snRNP-like protein
           [Methanobrevibacter smithii ATCC 35061]
          Length = 668

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/341 (30%), Positives = 168/341 (49%), Gaps = 22/341 (6%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F+ A DEFYSK  +   +   +A  +   +K  K    QE  +    + ++ S    
Sbjct: 268 FDNFNEACDEFYSKKVNTDIKNIKEAAWNKKVNKFEKRLKLQEETLDNFHKTIETSQHKG 327

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E+I  N   ++  +  V  A++   S++++ + +KE +K                NC+  
Sbjct: 328 EVIYSNYTTIENLVKVVNNAISKDYSYKEIGKTLKEAKK----------------NCLKE 371

Query: 471 L-LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
             +  ++D+M      L    + ++  L+   NA  +YE  KK + K +    A     K
Sbjct: 372 AEIFESIDKMGVLTLKLNETSININPKLTIPENAEIYYEKAKKAKKKTKGATIAIENTKK 431

Query: 530 AAEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
             EK K + ++  E        ++K + W+EK  WF++S+N LVI GRDA  NE +VK+Y
Sbjct: 432 QLEKIKAKKEVAMEHISVPKKRVKKNLKWYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKY 491

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWV 646
           M   D+Y+HAD+HGA+STVIK       V    L ++G F    S AW     T   +WV
Sbjct: 492 MDNNDIYLHADIHGATSTVIK--LEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWV 549

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            P QV+KT   GE+L  GSF+IRG +N++    + +  G++
Sbjct: 550 NPEQVTKTPEAGEFLPKGSFVIRGNRNYIRSAKVRIAIGIV 590



 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 63/111 (56%), Gaps = 3/111 (2%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           ++ L+ME G R+HT+ Y  +    P  F + LRK I+   +  + Q  +DRII  +  + 
Sbjct: 47  RIDLVMECGKRIHTSKYPLENPINPPVFPMLLRKRIKGANVVSITQHNFDRII--EIKVK 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + +Y +++EL+ +GNI+L D +  ++  L+  R  D+ ++    +++P E
Sbjct: 105 KDKYYTIVVELFDKGNIILLDEDNNIILPLKRKRFSDRDISSKKEYQFPEE 155


>gi|170582502|ref|XP_001896158.1| hypothetical protein [Brugia malayi]
 gi|158596691|gb|EDP34993.1| conserved hypothetical protein [Brugia malayi]
          Length = 643

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 70/131 (53%), Positives = 92/131 (70%), Gaps = 4/131 (3%)

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
           +HAD+ GASS +I+N      VPP TLN+A    + +S AW++K+ +SAWWV+ HQVS+T
Sbjct: 1   MHADVRGASSIIIRNKLGGGDVPPRTLNEAATMAISYSSAWEAKITSSAWWVHQHQVSRT 60

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDF 714
           APTGEYLT GSFMIRGKKN+LP   L MGFG++F+LDE SL  H  ER+V      M   
Sbjct: 61  APTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLERHREERKV----APMVTA 116

Query: 715 EDSGHHKENSD 725
           ED+  H+++ D
Sbjct: 117 EDNAMHQDDGD 127



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 62/185 (33%), Positives = 94/185 (50%), Gaps = 18/185 (9%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            ++R QK K +K+K+KYGDQDEEER +R+ LLAS  K    D + +N N    ++ K   +
Sbjct: 217  MTRRQKHKAEKIKKKYGDQDEEERQLRLMLLASKPK-DTRDLEKKNINEKALEKAKKKNA 275

Query: 952  PVDAPKVC--YKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHE 1009
                  +   ++C +   + ++ K  P   +   ED     L ET   D   M+ E    
Sbjct: 276  KDGKVSLTSQFECVRNASVVEE-KAEPSTIAKEEEDEQ---LLETDMADMAVMDAE---- 327

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
                E   LN    LT  PL  D+LL+ + V  PY  +Q++KY+VK+ PGT K+GK  + 
Sbjct: 328  ----ETKMLNS---LTWRPLDEDVLLFALVVVAPYQTMQNFKYKVKLTPGTGKRGKAAKS 380

Query: 1070 FYSLL 1074
              +L 
Sbjct: 381  AIALF 385


>gi|256811227|ref|YP_003128596.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
           fervens AG86]
 gi|256794427|gb|ACV25096.1| Fibronectin-binding A domain protein [Methanocaldococcus fervens
           AG86]
          Length = 671

 Score =  150 bits (379), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 189/361 (52%), Gaps = 17/361 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++   E   + +F  A+D++++K  +    ++ K+K +    K   I   
Sbjct: 255 YFDVVPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLTNIVVKKEKSKIEREIEKQENILKR 314

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q + +   K++ +++    +LI  N + V+  + A+R A   +M W  + ++V+E ++  
Sbjct: 315 QMDTLKKYKEDAEKNQIKGDLIYANYQIVEELLSAIRQA-REKMDWARIKKIVRENKE-- 371

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI+ +      + + L + +D+   EE+      V +D+  +A  NA  +YE  K
Sbjct: 372 HPILGLIENINENVGEIVIRLKSEVDDKVIEER------VSLDIRKNAFENAENYYEKAK 425

Query: 512 KQESKQEKTITA----HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           K ++K E    A      K  +  +K       +EK        ++  W+EKF W + + 
Sbjct: 426 KLKNKIEGIENAIELTKKKIEELKKKGEEELKEKEKLKMKKKVRKERKWYEKFKWTVIN- 484

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI+G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK    E  V   TL +   F
Sbjct: 485 GFLVIAGKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTEGRE--VDEETLEEVAKF 542

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+
Sbjct: 543 SVSHSRAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGV 602

Query: 687 L 687
           +
Sbjct: 603 I 603



 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 79/164 (48%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDL---SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   DV   V  L+ LI  R    + +   + +  I K+     V E G  E V+ 
Sbjct: 1   MKTEMTNVDVCCVVDELQSLINGRLDKAFLIDNENNRELILKIH----VPEGGSRELVIS 56

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           + +    +  T Y R+K   P  F + LRK+++  +L  + Q+ +DRI++F F      +
Sbjct: 57  IGKYKY-ITLTNYEREKPKLPPSFAMLLRKYLKNAKLVKIEQVNFDRIVIFHFETKEGIY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ +GN +  ++E  ++  LR  R   + +     +++P
Sbjct: 116 KLVVELFGEGNAIFLNNENVIIAPLRVERWSTRKIVPKEEYKFP 159


>gi|328909421|gb|AEB61378.1| serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 302

 Score =  150 bits (378), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 89/238 (37%), Positives = 138/238 (57%), Gaps = 11/238 (4%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 45  LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 100

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E    TQ    Y+EF P L +Q     +++FE+FD 
Sbjct: 101 TSNFSGKGYIIQKREM----KPSLEVDKPTQDILTYEEFHPFLFSQHSQCPYIEFESFDK 156

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D E+R+  L+Q  +      ELIE N
Sbjct: 157 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELIEMN 216

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 217 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRN 274


>gi|73669087|ref|YP_305102.1| hypothetical protein Mbar_A1575 [Methanosarcina barkeri str.
           Fusaro]
 gi|72396249|gb|AAZ70522.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 797

 Score =  150 bits (378), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 112/407 (27%), Positives = 197/407 (48%), Gaps = 22/407 (5%)

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           I PE  +  +  +L   H   E     + +D   P  L ++   E   F++F+ ALDEF+
Sbjct: 278 IKPEVGVEGEAPNLRPQHVKKEIKGKLETFD-VLPFDLTRYSGFEKEYFDSFNTALDEFF 336

Query: 363 SKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
            K    Q  E +   K++       +  + QE  +   ++E++++  +AE +  N + ++
Sbjct: 337 GKKALEQIEEVKAAKKKEKTLGVYERRLLQQEGSLKKFEKEIEKNNTLAETVYANYQGIE 396

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
             +  +  A +   SW+++  ++K+ +K   P A  I  +      +++    N D    
Sbjct: 397 ELLSVLNGARSTGYSWDEIRSILKQAKKT-VPAAQKITNIDPRTGTVTV----NFDG--- 448

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
                  + + +D+  +   NA+ +YE  KK   K++  + A     KA EKK   ++ +
Sbjct: 449 -------KSISLDIRKTVPQNAQEYYEKVKKFNKKKDGALKAIEDTRKAMEKKAVAKVAK 501

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
                  S  RK HW+++F WF+SS+ + ++ GRDA  NE I K+Y+ K D+  H    G
Sbjct: 502 AGRKLRAS--RKKHWYDRFRWFVSSDGFFIVGGRDADTNEEIFKKYLEKRDLVFHTQTPG 559

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEY 660
           A  TVIK    E  VP  TL +A  F V +S  W +   +   +WV   QVSKT  +GEY
Sbjct: 560 APLTVIKTGGEE--VPESTLQEAAQFAVSYSSLWKAGHFSGDCYWVKAEQVSKTPESGEY 617

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           +  G+F+IRG++N+    PL +  GL  + +   +G  ++  R  G+
Sbjct: 618 VKKGAFIIRGERNYFKDIPLGVAVGLELKGETRVIGGPVSAVRKHGD 664



 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 70/148 (47%), Gaps = 21/148 (14%)

Query: 2   VKVRMNTADVAAEVKCL----RRLIGMRCSNVY-----DLSPKTYIFKLMNSSGVTESGE 52
           +K  M++ADVAA V  L    + +I  +   +Y     ++    Y+F     +       
Sbjct: 1   MKQDMSSADVAAVVAELSAGPKSIIDAKIGKIYQPANEEIRINLYVFHQGRDN------- 53

Query: 53  SEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
                L++E+G R+H + Y R     P  F + LRK++   R+  V Q  +DRI+     
Sbjct: 54  -----LVIEAGKRIHLSKYLRASPTLPQAFPMLLRKYLMGGRIVSVEQHDFDRIVKIGIE 108

Query: 113 LGMNAHYVILELYAQGNILLTDSEFTVL 140
                  +I+EL+A GNIL+ DSE  ++
Sbjct: 109 RAGVHSNLIVELFAPGNILIVDSENRII 136


>gi|448546430|ref|ZP_21626594.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
 gi|448548417|ref|ZP_21627684.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
 gi|448557611|ref|ZP_21632800.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
 gi|445702883|gb|ELZ54823.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
 gi|445714168|gb|ELZ65935.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
 gi|445714512|gb|ELZ66274.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
          Length = 702

 Score =  149 bits (377), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 102/365 (27%), Positives = 172/365 (47%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQERIIDQQEGAIEGFEQQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+   ++E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRDAREEGVPWDDIGATLEEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL       D+++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTL-------DVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +    A KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELEAVKKRRDEWEADDDEDDEEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL+
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLH 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|448306550|ref|ZP_21496454.1| fibronectin-binding A domain-containing protein [Natronorubrum
           bangense JCM 10635]
 gi|445597848|gb|ELY51920.1| fibronectin-binding A domain-containing protein [Natronorubrum
           bangense JCM 10635]
          Length = 710

 Score =  149 bits (377), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 164/669 (24%), Positives = 278/669 (41%), Gaps = 121/669 (18%)

Query: 55  KVLLLMESG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RIELILEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRLIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPD------- 161

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               S+L+  LT S+E                               +FDL    +    
Sbjct: 162 ----SRLNP-LTVSRE-------------------------------AFDLEMEDSDTD- 184

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDN------AIQVL 284
                    +   L   L +G   +E I    G+   M +++ +  ED+      AI+ L
Sbjct: 185 ---------VVRTLATQLNFGGLYAEEICTRAGIEKGMDIADAD--EDDYDRLYEAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR 344
            L          D+ + +  P  Y+              +   S ++ D   P  L +  
Sbjct: 234 AL----------DLRNANFEPRLYLEDGEDG-------DDDDESARVVDA-TPFPLEEHA 275

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF----HKLNKIHMDQENRVHTLK 400
                 +++F AALD+++ ++E    E+     +   F     K  +I   Q+  +   +
Sbjct: 276 ELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIIDQQQGAIEGFE 335

Query: 401 QEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKE--ER--KAGNPV 454
           Q+ D   + AEL+  EY L  VD  +  ++ A A    W+++    +E  ER  +A   V
Sbjct: 336 QQADELREQAELLYAEYGL--VDDILSTIQDARAQDRPWDEIEARFEEGAERGIEAAEAV 393

Query: 455 AGL-----IDKLYLERNCMSLL----LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
            G+     I  + ++ + + L+    +  N D +  E K +  +K     AL+A  + R 
Sbjct: 394 VGIDSSEGIVTVDIDGDRIDLVAHDGVEQNADRLYTEAKRVAEKKAG---ALAAIEDTRE 450

Query: 506 WYE-LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
             E  K++++              + AE+K  L++       +I       WF++F WF 
Sbjct: 451 DLEDAKRRRDEWDADDEGDEEADDEEAEEKNWLEM------PSIPIRENEPWFDRFRWFH 504

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPP 618
           +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P 
Sbjct: 505 TSDGYLVIGGRNADQNEDLVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPD 564

Query: 619 LTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
            ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + +   
Sbjct: 565 SSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYHRD 624

Query: 678 HPLIMGFGL 686
            P+ +  G+
Sbjct: 625 TPVGVAVGI 633


>gi|150865765|ref|XP_001385110.2| highly conserved hypothetical protein Predicted RNA-binding
           [Scheffersomyces stipitis CBS 6054]
 gi|149387021|gb|ABN67081.2| conserved hypothetical protein Predicted RNA-binding protein
           [Scheffersomyces stipitis CBS 6054]
          Length = 1038

 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 89/223 (39%), Positives = 121/223 (54%), Gaps = 2/223 (0%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANI 548
           V +DL+LS +ANAR ++E KK  ESK+EK       A K AE+K +  +    +     +
Sbjct: 521 VWIDLSLSPYANARLYFESKKSAESKKEKVEKNTEMALKNAERKIKQDLAHNLKNEHDTL 580

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  +WFEKF WF+SSE YL ++GRD  Q +MI  R+ +  D +V A++ G+    +K
Sbjct: 581 KQLRPKYWFEKFYWFVSSEGYLCLAGRDPSQTDMIYYRFFNDNDFFVSAEMEGSLKVFVK 640

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
           N    + VPP TL QAG F    S AW  K+ TSAW ++   VSK    G  L  G F  
Sbjct: 641 NPFKGESVPPYTLMQAGNFAKSTSTAWSGKVSTSAWVLHGSDVSKKDFDGSLLAGGEFNY 700

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           + KK FLPP  L MGFGL    DE +   +   R  +  E G 
Sbjct: 701 KSKKEFLPPTQLTMGFGLYLLGDEETAQKYTKLRVNKEVEHGF 743



 Score =  120 bits (301), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 120/469 (25%), Positives = 224/469 (47%), Gaps = 63/469 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           +   R  N+Y+L  S + Y+ K         S    K +++++ G R+H T + R     
Sbjct: 21  IANYRLQNIYNLAGSNRQYVLKF--------SVPDSKKIVVLDCGNRVHLTDFDRPTTPA 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PS F  KLRKH++TRRL  ++Q+G DR+++ +F  G+   Y++LE ++ GN+LL D    
Sbjct: 73  PSNFVSKLRKHLKTRRLSGIKQVGNDRVLVLEFSDGL--FYLVLEFFSAGNVLLLDDNLK 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASKLHAALTSSKEPDANEPDK-VN 196
           +L+L R+ +  +KG       +Y   EI ++F+++  S+        ++ + +E    + 
Sbjct: 131 ILSLQRNVK--EKG----ENDKYAVNEIYKMFDKSLFSEDFK--YEKRDYNVDEIKAWIK 182

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
           E    V N S+E   G+K  K F + K                   +L   + +   LS 
Sbjct: 183 EQRIKVENQSQEPSSGKK-SKVFSIHK-------------------LLFVNVSH---LSS 219

Query: 257 HIIL----DTGLVPNMKLSEVNKLEDN-AIQVLVLAVAKFE-DWLQDVISGDI-VPEGYI 309
            +IL    + G+  +    E    EDN  +  +V A+ K E +++  + +GD     G+I
Sbjct: 220 DLILKNLQNAGISGSSSCFEF--AEDNEKLSTIVGALDKSEQEYISFISAGDNEQTNGFI 277

Query: 310 LMQNKHLGKDHPPTESGSSTQ---IYDEFCPL--LLNQFRSREFVKFETFDAALDEFYSK 364
           + +   L   + P+E  S      +YDEF P           +F + E ++  LD F+S 
Sbjct: 278 VSKKNPL---YNPSEEHSDNDLEYVYDEFHPFKPFKKNLEGYKFTEIEGYNKTLDTFFSA 334

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           +ES +   + + ++  A  +L     ++  ++ +L Q+ + + K  + I Y+ + V + I
Sbjct: 335 LESTKFALKIEQQKQNANKRLENARSERNKQIQSLIQQQETNSKKGDTIIYHADLVASCI 394

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL 472
            A++  L  +M W ++  +VK E+ +GN +   I   L L  N ++L+L
Sbjct: 395 SAIQKMLDKQMDWGNIEAIVKHEQSSGNEIMSTIKLPLNLNENKINLVL 443



 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 57/174 (32%), Positives = 85/174 (48%), Gaps = 33/174 (18%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG+K KLKKM +KY DQDEEER +RM  L +  +V++             KEK+  +   
Sbjct: 823  RGKKAKLKKMAQKYADQDEEERRLRMTALGTLHQVEQQ-----------QKEKEIELQKA 871

Query: 954  DAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEE 1013
             A K   K +++  + +  KE   +    +ED      DE + M+ + +           
Sbjct: 872  -AEKEKEKYRESAAVQRRKKEQQRELQRYLEDE---NEDEASAMNYLEI----------- 916

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
                   +D     P P+D    ++PV GP+SA+Q  KY+VKI PG+ KKGK I
Sbjct: 917  -------LDSFLAKPQPNDKFSAIVPVFGPWSALQKLKYKVKIQPGSGKKGKCI 963


>gi|448622787|ref|ZP_21669436.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
           35960]
 gi|445753295|gb|EMA04712.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
           35960]
          Length = 701

 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 170/364 (46%), Gaps = 42/364 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +++     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEKQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+   + E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLAEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL V       ++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSEN 568
           +   AA KK R +   +                   + ++      HWFE+F WF +S  
Sbjct: 441 REELAAVKKRRDEWEADDDDDEEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSG 500

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQ 623
           YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +
Sbjct: 501 YLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLRE 560

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  +
Sbjct: 561 AAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKV 620

Query: 683 GFGL 686
             G+
Sbjct: 621 AVGI 624



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|448602394|ref|ZP_21656450.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445747909|gb|ELZ99363.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
           BAA-897]
          Length = 702

 Score =  149 bits (376), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 103/365 (28%), Positives = 170/365 (46%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +++     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPNFEEQIAKQQRIIDQQEGAIEGFEKQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+   + E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLAEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL V       ++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +   AA KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELAAVKKRRDEWEADDDDEDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL 
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLR 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|289192132|ref|YP_003458073.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
           FS406-22]
 gi|288938582|gb|ADC69337.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
           FS406-22]
          Length = 671

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 190/361 (52%), Gaps = 17/361 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++   E   + +F  A+D++++K   +   ++ K+K +    +   I   
Sbjct: 255 YFDVVPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLVKVEVKKEKSKFEREIERQENILKR 314

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   K++ +++    +LI  N + V+  + A+R A   +M W  + ++++E ++  
Sbjct: 315 QLGTLKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQA-REKMDWARIKKIIRENKE-- 371

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI+ +      + + L + +D+   EE+      V +D+  +A  NA  +YE  K
Sbjct: 372 HPILGLIENINENVGEIVVRLKSEVDDNVIEER------VSLDIRKNAFENAESYYEKAK 425

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH----WFEKFNWFISSE 567
           K  +K E    A     K  ++  +    + K   +I   +KV     W+EKF W + + 
Sbjct: 426 KLRNKIEGIENAIELTKKKIDELKKKGEEELKEKESIQMKKKVRKERKWYEKFKWTVIN- 484

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI+G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK +  E  V   TL +   F
Sbjct: 485 GFLVIAGKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTYGRE--VDEETLEEVAKF 542

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+
Sbjct: 543 SVSHSRAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGV 602

Query: 687 L 687
           +
Sbjct: 603 I 603



 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 77/163 (47%), Gaps = 2/163 (1%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  +   DV   V  L+ LI  R    + L       +L+    V E G  E V+ +  
Sbjct: 1   MKSEITNVDVCCVVDELQNLINGRLDKAF-LIDNEQNRELILKIHVPEGGSRELVISIGR 59

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
               +  T Y R+K   P  F + LRK+++  +L  + Q+ +DRI++F F      + ++
Sbjct: 60  YKY-ITLTNYEREKPKLPPSFAMLLRKYLKNAKLIKIEQVNFDRIVIFHFETRDGIYKLV 118

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            EL+  GNI+  ++E  ++  LR  R   + +    ++++P +
Sbjct: 119 AELFGDGNIIFLNNEDIIIAPLRVERWSSRNIIPREKYKFPPQ 161


>gi|448409564|ref|ZP_21574778.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
 gi|445672910|gb|ELZ25479.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
          Length = 729

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 146/656 (22%), Positives = 252/656 (38%), Gaps = 129/656 (19%)

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           P  F + LR  ++   L DV Q  +DRI+   F        ++ EL+  GN+ + D    
Sbjct: 78  PPNFAMMLRNRMQGAELVDVSQFQFDRILELTFERDDETTTIVAELFGDGNVAILDGTGE 137

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           V+  L                    E  R+  RT A        S++             
Sbjct: 138 VIDCL--------------------ETVRLKSRTVAPGAQYEFPSAR------------- 164

Query: 199 GNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHI 258
            N +             G  +D  +   ++S+         L   L   L +G    E +
Sbjct: 165 FNPL-------------GVDYDAFEARMRDSD-------SDLVRTLATQLNFGGLYGEEL 204

Query: 259 ILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGK 318
               G+  N+ + E     D+ ++ L  A+ +  D L D    D+ P  Y  + +     
Sbjct: 205 CTLAGVDYNVPIEEAT---DDQLRALYDALRRLADRLAD---SDLDPRVYYDLDDPD--A 256

Query: 319 DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
           + P  + G+      +  P+ L ++  R    F++F+ ALD++++    +  E    A  
Sbjct: 257 EDPTDDDGAIEGQRVDVTPIPLAEYDDRYGEPFDSFNEALDDYFTFASDEDDEGGGDAAG 316

Query: 379 --------DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVA 430
                   ++   K  +I   Q+  +   + + +R    AE +  N + VD  +  V+ A
Sbjct: 317 GDRGRPDFESEIAKHERIIEQQQGAIEDFEAQAERERANAEALYANYDLVDDILSTVQEA 376

Query: 431 LANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK 490
            A   SW+D+     E  + G P A  +  L      +++       ++D E  TL   +
Sbjct: 377 RAEDRSWDDIEERFAEGARQGIPAAEAVVSLDGSEGTVTI-------DIDGERVTLAASE 429

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV----- 545
                      NA R Y   K+ E K+E    A       A+ ++ L+ ++E+       
Sbjct: 430 -------GVEKNADRLYREAKRIEGKKEGAEEA------IAQTRSELEAVEERKAEWEAA 476

Query: 546 ----------------------------ANISHMRKVHWFEKFNWFISSENYLVISGRDA 577
                                        +I   +  HWFE + WF +S+ +LVI GRDA
Sbjct: 477 DAGEAGSGGDESEGSDEDDDEPVDWLAEPSIPVRQSDHWFEDYRWFHTSDGFLVIGGRDA 536

Query: 578 QQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCH 631
             NE +VK+Y+ +GD + HA  HG  +T++K   P +       +P  +  +A  F V +
Sbjct: 537 DDNEDLVKKYLDRGDRFFHAQAHGGPATILKATGPSESYDDDVEIPESSKCEAAQFAVSY 596

Query: 632 SQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           S  W D K     + V   QVSKT  +GE+L  G F IRG + +     + +  G+
Sbjct: 597 SSIWKDGKFAGDVYEVGSDQVSKTPESGEFLEKGGFAIRGDRTYYESTEVGVAVGI 652


>gi|410695646|gb|AFV74963.1| serologically defined colon cancer antigen1-like protein, partial
           [Apis florea]
          Length = 273

 Score =  147 bits (370), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 94/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    D +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKYGFTLGCKIGKDFNIEED-MSKLILALEYANDMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  KQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|387175434|gb|AFJ66834.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175436|gb|AFJ66835.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175438|gb|AFJ66836.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175440|gb|AFJ66837.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175442|gb|AFJ66838.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175448|gb|AFJ66841.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175450|gb|AFJ66842.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175488|gb|AFJ66861.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175496|gb|AFJ66865.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  146 bits (369), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 94/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + KF +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKKFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|288560094|ref|YP_003423580.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
 gi|288542804|gb|ADC46688.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
          Length = 669

 Score =  146 bits (369), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 185/370 (50%), Gaps = 38/370 (10%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           ++ ++   + L+Q+ + E   F++F+ A DEFYS           +A  +    K +K  
Sbjct: 259 KVKEDVVAIRLHQYENFEEESFDSFNEACDEFYSSKVKHEITDIQEAVWNKKVGKFSKRL 318

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
             QE  +   ++ ++ S K  EL+  N   V+  +  ++ A      W+++ + +K+ +K
Sbjct: 319 EKQEETLRGFEKTIEDSQKKGELLFTNYVQVENILNVIKDAREKDYGWKEIGKTLKDAKK 378

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMD---DEEKTLPVEKVEVDLALSAHANARRW 506
           +G   A + + +    N     ++ N+D +    D +K++P              NA  +
Sbjct: 379 SGMAEAQIFESMDPLGN-----ITLNIDGISIALDSKKSIP-------------DNAEVY 420

Query: 507 YELKKKQESKQEKTITA--HSKA-FKAAEKKTRLQILQEKTVANISHMRK-----VHWFE 558
           YE  KK + K +    A  ++KA  K  E+K      +EK +ANI   +K     + W+E
Sbjct: 421 YEKAKKAKRKIKGAKIAIENTKAQLKDMEEK------KEKAMANIMVPQKRVKKNLKWYE 474

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           K  WF+SS+  LV+ GRDA  NE +VK+Y+ + DVY+HAD+HGA S V K       +  
Sbjct: 475 KLRWFVSSDGTLVVCGRDAGSNEAVVKKYLEQNDVYLHADIHGAPSVVAK--ISSDKLNN 532

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
             L + G F    S AW     T   +WV P QVSKT  +GE++  G+F+IRGK+N++  
Sbjct: 533 NLLKELGIFAASFSSAWSRNYGTQDVYWVEPEQVSKTPVSGEFVPKGAFIIRGKRNYIRG 592

Query: 678 HPLIMGFGLL 687
             L +  G++
Sbjct: 593 AKLEIAIGIV 602



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 60/107 (56%), Gaps = 1/107 (0%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++++G R+H + Y      +P  F + LRK ++   +  ++Q  +DR++  +    +  
Sbjct: 50  LVIQAGKRIHISQYPLANPQSPPSFPMLLRKRVKGANVVSIQQHNFDRVVEIKMKKDI-T 108

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           + +I+EL+A+GNI+L + E  +L  L+  +  D+ ++    + +P E
Sbjct: 109 YTLIVELFAKGNIILLNEENEILLPLKRKQWSDRDISSKKEYVFPIE 155


>gi|15790499|ref|NP_280323.1| hypothetical protein VNG1508C [Halobacterium sp. NRC-1]
 gi|169236235|ref|YP_001689435.1| hypothetical protein OE3153R [Halobacterium salinarum R1]
 gi|10580999|gb|AAG19803.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
 gi|167727301|emb|CAP14087.1| conserved hypothetical protein [Halobacterium salinarum R1]
          Length = 703

 Score =  146 bits (368), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 160/652 (24%), Positives = 259/652 (39%), Gaps = 116/652 (17%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R H     +  D    P  F   LR  +       VRQ G+DRI+ F+
Sbjct: 49  RVELLVEVGETKRAHVADPTHVPDAPGRPPNFAKMLRNRLSGADFHAVRQHGFDRILEFE 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        ++ EL+  GNI + D +  V+  L                    +  R+  
Sbjct: 109 FRREDADTTIVAELFGDGNIAVLDPQREVVDSL--------------------DTVRLQS 148

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A            PDA    +VN                       DLS  +     
Sbjct: 149 RTVAPGRDYGF-----PDA----RVN---------------------PLDLSYEAFAEQ- 177

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
              R     L   L   L +G   +E +    G+    K + V    ++ ++ L  A   
Sbjct: 178 --MRDSDTDLVRTLATQLNFGGLYAEELCSRAGV---EKTTPVADAPESTLEALFDAS-- 230

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVK 350
            E  L ++ +GD+ P+ Y           + PT+         +  P+ L++        
Sbjct: 231 -ETLLGNISAGDLDPQVY-----------YEPTDDEDEQGARVDVTPIALDERADLPSDA 278

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRS 406
           FE+F+ ALD++++ +++   E   +  +   F     K  +I   QE  +   + + +  
Sbjct: 279 FESFNDALDDYFTNLDTSEDEDSGETVDRPDFENEIEKQQRIIEQQEQAIEDFEAQAEAE 338

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
            + AE +  + + VD  + AVR A      W+ +A    +   A   V G   + ++  N
Sbjct: 339 REKAESLYGHYDLVDGLLSAVRQAREAGHGWQQIADTFDD---AAGDVPGA--EAFVGVN 393

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--H 524
             + ++   +D+            V +D +     NA R Y   K+ E K+     A  +
Sbjct: 394 ESAGMIRARIDD----------HTVTLDPSAGVEKNADRLYTEAKRIEEKKAGARAAIEN 443

Query: 525 SKAFKAAEKKTRLQILQEK---------------TVANISHMRKVHWFEKFNWFISSENY 569
           ++A   A K+ R +   E                + ++I    +  W+E+F WF +SE +
Sbjct: 444 TRADLDAVKQRRDEWEAEPESEHEDDADDEVAWLSRSSIPIRHQEQWYERFRWFRTSEGF 503

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQA 624
           LVI GRDA QNE +VK+YM + D + H+  HG   TV+K   P +P     VP     QA
Sbjct: 504 LVIGGRDAGQNEELVKKYMDRYDRFFHSQAHGGPITVLKTSAPSEPSNDIEVPERDARQA 563

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             F V  S  W D +    A+ V P QVSKT  +GEYL  G F +RG + + 
Sbjct: 564 ARFAVACSSVWKDGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAVRGDRTYF 615


>gi|448659123|ref|ZP_21683091.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
 gi|445760625|gb|EMA11882.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
          Length = 717

 Score =  146 bits (368), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 160/663 (24%), Positives = 263/663 (39%), Gaps = 103/663 (15%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P  L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRVD-VTPTPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F+ ALD+++     QR E+       +   +A   K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNPALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQERIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-L 509
               ++L +               N DE+  E K +  +K   + AL+A  N R   E +
Sbjct: 406 SEGTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAV 462

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K+++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +
Sbjct: 463 KERREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGF 517

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQA 624
           LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA
Sbjct: 518 LVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQA 577

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
             F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  + 
Sbjct: 578 AQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVA 637

Query: 684 FGL 686
            G+
Sbjct: 638 VGI 640


>gi|354610742|ref|ZP_09028698.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
 gi|353195562|gb|EHB61064.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
          Length = 745

 Score =  145 bits (367), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 176/383 (45%), Gaps = 46/383 (12%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE----DAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           F+ F+ ALD++++ +++   E+  +A      +A   K  +I   Q+  +   +Q+ +  
Sbjct: 320 FDRFNDALDDYFTNLDTTEEEESGEAVSRPDFEAEIEKQKRIIEQQQQAIDDFEQQAEAE 379

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN-PVAGLIDKLYLER 465
            + AEL+  N + VD  I  V  A      W+D+A   +E   AG+ P A +   +    
Sbjct: 380 REKAELLYGNYDLVDELIGVVADARGAGHGWQDIAERFEE--AAGDVPGADVFVGVNESE 437

Query: 466 NCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA-- 523
             + + + ++  E+D E     VEK           NA R Y   K+ E KQE    A  
Sbjct: 438 GTVRVRIDDHTIELDPESG---VEK-----------NADRIYTEAKRIEEKQEGARAAIE 483

Query: 524 HSKAFKAAEKKTRLQILQE---------KTVANISHMRKV--------HWFEKFNWFISS 566
           +++    + K+ R +   E           +A++  + +          W+E+F WF +S
Sbjct: 484 NTRGDLESAKQRREEWEAEPDEQESEADDELADVDWLSRSSIPIRNQEQWYERFRWFRTS 543

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTL 621
           E +LV+ GRDA QNE +VK+YM + D + H+  HG   TV+K   P +P     VP    
Sbjct: 544 EGFLVLGGRDADQNEELVKKYMDRYDRFFHSQAHGGPITVLKTSAPSEPSNEIEVPETDK 603

Query: 622 NQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
            QA  F VC S  W D +    A+ V P QVSKT  +GEYL  G F IRG + +    P 
Sbjct: 604 RQAAQFAVCCSSVWKDGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAIRGDRTYFRDLPA 663

Query: 681 IMGFGLLFRLDESSLGSHLNERR 703
               G+    +   LG  ++  R
Sbjct: 664 EWAVGIACEPNTRVLGGPIDAVR 686



 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 4/113 (3%)

Query: 55  KVLLLMESG--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R H  A  +  D    P  F   LR  +      +VRQ G+DRI+ F+
Sbjct: 49  RVELLLEVGETKRAHVAAPEHVPDAPGRPPNFAKMLRNRLSGADFHEVRQHGFDRILEFE 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           F        +++EL+  GN+ + D    V+  L + R   + VA  +++ +P+
Sbjct: 109 FRREDQDTTIVVELFGDGNVAVLDQNGEVVDCLETVRLKSRTVAAGAQYGFPS 161


>gi|257388236|ref|YP_003178009.1| fibronectin-binding A domain-containing protein [Halomicrobium
           mukohataei DSM 12286]
 gi|257170543|gb|ACV48302.1| Fibronectin-binding A domain protein [Halomicrobium mukohataei DSM
           12286]
          Length = 708

 Score =  145 bits (367), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 178/382 (46%), Gaps = 45/382 (11%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQ-RAEQQHKAKED-----AAFHKLNKIH 389
            P+ L ++   E   FETF  ALDE++ ++E +  AE+   A  D     +   K  +I 
Sbjct: 265 TPIPLEEYDDVESRAFETFTEALDEYFYEVEREDTAEEIADAGVDRPDFESEIEKYERII 324

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
             Q++ +   + + +   + AEL+    + VD  +  ++ A      W+++    +E ++
Sbjct: 325 QQQQSAIEDFESDAEAEREKAELLYARYDLVDEILSTIQGARTQDTPWDEIEATFEEGKE 384

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
            G   A  ++ L      ++L    ++D++          +V +D  +    NA + Y+ 
Sbjct: 385 QGIAAAEAVEGLDGSEGTVTL----SIDDV----------RVTIDATMGVEKNADQLYQA 430

Query: 510 KKKQESKQE---KTITAHSKAFKAAEKK-----------TRLQILQEKTV-----ANISH 550
            K+ E K+E     I    +  +A E++           T+ Q  +   V     A+I  
Sbjct: 431 AKRIEEKKEGAQAAIEDTREDLEAVERRRENWEAEDTTETQEQTAEADDVDWLSRASIPV 490

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            R+  W+++F WF +S  +LVI GR+A QNE +VK+Y+ +GD + HA  HG   TV+K  
Sbjct: 491 RRQEPWYDRFRWFRTSNGFLVIGGRNADQNEELVKKYLDRGDKFFHAQAHGGPVTVLKAT 550

Query: 611 RPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            P +      +P     +A  F V +S  W D K    A+ V P QVSKT  +GEYL  G
Sbjct: 551 GPSESSRDVDIPDQDKREAATFAVAYSSVWKDGKYAGDAYMVDPDQVSKTPESGEYLEKG 610

Query: 665 SFMIRGKKNFLPPHPLIMGFGL 686
            F IRG + +     + +  G+
Sbjct: 611 GFAIRGDRTYFRDLEVDVAVGI 632



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 4/113 (3%)

Query: 55  KVLLLMESG--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R H     +  D    P  F + LR  I    L DVRQ  +DRI+ F+
Sbjct: 49  RVELLIEVGENKRAHVVDADHVPDAPGRPPNFAMMLRNRISGGELADVRQFEFDRIMEFE 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           F     +  V+ EL+  GN+ + D    V+  L + R   + VA  S++ +P+
Sbjct: 109 FDRPDASTTVVAELFGDGNVAVLDEHGEVVDCLETVRLKSRTVAPGSQYEFPS 161


>gi|387175444|gb|AFJ66839.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175452|gb|AFJ66843.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175454|gb|AFJ66844.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175462|gb|AFJ66848.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175464|gb|AFJ66849.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175466|gb|AFJ66850.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175470|gb|AFJ66852.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175474|gb|AFJ66854.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175476|gb|AFJ66855.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175478|gb|AFJ66856.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175480|gb|AFJ66857.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175482|gb|AFJ66858.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175484|gb|AFJ66859.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175486|gb|AFJ66860.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175494|gb|AFJ66864.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175498|gb|AFJ66866.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175506|gb|AFJ66870.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175508|gb|AFJ66871.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175510|gb|AFJ66872.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175514|gb|AFJ66874.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175516|gb|AFJ66875.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175518|gb|AFJ66876.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175520|gb|AFJ66877.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175522|gb|AFJ66878.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175528|gb|AFJ66881.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  145 bits (365), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|410695644|gb|AFV74962.1| serologically defined colon cancer antigen1-like protein, partial
           [Apis cerana]
          Length = 273

 Score =  145 bits (365), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 157/277 (56%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+     +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGRDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKALLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|387175512|gb|AFJ66873.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175524|gb|AFJ66879.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175526|gb|AFJ66880.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|15669822|ref|NP_248636.1| hypothetical protein MJ_1625 [Methanocaldococcus jannaschii DSM
           2661]
 gi|42559938|sp|Q59020.1|Y1625_METJA RecName: Full=Uncharacterized protein MJ1625
 gi|1592339|gb|AAB99643.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
           2661]
          Length = 671

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 191/361 (52%), Gaps = 17/361 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L +++  E   + +F  A+D++++K  ++   ++ K+K +    +   I   
Sbjct: 255 YFDVVPIDLKKYKGLEKKYYNSFLEAVDDYFAKFLTKVVVKKEKSKIEKEIERQENILRR 314

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   K++ +++    +LI  N + V+  + A+R A   +M W  + ++++E ++  
Sbjct: 315 QLETLKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQA-REKMDWARIKKIIRENKE-- 371

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI+ +      + + L + +D+   EE+      V +D+  +A  NA  +YE  K
Sbjct: 372 HPILGLIENINENIGEIIIRLKSEVDDKVIEER------VSLDIRKNAFENAESYYEKAK 425

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH----WFEKFNWFISSE 567
           K  +K E    A     K  E+  +    + K   ++   +K+     W+EKF W + + 
Sbjct: 426 KLRNKIEGIENAIELTKKKIEELKKKGEEELKEKESMQMKKKIRKERKWYEKFKWTVIN- 484

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI+G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK    E  V   TL +   F
Sbjct: 485 GFLVIAGKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTQGKE--VDEETLEEVAKF 542

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+
Sbjct: 543 SVSHSRAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGVGV 602

Query: 687 L 687
           +
Sbjct: 603 I 603



 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 79/163 (48%), Gaps = 2/163 (1%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  +   DV   V  L+ LI  R    + L       +L+    V E G  E V+ + +
Sbjct: 1   MKSEITNVDVCCVVDELQNLINGRLDKAF-LIDNEQNRELILKIHVPEGGSRELVISIGK 59

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
               +  T Y R+K   P  F + LRK+++  +L  + Q+ +DR+++F F      + ++
Sbjct: 60  YKY-ITLTNYEREKPKLPPSFAMLLRKYLKNAKLIKIEQVNFDRVVIFHFETRDGIYKLV 118

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            EL+  GNI+  ++E T++  LR  R   + +    ++++P +
Sbjct: 119 AELFGDGNIIFLNNEDTIIAPLRVERWSTRNIVPKEKYKFPPQ 161


>gi|227828200|ref|YP_002829980.1| hypothetical protein M1425_1938 [Sulfolobus islandicus M.14.25]
 gi|229585429|ref|YP_002843931.1| hypothetical protein M1627_2016 [Sulfolobus islandicus M.16.27]
 gi|227459996|gb|ACP38682.1| protein of unknown function DUF814 [Sulfolobus islandicus M.14.25]
 gi|228020479|gb|ACP55886.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.27]
          Length = 609

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 172/337 (51%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D +LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTSLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I   +    +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII-AQENNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|395645660|ref|ZP_10433520.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
           4140]
 gi|395442400|gb|EJG07157.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
           4140]
          Length = 635

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 170/348 (48%), Gaps = 39/348 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+T++AAL+ FY ++ +   +++ K  +     +   I + QE  +   + ++ R+ K  
Sbjct: 255 FDTYNAALESFYPEVPASVTKEEEKRPK---LTREEVIRLQQETAIKKFESKIARAEKAV 311

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E I  N   V   I  ++ A +  MSW+++ +++K               L   +  +S+
Sbjct: 312 EAIYTNYPLVQEVITTLQRA-SRSMSWQEIEKILKS------------SDLPAAKAVVSV 358

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
             ++   ++D   +      V + +  S  AN  R+Y+  KK   K+E  + A  +    
Sbjct: 359 HPADAAVDVDVGMQ------VTIHVHESVEANVERYYDQIKKFRKKKEGALAAMERGVPK 412

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
            ++K +  +          H+ K  WF +F WF +++  LV+ GRDA QNE +VKRYM  
Sbjct: 413 QKEKPKETL----------HLLKKKWFHRFRWFYTTDGTLVLGGRDASQNEELVKRYMEG 462

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS-KMVTSAWWVYPH 649
            D +VHAD+HG S  ++K      P   L  ++  CF   +S AW +       +   P 
Sbjct: 463 KDTFVHADVHGGSVVIVKG-----PTEHLE-DEVACFAASYSNAWKAGHFAADVYIARPD 516

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           QVSKT  +GEY++ G+F++RG++ ++   PL +  G+  + D + +G 
Sbjct: 517 QVSKTPESGEYVSRGAFIVRGERQYVRDVPLGVAIGVQLKPDVTVIGG 564



 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 64/148 (43%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+  DV A V  L   + +    +Y    KT   +L    GV       K   L+E+G R
Sbjct: 7   MSGIDVRAMVTELCGHLPLWIGKIYQYDTKTLGIRLNGEGGV-------KHQFLIETGRR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H      +   TP G+ + LRKH+   R+  + Q G  RI     G       +++EL+
Sbjct: 60  AHLVRSLPESPKTPLGYAMFLRKHLEGGRVRAIGQYGLQRIFYIDIGKKTGVLRLVIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
            +GN +L D    +L  L  HR  D+ V
Sbjct: 120 DEGNAVLLDEGGVILKPLWHHRFKDRAV 147


>gi|238620391|ref|YP_002915217.1| hypothetical protein M164_1946 [Sulfolobus islandicus M.16.4]
 gi|238381461|gb|ACR42549.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.4]
          Length = 609

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 171/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D +LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTSLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|387175446|gb|AFJ66840.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175458|gb|AFJ66846.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175460|gb|AFJ66847.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175468|gb|AFJ66851.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175472|gb|AFJ66853.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175490|gb|AFJ66862.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175492|gb|AFJ66863.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175500|gb|AFJ66867.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175504|gb|AFJ66869.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  144 bits (363), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 157/277 (56%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   +  F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKXFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|387175502|gb|AFJ66868.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  144 bits (362), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 157/277 (56%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+    AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKX--KAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|325958497|ref|YP_004289963.1| fibronectin-binding A domain-containing protein [Methanobacterium
           sp. AL-21]
 gi|325329929|gb|ADZ08991.1| Fibronectin-binding A domain protein [Methanobacterium sp. AL-21]
          Length = 661

 Score =  144 bits (362), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 185/361 (51%), Gaps = 27/361 (7%)

Query: 333 DEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQ---RAEQQHKAKEDAAFHKLNKIH 389
           ++  PL L  ++  E   FE+F+ A DEFYS I  +      ++  + E   F K   I 
Sbjct: 245 EDVLPLDLLMYKDFEKESFESFNDAADEFYSSIVGEDIVNVNEEVWSGEVGKFEKRLNIQ 304

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
           ++    +   ++ V  S    E I  + + ++  IL +  +     SW ++   VK+ +K
Sbjct: 305 LET---LEKFEKTVKDSKIKGEAIYSDYQAIEN-ILNIIHSARETNSWLEIIATVKKAKK 360

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
              P   +I+ +    + M +L + NLD +          +V +D ++    NA  +Y  
Sbjct: 361 DKVPGLEIIESI----DKMGVL-TLNLDGV----------RVNIDSSMGIPENAEIYYNK 405

Query: 510 KKKQESKQEKTITAHSKAFKAAEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSE 567
            KK + K +    A  K  K  +K K + +I  EK +     ++K + W+EK  WF++S+
Sbjct: 406 GKKAKRKIKGVHIAIEKTRKEIDKAKNKREIEMEKVLVPQKRVKKDLKWYEKLRWFVTSD 465

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
             L I GRDA  NEM+VK++M   D+Y H+D+HGASS ++K    E  +P  ++N+   F
Sbjct: 466 GLLAIGGRDATTNEMVVKKHMENRDIYFHSDIHGASSVILKAGEGE--IPERSINETAAF 523

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             C S AW   +  T  +WV+P QVSKT  +GE++  G+F+IRG +N++   PL +  G+
Sbjct: 524 AACFSSAWSKGLGSTDVYWVHPEQVSKTPQSGEFVAKGAFIIRGSRNYMRGLPLTLSLGI 583

Query: 687 L 687
           +
Sbjct: 584 V 584



 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 59/110 (53%), Gaps = 1/110 (0%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V ++ ++G R+HTT Y       P  F + LRK+I+   +  V+Q  +DRI+       
Sbjct: 47  RVDVVFQAGFRVHTTQYPPQNPKIPPNFPMLLRKYIKGGTVTAVKQHNFDRIMRIDIQ-K 105

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
                +++EL+A+GNI+L D E  ++  L+     D+ ++    ++YP E
Sbjct: 106 EEKFSLVVELFAKGNIILLDHEDKIILPLKRKVWQDRKISSKEEYKYPPE 155


>gi|218883339|ref|YP_002427721.1| hypothetical protein DKAM_0025 [Desulfurococcus kamchatkensis
           1221n]
 gi|218764955|gb|ACL10354.1| protein of unknown function DUF814 [Desulfurococcus kamchatkensis
           1221n]
          Length = 659

 Score =  144 bits (362), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 174/705 (24%), Positives = 312/705 (44%), Gaps = 154/705 (21%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           ++K  M+  D+ + V     ++ G    N Y      +I KL    GV         ++ 
Sbjct: 5   LLKKAMDILDIYSWVNKYSSVVTGCLIDNAYHYK-SYWILKLRCREGVY--------IVK 55

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGL--GMNA 117
           +E GVR+H +    ++K+   GFT  LR  IR  R+  ++Q  ++RIILF+  +   +  
Sbjct: 56  IEPGVRMHLSQSHPEEKDI-DGFTRFLRSRIRDSRITSIKQPWWERIILFETSIHDKILR 114

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           HYV  EL  +G  ++TD    ++   R  +  D+ +        P+E+            
Sbjct: 115 HYV--ELLPRGQWIITDQSDKIVYASRFMKYRDRSIK-------PSEVY----------- 154

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENL-GGQKGGKSFDLSKNSNKNSNDGARAK 236
                      +  P K      N+S + K+ L    KGG+                   
Sbjct: 155 -----------SPPPLK------NLSPSDKDALLNVVKGGRDL----------------- 180

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGL--VPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
              ++T++  A G    ++E  I   GL  V N  +SE+        Q L   V ++   
Sbjct: 181 ---VRTIIS-AWGIPGHIAEEAIHRAGLYGVKNKGVSEI------PYQDLEKLVDEYRRI 230

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETF 354
           +++V++G    +GY++  ++            +  +IY  + P L ++   +     +  
Sbjct: 231 VEEVLNG----KGYLVYGDE------------NKLEIYTSYEPRLFSEVYDKTVKPLDDI 274

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           + A+D ++++ E   A   ++A+ +    KL +I    E R+   +QE        E+I 
Sbjct: 275 NTAIDVYFTEYE---AYLDYQARMEEVTEKLREI----EARIK--RQE--------EIIA 317

Query: 414 EYN--LEDVDAAILAVRVALANRMSWEDLARMVKE--ERKAGNPVAGLIDKLYLERNCMS 469
           EYN  +E++++ +  +    +N    E++    +E  E+K    +A           C  
Sbjct: 318 EYNNEIENIESILQTI---YSNYHVAEEILECARETREKKGWEHIA---------EEC-- 363

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHAN-ARRWYELKKKQESKQEKTITAHSKAF 528
               N++ E+  ++  + V+  E  L LS   + +R+  EL++K      KT +A     
Sbjct: 364 ----NSVIEIRKDKGMIVVKLGEKTLELSIREDLSRQVIELERKHGELVRKTESAKKVLE 419

Query: 529 KAAEKKTRLQI---LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           +  ++   + I    +EKT+   S      W+E+F+W  +   +L I GRD  QNE++V+
Sbjct: 420 EMHQQLNTISISMNTEEKTIRKPS---PTFWYERFHWLFTRNGFLAIGGRDQSQNELVVR 476

Query: 586 RYMSKGDVYVHADLHGASSTVIKN---HRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VT 641
           +Y+ + DV++HAD+HG S+ V+K+   H  E  V       A     C+S+AW +     
Sbjct: 477 KYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV------DASYLAACYSKAWKAGFSYI 530

Query: 642 SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +WV   QVSKT P GEYL  G+FM+ G KN+L   PL +G G+
Sbjct: 531 EVYWVPGRQVSKTPPPGEYLPRGAFMVYGSKNYLQV-PLRLGIGI 574


>gi|154304164|ref|XP_001552487.1| hypothetical protein BC1G_08352 [Botryotinia fuckeliana B05.10]
          Length = 484

 Score =  144 bits (362), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 68/131 (51%), Positives = 89/131 (67%), Gaps = 9/131 (6%)

Query: 590 KGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           KGDVY+HAD+ GA+S +++N+   P+ P+PP TL+QAG   V  S AWDSK   SAWWV 
Sbjct: 2   KGDVYLHADIRGAASVIVRNNPKTPDAPIPPQTLSQAGTLVVVTSSAWDSKAGMSAWWVT 61

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
             QVSK+APTGE+L  GSF   GKKNFLPP  L++GFG+LF++ + S   H N+ R++  
Sbjct: 62  ADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDESKARH-NKHRLQ-- 118

Query: 708 EEGMDDFEDSG 718
               DD   SG
Sbjct: 119 ----DDSPSSG 125



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 76/187 (40%), Gaps = 45/187 (24%)

Query: 907  YGDQDEEERNIRMALL-ASAGKVQKNDGDPQNENAST----HKEKKPAISPVDAPKVCYK 961
            Y DQDEE+R     ++ A+AG+ +                  KE+               
Sbjct: 269  YKDQDEEDRIAAQEIIGAAAGQEKAEAEAKAKAAREAELAFQKER--------------- 313

Query: 962  CKKAGH--LSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEE-DIHEIGEEEKGRL 1018
             ++A H    K+  EH                    EM K+ +E+  D HE  E E   +
Sbjct: 314  -RRAQHQRTQKETAEH-------------------EEMRKLMLEDGIDTHEDNEIE--TM 351

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLML 1078
              +D   G PLP D +L  IPVC P++A+  YKY+ KI PG  KKGK ++      +   
Sbjct: 352  TSLDSFVGLPLPGDEILEAIPVCAPWAAMGKYKYKAKIQPGAQKKGKAVREILGKWMAAS 411

Query: 1079 SLTPVFD 1085
            +   V D
Sbjct: 412  TAKGVLD 418


>gi|387175456|gb|AFJ66845.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  144 bits (362), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREAXKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|227830959|ref|YP_002832739.1| hypothetical protein LS215_2101 [Sulfolobus islandicus L.S.2.15]
 gi|227457407|gb|ACP36094.1| protein of unknown function DUF814 [Sulfolobus islandicus L.S.2.15]
          Length = 609

 Score =  144 bits (362), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|229579837|ref|YP_002838236.1| hypothetical protein YG5714_2060 [Sulfolobus islandicus Y.G.57.14]
 gi|228010552|gb|ACP46314.1| protein of unknown function DUF814 [Sulfolobus islandicus
           Y.G.57.14]
          Length = 609

 Score =  143 bits (361), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|344211873|ref|YP_004796193.1| fibronectin-binding A domain-containing protein [Haloarcula
           hispanica ATCC 33960]
 gi|343783228|gb|AEM57205.1| fibronectin-binding A domain protein [Haloarcula hispanica ATCC
           33960]
          Length = 717

 Score =  143 bits (361), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 158/661 (23%), Positives = 260/661 (39%), Gaps = 99/661 (14%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHAADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P+ L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRID-VTPIPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             F  F+ ALD+++   +  +  E     + D       +  + Q+        E D  V
Sbjct: 288 ESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEAEIEKQQRIIQQQEQAIEDFEADAEV 347

Query: 408 KM--AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
           +   AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L    
Sbjct: 348 EREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDGSE 407

Query: 466 NCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-LKK 511
             ++L +               N DE+  E K +  +K   + AL+A  N R   E +K+
Sbjct: 408 GTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAVKE 464

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           ++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +LV
Sbjct: 465 RREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGFLV 519

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGC 626
           I GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA  
Sbjct: 520 IGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQAAQ 579

Query: 627 FTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  +  G
Sbjct: 580 FAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVG 639

Query: 686 L 686
           +
Sbjct: 640 I 640


>gi|284998447|ref|YP_003420215.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284446343|gb|ADB87845.1| protein of unknown function DUF814 [Sulfolobus islandicus L.D.8.5]
          Length = 609

 Score =  143 bits (361), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|53136750|emb|CAG32704.1| hypothetical protein RCJMB04_33f3 [Gallus gallus]
          Length = 198

 Score =  143 bits (361), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 76/164 (46%), Positives = 103/164 (62%), Gaps = 9/164 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A V  LR  L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDIRALVAELRLSLLGMRVNNVYDVDSKTYLIRLQKPDC--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH++TRRL  VRQLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKTRRLVSVRQLGIDRIVDFQFGSNEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +
Sbjct: 113 IIELYDRGNIVLTDHEYLILNILRFRTDEADDVRFAVRERYPVD 156


>gi|448639710|ref|ZP_21676858.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
 gi|445762237|gb|EMA13458.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
          Length = 717

 Score =  143 bits (361), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 158/661 (23%), Positives = 260/661 (39%), Gaps = 99/661 (14%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQ--PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+    L   L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P+ L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRID-VTPIPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             F  F+ ALD+++   +  +  E     + D       +  + Q+        E D  V
Sbjct: 288 ESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEAEIEKQQRIIQQQEQAIEDFEADAEV 347

Query: 408 KM--AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
           +   AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L    
Sbjct: 348 EREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDGSE 407

Query: 466 NCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-LKK 511
             ++L +               N DE+  E K +  +K   + AL+A  N R   E +K+
Sbjct: 408 GTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAVKE 464

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           ++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +LV
Sbjct: 465 RREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGFLV 519

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGC 626
           I GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA  
Sbjct: 520 IGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQAAQ 579

Query: 627 FTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  +  G
Sbjct: 580 FAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVG 639

Query: 686 L 686
           +
Sbjct: 640 I 640


>gi|55377795|ref|YP_135645.1| hypothetical protein rrnAC0969 [Haloarcula marismortui ATCC 43049]
 gi|55230520|gb|AAV45939.1| unknown [Haloarcula marismortui ATCC 43049]
          Length = 717

 Score =  143 bits (361), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 158/661 (23%), Positives = 260/661 (39%), Gaps = 99/661 (14%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQ--PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+    L   L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P+ L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRID-VTPIPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             F  F+ ALD+++   +  +  E     + D       +  + Q+        E D  V
Sbjct: 288 ESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEAEIEKQQRIIQQQEQAIEDFEADAEV 347

Query: 408 KM--AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
           +   AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L    
Sbjct: 348 EREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDGSE 407

Query: 466 NCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-LKK 511
             ++L +               N DE+  E K +  +K   + AL+A  N R   E +K+
Sbjct: 408 GTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAVKE 464

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           ++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +LV
Sbjct: 465 RREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGFLV 519

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGC 626
           I GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA  
Sbjct: 520 IGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQAAQ 579

Query: 627 FTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  +  G
Sbjct: 580 FAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVG 639

Query: 686 L 686
           +
Sbjct: 640 I 640


>gi|269864556|ref|XP_002651614.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064197|gb|EED42442.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 320

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 85/260 (32%), Positives = 133/260 (51%), Gaps = 37/260 (14%)

Query: 436 SWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDL 495
            W   A   K E++ GNP A  I+   L+     + L +              E +++DL
Sbjct: 1   GWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEAIIKLGD--------------ENIKLDL 46

Query: 496 ALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM---- 551
             +   N    Y+ +++   K EKT             K  ++ +Q K      H+    
Sbjct: 47  RKTIDRNIEDIYKTRRRMREKAEKT-------------KIAMRDIQAKLKPRKEHIKVQD 93

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R  +WFEKF++FIS  N ++I G++AQQN+ IV +YM   D+Y H D+ GASS + K   
Sbjct: 94  RVSYWFEKFHFFISENNCVIIGGKNAQQNDQIVNKYMEDRDLYFHCDVKGASSVICKGSA 153

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
                    +  A  F + +S+AWD +++   ++V   QVSKTAP+GE+L  GSFMI+GK
Sbjct: 154 DR------NIEDATYFALVYSKAWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGK 207

Query: 672 KNFLPPHPLIMGFGLLFRLD 691
           KN + P+ L  G G++FR++
Sbjct: 208 KNMVYPYRLEYGVGVVFRIN 227



 Score = 42.0 bits (97), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/83 (22%), Positives = 41/83 (49%), Gaps = 13/83 (15%)

Query: 1027 NPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIF-------------YSL 1073
            NP   D +L+ + + GP+ +++ Y+Y V+I+PG  KK +  Q               +++
Sbjct: 238  NPDCDDEILHAMAIAGPWVSLKKYRYAVRIVPGNEKKQQVAQTILDRFDKQSTENPRHNM 297

Query: 1074 LLLMLSLTPVFDIFPFQCLCSRK 1096
             +  + +  + D+ P +C   +K
Sbjct: 298  WICAVRIQELIDVLPGKCKIPKK 320


>gi|424813826|ref|ZP_18239004.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalina sp. J07AB43]
 gi|339757442|gb|EGQ42699.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalina sp. J07AB43]
          Length = 632

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 96/345 (27%), Positives = 170/345 (49%), Gaps = 30/345 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P  L ++   E + F+TF  A+DE+Y + ++ + +++ +         + +    QE +
Sbjct: 234 SPFPLERYADDESIDFDTFSEAIDEYYYRKKALKEKKEKEEAYQEKKQGIERQKQQQERK 293

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRM---SWEDL-ARMVKEERKAG 451
           +  L++  +++ + AE I  N +     +  ++  + N +    WE    ++ K E +  
Sbjct: 294 IQGLEKSAEQNREKAERIYENYQ----LLQRIKRQIENSLDEDGWEQTRQKLEKSESEDA 349

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           + VA L        N     +S +  E          E ++V L     A A ++Y+  K
Sbjct: 350 DKVASL--------NKQEDFISVDTGE----------ENLKVYLFQDLEATASQYYDKAK 391

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
             E K E    A  +  K  E   + +I  ++ + + +  RK  WFEK+ WF SSE+YLV
Sbjct: 392 NSEEKIESAKEALKETKKELEDLKKEEINTDEVLEDKTQKRKKKWFEKYRWFYSSEDYLV 451

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           + GRDAQ N+M+VK++M   D+Y HAD  GA S VIK+    Q     T  +A    +  
Sbjct: 452 LCGRDAQTNDMLVKKHMESNDLYFHADFDGAPSVVIKDG---QEAGEQTRKEAAKAAITF 508

Query: 632 SQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           S+ W + +   +A++V P QV++   +GEYL  G+F+IRG + ++
Sbjct: 509 SKTWKAGIGADTAYYVEPGQVTQNPESGEYLQKGAFVIRGDREYM 553



 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 61/112 (54%), Gaps = 9/112 (8%)

Query: 51  GESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           GE ++ LL+     R   T Y RD    P GF ++LRKH+    +E+++Q G+DRI+  +
Sbjct: 41  GEDKERLLIGTD--RAFITKYKRDNPTRPPGFCMELRKHL--GHVEEIKQRGFDRILEIK 96

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            G       +I EL+ +GN +LT  +  ++  LR  +  D+ + +   ++YP
Sbjct: 97  SG----DTKLICELFGKGNFILT-KKGKIIGALREEKWADREIRVGLEYQYP 143


>gi|385773877|ref|YP_005646444.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323477992|gb|ADX83230.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 609

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D +LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTSLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     N    D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNVLKTDIEDKIPGKSKIVKTSI 605


>gi|448633897|ref|ZP_21674396.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
           29715]
 gi|445750588|gb|EMA02026.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
           29715]
          Length = 717

 Score =  142 bits (359), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 166/382 (43%), Gaps = 47/382 (12%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHM 390
            P+ L ++       F  F+ ALD+++     QR E+       +   +A   K  +I  
Sbjct: 275 TPIPLAEYEELYSESFTEFNTALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQKRIIQ 332

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
            QE  +   + + +   + AEL+  N + VD  +  V+ A  + +SW+D+     E    
Sbjct: 333 QQEQAIEDFEADAEAEREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFDEGADR 392

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
           G   A  +  L      ++L +                 +V VD       NA   Y+  
Sbjct: 393 GIEAAEAVVSLDGSEGTVTLDIEGT--------------RVTVDAFTGVEKNADELYKEA 438

Query: 511 KKQESKQEKTITA--HSKAFKAAEKKTRLQILQEK------------------TVANISH 550
           K+ E K+E  + A  +++    A K+ R +   +                   ++ +I  
Sbjct: 439 KRIEEKKEGALAAIENTREDLEAVKERRDEWEADDGEDEVDEDGSEDEPTDWLSIQSIPT 498

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
                W+E+F WF +S+ +LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K  
Sbjct: 499 RSTERWYEQFRWFHTSDGFLVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKAT 558

Query: 611 RPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            P +P      P  +L+QA  F V +S  W D K     + V P QVSKT  +GEYL  G
Sbjct: 559 GPSEPSKEVDFPQSSLDQAAQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKG 618

Query: 665 SFMIRGKKNFLPPHPLIMGFGL 686
            F +RG + +    P+ +  G+
Sbjct: 619 GFAVRGDRTYFEGTPVGVAVGI 640



 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 4/113 (3%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           F     +  ++ EL+  GN+ + D    V+  L + R   + VA  + + +PT
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEHGEVIDCLETVRLKSRTVAPGTPYEFPT 162


>gi|448730186|ref|ZP_21712496.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
           5350]
 gi|445793917|gb|EMA44482.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
           5350]
          Length = 699

 Score =  142 bits (359), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 163/724 (22%), Positives = 267/724 (36%), Gaps = 143/724 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSVDLAALVTELGTYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P GF   LR  +       V Q G+DR++ F+F       
Sbjct: 57  GETKRAHVVDPDNVPDAPGRPPGFAKMLRNRLSGADFAGVSQFGFDRVLTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ +GN+ + D+   V+  L +                     R+  RT A    
Sbjct: 117 KIVAELFGEGNVAVLDANDEVVDCLNT--------------------VRLQSRTVAPGAT 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+      P  V+ DG     A                   SN +          
Sbjct: 157 YEFPSSR----FNPLAVDSDGFAARMA------------------ESNTD---------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            L   L   L +G   +E +    G+     + + ++ E +A   L  A  +  D L   
Sbjct: 185 -LVRTLATQLNFGGLYAEELCTRAGVEKERAIEDSDEEEFSA---LYEATERLTDQLS-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG   P  Y         +D  P +            P  L +    +   F++F AAL
Sbjct: 239 -SGAFEPRLYR--------EDDQPVD----------VTPFPLEERADLDSEGFDSFTAAL 279

Query: 359 DEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++  +++   E+   + +   +    +  +I   QE  +   + + D     AE +  
Sbjct: 280 DAYFVALDTTEDEEGGGRERPDFEDDIERQQRIIEQQEGAIEDFEDQADAERAKAESLYA 339

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           + + VD  +  VR A      W+D+     E    G   A  +D +      +++ +   
Sbjct: 340 HYDLVDEILSTVRNAREQGTGWDDIEERFAEGADRGIAAAEAVDGVTPSEGTVTVDIDGR 399

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
             E+D      P + VE         NA R Y+  K+   K+E    A       AE + 
Sbjct: 400 SVELD------PRDGVE--------QNADRLYKEAKRVVGKKEGAEEA------VAETRA 439

Query: 536 RLQILQEK--------------------------TVANISHMRKVHWFEKFNWFISSENY 569
            L+ LQ +                          T  +I   +   W+E+F WF +S+ +
Sbjct: 440 ELEALQRRRDEWESADENETESTDTDEDEDIDWLTRRSIPVRQNEQWYERFRWFRTSDGF 499

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQA 624
           LV+ GR A QNE +VK+Y+ +GD + H    G   TV+K   P +P   +     TL + 
Sbjct: 500 LVLGGRSADQNEDLVKKYLERGDRFFHTQARGGPVTVLKATGPSEPTEEVEFSESTLEET 559

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
             F V +S  W + +    A+   P QVSKT  +GEYL  G F IRG + +     + + 
Sbjct: 560 AQFAVSYSSVWKNGRFAGDAYMASPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVA 619

Query: 684 FGLL 687
            G++
Sbjct: 620 VGIV 623


>gi|432328279|ref|YP_007246423.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
           sp. MAR08-339]
 gi|432134988|gb|AGB04257.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
           sp. MAR08-339]
          Length = 596

 Score =  142 bits (359), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 112/356 (31%), Positives = 176/356 (49%), Gaps = 61/356 (17%)

Query: 333 DEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ 392
           D F P+ L  + S    +F+TF+ AL  +   ++S+RA       E     ++ +   + 
Sbjct: 223 DFFSPIPLKMYPS-SIARFDTFNEALVNY---LKSERA------VESPEVLRIKRRIREI 272

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
           E  +    +E +RS K+ ELI  +  DV+ A+   + A    +S+          R  G 
Sbjct: 273 EETIEKFTREEERSRKIGELIYAHFGDVERALSEAKGA---EISY----------RARGK 319

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLAL--SAHANARRWYELK 510
            +                               L +E V V+L +  S   NA  +YE  
Sbjct: 320 TM------------------------------VLDIEGVPVELRVDKSVGENASLYYEKA 349

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           KK    +EK   A     KA E+   ++ ++EK    I   R+  WFEK+ WFISSE+ L
Sbjct: 350 KKM---REKIKGAQQALEKAKEELKSVKKMEEKKKREIRKSRRRFWFEKYRWFISSEDIL 406

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI+GRDA+ NE +VK+++   D+Y+HAD+HGA S VIK+   E  +   TL +A  F V 
Sbjct: 407 VIAGRDAKTNEEVVKKHLGDKDLYMHADIHGAPSVVIKSEGKE--IGEKTLYEAAQFAVS 464

Query: 631 HSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
            S+AW++     SA+WVYP QVSK   +GEY+  G++++ G++N++   PL +  G
Sbjct: 465 MSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVVHGRRNYIHKVPLRLAVG 520



 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 30/129 (23%), Positives = 61/129 (47%), Gaps = 15/129 (11%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A ++  R  I G     +Y +  + ++FK+         GE+  + + +   +
Sbjct: 1   MLSLDIHAWIEENREKIEGGFFKKIYQVGEREFLFKIYK-------GETRPLYVNLRGWI 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
                   R+    PS F + LRK    R++    QL +DRI++F+     + + ++LEL
Sbjct: 54  FFQ----GRETPMEPSMFVMFLRKRFSGRKILRFYQLNFDRIVVFE---TQDGYQLVLEL 106

Query: 125 YAQGNILLT 133
           +  GN+++ 
Sbjct: 107 FGDGNVVVV 115


>gi|399576519|ref|ZP_10770274.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
 gi|399237963|gb|EJN58892.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
          Length = 706

 Score =  142 bits (358), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/395 (24%), Positives = 178/395 (45%), Gaps = 47/395 (11%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIE-SQRAEQQHKAKE------DAAFHKLNKI 388
            P  L ++   +   F++F+AALD+++ +++ S  AE+     E           K  +I
Sbjct: 257 TPFPLEEYEGLDSAAFDSFNAALDDYFFRLDLSDEAEKGGGGAEANRPDFQEEIEKQKRI 316

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
              QE  +   +++     + AEL+  N E  D  +  VR A    + W D+A  + E  
Sbjct: 317 IQQQEGAIEGFEEQAQEEREKAELLYANYELADEVLSTVRGAREENIPWADIADTLAEGA 376

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           + G P A  ++ +      +++              T+  +++++D+++    NA R Y 
Sbjct: 377 EQGIPAAEAVEDVDGSTGTVTI--------------TIDGQRIDLDVSMGVEKNADRIYT 422

Query: 509 LKKKQESKQEKTITA---HSKAFKAAEKK-----------------TRLQILQEKTVANI 548
             K+ E K+   + A     +  +A EK+                      +   +  +I
Sbjct: 423 EAKRVEEKKAGALEAIENTREKLEAVEKRRDEWEASDDEPDEDEDDEEKPDIDWLSRNSI 482

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
               +  W+++F WF +S+ +LVI GR+A QNE IVK+Y++K D++ H   HG   T++K
Sbjct: 483 PIRNQDKWYDRFRWFETSDGFLVIGGRNADQNEEIVKKYLNKHDLFFHTQAHGGPVTILK 542

Query: 609 NHRPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              P +P     +P  +  +A  F V +S  W + +    A+ V   QVSKT  +GEY+ 
Sbjct: 543 ATGPSEPARDVDIPEQSREEAAQFAVAYSSIWKEGRFADDAYMVSADQVSKTPESGEYVE 602

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
            GSF++RG + +       +  GL    D   +G 
Sbjct: 603 KGSFVVRGDRTYYEDVAAEVAVGLRCEPDTRVVGG 637



 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L +E 
Sbjct: 4   KQELSSIDLAALVTELGRYEGAKVDKAYLYGDDLLRLKLRDF-------DRGRVDLFIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F+F  G    
Sbjct: 57  GDIKRAHVVAPEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQFEFDRILTFKFERGDEDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            ++ EL+ QGN+ + D    V++ L + R   + VA  S++ +P
Sbjct: 117 EIVAELFGQGNLAVLDENREVVSSLETVRLKSRTVAPGSQYEFP 160


>gi|448361523|ref|ZP_21550140.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
           DSM 12278]
 gi|445650542|gb|ELZ03465.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
           DSM 12278]
          Length = 720

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 161/726 (22%), Positives = 285/726 (39%), Gaps = 128/726 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         KL +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGAYEGAKLDKAYLYGDDLVRLKLRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVSQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +                 
Sbjct: 117 RIIVELFGQGNVAVTDGEYKVIDCLETVRLKSRTVVPGSRYEF----------------- 159

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                   PD            N    S+E  G +      D+ +               
Sbjct: 160 --------PDTR---------TNPLTISREAFGHEMEDSDTDVVR--------------- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L +G   +E +    G+   M +++ ++   + +   +  +A       D 
Sbjct: 188 TLAT----QLNFGGLYAEELCTRAGVEKAMDIADADEETYDGLYEAIERLA------LDT 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF--VKFETFDA 356
            +G+     Y+   ++   +D    + GS+ ++ D   P  L +    +     ++TF  
Sbjct: 238 RNGNFDSRLYLDTGDEDRTEDGD-GDDGSAARVVD-VTPFPLEEHEQDDLDGEPYDTFLE 295

Query: 357 ALDEFYSKIESQR------AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           ALD+++ ++E +        +Q+   +E+ A H+  +I   Q+  +   +Q+     + A
Sbjct: 296 ALDDYFFRLELEDEEEPDPTDQRPDFEEEIAKHE--RIIEQQQGAIEGFEQDAQNLRENA 353

Query: 411 ELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
           E +  EY L  VD  +  ++ A      W+++     E  + G   A  +    ++ +  
Sbjct: 354 ESLYAEYGL--VDEILSTIQEAREQDRPWDEIEERFAEGAEQGIDAAEAV----VDVDGS 407

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
             L++ ++D           E +E++       NA R Y   K+   K+E  + A     
Sbjct: 408 EGLVTVDVD----------GEYIELEAHDGVEQNADRLYTEAKRVAEKKEGALAAIEDTR 457

Query: 529 KAAEKKTRLQILQEKTVANISHMRKVH---------------------WFEKFNWFISSE 567
           +  E+  R +   E     ++                           WF++F WF +S+
Sbjct: 458 EDLEEAKRRRDEWEAADGEVADDEAAEDEGEDHDWLADPSIPIRENEPWFDRFRWFHTSD 517

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTL 621
            YLVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++
Sbjct: 518 GYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPESSI 577

Query: 622 NQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
            +A  F V +S  W D +     + V   QV+KT  +GEYL  G F +RG + +    P+
Sbjct: 578 EEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPV 637

Query: 681 IMGFGL 686
               G+
Sbjct: 638 GAAVGI 643


>gi|385776519|ref|YP_005649087.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323475267|gb|ADX85873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 609

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 169/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     N    D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNVLKTDIEDKIPGKSKIVKTSI 605


>gi|334310399|ref|XP_001370312.2| PREDICTED: nuclear export mediator factor NEMF isoform 1
           [Monodelphis domestica]
          Length = 1094

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 104/170 (61%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+ A +      L+GMR  N+YD+  KTY+ +L             KV LL+
Sbjct: 1   MKTRFSSVDICAILSEFNASLLGMRVHNIYDVDNKTYLIRLQKPDF--------KVTLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LT+ E+ +L +LR   D+   V    R +YP +  RVFE
Sbjct: 113 IIELYDKGNIVLTNYEYLILNILRFRSDEADDVKFAVREKYPVDHARVFE 162



 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/234 (34%), Positives = 116/234 (49%), Gaps = 33/234 (14%)

Query: 842  YISKAERRKLKKG-QGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGK-ISRGQKGK 899
            ++S  ERR++KK  Q +   D ++  EKE        P +      + G + + RGQK K
Sbjct: 834  HLSAKERREMKKKRQSNDSTDLEILEEKENTLKTEVSPNT---SKNVPGPQPMKRGQKSK 890

Query: 900  LKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVC 959
            +KKMKEKY DQDEE+R + M LL SAG         + +     K K          K  
Sbjct: 891  IKKMKEKYKDQDEEDRELIMKLLGSAG------SSKEEKGKKGKKGKTGKTKEEAVKKQP 944

Query: 960  YKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAM-------EEEDIHEIGE 1012
             K K    L+   K+          + P +G+  T E+ ++AM       EE+D  + G 
Sbjct: 945  QKFKSELRLADRIKK----------ETPFLGV-VTHELQELAMDDQPDDKEEQDTDQQGN 993

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            EE    N +D LTG P   D+LL+ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 994  EE----NLLDSLTGQPHSEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKA 1043


>gi|11499620|ref|NP_070862.1| hypothetical protein AF2038 [Archaeoglobus fulgidus DSM 4304]
 gi|2648497|gb|AAB89216.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 627

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 172/348 (49%), Gaps = 33/348 (9%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L  + + E   FE+F+ ALD+++SK  ++  E +    E+    KL K    
Sbjct: 220 YLDVVPMDLLYYSNYEKKYFESFNDALDDYFSKKLAEMDELESMKSEE--LEKLKKRLEI 277

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q+  +   + E +   K+ + I  N + V+  I A R A   R SW+++  +V  + K  
Sbjct: 278 QKESLRKFEDEAESFRKIGDAIYENYQMVEKIIEAFRAA-RERKSWDEIKEIVARDEK-- 334

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             +  L+  +  E+N + + +  + D             VE+++  S H NA  +YE  K
Sbjct: 335 --LKKLVKAIKPEKNAIVVKV-GDFD-------------VELEIKKSIHENADLYYEKAK 378

Query: 512 KQESKQE---KTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K   K E   + I A  +  +  E+K     L++K V +I   RK  W+E + WF +SE 
Sbjct: 379 KAREKAEGVKRAIEATLREMERVEEK-----LEKKLVTSIKVRRKKEWYENYRWFFTSEG 433

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI GR A+ NE IV +++   D++ H    GA + ++K     Q     ++ +A  F 
Sbjct: 434 FLVIGGRTAEMNEEIVAKHLESLDLFFHTQTPGAPAVILKRG---QEAGEESIREAAEFA 490

Query: 629 VCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             +S  W + K     ++V P QVSK+A  GEYL  GSF I GK+N+L
Sbjct: 491 ATYSALWKEGKHAGEVYYVLPEQVSKSAKAGEYLPKGSFYITGKRNYL 538



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 87/184 (47%), Gaps = 24/184 (13%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           ++++ D+ A V+ L+ L G +   VY   P     ++             KV L++E+G 
Sbjct: 3   QLSSFDIKACVRELKELEGGKVEKVYHHPPDEIRIRIYAGR---------KVDLVIEAGR 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T + +     PS F + LRKH+   R++ + Q  +DR+++ +F        ++ EL
Sbjct: 54  RIHLTKFPKQAPRFPSAFAMLLRKHLEGARIKKIEQYDFDRVVVIEFERFGEIRRIVAEL 113

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP---------TEICRVFERTTAS 175
           +++GN++L + E  V+  L+        + +   +R+P          E+ RV   +   
Sbjct: 114 FSKGNVVLLNEENRVIMPLKH------TIKVGELYRFPEQRERKDEDREVVRVLAMSGLG 167

Query: 176 KLHA 179
            L+A
Sbjct: 168 GLYA 171


>gi|126178886|ref|YP_001046851.1| hypothetical protein Memar_0936 [Methanoculleus marisnigri JR1]
 gi|125861680|gb|ABN56869.1| protein of unknown function DUF814 [Methanoculleus marisnigri JR1]
          Length = 632

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 173/362 (47%), Gaps = 41/362 (11%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
           P++L     RE  +F TF  ALD FY K    + E    A       +   I   Q   +
Sbjct: 243 PVVLAGDEVRE--RFATFSEALDAFYPKTVGGKEEA---AAGKPRLSQAEVIRRRQAEAI 297

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
              +++++R+ ++ E+I  N   V   I  +  A  NR SW+++ +++KE     NP A 
Sbjct: 298 KGFEKKIERNQRIVEVIYENYTAVAGIIATLDEASKNR-SWQEIEKILKE--NGDNPAAK 354

Query: 457 LIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK 516
           ++  ++     + + LS               E+V++ +  +   N  R+Y+  KK + K
Sbjct: 355 MVRAIHPADAAVDVDLSG--------------ERVKIYVHETIEQNLGRYYDQIKKFKKK 400

Query: 517 QEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           +   + A  +      +  R   LQ+K            W+ +F WF +S+  LVI GRD
Sbjct: 401 KTGALAAMERTVPEKPRTKRNLPLQKK-----------RWYHRFRWFTTSDGTLVIGGRD 449

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           A QNE +VK+YM  GD++VHAD+HG S  ++K            +++A  F   +S AW 
Sbjct: 450 ASQNEELVKKYMEGGDLFVHADVHGGSVVIVKGTTEH-------MDEAVRFAASYSNAWK 502

Query: 637 SKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           +   T+  +   P QVSKTA +GEY+  G+F++RG++ +    PL +  GL    + + +
Sbjct: 503 AGHFTADVYAARPDQVSKTAESGEYVARGAFIVRGERQYFRNAPLGVAIGLQMAPEVAVI 562

Query: 696 GS 697
           G 
Sbjct: 563 GG 564



 Score = 66.6 bits (161), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 11/144 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M+  D+ A V      + +    +Y    KT         G+  +GE   K L L+E+G 
Sbjct: 7   MSGVDLRALVAEAADRLPLWVGKIYQFDAKTL--------GIRLNGEDRAKYLFLIETGR 58

Query: 65  RLHTTA-YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
           R H TA +    KN PS F + LRKH+   ++  +RQLG +R +    G     +++I E
Sbjct: 59  RAHFTAEFPVPPKNPPS-FAMLLRKHLEGGKVLGIRQLGLERTMSLDIGKRDTTYHLIFE 117

Query: 124 LYAQGNILLTDSEFTVLTLLRSHR 147
           L+ +GN +L D  +T++  L  HR
Sbjct: 118 LFDEGNAVLCDEGYTIIKPLWHHR 141


>gi|254583608|ref|XP_002497372.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
 gi|238940265|emb|CAR28439.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
          Length = 1024

 Score =  142 bits (357), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 125/216 (57%), Gaps = 15/216 (6%)

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
           EEK L   KV +DL LSA+ANA  ++ +KK    KQ+K      KAFK  E+K   Q+ Q
Sbjct: 513 EEKGL---KVSIDLGLSAYANASYYFNIKKNNAEKQKKVEKNVEKAFKNIEEKVGRQLKQ 569

Query: 542 E-KTVANISHMRKV---HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
           + K   N+  +RKV   ++FEK +WFISSE +LV+ G+   + ++I  +Y+   DVY+  
Sbjct: 570 KLKETHNV--LRKVRTPYFFEKHHWFISSEGFLVLMGKSDSETDLIYSKYIEDDDVYLFN 627

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
                +   IKN    + VPP TL QAG   +  S+AW  K+ +S WW +   +SK  P+
Sbjct: 628 TF--GTQVWIKNPDSTE-VPPNTLMQAGILCMSASEAWSKKISSSPWWCFAKNISKFEPS 684

Query: 658 -GEYLTVGSFMIRGK--KNFLPPHPLIMGFGLLFRL 690
               L  G F+++ +  KNF+PP  L+MGFG L+++
Sbjct: 685 DNSVLPPGRFLLKNENNKNFMPPAQLVMGFGFLWKV 720



 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 116/483 (24%), Positives = 218/483 (45%), Gaps = 47/483 (9%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+    + LR  L   R +N+Y++  S + ++ K         +    K  +
Sbjct: 1   MKQRISALDLQLLAEELRENLESYRLNNIYNIADSNRQFLLKF--------NKPDSKFSV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T Y R     PSGF +KLRKH++++RL  +RQ+  DRI++ QF  G+  +
Sbjct: 53  VVDCGLRIHLTDYDRPTPPGPSGFVIKLRKHLKSKRLTALRQVHDDRILVLQFADGL--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN++L D    +L+L R          I+  H       +V E+ T     
Sbjct: 111 YLVLEFFSAGNVILLDENKKILSLQR----------IVQEHE-----NKVGEQYTMFD-D 154

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKE-NLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
           +  +++++ +A EP+  NE+   V    +E     +   K  +    S K   DG R K 
Sbjct: 155 SIFSNNEKTNAREPETYNEE--TVKQWLREAQTKFETESKILNEVVPSGK-KKDGQRKKI 211

Query: 238 PTLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
             +  +    L   P LS  ++       G  P+    +    E   + +L     +++ 
Sbjct: 212 KVM-AIHRLLLSREPHLSSDLLSKNLQMQGFSPSASCLDFVGQESAIVDLLNNTEKEYQS 270

Query: 294 WLQDVISGDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQ-FRSREFVKF 351
            L D         GYIL + N +   +    +     + +  F P +  Q       +K 
Sbjct: 271 LLSDSERS-----GYILAKRNVNFNSERDEKDLEFVYETFHPFEPFVAPQNVGDTRTIKI 325

Query: 352 E-TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           E  ++  LD F+S IES +   + + +E  A  +L    +D + ++  L      + +  
Sbjct: 326 EGGYNKVLDSFFSTIESSKYALRIQQQEQQATKRLEAARLDNQKKIQALVDAQSFNEEKG 385

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             I  N + V+    AV+  +  +M W  + ++++ E+K GN +A LI   L L+ N ++
Sbjct: 386 HSIIANADLVEQTKSAVQGYVDQQMDWSTIEKLIQVEQKRGNKIAQLIQLPLNLQENKIA 445

Query: 470 LLL 472
           + L
Sbjct: 446 IRL 448



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 27/38 (71%)

Query: 1030 PSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            P D +L +IPVC P+ A+  YKY+VKI PG AKK K +
Sbjct: 912  PRDEILDIIPVCAPWPALLKYKYKVKIQPGNAKKTKTM 949



 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/37 (59%), Positives = 30/37 (81%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           RG+KGKLKKM+ KYGDQDEEER +R+ +L +   ++K
Sbjct: 820 RGKKGKLKKMQRKYGDQDEEERQMRLNMLGTLKGMKK 856


>gi|159906014|ref|YP_001549676.1| hypothetical protein MmarC6_1632 [Methanococcus maripaludis C6]
 gi|159887507|gb|ABX02444.1| protein of unknown function DUF814 [Methanococcus maripaludis C6]
          Length = 680

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 177/356 (49%), Gaps = 25/356 (7%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q       +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLETKEKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  +++A   +M WE +  ++KE +   +PV   I  + 
Sbjct: 331 SRSNHKRGDLIYANYSFVDEIVSTIKLA-REKMGWEGIKNVIKENK--THPVLSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   + L LS       D    L  + V VDL  +A  NA   Y+  KK ++K +  I 
Sbjct: 388 EKNAELMLKLSA------DYGNGLIEDNVPVDLRKNAFENADIVYQKSKKFKNKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRK----------VHWFEKFNWFISSENYLVI 572
              +A K +EKK      +EK  + +   ++          + W+EK  W +    YL++
Sbjct: 441 ---EALKISEKKLAELKDKEKLDSEVLKEKEENIKKKERKVLKWYEKLKWTVIG-GYLIV 496

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   HS
Sbjct: 497 AGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFASSHS 556

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G+L
Sbjct: 557 RAWKLGVGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGML 612



 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 81/164 (49%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITITEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFP 159


>gi|219852170|ref|YP_002466602.1| hypothetical protein Mpal_1566 [Methanosphaerula palustris E1-9c]
 gi|219546429|gb|ACL16879.1| protein of unknown function DUF814 [Methanosphaerula palustris
           E1-9c]
          Length = 629

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 100/350 (28%), Positives = 169/350 (48%), Gaps = 44/350 (12%)

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           TF  AL+  Y  +      Q+      A   +  +I + QE  + +  +++  +  + +L
Sbjct: 257 TFSEALEAIYPLVTRHEGPQK-----KAPIPREERIRLQQEAALKSFDKKIVLNKAIVDL 311

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           I  N   V   I  +  A +  +SW+++  M+KE   + N VA  I  ++     + LLL
Sbjct: 312 IYENYTLVTDVIKTLDAA-SKTLSWQEIGSMLKE---SDNDVARQIAGVHPAEAAVDLLL 367

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF-KAA 531
                           +KV + +  S   N  R+Y   KK + K++  ++A  +   K A
Sbjct: 368 DG--------------KKVLIHVHESIEVNLERYYAQVKKFKKKRDGAVSAMERPVAKKA 413

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
             K  L  L+++            W+ +F WF +S+N LV+ GRDA QNE +VKRYM  G
Sbjct: 414 TSKVHLTPLKKR------------WYHRFRWFFTSDNCLVLGGRDAGQNEELVKRYMEGG 461

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQ 650
           D +VHAD+HGAS  ++K  + EQ      +++   F   +S AW S   ++  + V P Q
Sbjct: 462 DTFVHADVHGASVVIVKG-KTEQ------MDEVAQFAASYSGAWRSGHFSADVYAVRPDQ 514

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN 700
           VSKT   GE+++ GSF++RG++ +    PL +  G     + + +G  +N
Sbjct: 515 VSKTPEAGEFVSRGSFIVRGERTYFKSVPLGVAIGYQTEPNAAVIGGPVN 564



 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 71/148 (47%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+  D+ A    LR  + +  + +Y    K    +L          E  K  LL+ESG R
Sbjct: 7   MSGVDLLAVTAELREHLPLWINKIYQYDNKMLSIRLNGE-------EHAKYHLLIESGRR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H      +    P  F + LRK++   R+ ++RQ G  R++ F  G      ++++EL+
Sbjct: 60  IHLATVLPNPPKNPPSFAMLLRKYLEGGRVLEIRQQGLQRVVTFVIGKRDTTLHLVIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
            +GN++L D + T++  L  HR  D+ V
Sbjct: 120 DEGNVILCDDQMTIIKPLWHHRFKDREV 147


>gi|320100405|ref|YP_004175997.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
 gi|319752757|gb|ADV64515.1| protein of unknown function DUF814 [Desulfurococcus mucosus DSM
           2162]
          Length = 665

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 180/375 (48%), Gaps = 45/375 (12%)

Query: 325 SGSST-QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
           SG +T  IY  + PLL     +    + E  + A+D ++++ E++   Q+   +  AA  
Sbjct: 240 SGENTLDIYTSYNPLLFRDVYNNSVKQVEDINTAIDAYFTEYEAELERQRRLDELAAAVK 299

Query: 384 KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
           ++      QE  +   ++EV++  ++ +LI  N   V+ A+   R   A +  WE +A+ 
Sbjct: 300 EIEARIKRQEEVIRGYREEVEKIGRILQLIYGNYASVNEALECARSTRAVK-GWEHIAK- 357

Query: 444 VKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANA 503
                       G++  +Y ++  + L ++  + E+    K L  + VE++         
Sbjct: 358 ---------DCPGVVG-VYKDKGIVVLRVNGEVLELSIR-KGLDKQVVELE--------- 397

Query: 504 RRWYELKKKQE--SKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
                 KK+ E   K E  +    +  +   + +    +++KTV  +S      W+E+F+
Sbjct: 398 ------KKRGELVGKIESAVKVLEEMRRQLNEASSTMSIEDKTVRRLS---PTLWYERFH 448

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN---HRPEQPVPP 618
           W  +   +L I GRD  QNEM+V++Y+   DV++HAD+HG S+ V+K+   H  E  V  
Sbjct: 449 WLFTRNGFLAIGGRDQSQNEMVVRKYLGDNDVFIHADIHGGSAVVLKSRGLHSVEDVV-- 506

Query: 619 LTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
                A     C+S+AW +       +WV   QVSKT P+GEYL  G+FMI G KNFL  
Sbjct: 507 ----DASYLAACYSRAWRAGFSFIEVFWVPGSQVSKTPPSGEYLPRGAFMIYGSKNFLSI 562

Query: 678 HPLIMGFGLLFRLDE 692
            PL +  G  F  D+
Sbjct: 563 -PLRLAVGARFFSDD 576



 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 63/137 (45%), Gaps = 15/137 (10%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M+K  M+  DV A V +    L      N Y      +I KL   SGVT         L 
Sbjct: 1   MLKKAMDILDVYAWVGRHGASLTSCFVDNAYHCK-SYWILKLRCPSGVTH--------LK 51

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA-- 117
           +E  VR+H +    ++K+   GFT  LR  +R  R+  VRQ  ++RI++ + G       
Sbjct: 52  IEPAVRIHLSQSIPEEKDI-DGFTRFLRSRVRDSRILSVRQPWWERIVVLETGAREKPLR 110

Query: 118 HYVILELYAQGNILLTD 134
           HY+  E+  +G  ++ D
Sbjct: 111 HYI--EVVPRGQWVVAD 125


>gi|336122066|ref|YP_004576841.1| Fibronectin-binding A domain-containing protein
           [Methanothermococcus okinawensis IH1]
 gi|334856587|gb|AEH07063.1| Fibronectin-binding A domain protein [Methanothermococcus
           okinawensis IH1]
          Length = 684

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 108/354 (30%), Positives = 175/354 (49%), Gaps = 33/354 (9%)

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E F  ALD+++S+   ++  ++ + K      K  +I  +Q   +   +++   +    +
Sbjct: 279 EEFLTALDDYFSRFILKKEIKKEETKLQKMVKKQERILNNQIESLKKYEKQAKENQIKGD 338

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LI  N   VD  I  ++ A   +M W  + ++VKE +   NP+   I  +  +   ++L 
Sbjct: 339 LIYANYALVDEIITTLKSA-REKMDWSSIKKIVKENK--DNPILSKIVYINEKNGEITLK 395

Query: 472 LS----NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA---- 523
           LS    N L E D          V +D+  +A  NA  +Y   KK ++K E   TA    
Sbjct: 396 LSADYGNGLIEKD----------VSLDIRKNAFENADNYYSKSKKFKNKIEGVKTAINLS 445

Query: 524 ----HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
                    K   +   L+  +EKT+      +K  W+EKF W + + NYL+I+G+DA  
Sbjct: 446 KEKLEKLKKKEEIEMESLKEREEKTMEK-KERKKRKWYEKFKWTVIN-NYLIIAGKDATT 503

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNH-----RPEQPVPPLTLNQAGCFTVCHSQA 634
           NEM++KRY  K D+  H  + GA  TVIK +        +      LN+   F   HS+A
Sbjct: 504 NEMLIKRYTEKDDIVFHTLMEGAPFTVIKMNGKNIDELNEDEREFLLNETAKFAASHSKA 563

Query: 635 WDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           W   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+   PL +G G++
Sbjct: 564 WRLGLGSADVYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRSVPLELGIGIV 617



 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 82/166 (49%), Gaps = 12/166 (7%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLL 58
           +K  +   D+   VK L+++I  +    + +     K  I KL     + E G  E   L
Sbjct: 1   MKTELTNVDIHVAVKELQKIINGKLDKAFLVDSQDGKELILKLH----IPEIGTRE---L 53

Query: 59  LMESGVR--LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
            + +G    +  T Y+R+K   P  F + LRKH++  ++  + Q  +DRI+ F F  G  
Sbjct: 54  AIGTGKYKYITLTEYSREKPKNPPSFAMLLRKHLKNIKITSIEQHNFDRIVKFTFQWGEI 113

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           ++ +++EL+  GNI+L D+E  ++  L+  +   + +     +++P
Sbjct: 114 SYKLVVELFGDGNIILLDNEDKIILPLKIEKWSTRRIIPKEIYKFP 159


>gi|409721207|ref|ZP_11269418.1| RNA-binding protein, snrnp like protein [Halococcus hamelinensis
           100A6]
 gi|448724851|ref|ZP_21707356.1| RNA-binding protein, snrnp like protein [Halococcus hamelinensis
           100A6]
 gi|445785060|gb|EMA35856.1| RNA-binding protein, snrnp like protein [Halococcus hamelinensis
           100A6]
          Length = 697

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 163/711 (22%), Positives = 273/711 (38%), Gaps = 143/711 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELTSVDLAALVTELGRYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELMVEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  +  +  D    P  F   LR  +         Q G+DR++ F+F       
Sbjct: 57  GETKRAHVVSPDHVPDAPGRPPDFAKMLRNRLSGADFAGASQFGFDRVLTFEFEREDRNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ +GN+ + DS   V+  L +                     R+  RT A    
Sbjct: 117 RIVAELFGEGNVAVLDSTGEVVDCLNT--------------------VRLQSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+     +P  V+ +G                      +    +++ D       
Sbjct: 157 YEFPSSR----FDPLAVDYEG---------------------FAARMEESNTD------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            L   L   L +G   +E +    G+     + +  + E +A   L  A+ +  + L D 
Sbjct: 185 -LVRTLATQLNFGGLYAEELCTRAGVEKEQAIEDSGEEEYSA---LFDALTRLSERLSD- 239

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
             GD  P  Y         +D  P +            P  L +    +   FE+F  AL
Sbjct: 240 --GDFDPRIYR--------EDDEPVD----------VTPFPLEENADLDSEGFESFTEAL 279

Query: 359 DEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++  +E+   E+   + K   +    +  +I   QE  +   +++ +     AE +  
Sbjct: 280 DAYFVDLETTENEEGGGREKPDFEEEIERQQRIIDQQEGAIQGFEEQAEAERAKAESLYA 339

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N   VD  +  VR A      WE++    +E ++ G P A  +  +      +S+     
Sbjct: 340 NYGLVDEILSTVRTARERDTPWEEIEERFEEGKEQGIPAAEAVAGVEASEGTVSV----- 394

Query: 476 LDEMDDEEKTL-PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK 534
             E+D E  TL P E VE         NA R Y   K+   K+E    A       A+ +
Sbjct: 395 --EVDGETITLDPREGVE--------QNADRLYREAKRVVGKKEGAEEA------IADTR 438

Query: 535 TRLQILQEKTVA------------------------NISHMRKVHWFEKFNWFISSENYL 570
             L+ L+++                           +I       W+E+F WF +S+ +L
Sbjct: 439 AELEALEQRREEWEAGGADATDADDDSEDIDWLDRRSIPIRTNEQWYERFRWFHTSDGFL 498

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAG 625
           V+ GR+A QNE +VK+Y+ +GD ++H    G   TV+K   P +P     +P  TL++A 
Sbjct: 499 VLGGRNADQNEDLVKKYLDRGDRFLHTQARGGPVTVLKATGPSEPTREIDLPQGTLDEAA 558

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            F V +S  W D +     +   P QVSKT  +GEYL  G+F +RG + + 
Sbjct: 559 KFAVSYSSVWKDGRFAGDVYMADPDQVSKTPESGEYLEKGAFTVRGDRTYF 609


>gi|300711181|ref|YP_003736995.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
 gi|448296718|ref|ZP_21486771.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
 gi|299124864|gb|ADJ15203.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
 gi|445580850|gb|ELY35220.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
          Length = 694

 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 172/369 (46%), Gaps = 39/369 (10%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE-DAAFHKLNKI 388
           QI D   P+ L++  + E   ++ F+ ALD+++ ++++   E+   + E D    +  +I
Sbjct: 252 QIVD-VTPIALDEHAALEGDSYDRFNEALDDYFFELDTSEDEETDTSPEFDEEIERKKRI 310

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
              QE  +   +QE     + AEL+  N + VD  +  VR AL     WE++    ++  
Sbjct: 311 IDQQEGAIEGFEQEATEERERAELVYANYDTVDEVLTTVRGALEEGRGWEEIEATFEQGA 370

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           + G   A  +     E   +S+ L          E T+ +E     +      NA R Y 
Sbjct: 371 EQGIDAAERVTGFDPENGMVSVDLG---------EATVSLE-----VRSGVEKNADRIYT 416

Query: 509 LKKKQESKQ---EKTITAHSKAFKAAEKKTRLQILQEKTV--------------ANISHM 551
             K+ E K+   E+ I    +   A  ++ R    +++T               A+I   
Sbjct: 417 EAKRIEEKKAGAEEAIADTREELDALRERKRQWETRDETQDDGGEPEEIDWLSRASIPVR 476

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           +   W+E F WF +S+ YLVI GR+A +NE +VK+Y+ +GD + H   HG   TV+K   
Sbjct: 477 KSEEWYEDFRWFHTSDGYLVIGGRNADENEDLVKKYLDRGDRFFHTQAHGGPVTVLKATG 536

Query: 612 PEQPV-----PPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           P +P      P  ++ +A  F V +S  W + +    A+ V P QVSKT  +GEY+  G 
Sbjct: 537 PSEPAKDVEFPESSIQEAAQFAVSYSSVWKEGRFADDAYSVSPDQVSKTPESGEYIEKGG 596

Query: 666 FMIRGKKNF 674
           F+IRG + +
Sbjct: 597 FVIRGDRTY 605



 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 68/164 (41%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSIDLAALVGELNEYAGAKVDKAYLYGEDFLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHVAAPEHVPDAPGRPPDFAKMLRNRLSGADFTGVSQYEFDRILSFEFEREDGNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I EL+ +GN+ + D    V+  L + R   + VA  +R+++P
Sbjct: 117 TIIAELFGEGNVAVCDETRHVIDSLETVRLKSRTVAPGARYQFP 160


>gi|354507679|ref|XP_003515882.1| PREDICTED: nuclear export mediator factor NEMF-like, partial
           [Cricetulus griseus]
          Length = 220

 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 97/277 (35%), Positives = 134/277 (48%), Gaps = 60/277 (21%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A+K    
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPVDHAR------AAKPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L    E  A+ P                                           K   L
Sbjct: 167 LERLTEVIASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLE 277
           K VL   L YGPAL EH +++ G   N+K+ E  KLE
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLE 218


>gi|70606588|ref|YP_255458.1| hypothetical protein Saci_0795 [Sulfolobus acidocaldarius DSM 639]
 gi|68567236|gb|AAY80165.1| conserved Prokaryal protein [Sulfolobus acidocaldarius DSM 639]
          Length = 594

 Score =  139 bits (351), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 74/222 (33%), Positives = 126/222 (56%), Gaps = 23/222 (10%)

Query: 485 TLPVEKVEVDL--ALSAHANARRWYELKKKQESKQEKTITAHSK--------AFKAAEKK 534
           TL +  + +D+   L+ + NA ++Y+L K+   K +K      +         FK  E+K
Sbjct: 318 TLKINNISIDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERK 377

Query: 535 TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
             ++I           +RK  W+EK++W I+   ++VI+GRD+ QNE IV++ + + D++
Sbjct: 378 EEIRI----------SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIF 427

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSK 653
           +HAD+ GA++TV+K +  +  V    +  A     C+S+AW + +     +WVY +QVSK
Sbjct: 428 LHADIQGAAATVLKANSGQ--VSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSK 485

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           + P+GEYL  GSFMI G+KNF+    L +  G++ + DE  L
Sbjct: 486 SPPSGEYLAKGSFMIYGRKNFIKNVKLELAIGIMNQNDEVGL 527



 Score = 41.6 bits (96), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 74/147 (50%), Gaps = 18/147 (12%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSP-KTYIFKLMNSSGVTESGESEKVLLLMESG 63
           M+  D+ A +   + +I G R  NVY +S  + Y+FKL        S  ++K  L++E G
Sbjct: 7   MSYIDLLAWITENKSIIEGSRIDNVYKISGIQAYLFKL-------HSKNTDK-FLVVEPG 58

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R+H T Y R+K  +  G    +R+ ++ + ++ +  LG +RI      + +    + +E
Sbjct: 59  KRIHFTKYDREK--SSEGEVRLIRELVKEKIIKSINILGNERIA----KIDLIDRKIYIE 112

Query: 124 LYAQGNILLTDSEFTVL--TLLRSHRD 148
           L  +G +++TD    VL  T  +  RD
Sbjct: 113 LLPRGLLVITDGNNKVLFSTEYKEFRD 139


>gi|229581503|ref|YP_002839902.1| hypothetical protein YN1551_0858 [Sulfolobus islandicus Y.N.15.51]
 gi|228012219|gb|ACP47980.1| protein of unknown function DUF814 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 609

 Score =  139 bits (351), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  +K  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLKKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|449066809|ref|YP_007433891.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
 gi|449069082|ref|YP_007436163.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
           Ron12/I]
 gi|449035317|gb|AGE70743.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
 gi|449037590|gb|AGE73015.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
           Ron12/I]
          Length = 588

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 74/222 (33%), Positives = 126/222 (56%), Gaps = 23/222 (10%)

Query: 485 TLPVEKVEVDL--ALSAHANARRWYELKKKQESKQEKTITAHSK--------AFKAAEKK 534
           TL +  + +D+   L+ + NA ++Y+L K+   K +K      +         FK  E+K
Sbjct: 312 TLKINNISIDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERK 371

Query: 535 TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
             ++I           +RK  W+EK++W I+   ++VI+GRD+ QNE IV++ + + D++
Sbjct: 372 EEIRI----------SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIF 421

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSK 653
           +HAD+ GA++TV+K +  +  V    +  A     C+S+AW + +     +WVY +QVSK
Sbjct: 422 LHADIQGAAATVLKANSGQ--VSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSK 479

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           + P+GEYL  GSFMI G+KNF+    L +  G++ + DE  L
Sbjct: 480 SPPSGEYLAKGSFMIYGRKNFIKNVKLELAIGIMNQNDEVGL 521



 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 74/147 (50%), Gaps = 18/147 (12%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSP-KTYIFKLMNSSGVTESGESEKVLLLMESG 63
           M+  D+ A +   + +I G R  NVY +S  + Y+FKL        S  ++K  L++E G
Sbjct: 1   MSYIDLLAWITENKSIIEGSRIDNVYKISGIQAYLFKL-------HSKNTDK-FLVVEPG 52

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R+H T Y R+K  +  G    +R+ ++ + ++ +  LG +RI      + +    + +E
Sbjct: 53  KRIHFTKYDREK--SSEGEVRLIRELVKEKIIKSINILGNERIA----KIDLIDRKIYIE 106

Query: 124 LYAQGNILLTDSEFTVL--TLLRSHRD 148
           L  +G +++TD    VL  T  +  RD
Sbjct: 107 LLPRGLLVITDGNNKVLFSTEYKEFRD 133


>gi|150399105|ref|YP_001322872.1| hypothetical protein Mevan_0351 [Methanococcus vannielii SB]
 gi|150011808|gb|ABR54260.1| protein of unknown function DUF814 [Methanococcus vannielii SB]
          Length = 680

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 182/356 (51%), Gaps = 33/356 (9%)

Query: 347 EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ-ENRVHTLKQEVDR 405
           E   +E+F  ALDE++S+   ++  +Q + K +    K  +I   Q E +    KQ V  
Sbjct: 275 EIKNYESFLVALDEYFSRFIIKKEIKQAETKINKLVKKQERILNSQLETKEKYEKQSVLN 334

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
             K  +LI  N  DVD  +  +R A   +M W  +  ++ + +   + + G I  +  + 
Sbjct: 335 QEK-GDLIYANYMDVDEILSTIRSA-REKMDWNAIKEVINKNK--DHQILGKIISVNEKN 390

Query: 466 NCMSLLLSNNLDEMDDEEKTLPVEK-VEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
             +SL LS  LD  +       +EK V +DL  +A  +A  +Y+  KK ++K    ++  
Sbjct: 391 AEISLKLS--LDYGNG-----IIEKNVVLDLRKNAFESADDFYQKSKKFKNK----VSGV 439

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVH------------WFEKFNWFISSENYLVI 572
            +A K +EKK  L  L+EK   +   +R+              W+EK  W +  + YL++
Sbjct: 440 IEALKISEKK--LNELKEKEKTDSEVLREKEENIKKKEKKLLKWYEKLKWTLI-DGYLIV 496

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +G+DA  NEMI+KRY+ K D+  H  + GA  TVIK    E+     TL +   F   HS
Sbjct: 497 AGKDATTNEMIIKRYVEKNDIVFHTLMDGAPFTVIKMKDSEKAPEEKTLFEVSKFAASHS 556

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G+ 
Sbjct: 557 RAWKLGVGSADVYWVMPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALDLGVGIF 612



 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 79/164 (48%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++  V  L+ LIG +    + LS    K  + K+     + E G S+++ +
Sbjct: 1   MKTEMTNVDISVAVNELQSLIGAKFDKAFLLSGSDGKELVLKV----HLPEVG-SKEIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRK++   ++  + Q  +DRI+LF F      +
Sbjct: 56  GLGKYKYITITEYEREKPKNPPSFAMLLRKNLNNIKITSIEQHNFDRIVLFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ +GN +L D    ++  L+  R   + V     +++P
Sbjct: 116 KLIIELFGEGNAILLDKNDVIILPLKIERWSTRNVVPKEIYKFP 159


>gi|302761992|ref|XP_002964418.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
 gi|300168147|gb|EFJ34751.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
          Length = 382

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 116/221 (52%), Gaps = 24/221 (10%)

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           ++TSAWWVY HQVSK APTGEYLTVGS MIRGKKNFLPP+PL+MGFGL FRLD+SS+ +H
Sbjct: 174 IITSAWWVYDHQVSKNAPTGEYLTVGSLMIRGKKNFLPPYPLVMGFGLFFRLDKSSIPAH 233

Query: 699 LNERRVRG-------EEEGMDD--FEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAH 749
            NERR+R        E E  DD   +D+       ++   K+  D     E  SV  +  
Sbjct: 234 FNERRIRAKGDNEEPEAEIQDDEEIDDASVEDSQDNVHERKESGDGGSTIEKASVMEAEE 293

Query: 750 PAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGS-- 807
                  +    + E            ++   D     A      ++ L+D+AL L S  
Sbjct: 294 ARSEEAESEEARALE-----------TENAAMDEHEEQAPQSDSDIDSLLDKALELKSVL 342

Query: 808 -ASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAE 847
            + + + K+G+   Q +    D  V+ T   R+K YISKAE
Sbjct: 343 PSQVDTNKYGLGEVQTE-DHVDDAVQETKVAREKQYISKAE 382



 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 68/144 (47%), Positives = 88/144 (61%), Gaps = 22/144 (15%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLL 340
           +  L+ A+ +FEDWL+ V +GD +PEGYI          HP   +   T    E      
Sbjct: 17  LHSLLEAIKRFEDWLESVTTGDFMPEGYITF--------HPNKTAKKKTAESAE------ 62

Query: 341 NQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
                    KF+TFDA LDEF+SKIE QR +QQ K +ED+A+ KL KI +DQ +RV +LK
Sbjct: 63  --------EKFDTFDAVLDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRVDQRSRVESLK 114

Query: 401 QEVDRSVKMAELIEYNLEDVDAAI 424
           +EVD++V  AELIEYNL DVD AI
Sbjct: 115 REVDQAVHTAELIEYNLADVDLAI 138


>gi|161527567|ref|YP_001581393.1| hypothetical protein Nmar_0055 [Nitrosopumilus maritimus SCM1]
 gi|160338868|gb|ABX11955.1| protein of unknown function DUF814 [Nitrosopumilus maritimus SCM1]
          Length = 652

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 169/368 (45%), Gaps = 51/368 (13%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           E  P+ L +    E  K  +F   LD  +++    + +    +  D    +L     +QE
Sbjct: 247 EVLPIQLGKIEG-EITKVNSFIEGLDTVFTQNIVDKGKSIQTSGSDKKIKELETQISEQE 305

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             + T+K+   RS  +  +     + +   IL++  + A  +   + A+++ E+   G P
Sbjct: 306 KAIQTVKE---RSKNITNVANSLYDMISKGILSIEDSSAQEIMTANNAKLISEK---GIP 359

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
           +  + D                             EK++VD   S  + A   +   KKQ
Sbjct: 360 LIVIQD-----------------------------EKIKVDTKASLQSIASALFNEAKKQ 390

Query: 514 ESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN-----ISHMRKVHWFEKFNWFISSEN 568
                      SK  K  EK      LQ KT +      +S +RK +W+E++ WF +S+ 
Sbjct: 391 SGAISSIEEIKSKTLKKLEK------LQNKTESEKDTILVSEIRKKNWYERYRWFYTSDG 444

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI GRDA  N  +V++++ K D   H D+ G+   +IK+    Q VP  ++N+    T
Sbjct: 445 FLVIGGRDAASNSAVVRKHLDKNDKIFHGDIFGSPFFIIKDA---QNVPDTSMNEVSHAT 501

Query: 629 VCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           VC S+AW   M   SA+WV P QV K+AP+GE+L  GSF I G++NF+    L +  G++
Sbjct: 502 VCFSRAWREGMYGVSAYWVNPDQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGII 561

Query: 688 FRLDESSL 695
            + D  +L
Sbjct: 562 PQEDGYAL 569



 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 63/121 (52%), Gaps = 11/121 (9%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       +S+  +++  SGV L      +  +  P+    +
Sbjct: 27  VSNIYGITKDSILFKLHHTE------KSDLFMMISTSGVWL---TEVKIDQVEPNKLLKR 77

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +L+ + Q+G +RI  F+F  G    +V++ E +  GNILL ++E  +L L  
Sbjct: 78  LRSDLLRLKLKKIEQIGAERIAYFRFE-GFGKEFVLVGEFFGDGNILLCNNEMKILALQH 136

Query: 145 S 145
           S
Sbjct: 137 S 137


>gi|261403479|ref|YP_003247703.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
           vulcanius M7]
 gi|261370472|gb|ACX73221.1| Fibronectin-binding A domain protein [Methanocaldococcus vulcanius
           M7]
          Length = 670

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 190/360 (52%), Gaps = 17/360 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           YD   P+ L ++   E   +E+F  A+D++++K  +    ++ K+K +    +   I   
Sbjct: 257 YD-VVPVNLKKYEDLEKKYYESFLDAVDDYFAKFLTNVEVKKKKSKIEKEIERQENILKR 315

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   K++ +++    +LI  N + V+  + A++ A   +M W  + ++VKE +   
Sbjct: 316 QLETLERYKKDAEKNQIKGDLIYANYQIVENLLSAIKQA-REKMDWARIKKIVKENK--D 372

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+  L++ +    N   +++    D  D   KT+  E++ +D+  +A  NA R+YE  K
Sbjct: 373 HPILDLVEDI--RENIGEIIVRLKADVGD---KTIE-ERIPLDIRKNASENAERFYEKAK 426

Query: 512 KQESKQEKTITA---HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K + K E   TA     K  +  +KK    + +E         ++  W+EKF W + +  
Sbjct: 427 KLKHKVEGIKTAIELTKKKIEELKKKEEKTLGEEIPEMKKKKRKERKWYEKFKWTVIN-G 485

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI+G+DA  NE+++K+Y  K D+  HA++ GA  TVIK     + V   TL +   F+
Sbjct: 486 FLVIAGKDAITNEILIKKYTDKDDIVFHANIQGAPFTVIKTQG--RDVDEETLEEVAKFS 543

Query: 629 VCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+L
Sbjct: 544 VSHSKAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVL 603



 Score = 70.5 bits (171), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 82/164 (50%), Gaps = 2/164 (1%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M+K  M   DV   +  L++L+  R    + L  +    +L+    V E G  E V+ + 
Sbjct: 1   MMKTEMTNVDVCGVILELQKLVNSRLDKAF-LVERDNNRELILKLHVPEGGSRELVISVG 59

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +    +  T Y RDK   P  F + LRK+++  +L  + Q+ +DRI +  F      + +
Sbjct: 60  KYKY-ITLTNYERDKPKIPPSFAMLLRKYLKNAKLVKIEQVNFDRIAILHFETREGIYKL 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           I+EL+ +GN++  +SE T+++ LR      + +    ++++P +
Sbjct: 119 IVELFGEGNVIFLNSEDTIISPLRVEIWSSRKIVPKEKYQFPPQ 162


>gi|254166596|ref|ZP_04873450.1| conserved domain protein [Aciduliprofundum boonei T469]
 gi|197624206|gb|EDY36767.1| conserved domain protein [Aciduliprofundum boonei T469]
          Length = 593

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 123/204 (60%), Gaps = 6/204 (2%)

Query: 483 EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           E  L  EK+++ +  S   NA  +Y+  KK    +EK   A     KA E+  +++  +E
Sbjct: 325 EIELEGEKIKLYVDKSVGENAGIYYDRSKKM---REKIKGAREALEKAKEELKKVKKKEE 381

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K    I   R+  WFEK+ WFISSE  LVI+GRDA+ NE +VK+++  GD+Y+HAD+HGA
Sbjct: 382 KKKKEIRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGA 441

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYL 661
            S VIK+   E  +   TL +A  F V  S+AW++     SA+WVYP QVSK   +GEY+
Sbjct: 442 PSVVIKSEGKE--IGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYV 499

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFG 685
             G++++ GK+N++   PL +  G
Sbjct: 500 ARGAWVVHGKRNYIHKVPLQLAVG 523



 Score = 48.9 bits (115), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K  R  I G     +Y +  + ++FK+         GE++ +       V
Sbjct: 5   MLSLDIYAWLKENREFIEGGFFKKIYQVGEREFLFKIYK-------GETKPLY------V 51

Query: 65  RLHTTAYARDKKN--TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            L    +  D++    PS F + LRK    +++    Q  +DRII+F+     N + +I+
Sbjct: 52  NLRGWLFFDDRETPLEPSMFVMFLRKRFSGKKIVKFYQFNFDRIIIFEVP---NGYSLII 108

Query: 123 ELYAQGNILLT 133
           EL+  GNI++T
Sbjct: 109 ELFGDGNIIVT 119


>gi|260803886|ref|XP_002596820.1| hypothetical protein BRAFLDRAFT_116214 [Branchiostoma floridae]
 gi|229282080|gb|EEN52832.1| hypothetical protein BRAFLDRAFT_116214 [Branchiostoma floridae]
          Length = 168

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/167 (44%), Positives = 111/167 (66%), Gaps = 10/167 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  ++  ++GMR +NVYD+  KTY+ KL+ +         EK +LL+
Sbjct: 1   MKGRFSTVDLRAILTEIKDSVLGMRVANVYDIDNKTYLIKLVKTD--------EKKMLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG RL+ T++   K   PSGF++KLRKH+RTRRL  ++QLG DRI+  QFG    A+++
Sbjct: 53  ESGTRLYATSFDWPKNMMPSGFSMKLRKHLRTRRLISIQQLGSDRIVDMQFGENEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICR 167
           I+ELY +GN++LTD E+T+L LLR+  + D  V    R +YP E+ R
Sbjct: 113 IVELYDRGNLILTDYEYTILNLLRTRTEGD-DVRFAVREKYPLELAR 158


>gi|289596339|ref|YP_003483035.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
 gi|289534126|gb|ADD08473.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
          Length = 589

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 123/204 (60%), Gaps = 6/204 (2%)

Query: 483 EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           E  L  EK+++ +  S   NA  +Y+  KK    +EK   A     KA E+  +++  +E
Sbjct: 321 EIELEGEKIKLYVDKSVGENAGIYYDRSKKM---REKIKGAREALEKAKEELKKVKKKEE 377

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K    I   R+  WFEK+ WFISSE  LVI+GRDA+ NE +VK+++  GD+Y+HAD+HGA
Sbjct: 378 KKKKEIRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGA 437

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYL 661
            S VIK+   E  +   TL +A  F V  S+AW++     SA+WVYP QVSK   +GEY+
Sbjct: 438 PSVVIKSEGKE--IGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYV 495

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFG 685
             G++++ GK+N++   PL +  G
Sbjct: 496 ARGAWVVHGKRNYIHKVPLQLAVG 519



 Score = 48.9 bits (115), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K  R  I G     +Y +  + ++FK+         GE++ +       V
Sbjct: 1   MLSLDIYAWLKENREFIEGGFFKKIYQVGEREFLFKIYK-------GETKPLY------V 47

Query: 65  RLHTTAYARDKKN--TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            L    +  D++    PS F + LRK    +++    Q  +DRII+F+     N + +I+
Sbjct: 48  NLRGWLFFDDRETPLEPSMFVMFLRKRFSGKKIVKFYQFNFDRIIIFEVP---NGYSLII 104

Query: 123 ELYAQGNILLT 133
           EL+  GNI++T
Sbjct: 105 ELFGDGNIIVT 115


>gi|45358591|ref|NP_988148.1| hypothetical protein MMP1028 [Methanococcus maripaludis S2]
 gi|44921349|emb|CAF30584.1| unnamed protein product [Methanococcus maripaludis S2]
          Length = 680

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 176/358 (49%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q +     +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLDTKDKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  ++ A   +M W  +  ++KE +   +P+   I  + 
Sbjct: 331 SVSNHKRGDLIYANYSLVDEIVSTIKDA-REKMDWNGIKNVIKENK--THPILSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK ++K +  I 
Sbjct: 388 EKNAELTLKLSA------DYGNGLIEDSVPVDLRKNAFENADIVYQKSKKFKNKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEK------------TVANISHMRKVHWFEKFNWFISSENYL 570
              +A K +EKK  L  L+EK                    + + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKEKEKLDSEVFKEKEEKIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFAAS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G++
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGII 612



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 80/164 (48%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++  V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISVAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYMTLTEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFP 159


>gi|284174391|ref|ZP_06388360.1| hypothetical protein Ssol98_07002 [Sulfolobus solfataricus 98/2]
 gi|384433658|ref|YP_005643016.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|261601812|gb|ACX91415.1| protein of unknown function DUF814 [Sulfolobus solfataricus 98/2]
          Length = 609

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 82/245 (33%), Positives = 133/245 (54%), Gaps = 17/245 (6%)

Query: 448 RKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANAR 504
           R+ GN +   A  ID+L L     S  +  N+D ++          +E+D +LSA  NA 
Sbjct: 299 RQLGNFILSKAYEIDQLLLNNRAKSKKVKLNVDGVE----------IELDTSLSATKNAM 348

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           R+++  K+ + K E+ + +  +  +   K  + +I ++  +     +RK  W+EK+ W I
Sbjct: 349 RFFDEAKEYKRKIERALKSLEELKEKLAKIEKQEIEKQNEIK--LTLRKKEWYEKYRWSI 406

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           S   YL+I GRDA QNE IVK+Y+   D+++HAD+ GA +T+I      + +    +  A
Sbjct: 407 SRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIITQ-DNKTISEEDIYDA 465

Query: 625 GCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
                 +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L + 
Sbjct: 466 AVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFIKNVKLQLA 525

Query: 684 FGLLF 688
            GL+ 
Sbjct: 526 IGLIL 530


>gi|15897146|ref|NP_341751.1| hypothetical protein SSO0195 [Sulfolobus solfataricus P2]
 gi|13813331|gb|AAK40541.1| Membrane conserved hypothetical protein [Sulfolobus solfataricus
           P2]
          Length = 609

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 82/245 (33%), Positives = 133/245 (54%), Gaps = 17/245 (6%)

Query: 448 RKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANAR 504
           R+ GN +   A  ID+L L     S  +  N+D ++          +E+D +LSA  NA 
Sbjct: 299 RQLGNFILSKAYEIDQLLLNNRAKSKKVKLNVDGVE----------IELDTSLSATKNAM 348

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           R+++  K+ + K E+ + +  +  +   K  + +I ++  +     +RK  W+EK+ W I
Sbjct: 349 RFFDEAKEYKRKIERALKSLEELKEKLAKIEKQEIEKQNEIK--LTLRKKEWYEKYRWSI 406

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           S   YL+I GRDA QNE IVK+Y+   D+++HAD+ GA +T+I      + +    +  A
Sbjct: 407 SRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIITQ-DNKTISEEDIYDA 465

Query: 625 GCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
                 +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L + 
Sbjct: 466 AVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFIKNVKLQLA 525

Query: 684 FGLLF 688
            GL+ 
Sbjct: 526 IGLIL 530


>gi|300176455|emb|CBK23766.2| unnamed protein product [Blastocystis hominis]
          Length = 159

 Score =  138 bits (347), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 74/162 (45%), Positives = 104/162 (64%), Gaps = 10/162 (6%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K RM   DV A V  L+ ++ G + +NVYD+S K YI KLM            +  L+
Sbjct: 1   MPKTRMTALDVRACVNELKGIVLGAKLANVYDVSNKVYILKLMKGGA--------QYNLV 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +ESGVR+H T Y R+K   P+ F+ KLRKHIR RR+E VRQ+G+DR++   FG G   ++
Sbjct: 53  IESGVRVHLTKYLREKNQFPNTFSQKLRKHIRNRRIEAVRQIGFDRVVDLVFGNGETTYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           VI+ELY+ GNI+LT+ EF V+ LLRS+  +D G  +  +H+Y
Sbjct: 113 VIVELYSGGNIILTNYEFEVMFLLRSYTLND-GTQVDVKHQY 153


>gi|150402208|ref|YP_001329502.1| hypothetical protein MmarC7_0281 [Methanococcus maripaludis C7]
 gi|150033238|gb|ABR65351.1| protein of unknown function DUF814 [Methanococcus maripaludis C7]
          Length = 680

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 178/358 (49%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q       +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLETKEKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  +++A   +M W  +  ++KE +   +PV   I  + 
Sbjct: 331 SILNHKRGDLIYANYSLVDEIVSTIKLA-REKMDWNGIKNVIKENK--THPVLSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK ++K    I 
Sbjct: 388 EKNAELTLNLSA------DYGNGLIEDTVPVDLRKNAFENADIVYQKSKKFKNKVHGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRK------------VHWFEKFNWFISSENYL 570
              +A K +EKK  L  L+EK   +   +++            + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKEKEKLDSEVLKEKEENIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFAAS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GE+L  G+F+IRGK+NF+    L +G G+L
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEFLKKGAFVIRGKRNFIRSAALELGIGML 612



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 82/164 (50%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     + TT Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITTTEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFP 159


>gi|254167318|ref|ZP_04874170.1| conserved domain protein [Aciduliprofundum boonei T469]
 gi|197623581|gb|EDY36144.1| conserved domain protein [Aciduliprofundum boonei T469]
          Length = 589

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 65/139 (46%), Positives = 93/139 (66%), Gaps = 3/139 (2%)

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           I   R+  WFEK+ WFISSE  LVI+GRDA+ NE +VK+++  GD+Y+HAD+HGA S VI
Sbjct: 383 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 442

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           K+   E  +   TL +A  F V  S+AW++     SA+WVYP QVSK   +GEY+  G++
Sbjct: 443 KSEGKE--IGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAW 500

Query: 667 MIRGKKNFLPPHPLIMGFG 685
           ++ GK+N++   PL +  G
Sbjct: 501 VVHGKRNYIHKVPLQLAVG 519



 Score = 47.0 bits (110), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K     I G     +Y +  + ++FK+         GE++ +       V
Sbjct: 1   MLSLDIYAWLKENIEFIEGGFFKKIYQVGEREFLFKIYK-------GETKPLY------V 47

Query: 65  RLHTTAYARDKKN--TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            L    +  D++    PS F + LRK    +++    QL +DRII+F+     N + +I+
Sbjct: 48  NLRGWLFFDDRETPLEPSMFVMFLRKRFSGKKIVKFYQLNFDRIIIFEVP---NGYSLII 104

Query: 123 ELYAQGNILLT 133
           EL+  GNI++T
Sbjct: 105 ELFGDGNIIVT 115


>gi|340624350|ref|YP_004742803.1| fibronectin-binding A domain-containing protein [Methanococcus
           maripaludis X1]
 gi|339904618|gb|AEK20060.1| Fibronectin-binding A domain protein [Methanococcus maripaludis X1]
          Length = 680

 Score =  137 bits (345), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 176/358 (49%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q +     +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLDTKDKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  ++ A   +M W  +  ++KE +   +P+   I  + 
Sbjct: 331 SISNHKRGDLIYANYSLVDEIVSTIKDA-REKMDWNGIKNVIKENK--THPILSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK ++K +  I 
Sbjct: 388 EKNAELTLKLSA------DYGNGLIEDSVPVDLRKNAFENADIVYQKSKKFKNKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEK------------TVANISHMRKVHWFEKFNWFISSENYL 570
              +A K +EKK  L  L+EK                    + + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKEKEKLDSEVFKEKEEKIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      + +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENIMFEVAKFAAS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G++
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGII 612



 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 81/164 (49%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITLTEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKELYKFP 159


>gi|159040762|ref|YP_001540014.1| hypothetical protein Cmaq_0175 [Caldivirga maquilingensis IC-167]
 gi|157919597|gb|ABW01024.1| protein of unknown function DUF814 [Caldivirga maquilingensis
           IC-167]
          Length = 650

 Score =  137 bits (344), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 132/437 (30%), Positives = 215/437 (49%), Gaps = 49/437 (11%)

Query: 268 MKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI-----SGDIVPEGYI--LMQNKHLGKDH 320
           +KL E + L D A   L   +    DW ++V      S  ++  G I  +++  HLG+  
Sbjct: 160 LKLIEDSGLSDEA---LAKGLGLGTDWAREVCTRSGCSDPVLVWGSIRGILEVLHLGRLK 216

Query: 321 PPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQR-AEQQHKAKED 379
           P   +  S        P+ L+  +  EF + E+F+ A+D++++ IE +R AE++ K  ED
Sbjct: 217 PVIYASPSY-----VSPIPLSSIKG-EFKEVESFNKAVDDYFTSIEVERVAEERVKGIED 270

Query: 380 AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE---YNLEDVDAAILAVRVALANRMS 436
               +L     + E+ V    +E +   +  ELI    Y   ++  A+L  R  +A++ S
Sbjct: 271 E-IARLESSIKELEDTVGGYLREAENLRRRGELIMGRLYEFSELHEALL--RAYMADKDS 327

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLA 496
           ++     VK     G  V   ID   L R  + + ++NN              +VE+ L 
Sbjct: 328 FKA---KVKGIEYGGIKV---IDYDPL-RKTVKVTVNNN--------------EVELTLG 366

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKA-FKAAEKKTRLQILQEKTVANISHMRKVH 555
            S    A +++E  K+ E K +      ++   K  E ++R+    E+T A +  +    
Sbjct: 367 ESPGETAAKYFEEAKRLEKKAKAAEAKLTELRGKVNELRSRVNEATEETRAAVRFVASRE 426

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WFE+F WFI+S    V++G+DA QNE IVKRYM+  D+++HAD+ G   TVIK  R  Q 
Sbjct: 427 WFERFRWFITSGGSPVLAGKDAGQNEAIVKRYMNPWDLFLHADVQGGPVTVIKVTR-GQE 485

Query: 616 VPPLTLNQAGCFTVCHSQAWD-SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           V    L +A  +   +S+AW         ++V   QVSK AP+GEYL+ G FMI G++ +
Sbjct: 486 VKQQDLIEAAQYAAAYSKAWKLGANSIDVYYVKGEQVSKKAPSGEYLSKGGFMIYGQRGW 545

Query: 675 LPPHPLIMGFGLLFRLD 691
           +    LI+  GL  R+D
Sbjct: 546 VRGVELIISVGL--RID 560


>gi|134045609|ref|YP_001097095.1| hypothetical protein MmarC5_0566 [Methanococcus maripaludis C5]
 gi|132663234|gb|ABO34880.1| protein of unknown function DUF814 [Methanococcus maripaludis C5]
          Length = 680

 Score =  137 bits (344), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 175/358 (48%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q       +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLETKEKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  ++ A   +M W  + +++KE +   +P+   I  + 
Sbjct: 331 SLSNHKRGDLIYANYSLVDEIVGTIKDA-REKMDWNGIKKIIKENK--THPILSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK + K +  I 
Sbjct: 388 EKNAELTLKLSA------DYGNGLIEDTVPVDLRKNAFENADIVYQKSKKFKHKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEK------------TVANISHMRKVHWFEKFNWFISSENYL 570
              +A K +EKK  L  L++K                    + + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKDKEKLDSEILKEKEEKIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENVLFEVAKFASS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G+L
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGML 612



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 81/164 (49%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITITEYEREKPRNPHSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFP 159


>gi|257053989|ref|YP_003131822.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
           12940]
 gi|256692752|gb|ACV13089.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
           12940]
          Length = 707

 Score =  135 bits (341), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 115/482 (23%), Positives = 202/482 (41%), Gaps = 77/482 (15%)

Query: 243 VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGD 302
            L   L +G    E +    G+  N  + E     D   + L  AV      L++   GD
Sbjct: 188 TLATQLNFGGLYGEELCSRAGVPYNQAIGETT---DAEFEALYDAVNDLSTRLRE---GD 241

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           + P  Y     +    D                 P+ L ++       F++F+ AL+ ++
Sbjct: 242 LDPRLYFETDEQETPVD---------------VTPVPLVEYEDTPGESFDSFNDALEAYF 286

Query: 363 SKIESQRAEQQ---HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
             +E +  E++   ++   +A   K  +I   QE  +   +++ +   + AEL+  N + 
Sbjct: 287 LGLEQEPDEEETGSNRPDFEAEIEKQKRIIQQQEGAIEDFEEDAEAEREKAELLYANYDL 346

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VD  +  V+ A A    W+++   +   +  G P A                    + ++
Sbjct: 347 VDEVLSTVQDARAAETPWDEIEATLSAGKDQGIPAA------------------EAVRDV 388

Query: 480 DDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESK-------------QEKTIT 522
           D  E T+ V+     +E+D       NA R Y+  K+ E K             Q + + 
Sbjct: 389 DGSEGTVTVQIDDHHIELDADTGVEKNADRLYQEAKRIEGKKAGAEEAIANTREQLEAVK 448

Query: 523 AHSKAFKAAEKKTRLQILQEK-----------TVANISHMRKVHWFEKFNWFISSENYLV 571
              +A++A++         E            T  +I       W+E+F WF +S+ +LV
Sbjct: 449 QRREAWEASDGDDGGDGSGETHEDDQEDVDWLTRESIPIRTSEEWYERFRWFTTSDGFLV 508

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP-EQP-----VPPLTLNQAG 625
           I GR+A QNE +VK+Y+ +GD++ H   HGA +T++K   P E P     +P  +  +A 
Sbjct: 509 IGGRNADQNEELVKKYLDRGDLFFHTQAHGAPATILKATGPSEAPPDDISIPESSREEAA 568

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            F + +S  W + K     + V P QV+KT  +GEYL  GSF IRG + +    P+ +  
Sbjct: 569 QFAISYSTLWKEGKYAGDVYCVGPDQVTKTPESGEYLEKGSFAIRGDRTYYDDTPVGVAV 628

Query: 685 GL 686
           G+
Sbjct: 629 GI 630



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 70/164 (42%), Gaps = 9/164 (5%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSS-GVTESGESEKVLLLME 61
           K  + + D AA    LR  +G      Y         KL   + G  E      +L+ ++
Sbjct: 4   KRELTSVDCAALAGELRAFVGAYHEKSYLYDDDLLRLKLSGPNFGRIE------LLIEVD 57

Query: 62  SGVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
              R+HT    R  +    P  F + LR  +   +LE V Q  +DRI+  +F    +   
Sbjct: 58  DPKRVHTITPDRVPNAPERPPNFAMMLRNRLEGAQLESVEQFEFDRILQLRFERSDDHTT 117

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           +I EL+  GN+ + D   TV+  L + R   + V   S++ +P+
Sbjct: 118 IIAELFGDGNLAVLDETDTVIDSLETVRLQSRTVTPGSQYEFPS 161


>gi|193787557|dbj|BAG52763.1| unnamed protein product [Homo sapiens]
          Length = 481

 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 56/86 (65%), Positives = 70/86 (81%)

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  
Sbjct: 1   MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 60

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMD 712
           LF++DES +  H  ER+VR ++E M+
Sbjct: 61  LFKVDESCVWRHQGERKVRVQDEDME 86


>gi|374635672|ref|ZP_09707266.1| Fibronectin-binding A domain protein [Methanotorris formicicus
           Mc-S-70]
 gi|373561525|gb|EHP87758.1| Fibronectin-binding A domain protein [Methanotorris formicicus
           Mc-S-70]
          Length = 673

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/377 (28%), Positives = 191/377 (50%), Gaps = 23/377 (6%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++   E  ++  F  ALD+++++   +   ++ ++K      K  +I   
Sbjct: 256 YVDVVPINLKKYGDFEKKEYGEFLEALDDYFAQFMVKVEVKKEESKLQKLIKKQERILKT 315

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   ++++  + +  +LI  N   VD  +  +R A   +M W  + +++KE +   
Sbjct: 316 QWETLEKYEKDMQENQEKGDLIYANYMLVDEILNTLRNA-REKMDWYKIKKIIKEHK--D 372

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +PV GLI  +  +   + + LS +  +   E+       V +D+  +A  NA  +Y   K
Sbjct: 373 HPVLGLIQNINEKNGEIVIKLSADYGDRKIEKN------VSLDIRKNAFENAETYYTKSK 426

Query: 512 KQESKQEKT-----ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           K + K E       +T         +++  L+ L+EK        ++  W+EKF W + +
Sbjct: 427 KLKGKLEGIKEAIKLTEKKIEELKEKEEIELKELKEKEKIKKKERKERKWYEKFKWTVIN 486

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
             +LVI+G+DA  NE+++K+Y    D+  HA + GA  TVIK ++  + V   TLN+   
Sbjct: 487 -GFLVIAGKDAVTNELLIKKYTEDDDIVFHAQIEGAPFTVIKTNK--RIVDEETLNEVAK 543

Query: 627 FTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F+V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRGK+NF+   PL +G G
Sbjct: 544 FSVAHSRAWKLGWGALDTYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRNVPLELGIG 603

Query: 686 LL-----FRLDESSLGS 697
           ++      RL  S L +
Sbjct: 604 VIEYDDALRLTTSPLNT 620



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/97 (24%), Positives = 51/97 (52%)

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +  T Y R K   P  F + LRK+++  ++  + Q+ +DRI++  F      + +++EL+
Sbjct: 64  ITMTNYERKKPKNPPSFAMLLRKYLKNIKITKIEQVDFDRIVIITFEWNETVYKLVVELF 123

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
             GN++L D E  ++  L+  R   + +     +++P
Sbjct: 124 GDGNVVLLDKEDRIIMPLKMGRWSTRNIIPKEFYKFP 160


>gi|407461558|ref|YP_006772875.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407045180|gb|AFS79933.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 651

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 80/236 (33%), Positives = 128/236 (54%), Gaps = 21/236 (8%)

Query: 471 LLSNNLDEMDDEEKTLPV-----EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
           +LSNN  ++  E K +P+     EK+++++     + A   +   KKQ           S
Sbjct: 344 ILSNNNAKLITE-KGIPLIVIQDEKIKINIKAPLQSIASTLFNEAKKQSGAISSIEEIKS 402

Query: 526 KAFKAAEKKTRLQILQEKTVAN-----ISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           K  K  EK      LQ KT +      +S +RK +W+E++ WF +S+ +LVI GRDA  N
Sbjct: 403 KTLKKLEK------LQNKTDSEKDSVLVSEIRKKNWYERYRWFYTSDGFLVIGGRDAASN 456

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
             +V+++++K D   H D+ G+   +IK+    Q  P  ++N+    TVC S+AW   M 
Sbjct: 457 SAVVRKHLAKNDKIFHGDIFGSPFFIIKDA---QNAPDTSMNEVAHATVCFSRAWREGMY 513

Query: 641 -TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
             SA+WV P QV K+AP+GE+L  GSF I G++NF+    L +  G++ + D  +L
Sbjct: 514 GVSAYWVNPEQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGIIPQEDGYAL 569



 Score = 43.5 bits (101), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 79/164 (48%), Gaps = 19/164 (11%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       +S+  +++  SGV L      +  +  P+    +
Sbjct: 27  VSNIYGITKDSILFKLHHTE------KSDLFMMISTSGVWL---TEVKIDQVEPNKLLKR 77

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +L+ ++Q+G +RI  F F  G    +V++ E +  GNILL + E  +L L  
Sbjct: 78  LRSDLLRLKLKKIKQIGAERIAYFTFE-GFGKEFVLVGEFFGDGNILLCNDEMKILALQH 136

Query: 145 S----HRDDDKGVAIMSRHRYPTEICRV----FERTTASKLHAA 180
           S    HR    G+  ++  +   +I  +    FE    ++L AA
Sbjct: 137 SIDVRHRKLSVGLEYVTPPQSGLDIFNLSESDFEDIKTTELVAA 180


>gi|407465827|ref|YP_006776709.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
 gi|407049015|gb|AFS83767.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
          Length = 648

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 172/362 (47%), Gaps = 55/362 (15%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           E  P+ L +    E  +  +F   LD  +++   ++ +    +  D    +L     +QE
Sbjct: 244 EVLPIRLGKLEG-EITQVNSFIEGLDTVFTENIIEKGKSVQSSGSDKKIKELQTQISEQE 302

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             + T+K+   RS  +  +     E V   I+++   LA  +  ++ A+++ E+      
Sbjct: 303 KAIETVKE---RSKNITNVANSLFEMVSKGIISIEDNLAQEILAKNNAKLINEK------ 353

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
                         +SL++  +             EK++++      + A   ++  KKQ
Sbjct: 354 -------------GISLIVVQD-------------EKIKINTQSPLQSIASVLFDEAKKQ 387

Query: 514 ESKQEKTITAHSKAFKAAEKKT--RLQILQEKT-----VANISHMRKVHWFEKFNWFISS 566
            S           + KA ++KT  RL+  Q KT     +  +S +RK +W+E++ WF ++
Sbjct: 388 SSA--------IFSIKAIKEKTEKRLEKFQSKTDSEKDLIVVSEIRKKNWYERYRWFFTT 439

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           + +L I GRDA  N  ++++++ K D   H D+ G+   ++K+    Q  P  ++N+   
Sbjct: 440 DGFLTIGGRDAASNSAVIRKHLDKNDKIFHGDIFGSPFFILKDS---QNAPDTSMNEVAH 496

Query: 627 FTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
            TVC S+AW   M   SA+WVYP Q+ K+AP+GE+L  GSF I G++NF+    L +  G
Sbjct: 497 ATVCFSRAWREGMYGVSAYWVYPDQIKKSAPSGEFLPKGSFTIEGQRNFIKSDTLRLAVG 556

Query: 686 LL 687
           ++
Sbjct: 557 IM 558



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 11/124 (8%)

Query: 23  GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGF 82
           G   SN+Y ++  + +FKL ++       +S+  +++  SGV L +    +  +  P+  
Sbjct: 21  GYYVSNIYGITKDSILFKLHHTE------KSDLFMMISTSGVWLTS---VKIDQMEPNRL 71

Query: 83  TLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLT 141
             +LR  +   +L+ + Q+G +RI  F F  G    +V++ E +  GNILL ++E  +L 
Sbjct: 72  LKRLRSDLLRLKLKKIEQIGAERIAYFTFE-GFGKEFVLVGEFFGDGNILLCNNEMKILA 130

Query: 142 LLRS 145
           L  S
Sbjct: 131 LQHS 134


>gi|284161856|ref|YP_003400479.1| fibronectin-binding A domain-containing protein [Archaeoglobus
           profundus DSM 5631]
 gi|284011853|gb|ADB57806.1| Fibronectin-binding A domain protein [Archaeoglobus profundus DSM
           5631]
          Length = 626

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 118/437 (27%), Positives = 192/437 (43%), Gaps = 66/437 (15%)

Query: 243 VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGD 302
           +L    G G   +E   L  G+  N    +++  E   I   ++++       + V  GD
Sbjct: 161 LLAVKCGLGGLFAEETCLRAGIDKNKLGKDLSDEEFERIYRAMMSI------FEPVFKGD 214

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           I P              H   + G     Y +  P+ L  +R  E   FE+F+ ALDEFY
Sbjct: 215 IKP--------------HIVIKDGE----YIDVLPIELEYYRDYEKKYFESFNKALDEFY 256

Query: 363 SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDA 422
           SK  ++  E++          KL K    Q      L++E ++   + + I  N   ++ 
Sbjct: 257 SKTIAETEEEES-----EELKKLRKRLEIQLESKRKLEEEAEKFKSLGDFIYENYATIEK 311

Query: 423 AILAVRVALANRMSWEDL---ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           A+ A R A   +MS+E+    A+ +K  +  G             ++ + ++L+      
Sbjct: 312 ALNAFRQA-KEKMSFEEFKAKAKSLKFVKDVG-------------KDYVVIVLNGK---- 353

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
                     ++ +DL    H  A  +YE  KK   K E  + A  K  K  E+  R + 
Sbjct: 354 ----------EIRLDLDKDIHGIAESYYEKAKKAREKLEGLLIAIEKTKKEIEEAERKEK 403

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           L  K  A I  +RK  WFE+F WFI+S+ +L I GR+AQ NE IV +Y+   D++ H   
Sbjct: 404 L--KYTAPIRIVRKREWFERFRWFITSDGFLAIGGRNAQMNEEIVSKYLEPKDLFFHTQT 461

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTG 658
            GA + V+K        P +++ +   F   +S  W   + +   ++V   QV K+A  G
Sbjct: 462 PGAPAVVLKKG---LEAPEISIVETAQFAAIYSSLWKQGLHSGEVYYVTADQVKKSAKAG 518

Query: 659 EYLTVGSFMIRGKKNFL 675
           EYL  GSF I GK+N++
Sbjct: 519 EYLPKGSFYIVGKRNYI 535



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 74/142 (52%), Gaps = 13/142 (9%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M++ D+   V+ L+ LIG +   +Y   P     K      +   G  +   L++E+G R
Sbjct: 1   MSSLDIYVCVRELQELIGGKVEKIYHYPPNEIRIK------IYAKGRKD---LIIEAGRR 51

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T + ++    PS F + LRKH+  +R+E + Q  +DR+++  FG       ++ EL+
Sbjct: 52  IHLTIFPKESPKFPSPFAMLLRKHLEGKRIEKIWQHDFDRVVVIDFG----DRKIVAELF 107

Query: 126 AQGNILLTDSEFTVLTLLRSHR 147
           A+GN+ LTD  F V+  +   R
Sbjct: 108 AKGNVALTDENFDVIMDIHGKR 129


>gi|48478297|ref|YP_024003.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
 gi|48430945|gb|AAT43810.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
          Length = 611

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/205 (35%), Positives = 124/205 (60%), Gaps = 18/205 (8%)

Query: 482 EEKTLPVEK----VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK-KTR 536
           E+KT  ++     + ++   SA  N    ++  K  ++K E    A  ++ +  EK K R
Sbjct: 344 EKKTFEIKMDDDLIRINYTKSAGENLNIIFDTAKDYKNKIEGAKRAIEESMRLYEKEKNR 403

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            ++ +          R  +WFE ++WF SS N++V++GRDA+ NE ++K++M + D+YVH
Sbjct: 404 TEVKK----------RPRYWFETYHWFFSSNNFMVLAGRDAKTNESLIKKHMEENDIYVH 453

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTA 655
           ADL+GA ST+IK+      +   T+ +A  F +  S+AW + + + +A+WVYP QVSKT 
Sbjct: 454 ADLYGAPSTLIKSEG--NTIDERTIREACIFAISFSRAWPAGIGSGTAYWVYPSQVSKTP 511

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPL 680
            +GE+++ GS++IRGK+N++   PL
Sbjct: 512 ESGEFISKGSWVIRGKRNYIFDLPL 536


>gi|327401161|ref|YP_004342000.1| fibronectin-binding A domain-containing protein [Archaeoglobus
           veneficus SNP6]
 gi|327316669|gb|AEA47285.1| Fibronectin-binding A domain protein [Archaeoglobus veneficus SNP6]
          Length = 637

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/349 (29%), Positives = 174/349 (49%), Gaps = 39/349 (11%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L  +   E   F TF+ ALDE+Y++  S+  +++ +        +L K+   
Sbjct: 236 YIDVLPIELQIYDGLERKYFPTFNEALDEYYARRISEVKQEESE--------ELKKLKAR 287

Query: 392 QENRVHTLKQ---EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
            E ++ T K+   E++R     + +  N + ++  + A R A   + SW+++ ++V+   
Sbjct: 288 LEKQLETKKEFENEMERYRAAGDAVYENYQLLEQILEAFRQARQQK-SWDEIKKIVR--- 343

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
            A   ++ L+ +++ E+N + + +   ++   D  K LP               A  +YE
Sbjct: 344 -AHPKLSKLVVEIHPEKNSVVVNIGPKIELALD--KNLP-------------QIADVYYE 387

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ-EKTVANISHMRKVHWFEKFNWFISSE 567
             KK   K E  + A  K     E+  R++ L+ +K V  +   RK  WFE+F WFI+S+
Sbjct: 388 RAKKVRQKLEGLLKAIEKT---KEEMQRVEELEAKKYVKGLRVARKREWFERFRWFITSD 444

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI GR+A  NE IV +YM   D++ H    GA +TV+K     Q  P  ++ +A  F
Sbjct: 445 GFLVIGGRNAAMNEEIVSKYMEPKDLFFHTQTPGAPATVLKLG---QEAPETSIIEAAQF 501

Query: 628 TVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
              +S  W + K     ++V P QV + A  GEYL  GSF I GK+N+L
Sbjct: 502 AATYSALWKEGKYSGEVYYVKPEQVKRAAKHGEYLARGSFYIEGKRNYL 550



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 77/139 (55%), Gaps = 10/139 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M++AD+AA V  L++L+G +   +Y   P     K      +   G  +   L++E+G R
Sbjct: 4   MSSADIAACVSELQQLVGGKVEKIYHHPPDEIRVK------IYAGGRKD---LILEAGRR 54

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T + R+    PS F + LRKH+   R+  + Q  +DR+++ +       +++I+EL+
Sbjct: 55  IHLTKFPRESPRIPSSFAMLLRKHLEGGRVRKIEQHDFDRVVVIEVE-REKRNFIIVELF 113

Query: 126 AQGNILLTDSEFTVLTLLR 144
           ++GN++L D  F ++  L+
Sbjct: 114 SKGNVILADESFRIIMPLK 132


>gi|330506586|ref|YP_004383014.1| hypothetical protein MCON_0325 [Methanosaeta concilii GP6]
 gi|328927394|gb|AEB67196.1| protein of unknown function (DUF814) [Methanosaeta concilii GP6]
          Length = 641

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/360 (29%), Positives = 166/360 (46%), Gaps = 45/360 (12%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           +  P  L  +   E  +F TF  ALD F+ + E +   Q      D   H++      Q 
Sbjct: 247 DVLPRPLKLYSGLEKKRFVTFSEALDAFFVEREKETTRQ------DPLEHRIEL----QR 296

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             +   + +    V+  ELI      V+  +  +  A A   S+  +      ER +G+ 
Sbjct: 297 KAIEEFRSQEAELVRKGELIYQLYGSVEQILTLMNDARARGFSYNQIW-----ERISGSG 351

Query: 454 VAGLIDKLYLE-RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE---- 508
           +      L L+ R  M + L                E++E++  L+   NA+R+Y+    
Sbjct: 352 LPQAKTILSLDGRGEMRVFLDG--------------EELELNAELAVPQNAQRYYDKAKD 397

Query: 509 -LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
            ++K + ++    IT   KA K A KKTR        V++    RK  W+E+F WF SS+
Sbjct: 398 MVRKARGAQSALAITEELKAGKVAPKKTR-------AVSSYYRRRKPKWYERFRWFYSSD 450

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LV+ GRDA  NE I  +Y+ + D+ +H D  GA  T IK    E  VP  TL +A  F
Sbjct: 451 GFLVLGGRDADSNEEIYAKYLERRDLAMHTDAPGAPLTAIKTEGKE--VPESTLQEAAGF 508

Query: 628 TVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            V +S  W S +  +  + V   QVSKT  +GE+L  G+F+IRG++ +    PL +  G+
Sbjct: 509 AVSYSSLWKSGLAAADCYLVKGDQVSKTPESGEFLKKGAFVIRGERRYFRDVPLGIALGI 568



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 74/144 (51%), Gaps = 8/144 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M+  DVAA VK L+ R++G      Y  S       +       +S    ++ LL+
Sbjct: 1   MKKAMSNVDVAAMVKELQDRILGGFMGKAYQQSSDRIWLSV-------QSPAEGRLDLLL 53

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T   R    TP  F   LR H+   R+ D+RQ  +DR++  +      A Y+
Sbjct: 54  ETGRRVHITKAERPASKTPPQFPTMLRSHLSGGRIVDIRQHQFDRVLEIKVERSGTARYL 113

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+EL+ +G+++L D    +L++LR
Sbjct: 114 IVELFPKGSMILLDESRNILSMLR 137


>gi|310752298|gb|ADP09459.1| FbpA and DUF814 domain protein [uncultured marine crenarchaeote
           E48-1C]
          Length = 608

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 178/368 (48%), Gaps = 35/368 (9%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF----HKLNKIHMDQ 392
           P  L  +   E   +E+F+  LDEFY ++ +         +E  +      +L +I   Q
Sbjct: 180 PFRLKCYADFEHKCYESFNETLDEFYVRVGAIEKALTVATEEVGSLKQEMERLKRIIEMQ 239

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
           E    T K  +  + +M ++I  +  +++A +           +W+++   V  E+K G 
Sbjct: 240 EEACATAKTNMQENKRMGDIIHVHAGELEALLHRFLAGREEGKAWDEIVSEVLAEKKTGV 299

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
             +G +    +  +   L++   LD +          +  + L  S   NA R+Y   K+
Sbjct: 300 KSSGFL----VSFDDKHLVVDVCLDGL----------QFGLSLRRSLFDNAARFYRRYKR 345

Query: 513 QESKQEKTITAHSKAFKAAEK-KTRLQILQEKTVANISHM--------RKVH---WFEKF 560
            + K +    A  ++ +  E+ + RL+  + +   ++S +        RK+    WFEKF
Sbjct: 346 AKQKLDGAKIAMEESHRKLEEVEARLE--KAEAAGSVSPVEVIEEVAERKIERKKWFEKF 403

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SS+  LV++G+DA  NE++V +Y + GD+  HAD+ GA   V+K +  E+P     
Sbjct: 404 RWFVSSDGVLVVAGKDAVSNEVLVNKYATDGDIVFHADVVGAPFVVVKMN-GEKPSEE-C 461

Query: 621 LNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           L QAG F    S+ W     +   +WV P Q+ K+A +G+Y+  G F++RGK+N++   P
Sbjct: 462 LRQAGVFAASFSRGWREGFASVDVYWVKPDQLDKSAKSGQYVPKGGFVVRGKRNWMRGSP 521

Query: 680 LIMGFGLL 687
           L +  G++
Sbjct: 522 LRLAVGIV 529



 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
           + LRK++R  RL +V Q  ++R+++F F        + LEL+  GN +L D + T+L  L
Sbjct: 1   MGLRKYLRNCRLANVEQSDFERVVIFTFETWAGEMRLYLELFGGGNAILVDEKGTILQAL 60

Query: 144 RSHRDDDKGVAIMSRHRY-------PTEICR 167
              R  D+ +      R+       P  +CR
Sbjct: 61  TYKRMRDRNIIRDQIFRFAPPVGKNPFRVCR 91


>gi|389860344|ref|YP_006362583.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
 gi|388525247|gb|AFK50445.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
          Length = 644

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 186/375 (49%), Gaps = 54/375 (14%)

Query: 331 IYDEFCP-LLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQ--HKAKEDAAFHKLNK 387
           +Y  F P +L+ +++    VK   F+ A+D F+   E + A +    +A E AA  +L K
Sbjct: 241 LYTSFKPSVLIEEYKLS--VKGVDFNTAVDTFFGHYERRVARETTLRRAGEKAA--ELKK 296

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              + + R+   ++++D    +   I  N   V+  +L  +  +     WE +      E
Sbjct: 297 AIDEIQQRISAFQKDLDGYRSILNTIYENYAQVEQVLLCAQ-EVRRAAGWESVP-----E 350

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
           R +G      ++    ++  + + + ++   +D          + +DL        R   
Sbjct: 351 RCSG------VESYQADKGLVLVKVGDSTVWLD----------IRLDLK-------RNVI 387

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRL--QILQEKTVANISHMRKVHWFEKFNWFIS 565
           E+KKK    + K  TA +K  +  E+  ++    L+E  +     +R   W+E+F+W I+
Sbjct: 388 EIKKKIGELERKLETALNKKREMEEELKQIGEASLEEPRLV----IRPREWYERFHWTIT 443

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           S  +L I GRDA QNE I ++YM + D+++HAD+HGA   V+K     + VP   + +A 
Sbjct: 444 SNGFLAIGGRDADQNETIYRKYMEESDIFLHADVHGAPVVVVKTR--GEDVPETDIREAA 501

Query: 626 CFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP-PHPLIMG 683
             T C+S+AW + + +   +WV   QVSK+ P+GEYL+ GSFM+ GK+N+L  P  L +G
Sbjct: 502 YLTACYSRAWKAGLASIEVFWVRGGQVSKSPPSGEYLSKGSFMVYGKRNYLSIPLELALG 561

Query: 684 --------FGLLFRL 690
                   +G+ +RL
Sbjct: 562 VEKVESSVYGVYYRL 576



 Score = 41.2 bits (95), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 22/82 (26%), Positives = 45/82 (54%), Gaps = 3/82 (3%)

Query: 60  MESGVRLH-TTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +E GVR H +     +KK  P    + +RKH+   ++  VRQ+G++R++  +   G   +
Sbjct: 47  LEPGVRFHLSNIVPSEKKVDP--LAIFVRKHLDNVKVLGVRQVGWERVLRVELARGSEKY 104

Query: 119 YVILELYAQGNILLTDSEFTVL 140
            + +EL  +G +++ + E  +L
Sbjct: 105 SMFIELLPRGVVVIANYEERIL 126


>gi|20093528|ref|NP_613375.1| RNA-binding protein snRNP [Methanopyrus kandleri AV19]
 gi|19886366|gb|AAM01305.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Methanopyrus kandleri AV19]
          Length = 671

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 162/684 (23%), Positives = 279/684 (40%), Gaps = 100/684 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + DV A  + L  L+ G     +Y +  +    K ++  GV          L+ E G+
Sbjct: 9   MTSFDVRATARELDSLLEGALIDKIYQVGERELKVK-VHVPGVGSH------YLVWEPGM 61

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T   +   + P+  +  LR  +   R+E V QLG+DRI+ F    G   H   +EL
Sbjct: 62  RVHLTWRPKPSPDQPTSVSQALRNTLSGDRIERVTQLGFDRILRFDLRSGRRVH---VEL 118

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
             +G + +TD    +     + R  ++ V        P E+                   
Sbjct: 119 LPKGTLAVTDENNVIERAFPARRFRNRAVV-------PGEVY------------------ 153

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
            EP    PD    D +                   +L   ++++           L   L
Sbjct: 154 -EPPEGPPDPYELDRDAF----------------LELLLEADRD-----------LVRTL 185

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
              +G G   +E ++L  GL    +    +   +     L        D L+ +  GD+ 
Sbjct: 186 AVDVGLGGLYAEEVLLRAGLYERRE----SHASEFEEDELEELYETLRDLLEQISEGDLR 241

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY-S 363
           P  Y   +  ++     P E  S     DE            E  + +TF  ALDE+Y +
Sbjct: 242 PTLYRTTERDYVDVTPVPLERYS-----DEL-----------EMEEQDTFQRALDEYYVT 285

Query: 364 KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAA 423
           K  +++  +  +  E         I   Q + +  L+ + ++    A  +  N   VD  
Sbjct: 286 KFLAEKEREVREEWEREKRRLERTIER-QRSSIEQLRTKAEKLRGRANALYLNYNLVDGI 344

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEE 483
           +  +R A     S +++ R ++E + +G      I  + +E   + L L       ++ E
Sbjct: 345 LSELRKAERKGYSLDEIKRRIQEAKGSGIEEVERIADIDVENRRVILRLPG-----ENGE 399

Query: 484 KTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEK 543
            T+PV  ++ D+  +A     R  EL++K E  QE  +    +  +   ++   ++  E+
Sbjct: 400 VTVPV-PIDSDVHSTASKLFDRAKELERKAERAQE-VLREQERELEKLLEEGPPEVELEE 457

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
               ++  RK  W+E+F WFISS+ ++VI G DA  NE+I++RY+ + D+ VHA +HGA 
Sbjct: 458 LTVELTKRRKKDWYERFRWFISSDGFVVIGGSDAHTNEIILRRYLEEHDILVHAHVHGAP 517

Query: 604 STVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLT 662
             VIK    E  VP  TL +A  F   +S+AW   +     +WV   QV K+A       
Sbjct: 518 HVVIKTEGEE--VPETTLREAAIFAASYSRAWRWGLKAADVYWVTADQVDKSAEAPH--- 572

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            G  +IRGK+N+     L +  G+
Sbjct: 573 -GGAIIRGKRNWFRRTELKVAIGV 595


>gi|150400994|ref|YP_001324760.1| hypothetical protein Maeo_0563 [Methanococcus aeolicus Nankai-3]
 gi|150013697|gb|ABR56148.1| protein of unknown function DUF814 [Methanococcus aeolicus
           Nankai-3]
          Length = 686

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 198/375 (52%), Gaps = 30/375 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y    P+ L ++ + +   ++ F  A+D+++S    +   ++ + K     ++  +I   
Sbjct: 257 YFSISPIELLKYANYDKKYYDNFLTAMDDYFSIFILKTEIKKQETKIQKMVNRQERILNS 316

Query: 392 Q-ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
           Q E+     KQ+++  +K  +LI  N   VD  IL   ++   ++ W+++ ++VK+ +  
Sbjct: 317 QIESLKKYEKQDIENKLK-GDLIYANYAMVDE-ILNTIISAREKLEWKEIKKIVKQNK-- 372

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK-VEVDLALSAHANARRWYEL 509
            NP+ G I  +  E+N   ++L+  +D  D      P+ K V +D+  +A  NA  +Y  
Sbjct: 373 DNPILGKIVSIN-EKNG-EIILNLTVDYGDGA----PITKNVILDIRKNAFENADNYYGK 426

Query: 510 KKKQESKQEKTITAHSKAFKAAEK-----KTRLQILQEK--TVANISHMRKVHWFEKFNW 562
            KK + K +   TA   + K  +K     ++ ++ L+EK  T       +K  W+EKF W
Sbjct: 427 SKKFKHKIKGVHTAIEISEKKLKKLKIQEESEMETLKEKEETTMVKKERKKRKWYEKFKW 486

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--NHRPEQPVPPLT 620
            + ++ YLVI+G+DA  NE ++KRY  K D+  H  + GA  TVIK    +  + +  L+
Sbjct: 487 TVIND-YLVIAGKDASTNESLIKRYTEKDDIVFHTQMAGAPFTVIKVDKSKGNKTIEELS 545

Query: 621 -------LNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
                  +++   + V HS+AW   + ++  +WV P Q+SKTA +GEYL+ G+FM+RGK+
Sbjct: 546 EEERNHLISETAKYAVSHSKAWKLGLGSADVYWVKPDQISKTAESGEYLSKGAFMVRGKR 605

Query: 673 NFLPPHPLIMGFGLL 687
           NF+    L +G G++
Sbjct: 606 NFIRSAILDLGIGII 620



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/168 (26%), Positives = 81/168 (48%), Gaps = 16/168 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLL 58
           +K  +   D+   V+ L+++I  +    + ++    K  I K+     + E G  E V+ 
Sbjct: 1   MKTELTNVDIFVAVQELQQIINGKLDKAFLVNSQQGKELILKI----HIPEIGTREIVV- 55

Query: 59  LMESGVRLH----TTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
               GV  H     T Y+RDK   P  F + LRKH++  ++  V Q  +DRII  +F   
Sbjct: 56  ----GVGKHKYITLTEYSRDKPRNPPSFAMLLRKHLKNIKIVSVEQHNFDRIIKIKFQWN 111

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
              + +++EL+  GN++L D E T++  L+  R   + +     +++P
Sbjct: 112 EIEYILVIELFGDGNVILLDKENTIILPLKIERWSTRKIVPKEIYKFP 159


>gi|452206612|ref|YP_007486734.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
 gi|452082712|emb|CCQ35980.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
          Length = 703

 Score =  134 bits (336), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 175/384 (45%), Gaps = 54/384 (14%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQ-RAEQQHKAKED-----AAFHKLNKIHM 390
           PL  ++    +   FE+F+ A+DE++ ++E++   E+Q  A  D     +   K  +I  
Sbjct: 261 PLREHETEGYDATAFESFNGAIDEYFYRLETESETEEQAGAGTDRPDFESEIEKYERIIE 320

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
            QE  + +  ++ D   + AE +  N + +D     VR A  + + W +    ++E  +A
Sbjct: 321 QQEGAIESYDEQADEEQRKAESLYGNYDLIDEICSTVRAAREDGVPWAE----IEETFEA 376

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRW 506
           G            ER   +   +  +  +D  E T+ V+     +E+D  +    NA R 
Sbjct: 377 G-----------AERGIEA---AEAVVSVDGAEGTVTVDLGDGPIELDPTVGVERNADRL 422

Query: 507 YELKKKQESKQE---KTITAHSKAFKAAEKKTRLQILQEK---------------TVANI 548
           Y   K+   K+E     I    +   A E++      ++                 V+++
Sbjct: 423 YTEAKRVRGKKEGAQAAIEDTREDLAAVERRREAWEAEDADEGEDEDDDAETDYLAVSSV 482

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  W E+F WF +S+ +LVI GR+A QNE +VK+YM   D + HA  HGA  T++K
Sbjct: 483 PVRYDEKWHERFRWFRTSDGFLVIGGRNADQNEELVKKYMDPSDRFFHAQAHGAPVTILK 542

Query: 609 NHRPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              P++P     +P  +  +A  F V +S  W D K     + V   QVSKT  +GEY+ 
Sbjct: 543 ATEPDEPARDVDIPETSKREAARFAVSYSSVWKDGKFEGDVYEVDADQVSKTPESGEYVE 602

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            GSF+IRG + +   H + +G  +
Sbjct: 603 KGSFVIRGDREYY--HDVSVGVSV 624



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 70/163 (42%), Gaps = 11/163 (6%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-- 63
           M + D+AA V  LR   G      Y         K+ +        +  ++ L++E+G  
Sbjct: 7   MTSVDLAALVGELREYTGAVVDKAYLYGDDFVRLKMRDY-------DRGRIELVVETGDP 59

Query: 64  VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            R H     +  D    P  F + LR  I     E V Q G+DRI+ F+F        V+
Sbjct: 60  KRAHVAVPDHVADAPGRPPNFAMMLRNRIAGANFEGVEQYGFDRILTFRFEREDATTLVV 119

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            EL+  GN+ + + +  V+  L + R   + VA  S++ YP E
Sbjct: 120 AELFGDGNVAVMNEDREVIDSLDTVRLTARTVAPGSQYGYPDE 162


>gi|269865041|ref|XP_002651784.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063882|gb|EED42272.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 243

 Score =  134 bits (336), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 68/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKVQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVVCKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164


>gi|76801680|ref|YP_326688.1| hypothetical protein NP2070A [Natronomonas pharaonis DSM 2160]
 gi|76557545|emb|CAI49126.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160]
          Length = 699

 Score =  134 bits (336), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/393 (25%), Positives = 180/393 (45%), Gaps = 46/393 (11%)

Query: 336 CPLLLNQFRSREF--VKFETFDAALDEFYSKIESQRAEQQHKAKED-----AAFHKLNKI 388
            PL L +  +  +    FE F+ A+D ++ +++++ AE +    +D     +   K  +I
Sbjct: 257 TPLPLEEHTAEGYDATAFEHFNGAIDAYFHRLQAE-AETETDTGDDKPDFESEIAKFERI 315

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
              Q+  +   +++ +   + AEL+  N + VD     V+ A    + W+++    +E  
Sbjct: 316 IEQQQGAIEEYEKQAEVEQQKAELLYGNYDLVDEICSTVQSAREEGVPWDEIETTFEEGA 375

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           + G   A  +  +      +++       ++DD+E       +++D  +    NA R Y+
Sbjct: 376 ERGIDAAAAVVGVDAAEGTVTI-------DLDDKE-------IDLDPTMGVEKNADRLYQ 421

Query: 509 LKKKQESKQEKTITAHSKAFKAAE--KKTRLQILQEK----------------TVANISH 550
             K+   K+E    A     +  E  K+ R Q   +                 ++A++  
Sbjct: 422 EAKRVRGKKEGAQAAIEDTREDLEDVKERRRQWEADDDEDDDADEESPDRDYLSMASVPV 481

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
                W+E+F WF +S+++LVI GRDA QNE +VK+YM   D + HA  HG   T++K  
Sbjct: 482 RYDEKWYEQFRWFRTSDDFLVIGGRDADQNEALVKKYMDPSDRFFHAQAHGGPVTILKAT 541

Query: 611 RPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            P++P     +P  +  +A  F V +S  W D K     + V P QVSKT  +GEY+  G
Sbjct: 542 APDEPAREVDIPDTSKREAAQFAVSYSSVWKDGKFEGDVYEVDPDQVSKTPESGEYIEKG 601

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
            F+IRG +N+     + +  G+    D   +G 
Sbjct: 602 GFVIRGDRNYYRDMQVGVAVGIKCEPDTRVIGG 634



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 70/166 (42%), Gaps = 11/166 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  M + D+AA V  LR   G      Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRAMTSVDLAALVGELRDYTGAVVDKAYLYGDDFVRLKMRDY-------DRGRIELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F + LR  I     E V Q G+DRI+ F+F       
Sbjct: 57  GDPKRAHVAVPEHVPDAPGRPPNFAMMLRNRIAGANFEGVEQYGFDRILTFRFEREDQTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            ++ EL+  GNI + + +  V+  L + R   + VA  +++ YP E
Sbjct: 117 LIVAELFGDGNIAVLNEDHEVIDCLDTVRLSARTVAPGAQYGYPDE 162


>gi|269863594|ref|XP_002651278.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064823|gb|EED42778.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 262

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 68/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVVCKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164


>gi|269867209|ref|XP_002652521.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220062310|gb|EED41535.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 265

 Score =  133 bits (335), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 24  KAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 83

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 84  NKYMEDRDLYFHCDVKGASSVICKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 137

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 138 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 184


>gi|124027973|ref|YP_001013293.1| hypothetical protein Hbut_1105 [Hyperthermus butylicus DSM 5456]
 gi|123978667|gb|ABM80948.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
          Length = 672

 Score =  133 bits (335), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 172/356 (48%), Gaps = 42/356 (11%)

Query: 331 IYDEFCPLLLNQFRSR--------EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
           +YD+  PL +  F  R        E+  F     A DE++  +  + A     A E  A 
Sbjct: 239 VYDKGVPLTVTCFEPRGLAARYGFEYRAFNDPSTAYDEYFLTVAREAAGASTVAAEIEAE 298

Query: 383 HK--LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDL 440
            K  L  +   + N  H L++++    ++AE++  N+ DV  A+   R  +     WE +
Sbjct: 299 RKKLLASLEAARRNLEH-LRKKLRELEELAEIVSTNIADVYDAVECAR-KMRETAGWEQI 356

Query: 441 ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAH 500
                     GN   G++D +   +  + + +  N+  +D     +   ++ VDL     
Sbjct: 357 P---------GN-CPGVVD-VEPNKGIIKISIVGNIVPIDIR---MEPGRLVVDLY---- 398

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKF 560
              +R  E++ K E + EK +          E+K R ++L+ + +     +R+  W+EK+
Sbjct: 399 ---KRIGEVRAKIE-RGEKAVKDIEARLAELEEKVRQRLLRARAM-----VRRKEWYEKY 449

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
           +W I+S  YL I GRDA QNE +VKRY++   +++HAD+HGA + V       Q  P   
Sbjct: 450 HWVITSHGYLAIGGRDASQNESVVKRYLNDKRIFMHADIHGAPAVVFFAE--GQTPPEQD 507

Query: 621 LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           L +A      +S+AW + +     +WV+  QVSK AP GEYL  G+FM+ GK+N++
Sbjct: 508 LREAAAIAAAYSKAWKAGIGSVDVYWVWGSQVSKAAPAGEYLAKGAFMVYGKRNYI 563



 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 71/138 (51%), Gaps = 10/138 (7%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  M   DVAA V+ L  L G R +N+Y            N   +     +E   +++  
Sbjct: 5   KTSMTAFDVAAVVRQLSGLQGSRLANIYA----------YNGGFLLRFKGAEDARVVVVP 54

Query: 63  GVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            VRLH T Y   ++ TP    + LRK+IR  RLE V Q G+DRI +F+F  G  ++ ++ 
Sbjct: 55  AVRLHATRYEPAERGTPPPLVMGLRKYIRGARLESVEQHGFDRIAVFRFSRGNGSYVLVT 114

Query: 123 ELYAQGNILLTDSEFTVL 140
           EL  +G ++L DS + +L
Sbjct: 115 ELLPRGVVVLADSSWKIL 132


>gi|146302942|ref|YP_001190258.1| hypothetical protein Msed_0157 [Metallosphaera sedula DSM 5348]
 gi|145701192|gb|ABP94334.1| protein of unknown function DUF814 [Metallosphaera sedula DSM 5348]
          Length = 601

 Score =  133 bits (335), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 76/204 (37%), Positives = 119/204 (58%), Gaps = 18/204 (8%)

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANIS 549
           K+E+D  +SA  NA +++E  K+ ++K  +T     +  +  EKK   Q ++ K+   I 
Sbjct: 322 KIEIDPKISASKNASQYFEKAKELDAKIRRT----RETIEELEKKK--QEIKAKSKETIE 375

Query: 550 H----MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
                +RK  W+E+++W I+S  ++VI+GRD  QNE IV++ +   D+++HAD+ GA +T
Sbjct: 376 GSKILVRKKEWYERYHWTITSNGFIVIAGRDIDQNESIVRKMLEDKDIFLHADIQGAPAT 435

Query: 606 VIKNHRPEQPV--PPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLT 662
           VIKN     PV      L  A     C+S+AW   + +   +WVY  QVSK+ P+GEYL 
Sbjct: 436 VIKN-----PVGIGEQDLMDAAVLAGCYSKAWKLGLASIDVFWVYGEQVSKSPPSGEYLP 490

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            GSFMI GKKN++    L +  G+
Sbjct: 491 KGSFMIYGKKNYIKNVKLELTIGV 514


>gi|15920412|ref|NP_376081.1| hypothetical protein ST0231 [Sulfolobus tokodaii str. 7]
 gi|15621194|dbj|BAB65190.1| hypothetical protein STK_02310 [Sulfolobus tokodaii str. 7]
          Length = 595

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 112/200 (56%), Gaps = 11/200 (5%)

Query: 491 VEVDLALSAHANARRWYELKKK---QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN 547
           +E+D  LS + NA +++++ K+   +  K E+T+    +  K  +K+     ++E+T   
Sbjct: 326 IELDPKLSVYKNASKYFDIAKEYAEKRKKAEETLNNLKQKLKELDKQ-----IEERTEEI 380

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
              +RK  W+EK+ W  +   YLVI+GRD  QNE +V++ +   D+++HAD+ GA +T+I
Sbjct: 381 RISLRKREWYEKYRWSFTRNGYLVIAGRDIDQNESLVRKLLEPKDIFLHADIQGAPATII 440

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSF 666
           K       V    +  A     C+S+AW   M     +WV   QVSK+ P+GEYL  GSF
Sbjct: 441 KTQG--NNVTEDDIRDAAVIAACYSKAWKVGMGAIDVFWVNGDQVSKSPPSGEYLKKGSF 498

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
           MI GKKNF+    + +  GL
Sbjct: 499 MIYGKKNFINNVKMQLFLGL 518



 Score = 40.8 bits (94), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 56/115 (48%), Gaps = 15/115 (13%)

Query: 21  LIGMRCSNVYDLS-PKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +I  R  NVY +S  + Y  KL           S+K L++ E G R+H T Y R K+   
Sbjct: 23  IISCRVDNVYKISGTQAYFLKL-------HCKNSDKNLVI-EPGKRIHFTKYDRQKE--I 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTD 134
           S     +R HI+ + + ++  LG +RII   F        + +EL  +G +++TD
Sbjct: 73  SNEVSLIRAHIKDKIINNIELLGKERIIKLTFM----DRLMYIELLPRGLLVITD 123


>gi|269864527|ref|XP_002651604.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064216|gb|EED42452.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 257

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVICKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164



 Score = 41.6 bits (96), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/96 (22%), Positives = 46/96 (47%), Gaps = 16/96 (16%)

Query: 1017 RLNDVDY---LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIF--- 1070
            R+N+ D       NP   D +L+ + + GP+ +++ Y+Y V+I+PG  KK +  Q     
Sbjct: 162  RINNKDKEWEFRDNPDCDDEILHAMAIAGPWVSLKKYRYAVRIVPGNEKKQQVAQTILDR 221

Query: 1071 ----------YSLLLLMLSLTPVFDIFPFQCLCSRK 1096
                      +++ +  + +  + D+ P +C   +K
Sbjct: 222  FDKQSTENPRHNMWICAVRIQELIDVLPGKCKIPKK 257


>gi|397781041|ref|YP_006545514.1| hypothetical protein BN140_1875 [Methanoculleus bourgensis MS2]
 gi|396939543|emb|CCJ36798.1| putative protein MJ1625 [Methanoculleus bourgensis MS2]
          Length = 659

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 180/378 (47%), Gaps = 49/378 (12%)

Query: 310 LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQR 369
           LM +    +D   T+SG          P++L     RE  +FETF  ALD FY K+  ++
Sbjct: 251 LMADVGRRRDPVITQSGC--------WPVVLAGEEVRE--RFETFSEALDAFYPKVAGEK 300

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
            E   +        +   I   Q   +   ++++ R  ++ E++  N   V   I  +  
Sbjct: 301 EEAAAEKPR---LSREEVIRQRQAEAIKGFEKKIRRYERVVEVLYENYTAVTGVITTLDA 357

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE 489
           A  +R SW+++ +++K    + N  A +I  ++     + L L+               E
Sbjct: 358 ASRDR-SWQEIEQILKS--NSDNAAAKMIRAVHPAEAAVELDLAG--------------E 400

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANIS 549
           +V+V +  +   N  R+Y+  KK + K+   + A  +A     ++ +  + Q+K      
Sbjct: 401 RVKVYVHETIEQNIGRYYDQIKKFKKKKAGALAAMERAITVKPRRKQHLVFQKK------ 454

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
                 W+ +F WF +S+  LVI GRDA QNE +VK+YM  GD+++HAD+HG S  ++K 
Sbjct: 455 -----RWYHRFRWFSTSDGVLVIGGRDASQNEELVKKYMEGGDLFIHADVHGGSVVIVKG 509

Query: 610 HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMI 668
                      L++A  F   +S AW +   ++  +   P QVSKTA +GEY+  G+F++
Sbjct: 510 ATEH-------LDEAAQFAASYSNAWKAGHFSADVYAARPDQVSKTAESGEYVARGAFIV 562

Query: 669 RGKKNFLPPHPLIMGFGL 686
           RG++ +    P+ +  GL
Sbjct: 563 RGERQYFRNVPVGVAIGL 580



 Score = 76.6 bits (187), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 49/150 (32%), Positives = 79/150 (52%), Gaps = 11/150 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M+  D+ A V      + +    +Y    KT         G+  +GE   K LLL+E+G 
Sbjct: 34  MSGVDLRALVAEAADRLPLWVGKIYQFDAKTL--------GIRLNGEDRAKYLLLVETGR 85

Query: 65  RLHTTA-YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
           R+H TA + +  KN PS F + LRKH+   ++ D+RQLG +R +    G     +++I E
Sbjct: 86  RIHFTAEFPKPPKNPPS-FAMLLRKHLEGGKVLDIRQLGIERTMSIDIGKRDTTYHLIFE 144

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           L+ +GN +L D E+T++  L  HR  ++ V
Sbjct: 145 LFDEGNAILCDEEYTIIKPLWHHRFKNRDV 174


>gi|257077022|ref|ZP_05571383.1| hypothetical protein Faci_08161 [Ferroplasma acidarmanus fer1]
          Length = 615

 Score =  132 bits (332), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 60/130 (46%), Positives = 95/130 (73%), Gaps = 3/130 (2%)

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R  +WFE ++WFISS   ++++GRDA+ NE +VK++MS  D+YVHADL+GA STVIK+  
Sbjct: 409 RVKYWFESYHWFISSSGNMIMAGRDAKTNEKLVKKHMSDDDIYVHADLYGAPSTVIKHEG 468

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            E  +   T+ +A  F++  S+AW + + + +A+WVYP QVSKT  +GE+++ GS+++RG
Sbjct: 469 IE--ITEETIKEACIFSISLSRAWPAGIGSGTAYWVYPSQVSKTPESGEFVSKGSWIVRG 526

Query: 671 KKNFLPPHPL 680
           K+N++   PL
Sbjct: 527 KRNYVLNIPL 536



 Score = 41.6 bits (96), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 4/107 (3%)

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNI 130
           Y  +K    +  ++  RK +  +R+  + Q+ +DR++      G     +ILEL+  GN+
Sbjct: 62  YDAEKPEEATQLSMLFRKQLSEKRIVGIEQINFDRVVRITLHTGQE---IILELFGGGNL 118

Query: 131 LLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           +LTD+   V   +  H    + V I   +  P  I  V +  T S +
Sbjct: 119 ILTDNGKIVFA-MEQHVYKTRKVQIGEEYIPPAVINPVADLETFSGI 164


>gi|269864419|ref|XP_002651566.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064286|gb|EED42490.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 290

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVICKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164


>gi|14601515|ref|NP_148055.1| hypothetical protein APE_1611 [Aeropyrum pernix K1]
 gi|5105298|dbj|BAA80611.1| conserved hypothetical protein [Aeropyrum pernix K1]
          Length = 650

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/192 (38%), Positives = 109/192 (56%), Gaps = 13/192 (6%)

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ----ILQEKTVANISHMRKVHWFEKF 560
           R Y    + E+K E+      KAF  AE ++RL+      + +++  I   RK  WFEK+
Sbjct: 368 RLYREAGELEAKAERA----EKAF--AEARSRLEEAVRRARLRSLRRIIEGRKRFWFEKY 421

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
           +W I+   +L I GRDA QNE +VKRY+   D+++HAD+HGA +TV+   R  QP     
Sbjct: 422 HWTITRNGFLAIGGRDAGQNESVVKRYLGDDDIFLHADIHGAPATVLLTRRL-QPGDD-D 479

Query: 621 LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           +  A      +S+AW +     S +WVY  QVSK+ P GEYL  G+FM+ GK+N++   P
Sbjct: 480 IYDAAVLAAAYSRAWKAGAGGVSVYWVYGSQVSKSPPAGEYLARGAFMVYGKRNYIHHVP 539

Query: 680 LIMGFGLLFRLD 691
           L +  G++   D
Sbjct: 540 LKLALGIVMHKD 551



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 80/152 (52%), Gaps = 14/152 (9%)

Query: 1   MVKVRMNTADV-AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M +  MN+ DV  A ++    L G R  N+Y    K  +  LM   G T +     V ++
Sbjct: 1   MARKSMNSLDVHIAAIQLDNMLRGARLDNIYWPPEKKGV--LMKFKGPTGT-----VNVI 53

Query: 60  MESGVRLHTTA-YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            E  VR+H T+  A  ++  P+GF   LRK +R  RLE VRQLG+DRI+   F  G   H
Sbjct: 54  AEPSVRIHATSRTAALREVVPTGFVAILRKRVRGSRLEGVRQLGFDRIVELSFSTG---H 110

Query: 119 YVILELYAQGNILLTDSEFTV--LTLLRSHRD 148
            + +E+  +G+++L +SE  +   T++   RD
Sbjct: 111 RLYVEIMPRGSLVLVNSEGVIEATTVVAEFRD 142


>gi|374630447|ref|ZP_09702832.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
           2279]
 gi|373908560|gb|EHQ36664.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
           2279]
          Length = 629

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/329 (28%), Positives = 161/329 (48%), Gaps = 49/329 (14%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FET+  ALD ++   E   AE + K        K   I   Q+  +   ++++  + +  
Sbjct: 255 FETYSQALDSYFGLPEVSEAEVKKK------LSKAEIIRKRQQEAIVKFEEKITLASEKV 308

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E+I  N + + A I+      + +MSW+++  ++K    A NP+A +I ++Y     + +
Sbjct: 309 EIIYANYQTI-ADIVKTLSDASLKMSWQEIEDILK---NADNPMAKMIKRVYPSEAAVDI 364

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           LL           KT+ +   E         NA R+Y   KK + K+   + A  + FK 
Sbjct: 365 LLDG---------KTIKLYASE-----GVEGNAGRYYSEIKKFKKKKAGALVAMER-FKV 409

Query: 531 AE----KKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
            E    K+T ++ ++ K            W+ KF WF +S++ LVI GRDA  NE IV++
Sbjct: 410 TERPERKRTDIKFIKPK------------WYHKFRWFYTSDDVLVIGGRDAGTNEDIVRK 457

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           Y+   D ++HAD+HG S+  +K            +++A  F V +S AW S   ++  + 
Sbjct: 458 YLEGKDTFLHADIHGGSAVAVKGETE-------CMDEAAVFAVSYSNAWKSGFYSADVYA 510

Query: 647 YPH-QVSKTAPTGEYLTVGSFMIRGKKNF 674
            P  QVSKTA +GE L  G+F+IRG++ +
Sbjct: 511 VPRDQVSKTAESGESLKRGAFVIRGERKY 539



 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 80/153 (52%), Gaps = 9/153 (5%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGES-EKVLLLM 60
           VK  M+  D+ A +  L  L+ +    +Y      + F+L        +GE  +K  ++ 
Sbjct: 3   VKKGMSGLDLRAVIAELNGLMPLWIGKIYQYDQNAFGFRL--------NGEDRQKFSIIA 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR+H T         PSG+++ LRK++   R+ ++ Q G  R++    G   + +++
Sbjct: 55  ESGVRVHLTKKLPKSPENPSGYSMYLRKYLSGGRILEINQPGIQRVLDLTIGKSESIYHL 114

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           I E + +GN +L DSE+T+L  L+ HR  D+ +
Sbjct: 115 IFEFFDEGNAILCDSEYTILNALKRHRFKDRDI 147


>gi|335438854|ref|ZP_08561586.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
 gi|334890357|gb|EGM28628.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
          Length = 707

 Score =  131 bits (329), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/482 (23%), Positives = 206/482 (42%), Gaps = 77/482 (15%)

Query: 243 VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGD 302
            L   L +G    E +    G+  N  + E   +E    + L  AV+   + L++   GD
Sbjct: 188 TLATQLNFGGLYGEELCSRAGVSYNQAIEETTDVE---FEALYDAVSDLSERLRE---GD 241

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           + P  Y+   ++    D                 P+ L ++  +    F++F+ AL+E++
Sbjct: 242 LDPRLYVEADDQETPVD---------------VTPVPLVEYEDKPSEAFDSFNDALEEYF 286

Query: 363 SKIESQRAEQQ---HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
             +E +  E++   ++   +A   K  +I   QE  +   ++E     + AEL+  N + 
Sbjct: 287 LGLEQEPDEEETGSNRPGFEAEIEKQKRIIAQQEGAIEDFEEEAAAEREKAELLYANYDL 346

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VD  +  ++ A A    W ++   +   +  G P A                    + ++
Sbjct: 347 VDEVLSTIQDARAADTPWAEIEETLSAGKDQGIPAA------------------EAVSDV 388

Query: 480 DDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESK-------------QEKTIT 522
           D  E T+ V+    ++E+D       NA R Y+  K+ E K             Q + + 
Sbjct: 389 DGSEGTVTVQIDDHRIELDADTGVEKNADRLYQEAKRIEDKKAGAKEAIENTREQLEAVK 448

Query: 523 AHSKAFKAAEKKTRLQILQEKTVA-----------NISHMRKVHWFEKFNWFISSENYLV 571
              +A++A++         +               +I       W+E F WF +S+ +LV
Sbjct: 449 QRREAWEASDGNDGGDGSGDTDEDDQEDIDWLARESIPIRTSEEWYEHFRWFHTSDGFLV 508

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP-EQP-----VPPLTLNQAG 625
           I GR+A QNE +VK+Y+ +GD++ H   HGA +T++K   P E P     +P  +  +A 
Sbjct: 509 IGGRNADQNEELVKKYLDRGDLFFHTQAHGAPATILKATGPSEAPPDDISIPESSREEAA 568

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            F + +S  W D K     + V   QV+KT  +GEYL  GSF IRG++ +    P+ +  
Sbjct: 569 QFAISYSTLWKDGKYAGDVYCVEHDQVTKTPESGEYLEKGSFAIRGERTYYDDTPVGVAV 628

Query: 685 GL 686
           G+
Sbjct: 629 GI 630



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 9/164 (5%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSS-GVTESGESEKVLLLME 61
           K  + + D AA    LR  +G      Y         KL   + G  E      +L+ ++
Sbjct: 4   KRELTSVDCAALAGELRAFVGAYHEKSYLYDDDLLRLKLSGPNFGRIE------LLIEVD 57

Query: 62  SGVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
              R+HT A  R  D    P  F + LR  +   +L  V Q  +DRI+  +F    +   
Sbjct: 58  DPKRVHTVAPERVPDAPERPPNFAMMLRNRLEGAQLASVEQFEFDRILQLRFERSDDHTT 117

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           +I EL+  GN+ + D   TV+  L + R   + V   SR+ +P+
Sbjct: 118 IIAELFGDGNLAVLDETDTVIDSLETVRLQSRTVTPGSRYEFPS 161


>gi|448377770|ref|ZP_21560466.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
           14624]
 gi|445655714|gb|ELZ08559.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
           14624]
          Length = 736

 Score =  131 bits (329), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 169/737 (22%), Positives = 282/737 (38%), Gaps = 134/737 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L  L G +    Y         KL +     + G  E  + + E+
Sbjct: 4   KRELSSVDLAAVVGELSDLEGAKVDKAYLYGDDLVRLKLRD----FDRGRVELFIEVSET 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R+HT A  R  D    P  F   LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RVHTVAQERVPDAPGRPPHFAKMLRNRLSGADFAGVSQYEFDRILEFVFEREDANTRL 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ +GN+ +TD E+ V+  L                    E  R+  RT A      
Sbjct: 119 IVELFGEGNVAVTDGEYEVVDSL--------------------ETIRLKSRTVAPGARYE 158

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              S+               N    S+E         +FD  +  +++  D  R    TL
Sbjct: 159 FPESR--------------VNPLTVSRE---------AFD--RQMDESDTDVVR----TL 189

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            T     L +G   +E +    G+   + + +  + E    + L  A+ +      DV +
Sbjct: 190 AT----QLNFGGLYAEELCTRAGVEKTIDIEDAGESE---YERLYGAIERL---AIDVRN 239

Query: 301 GDIVPEGYILMQNKHL------GKDHPPTESGSSTQIYDEF-----------CPLLLNQF 343
           G   P  Y+  +++        G D    E+G + +  DE             PL  +Q 
Sbjct: 240 GAFDPRLYLEHEDEEGETEGDSGTDD---EAGPTAETDDETEASGTPVDVTPFPLDEHQQ 296

Query: 344 RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE----DAAFHKLNKIHMDQENRVHTL 399
              E   F++F  ALDE++ ++E    E    A +    +A   K  +I   QE  +   
Sbjct: 297 AGLEPEAFDSFTDALDEYFYRLELADEEPADAASQRPDFEAEIAKQQRIIEQQEGAIEEF 356

Query: 400 KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           ++E +   + AEL+  N   VD  +  VR A      W ++    +   + G   A  + 
Sbjct: 357 EREAEAERERAELLYANYGFVDEILSTVRDARTEGTPWAEIEERFEAGAEQGIDAAEAVV 416

Query: 460 KLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY----ELKKKQES 515
            +      +++ L                E++ +D       NA R Y     + +K+E 
Sbjct: 417 DVDGANGRVTIELDG--------------ERIGLDADDGVEKNADRLYTEGKRIAEKKEG 462

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKT-------------------VANISHMRKVHW 556
            Q+       +     E+K   +   E +                    ++I       W
Sbjct: 463 AQQAIENTREELADVRERKAAWEADDEGSDETGGDDSDEDEPDIDWLARSSIPIRENEPW 522

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK------NH 610
           F++F W  +S+ +LVI GR+A QNE +V +Y+  GD   H   HG   TV+K      + 
Sbjct: 523 FDRFRWVQTSDGFLVIGGRNADQNEELVNKYLEPGDRVFHTQAHGGPVTVLKATDPSESS 582

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
           RP+   P  ++ QA  F V ++  W D +     + V   QV+KT  +GEYL  G F IR
Sbjct: 583 RPDMEFPEASIEQAAQFAVSYASVWKDGRYAGDVYAVDADQVTKTPESGEYLEKGGFAIR 642

Query: 670 GKKNFLPPHPLIMGFGL 686
           G + +    P+ +  G+
Sbjct: 643 GDRTYHRDTPVDVAVGI 659


>gi|374632982|ref|ZP_09705349.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
           yellowstonensis MK1]
 gi|373524466|gb|EHP69343.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
           yellowstonensis MK1]
          Length = 602

 Score =  131 bits (329), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 12/208 (5%)

Query: 483 EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ---EKTITAHSKAFKAAEKKTRLQI 539
           E TL    VE+D  LS    A  ++E  K+ ESK    E+TI    K  +  + K R + 
Sbjct: 316 EVTLGEVTVEIDPNLSLTRVASSYFERAKELESKARRAEETIAELKKKVEELKLKLR-ET 374

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
            + K++     +RK  W+EK+ W  +  NYLVI+GRD  QNE +VK+ + + ++++HAD+
Sbjct: 375 EESKSLV----IRKKEWYEKYRWSFTRNNYLVIAGRDVDQNESLVKKMLGEEEIFLHADI 430

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTG 658
            GA +T+IK+ +    V    +  A     C+S+AW   +     +WVY  QVSK+ P+G
Sbjct: 431 QGAPATIIKDSK---GVQEGDIYDAAVVAACYSKAWKLGLGSVDVFWVYGSQVSKSPPSG 487

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           EYL  GSFMI GKKNF+    L +  GL
Sbjct: 488 EYLPKGSFMIYGKKNFIKNVRLELAIGL 515



 Score = 44.3 bits (103), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 58/118 (49%), Gaps = 14/118 (11%)

Query: 20  RLIGMRCSNVYD-LSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           +++G R  N+Y  L  + Y+F L    G  E+        ++E   R+H T Y R++   
Sbjct: 26  KIVGCRVDNIYSILKGRGYLFLLHCRDGDKET--------ILEPSRRIHFTRYQRER--V 75

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
                  LR+ +R   + +V  +  +RI++F      N H + LEL  +G +++TDS+
Sbjct: 76  LDNKAKMLRELVRGAVIREVDVVPGERIVVFSLS---NDHKIYLELLPKGVLVVTDSQ 130


>gi|330835774|ref|YP_004410502.1| hypothetical protein Mcup_1916 [Metallosphaera cuprina Ar-4]
 gi|329567913|gb|AEB96018.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
          Length = 508

 Score =  131 bits (329), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 77/216 (35%), Positives = 120/216 (55%), Gaps = 16/216 (7%)

Query: 490 KVEVDLALSAHANARRWYELKKKQE---SKQEKTITAHSKAFKAAEKKTRLQILQEKTVA 546
           K+E+D + S   NA  +++  K+ E    K E+TI    +  +    KT+ +I   K + 
Sbjct: 236 KIEIDPSKSIAKNAALYFDKAKELEEKIKKTEETIVELERKKQDLLSKTKEEIESSKVL- 294

Query: 547 NISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
               +RK  WFEK++W I+   Y+VI+GRD  QNE +VK+++   D+++HAD+ GA +TV
Sbjct: 295 ----IRKREWFEKYHWTITKNGYIVIAGRDIDQNESLVKKFLGDDDIFLHADIQGAPATV 350

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGS 665
           IK+      +    L  A      +S+AW   + +   +WVY  QVSK+ P+GEYL  GS
Sbjct: 351 IKSP---NSISDEDLLDAATLAASYSKAWKLGLGSIDVFWVYGKQVSKSPPSGEYLPKGS 407

Query: 666 FMIRGKKNFLPPHPLIMGFGL----LFRLDESSLGS 697
           FMI GKKNF+    L +  G+     FR++  S  +
Sbjct: 408 FMIYGKKNFIKNVKLELTVGINTKEGFRIEVGSFNT 443


>gi|296241940|ref|YP_003649427.1| hypothetical protein Tagg_0195 [Thermosphaera aggregans DSM 11486]
 gi|296094524|gb|ADG90475.1| protein of unknown function DUF814 [Thermosphaera aggregans DSM
           11486]
          Length = 666

 Score =  130 bits (326), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 193/393 (49%), Gaps = 55/393 (13%)

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
           +GY+++Q ++              Q++  + P+L  +    E  + E+ D  +D +++++
Sbjct: 236 KGYLVLQEEN-------------PQLFTAYYPVLFKEEYGFEVKELESIDEVIDIYFTRL 282

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR-SVKMAELIEYNLEDVDAAI 424
           E        +++  A    LN+  + Q+  +   ++++D  S K++ +  Y   D+ +A+
Sbjct: 283 ELSLELAGKQSEMKAKLDSLNERILRQKEIISNYQRQLDEISNKLSSIYTY-FTDISSAL 341

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
                         D AR  +EE+             Y+ +NC  ++   N+ + D  E 
Sbjct: 342 --------------DCARKTREEQGWE----------YIVKNCPGII---NIHK-DKGEV 373

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK---TRLQILQ 541
            L V    + L++      ++  E++K +   + K  TA + + K  EK+   T+++ L 
Sbjct: 374 ELSVGGRTITLSIRIPLE-KQIIEMEKIKGEVKRKIDTALN-SLKEIEKEYDATKME-LD 430

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + + + +  ++   W+EKF+W  +   +LV+ GRDA QNE IVK+Y+   D+++HA++HG
Sbjct: 431 KFSASKMISIKPRSWYEKFHWLFTRNGFLVVGGRDASQNEAIVKKYLRDKDIFLHAEIHG 490

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGE 659
            S+ V+  +  E   P L+ +  A     C+S+AW + M     +W     VS + P+GE
Sbjct: 491 GSAAVLLTNGKE---PSLSDIEDAALIPACYSKAWKTGMGFIEVFWTMGSSVSLSPPSGE 547

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
           YL  G+ M+ GKKN+L   PL +G GL    DE
Sbjct: 548 YLPKGAIMVYGKKNYL-KTPLRLGLGLDVVCDE 579


>gi|119719655|ref|YP_920150.1| hypothetical protein Tpen_0745 [Thermofilum pendens Hrk 5]
 gi|119524775|gb|ABL78147.1| protein of unknown function DUF814 [Thermofilum pendens Hrk 5]
          Length = 610

 Score =  130 bits (326), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 86/280 (30%), Positives = 141/280 (50%), Gaps = 39/280 (13%)

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
             +++ V+   + AEL+  +   VD  + A R  +A+R+ W     +V+   K   P+  
Sbjct: 283 EAIRRAVEELSRKAELLSRHSATVDEVLAAYRGLVASRLQWS----LVEARLKEAYPIVK 338

Query: 457 LIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK 516
            +D     R+ + L       E++  E       VEVD + SA +NA  ++E   K +S 
Sbjct: 339 SVDP---ARSRLVL-------ELEGVE-------VEVDASRSALSNAASYFE---KAKSA 378

Query: 517 QEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           + K   A +             + +    A     +   W+ +F +F +S  +LV++GR 
Sbjct: 379 KRKLAEASA------------AVERSAEPAPARPAKPAAWYAQFRFFFTSNGFLVVAGRS 426

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           A QNE++V+RYM  GD+++HAD+HGA++ V+K    +QP     + +A  F  C S AW 
Sbjct: 427 AGQNELLVRRYMEPGDIFLHADIHGAAAVVLKTG-GKQP-GEADIAEAAQFAACFSSAWK 484

Query: 637 SKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             +     +WV   QVSK  P+GEYL  GSFM+ GKKN++
Sbjct: 485 GGLYAVDVFWVPAEQVSKKPPSGEYLAKGSFMVYGKKNYV 524


>gi|403216659|emb|CCK71155.1| hypothetical protein KNAG_0G00970 [Kazachstania naganishii CBS
           8797]
          Length = 1006

 Score =  129 bits (325), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 180/385 (46%), Gaps = 51/385 (13%)

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           T++  +D+F+S +ES +   + + +E  A  KL +   +   R+  L     ++ +   L
Sbjct: 319 TYNRTVDKFFSTLESSKYAMKIQNQETLAGKKLEEARSENGKRIQALIDVQSQNEQKGHL 378

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERN--CMS 469
           I  + E V+ A  AV+  L  ++ W  + +++  E+K GN +A  I   L L++N   + 
Sbjct: 379 IITHAELVEDAKGAVQGLLDQQLDWNIIEKLIITEQKKGNKIAKAIKLPLKLKKNTIVLE 438

Query: 470 LLLSNNLDEMDDEE------------------------------------KTLPVEKVEV 493
           L L +N D  DD E                                    + L    V V
Sbjct: 439 LPLEDNNDTEDDTELSEEVDSSDISSSELSSDEESDQGSTQHQHRKSNRIRALKPTTVSV 498

Query: 494 D--LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVANI 548
           D  L LS +ANA  ++ +KK    KQ+K      KA K  E K   Q+   L+E     +
Sbjct: 499 DIKLDLSTYANASEYFMVKKHTVEKQKKVEQNLDKAMKNIETKVNKQLNSKLKESHKV-L 557

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  ++FEK+NWFISSE +LV+ G+   + + +  +Y++  D+YV  +    S   IK
Sbjct: 558 KRLRTPYFFEKYNWFISSEGHLVLMGKSDIETDQLYSKYITPDDIYVSNEF--GSHVWIK 615

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFM 667
           N +  + VPP T+ QAG F +  S AW  K+ +S ++     VSK +A     L  G + 
Sbjct: 616 NPKKTE-VPPNTIMQAGIFAMAASVAWSKKLSSSPYFCSASNVSKFSANDNTVLPQGCYR 674

Query: 668 I--RGKKNFLPPHPLIMGFGLLFRL 690
           +    +K  LPP  L+MG G  +++
Sbjct: 675 LIDEREKVVLPPAQLVMGLGFFWKV 699



 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 83/146 (56%), Gaps = 13/146 (8%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+   D+   V  L   L   R +N+Y++  S + ++ K         +    K+ +
Sbjct: 1   MKQRLGALDIQLLVPELSTALESYRLNNIYNVADSSRQFLLKF--------NKPDSKINV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G++++ T ++RD    PSGF +KLRKH++ +RL  +RQ+  DRII+ QF  G N  
Sbjct: 53  VVDCGLKIYMTEFSRDIPPVPSGFVVKLRKHLKAKRLTALRQVLDDRIIVLQFADGKN-- 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR 144
           Y++LE ++ GN++L D    +L + R
Sbjct: 111 YLVLEFFSAGNVILLDETRKILLVQR 136



 Score = 43.9 bits (102), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 27/36 (75%)

Query: 1032 DILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            D +L ++PV  P+ A+  +KY+VK++PG+AKK K +
Sbjct: 897  DEILDIVPVFAPWPALAKFKYKVKLVPGSAKKTKAM 932



 Score = 43.1 bits (100), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 2/59 (3%)

Query: 874 ASSQPESIVRKTKIEGG--KISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQK 930
           + S PE++     I G   K  RG+KGKLKKM+ KY DQDE ER +++  L +   ++K
Sbjct: 781 SGSIPENMSVAETIVGDIKKNVRGKKGKLKKMQRKYRDQDENERLLKLEALGTLKGIEK 839


>gi|355571923|ref|ZP_09043131.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
 gi|354825019|gb|EHF09254.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
          Length = 633

 Score =  129 bits (324), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 91/316 (28%), Positives = 150/316 (47%), Gaps = 44/316 (13%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           ++ KA+ D      + I   Q+  V   ++++    +  E +  +   V   + A+R A 
Sbjct: 281 KEEKARRD------DHIRSRQQEAVKKFEEKIAACERAVEALYSHYTLVSEILEALRKAR 334

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKV 491
             R SW+++  +V   R A +  A  I  +Y  R  + + L                E+V
Sbjct: 335 ETR-SWQEIEALV---RGAKSGPATRIVAVYPGRGAVDIDLG---------------ERV 375

Query: 492 EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM 551
            + +  S  ANA  +YE  KK   K      A  +A +  E++T      +K        
Sbjct: 376 TLTVGESIEANAAAYYEEIKKYRRKIAGAQAAMERAVQKKERRTVRAAAGKK-------- 427

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
               W+ +F WFI+S+  LV+ GRDA QNE +VK+YM   D++VHAD+HGAS  ++K   
Sbjct: 428 ---RWYHRFRWFITSDGVLVVGGRDASQNEELVKKYMEGSDLFVHADVHGASVVIVKGKT 484

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSK-MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            +       +++   F   +S AW S  +    + V P QVSKT  +GEY++ GSF++RG
Sbjct: 485 GK-------MDEVATFAASYSGAWKSGHLAADVYCVAPSQVSKTPESGEYVSRGSFIVRG 537

Query: 671 KKNFLPPHPLIMGFGL 686
           ++ +    PL +  GL
Sbjct: 538 ERRYFRNVPLGIAIGL 553



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 80/159 (50%), Gaps = 7/159 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           ++  DV A V    RL+ +     Y+++P T + +           E  +  L++E  VR
Sbjct: 7   LSGIDVRALVTEWERLLPLWVDKAYEVAPGTILLRFKGK-------EHGRHALVIEPPVR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H T +      TPS F + LRK++   R+  VRQ G  RI++F  G G   +++++EL+
Sbjct: 60  AHLTWHEVAVPKTPSAFAMLLRKYLSGGRVLSVRQHGIQRIVIFDIGKGDRLYHLVIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +GNI+L  S++T++   R     ++ +   + +  P E
Sbjct: 120 DRGNIVLCASDWTIIQPFRRLHFREREIVAGAAYTLPPE 158


>gi|429217609|ref|YP_007175599.1| RNA-binding protein [Caldisphaera lagunensis DSM 15908]
 gi|429134138|gb|AFZ71150.1| putative RNA-binding protein, snRNP like protein [Caldisphaera
           lagunensis DSM 15908]
          Length = 669

 Score =  129 bits (324), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 92/143 (64%), Gaps = 3/143 (2%)

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
           V NI   RK  W+EK++W ++  N+L I GRDA QNE +VK+Y+S+ D+Y+HAD+HG+ S
Sbjct: 437 VKNIIRSRKREWYEKYHWILTRNNFLAIGGRDADQNESVVKKYLSEKDIYIHADIHGSPS 496

Query: 605 TVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTV 663
            V+      + V    +N A    + +S+AW + M    A+WV  +QVSK+ P+GEYL  
Sbjct: 497 VVL--FANNKDVGEEDINDAAIIAIAYSKAWKAGMGSVGAYWVLGNQVSKSPPSGEYLAK 554

Query: 664 GSFMIRGKKNFLPPHPLIMGFGL 686
           GSFMI GKKNFL P  + +  G+
Sbjct: 555 GSFMIYGKKNFLKPINMELYLGI 577



 Score = 43.1 bits (100), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 27/110 (24%), Positives = 55/110 (50%), Gaps = 5/110 (4%)

Query: 57  LLLMESGVRLHTTAYAR-DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
           +LL+E  +R+H +   +   +     F L LRK+IR +++  V Q+G+DR+I   F    
Sbjct: 71  ILLIEPSLRIHFSNRIKPSSEFVDKQFALLLRKYIRDQKITSVEQIGFDRLIKITF---F 127

Query: 116 NAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEI 165
           N     +E+  +G + L D    ++   +  +  D+ +    ++++P  I
Sbjct: 128 NIK-TFVEILPKGVVALVDENDQIIGATKYLKFKDREIKPKIKYKFPKII 176


>gi|124485365|ref|YP_001029981.1| hypothetical protein Mlab_0540 [Methanocorpusculum labreanum Z]
 gi|124362906|gb|ABN06714.1| protein of unknown function DUF814 [Methanocorpusculum labreanum Z]
          Length = 642

 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 103/341 (30%), Positives = 166/341 (48%), Gaps = 48/341 (14%)

Query: 351 FETFDAALDEFYSKIESQRA-EQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           F TF  AL+ FY K  +++  EQ+ K        K  +I   QE  V    +++  + ++
Sbjct: 258 FATFSQALEAFYPKPVAEKVIEQKIK------LSKEERIRKQQEAAVVNFDKKIAEATEI 311

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
           +E+I  +  +V   I  V  A + ++SW+D+A ++K   K+  P A         +  +S
Sbjct: 312 SEIIYSHYGEVQETI-DVLAAASQKLSWQDIAAVIK---KSDLPAA---------KRIIS 358

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           +   N    +D +EK     KV + +  S  AN  R++ + KK  +K+   + A      
Sbjct: 359 VDPKNASVVIDLQEK----HKVTIFVHESLEANVGRYFAVVKKFRAKKAGALRAMEAGIV 414

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
            AEKK           A      K  W+ +F W  +S+  LVI GR+A QNE +VK+YM 
Sbjct: 415 HAEKKK----------AAGPGRLKPKWYHRFRWMETSDGVLVIGGRNADQNEELVKKYME 464

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW----DSKMVTSAWW 645
             D ++HAD+ GAS+ ++K            ++QA  F   +S+AW     S  V +A  
Sbjct: 465 GKDTFLHADVFGASAVIVKGVTER-------MDQAVQFAASYSRAWAGGGASVDVIAA-- 515

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             P+QVSKT  +GEY+  GSF+IRG++      PL +  G+
Sbjct: 516 -SPNQVSKTPESGEYVAHGSFVIRGERKIYKDVPLEIAIGV 555



 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 75/148 (50%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+ ADV A    L  L+ +    +Y     +  F+L          E  + LL +  G+R
Sbjct: 7   MSGADVKAMTAELAALLPLWIGKIYQYDNASLGFRLNGE-------EKARHLLYVVRGIR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H  +        PSGF++ LRK+I   ++ ++ Q   +R+I+   G G + + +I+EL+
Sbjct: 60  AHLVSELPPAPKNPSGFSMYLRKYIEGGKVLNIEQKAIERVIIITIGKGPSEYKLIIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
            +GN++LTD +FT++  L   R  D+ +
Sbjct: 120 DEGNLILTDEKFTIINALAQRRFRDRDI 147


>gi|302347972|ref|YP_003815610.1| fibronectin-binding protein [Acidilobus saccharovorans 345-15]
 gi|302328384|gb|ADL18579.1| Predicted fibronectin-binding protein [Acidilobus saccharovorans
           345-15]
          Length = 647

 Score =  127 bits (318), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 59/149 (39%), Positives = 91/149 (61%), Gaps = 5/149 (3%)

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           +SH R+  W+E+++W ++S   L + GRDA QNE +V++ +   DV++HAD+HGA + ++
Sbjct: 417 VSHRRRA-WYERYHWLVTSSGVLAVGGRDADQNESLVRKMLGPNDVFLHADIHGAPAVIL 475

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
                        +++A   T  +S+AW   M + S +W Y  QVSK+ P+GEYLT GSF
Sbjct: 476 MAA-AAGGFTETDVSEAAVLTAAYSRAWKEGMASVSVYWAYGSQVSKSPPSGEYLTKGSF 534

Query: 667 MIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           M+ GKKN+L P  L +  G+   LDE  L
Sbjct: 535 MVYGKKNYLRPLRLELYLGIA--LDEEGL 561


>gi|429192346|ref|YP_007178024.1| RNA-binding protein [Natronobacterium gregoryi SP2]
 gi|448325749|ref|ZP_21515133.1| Fibronectin-binding A domain-containing protein [Natronobacterium
           gregoryi SP2]
 gi|429136564|gb|AFZ73575.1| putative RNA-binding protein, snRNP like protein [Natronobacterium
           gregoryi SP2]
 gi|445614570|gb|ELY68242.1| Fibronectin-binding A domain-containing protein [Natronobacterium
           gregoryi SP2]
          Length = 710

 Score =  126 bits (316), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 167/371 (45%), Gaps = 53/371 (14%)

Query: 351 FETFDAALDEFYSKIE------SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +++F A LD+++ ++E      S   EQ+   +E+ A  K  +I   QE  +   +Q+ +
Sbjct: 281 YDSFLAVLDDYFFRLELEEEDDSDPTEQRPDFEEEIA--KYERIIEQQEGAIEGFEQQAE 338

Query: 405 RSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
           +  + AEL+  EY L  VD  +  VR A      W+++    +E ++ G   A  +  + 
Sbjct: 339 QLREKAELLYAEYGL--VDEVLSTVREAREQDRPWDEIEERFEEGKERGIEAAKAVVDVD 396

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                +++              TL  E VE+ +      NA R Y+  K  E K+E  + 
Sbjct: 397 GSEGTVTV--------------TLDGEHVELAVHDGVEQNADRLYKEAKDIEGKKEGALA 442

Query: 523 AHSKAFKAAE--KKTRLQILQEK------------------TVANISHMRKVHWFEKFNW 562
           A     +  E  K+ R Q   +                   ++ ++       W+++F W
Sbjct: 443 AIEDTREDLEEAKRRRDQWEVDDEDDGDDDEIDEADSKDWLSMPSVPIRENEPWYDRFRW 502

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------V 616
           F +S++YLVI GR+A QNE +VK+Y+  GD   H   HG   TV+K   P +       +
Sbjct: 503 FYTSDDYLVIGGRNADQNEELVKKYLEPGDKVFHTQAHGGPVTVLKATDPSEASSHDIDL 562

Query: 617 PPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + + 
Sbjct: 563 PQTSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYY 622

Query: 676 PPHPLIMGFGL 686
              P+ +  G+
Sbjct: 623 DDTPVGVAVGI 633



 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 56/112 (50%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V L++E G   R HT A  R  D    P  F + LR  +      DV Q  +DRI+ F 
Sbjct: 49  RVELILEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFVDVEQYEFDRILEFI 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|422293271|gb|EKU20571.1| hypothetical protein NGA_2069500, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 107

 Score =  125 bits (315), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 55/83 (66%), Positives = 62/83 (74%)

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           V P+ L +AGC  V  S AW +KMVTSAWWV   QVSKTAP GE+L  GSFM+RGKKNFL
Sbjct: 10  VSPVALQEAGCLAVSRSSAWKAKMVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFL 69

Query: 676 PPHPLIMGFGLLFRLDESSLGSH 698
            P PL MG GLLF+LDE S+G H
Sbjct: 70  APQPLEMGLGLLFKLDEGSVGRH 92


>gi|305663918|ref|YP_003860206.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
 gi|304378487|gb|ADM28326.1| protein of unknown function DUF814 [Ignisphaera aggregans DSM
           17230]
          Length = 667

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/185 (36%), Positives = 107/185 (57%), Gaps = 13/185 (7%)

Query: 513 QESKQEKTITAHSKAF-KAAEKKTRL---------QILQEKTVANISHMRKVHWFEKFNW 562
           Q ++  K I+   K+  +A E+K +L         +IL+EK    +    K  W+EK++W
Sbjct: 395 QYNELRKNISDIEKSIERALEEKVKLMQKINEMNNRILEEKQKVKVKLSLKKEWYEKYHW 454

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
            I+   +LVI GRDA QN  +++R++   D+ +HAD+HGAS+ +IK     + V   TL 
Sbjct: 455 TITPTGFLVIGGRDASQNIQLIRRFLEPNDIVLHADIHGASTVIIKTG--GRDVDEETLM 512

Query: 623 QAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A     C+S+AW S ++    +WVY  Q+S + PTGEYL  GS+M+ GKKN++    L 
Sbjct: 513 EAATIAACYSKAWKSGLLAIDVFWVYGSQISLSPPTGEYLPKGSYMVYGKKNYIKNVSLK 572

Query: 682 MGFGL 686
           +  G+
Sbjct: 573 LALGI 577


>gi|448399812|ref|ZP_21571045.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
           13563]
 gi|445668265|gb|ELZ20895.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
           13563]
          Length = 722

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 171/401 (42%), Gaps = 61/401 (15%)

Query: 326 GSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQ----QHKAKEDAA 381
            S  Q+ D   P  L +    +   +ETF  ALD+++ ++E    E+    + +   D+ 
Sbjct: 266 ASEGQVVD-VTPFPLEEHTDLDSEPYETFLEALDDYFFQLELGEDEEPEPTEQRPDFDSE 324

Query: 382 FHKLNKIHMDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWED 439
             K  +I   Q+  +   +QE D   + AEL+  EY L  VD  +  ++ A      W++
Sbjct: 325 IAKYERIIEQQQGAIEGFEQEADALREQAELLYAEYGL--VDEILSTIQDARVQDRPWDE 382

Query: 440 LARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPV----EKVEVDL 495
               ++E  +AG                  +  +  + ++D  E T+ V    E++++ +
Sbjct: 383 ----IRERFEAGAE--------------QGIEAAEAVVDVDGSEGTVTVDLDGERIDLVV 424

Query: 496 ALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV---------- 545
                 NA R Y   K+ E K+E  + A     +  E   R +   E T           
Sbjct: 425 EQGVEQNADRLYTEAKRVEEKKEGALAAIEDTREDLEDAKRRRDEWEATEREDTSEDGED 484

Query: 546 -------------ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
                         +I       WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD
Sbjct: 485 EADEAEQRDWLAEPSIPIRENEPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGD 544

Query: 593 VYVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWW 645
             +H   HG   TV+K   P +       +P  ++ +A  F V +S  W D +     + 
Sbjct: 545 KVLHTQAHGGPVTVLKATDPSEASSSDIELPDSSIEEAAQFAVSYSSVWKDGRYAGDVYA 604

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           V   QV+KT  +GEYL  G F IRG + +    P+    G+
Sbjct: 605 VDADQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 645



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RIELILEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGNI +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FEREDGTTRIIVELFGQGNIAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|393796641|ref|ZP_10380005.1| hypothetical protein CNitlB_10052 [Candidatus Nitrosoarchaeum
           limnia BG20]
          Length = 638

 Score =  125 bits (313), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 58/147 (39%), Positives = 89/147 (60%), Gaps = 4/147 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK     + +RK +W+E++ WF +S+  L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 415 EKESVTFAEIRKKNWYERYRWFFTSDGILAIGGRDAPSNSAVVRKHLGKNDKIFHGDIFG 474

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++K+   E P PP +LN+    TVC S+AW   M   SA+WV P QV K+AP+G++
Sbjct: 475 SPFFILKD--TENP-PPASLNEVAHATVCFSRAWREGMYGVSAFWVNPEQVKKSAPSGQF 531

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           L  GSF I G++NF+    L +  GL+
Sbjct: 532 LPKGSFTIEGQRNFVKISTLKLAVGLM 558



 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 63/121 (52%), Gaps = 11/121 (9%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       + +  +++  SGV L +    + ++  P+    +
Sbjct: 24  VSNIYGVTKDSILFKLHHTE------KPDIYMMISTSGVWLTS---VKIEQMEPNRLLKR 74

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +++ + Q+  +RI  F F  G +  +VI+ E +  GNILL ++E  +L L  
Sbjct: 75  LRSDLLRLKVKKIEQIASERIAYFTFE-GFDKEFVIVGEFFGDGNILLCNNEMKILALQH 133

Query: 145 S 145
           S
Sbjct: 134 S 134


>gi|21227916|ref|NP_633838.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
 gi|20906336|gb|AAM31510.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
          Length = 343

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 110/207 (53%), Gaps = 5/207 (2%)

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA+ +YE  KK   K++  I A     KA EKK   +  +       S  RK HW+++F 
Sbjct: 4   NAQEYYEKVKKFTKKKDGAIRAIEDTKKAMEKKAATKSAKAGRKLQAS--RKKHWYDRFR 61

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           WF+SS+ +LV+ GRDA  NE I K+YM K D+  H    GA  TV+K    E  VP  TL
Sbjct: 62  WFVSSDGFLVVGGRDADTNEEIFKKYMEKRDIVFHTQTPGAPLTVVKTGGKE--VPDSTL 119

Query: 622 NQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
            +   F V +S  W +   +   +W+   QV+KT  +GEYL  G+F+IRG++N+    PL
Sbjct: 120 QEVSQFAVSYSSLWKAGQFSGDCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKDVPL 179

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGE 707
            +  GL  + +   +G   +  R  G+
Sbjct: 180 GIAVGLELKGETRIIGGPASAVRKHGD 206


>gi|67624075|ref|XP_668320.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54659500|gb|EAL38073.1| hypothetical protein Chro.50204 [Cryptosporidium hominis]
          Length = 1375

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 92/154 (59%), Gaps = 15/154 (9%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM + D+ A V  + + L G +  N+YD++ +TY+FK          G  EK  LL
Sbjct: 1   MVKSRMTSVDICAMVHGISKDLKGQKLINIYDINSRTYLFKF---------GGEEKKFLL 51

Query: 60  MESGVRLHTTAYARDKK-----NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +ESG+R HTT + R+ +     ++ S F  KLR++IR ++L+D+ Q+G DRI+   FG G
Sbjct: 52  VESGIRFHTTQWKRENEHKTSVSSISFFNSKLRRYIRNKKLDDISQMGMDRIVKLTFGFG 111

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRD 148
            N  Y+I E +  GNI+LTD  + +L +LR   D
Sbjct: 112 DNTFYLIFEFFVAGNIILTDCNYKILVILRDTND 145



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 75/180 (41%), Gaps = 54/180 (30%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RG+K KLKK+ +KYG+QD+EER I+M L  S    + ND                   
Sbjct: 1168 LPRGKKSKLKKVADKYGEQDDEERKIKMMLFGSKEMKKAND------------------- 1208

Query: 952  PVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIG 1011
                               DC  +   +S+   +N    L   ++ +K   E+E + ++ 
Sbjct: 1209 -------------------DCSSNKTKNSNEFLNNQNRQL-HISQQEKRRKEQEKMEKVY 1248

Query: 1012 EEEKGRLND-------VDYLTGNPLPSDI-----LLYVIPVCGPYSAVQSYKYRVKIIPG 1059
               K R+ D         Y   + LP++      ++ VIP   P++ ++ +KY  ++ PG
Sbjct: 1249 ---KNRIVDNSTENREFQYFKDSLLPTNKDEDSEIIAVIPTFAPFTCIKDFKYCARLTPG 1305


>gi|329766254|ref|ZP_08257812.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329137313|gb|EGG41591.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 590

 Score =  124 bits (311), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 90/147 (61%), Gaps = 4/147 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK    ++ +RK +W+E++ WF +S+  L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 367 EKESVTVAEIRKKNWYERYRWFFTSDGILAIGGRDAPSNSAVVRKHLGKNDKIFHGDIFG 426

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++K+   + P PP +LN+    TVC S+AW   M   SA+WV P QV K+AP+G++
Sbjct: 427 SPFFILKD--VDNP-PPASLNEVAHATVCFSRAWREGMYGVSAFWVNPEQVKKSAPSGQF 483

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           L  GSF I G++NF+    L +  GL+
Sbjct: 484 LPKGSFTIEGQRNFVKISTLKLAVGLM 510


>gi|154150873|ref|YP_001404491.1| hypothetical protein Mboo_1330 [Methanoregula boonei 6A8]
 gi|153999425|gb|ABS55848.1| protein of unknown function DUF814 [Methanoregula boonei 6A8]
          Length = 631

 Score =  124 bits (311), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 176/362 (48%), Gaps = 42/362 (11%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
           P++L +   ++  +F  F  AL+ FY   ++++ +   + K      +  +I   QE  +
Sbjct: 242 PVVLAENAPQDENQFAGFSDALEVFYPMTKAEKVKVAARPK----LSEGERIRKYQEAAI 297

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
               ++V ++ ++   I  N   +   I ++  A + R+SW+++   +K+          
Sbjct: 298 KKFDEKVAKAEEVVAAIYENYPFISQVITSL-AAASKRLSWQEIEHHLKDTSSTD----- 351

Query: 457 LIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK 516
                   +   +        E+D  +K      V++ +  +   NA  +Y+  KK + K
Sbjct: 352 -------AKRITAFFPGEAAVEVDIGKK------VKIFVHETVEQNAGHYYDQIKKFKKK 398

Query: 517 QEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           +E  + A          K R ++++     +I  M+K+ W+ +F WFI+S+  +V+ GRD
Sbjct: 399 KEGALLAMKTV------KPRKKVIRH----DIVPMKKL-WYHRFRWFITSDGVVVLGGRD 447

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           A QNE +VK+YM+ GD++VHAD+HGAS  ++K    +       +++   F   +S AW 
Sbjct: 448 AGQNEELVKKYMTGGDLFVHADVHGASVVIVKGKTEK-------MDEVAQFAASYSGAWR 500

Query: 637 SKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           S   T+  +   P QVSKT   GE++  GSF++RG++ +    PL +G GL+     + +
Sbjct: 501 SGHFTADVFSAQPTQVSKTPQAGEFVARGSFIVRGERTYYRDVPLSVGIGLVLEPYAAVI 560

Query: 696 GS 697
           G 
Sbjct: 561 GG 562



 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 64/109 (58%), Gaps = 1/109 (0%)

Query: 46  GVTESGESE-KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
           G+  +GE+  K LLL+E+G R H    A +    P  F + LRK++   ++  +RQ G +
Sbjct: 39  GIRLNGEAHAKYLLLIEAGRRAHLVKNAPEPPKNPPQFAMFLRKYLTGGKVLAIRQHGLE 98

Query: 105 RIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           RI++F  G G   + +I+EL+ +GN++L D  + ++  LR HR  D+ +
Sbjct: 99  RILIFDIGKGALTYRLIIELFDEGNVILADEAYRIIKPLRHHRFKDRDI 147


>gi|68062538|ref|XP_673276.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56491007|emb|CAH97640.1| hypothetical protein PB000420.02.0 [Plasmodium berghei]
          Length = 423

 Score =  124 bits (310), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 100/156 (64%), Gaps = 9/156 (5%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   +IG   +N+Y++S K Y+ K         S + +K  LL
Sbjct: 1   MGKQRLTALDIRAIITSCKNSIIGSVVTNIYNISNKIYVLKC--------SKKEQKYFLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PSGFT+KLRKH+R+R++ ++ QLG DR+I  QFG   N ++
Sbjct: 53  VEAEKRVHITEWVREKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVIDIQFGYDDNVYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAI 155
           +I+ELY  GNI+LT++++ ++ +L+S+ D+ K + I
Sbjct: 113 LIVELYIAGNIILTNNDYKIIFILKSNDDNKKNLKI 148


>gi|156844590|ref|XP_001645357.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156116018|gb|EDO17499.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 1019

 Score =  123 bits (309), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 71/210 (33%), Positives = 115/210 (54%), Gaps = 10/210 (4%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--I 548
           V +DL LSA+ANA +++ +KK    KQ+K      KA K  E++   Q+ ++   ++  +
Sbjct: 519 VTIDLGLSAYANASQYFSIKKTSVEKQKKVEKNAEKAMKNIEERVSQQLKKKLKESHEVL 578

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +RK ++FEK+ WFISSE +LV+ G+   + + I  +Y+   DV+    +  A  T + 
Sbjct: 579 KKIRKPYFFEKYFWFISSEGFLVMMGKSELETDQIYSKYIENDDVF----MQNAFGTQVW 634

Query: 609 NHRPEQP-VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSF 666
              P+   +PP TL QAG F +  S+AW  K+  S  W Y   +SK  + T   L  G F
Sbjct: 635 IKNPDMTEIPPNTLMQAGIFCMSASEAWSKKIAASPRWCYARNISKFDSTTNTLLPRGRF 694

Query: 667 MIRGKKNF--LPPHPLIMGFGLLFRLDESS 694
            ++ +K+   LPP  L+MGFG  +++   S
Sbjct: 695 ALKDEKSMIHLPPAQLVMGFGFAWKVKTES 724



 Score =  112 bits (279), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 117/487 (24%), Positives = 225/487 (46%), Gaps = 57/487 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+      LR+ + G R +NVY++  S + ++ K   S          K+ +
Sbjct: 1   MKQRVSALDILLLGNELRQEVEGYRLTNVYNIAESSRQFLLKFNKSDS--------KINV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R     PSGF +KLRKH++ +RL   RQ+  DRI++ QF  G+  +
Sbjct: 53  VVDCGLRIHKTDFTRPIPPAPSGFVVKLRKHLKAKRLTGFRQVKNDRILVLQFADGL--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN++L D    +L+L R  ++            Y  ++   +E    S L 
Sbjct: 111 YLVLEFFSAGNVILLDENRKILSLQRIVQE------------YGNKVGEAYEMFDES-LF 157

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A + ++ E    E D + E  N +     +    +   +S  L +  NK      + K+ 
Sbjct: 158 AEIGNTTE---KELDYLKEYNNEMVREWIDEALAKFKLESSHLLQEENK-----GQHKKV 209

Query: 239 TLKTVLGEALGYGPALSEHIIL----DTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
            + ++    L   P LS  +I       G+ P+    E +   D+ + +L    ++F++ 
Sbjct: 210 KVMSIAKLLLNKEPHLSSDLISKNLKKNGINPSSSSLEYSDKIDDLVNILNATTSEFKEL 269

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQF-RSREFVKFE 352
           L +         GYIL +     +++ P +    T+ IY+ F P     F  S++  K +
Sbjct: 270 LNNDEKC-----GYILAKK---NENYNPEKHSPDTEFIYETFHP--FEPFVESKDLEKTK 319

Query: 353 T------FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
                  ++  LD+F+S IES +   + + +E  A  KL+   ++ E R+  L      +
Sbjct: 320 IIEIPGDYNKTLDQFFSTIESSKYSLRIQNQELQAKKKLDDAKLENERRIQALVDVQTSN 379

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
            +   LI  +   ++    AV+  +  +M W  +  ++  E+K GN +A  +   L L+ 
Sbjct: 380 EQKGHLIIAHSNLIEEVKFAVQGLIDQQMDWNTIENLIGSEQKKGNKIAQKVKLPLKLKN 439

Query: 466 NCMSLLL 472
           N + ++L
Sbjct: 440 NKIDVIL 446


>gi|50293495|ref|XP_449159.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528472|emb|CAG62129.1| unnamed protein product [Candida glabrata]
          Length = 1031

 Score =  123 bits (308), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 111/206 (53%), Gaps = 12/206 (5%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--- 547
           V +DL LSA+ANA  ++ +KK    KQ+K      KA K  E K   Q LQ+K   +   
Sbjct: 516 VAIDLGLSAYANASTYFNMKKDHAEKQKKVEKNIEKAMKNIEDKIGKQ-LQKKLKESHDV 574

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           +  +RK ++FEK+ WF S+E +LV+ G+   + + I  RY+   D+++       +   I
Sbjct: 575 LKKIRKPYFFEKYFWFYSTEGFLVMLGKSNVETDQIYSRYIEDDDIFMSNSFD--TKVWI 632

Query: 608 KNHRPEQ-PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT-GEYLTVGS 665
           KN  PE+  VPP TL QAG   +  S AW  K+ +S WW +   V+K     G  L  G 
Sbjct: 633 KN--PERVEVPPNTLMQAGILCMSASPAWQKKIASSPWWCFAKNVTKFDDVDGSVLAPGV 690

Query: 666 FMIRGKK--NFLPPHPLIMGFGLLFR 689
           F +R +K  N LPP  L+MG G +++
Sbjct: 691 FRLRNEKQINMLPPAQLVMGVGFMWK 716



 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 120/490 (24%), Positives = 229/490 (46%), Gaps = 63/490 (12%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKV 56
           +K R++  D+   A E+K    L G R SN+Y++  S + ++ K         +    K 
Sbjct: 1   MKQRISALDLQILAVELKSA--LEGFRLSNIYNIADSSRQFLLKF--------NKPDSKA 50

Query: 57  LLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
            ++++ G+R+H T + R    TPSGF +KLRKH++++RL  +RQ+  DRI++ +F  G+ 
Sbjct: 51  NVVVDCGLRIHLTEFNRPVPPTPSGFVVKLRKHLKSKRLTALRQVTGDRILVLEFADGL- 109

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASK 176
             Y++LE ++ GN++L D E  +L L R   + +  V          E+  +F+ TT  +
Sbjct: 110 -FYLVLEFFSAGNVILLDHERKILALQRIVHEHENKVG---------EVYNMFDETTFDE 159

Query: 177 LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
            +   T  +       + VN   N      K  L          LS+N +KN       K
Sbjct: 160 -NMNDTQDERERTYSLELVNSWMNECETKFKSELSI--------LSQNESKN-------K 203

Query: 237 QPTLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFE 292
           +  + ++    L   P LS  ++       G  P+    E    +D  + +L+    ++ 
Sbjct: 204 KVKVMSIHKLLLSKVPHLSSDLLSKNLRIHGFNPSSSCLEYIGKKDEILNLLLETEKEY- 262

Query: 293 DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLL-----LNQFRSR 346
              +++++ D    GYI+ +   L K   P   G   + IY+ F P +      ++ +S+
Sbjct: 263 ---KNLLNAD-EKTGYIIAKKNPLYKIDTP---GYDLEYIYENFHPFIPHIPATDEDKSK 315

Query: 347 EFVKFE-TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
             +K E  ++  LD+F+S IES +   + + +E  A  K+     + + R+  L+++   
Sbjct: 316 -VIKIEGDYNKTLDDFFSTIESSKYALKIQNQEQQAKQKIEAARQENKKRIDALREQQAS 374

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
           +     L+  N++ V+    AV   +  +M W  + ++++ E+  GN +A  +   L L+
Sbjct: 375 NETKGNLLIANVDLVEEVKSAVLGLVNQQMDWNTIEKLIQSEQNKGNKIAKHVSLPLDLK 434

Query: 465 RNCMSLLLSN 474
            N + +LL N
Sbjct: 435 NNKIKILLPN 444



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 1014 EKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
            EK R   V  LT     SDI    +PV  P+ A+  YKY+VKI PG AKK K +
Sbjct: 910  EKIRTELVPNLTKEEEISDI----VPVFAPWPAMLKYKYKVKIQPGNAKKTKTL 959


>gi|432330923|ref|YP_007249066.1| putative RNA-binding protein, snRNP like protein [Methanoregula
           formicicum SMSP]
 gi|432137632|gb|AGB02559.1| putative RNA-binding protein, snRNP like protein [Methanoregula
           formicicum SMSP]
          Length = 630

 Score =  123 bits (308), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 163/363 (44%), Gaps = 53/363 (14%)

Query: 340 LNQFRSREFVKFETFDAALDEFY--SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVH 397
           +N     E   + TF  AL+ FY  +K E +   +   AKED       +I   Q+  + 
Sbjct: 243 INLRTGEETTAYPTFSLALEAFYPMTKAEKKATSRPKIAKED-------RIRSHQQAAI- 294

Query: 398 TLKQEVDRSVKMAELIE---YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
              ++ DRS+  AE +    Y      A ++    A +   SW+++ + +   R A +  
Sbjct: 295 ---KKFDRSIAQAEEVVNAIYENYPFIAQVIGTLAAASKTHSWQEIEKRI---RAAPSEE 348

Query: 455 AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQE 514
              I   +     + + L   +    +E               S   NA  +Y++ KK +
Sbjct: 349 TKKITAFFPGEAAVEIDLGKRIKVFVNE---------------SVEQNAGHYYDVIKKFK 393

Query: 515 SKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISG 574
            K+   +TA        + K R  +  +K            W+ +F WFI+S+  +V+ G
Sbjct: 394 KKKAGAVTAMETVATKKQTKRREFVPLKK-----------QWYHRFRWFITSDGAVVLGG 442

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQA 634
           RDA QNE +VK+YM+ GD +VHAD+HGAS  ++K            +++   F   +S A
Sbjct: 443 RDATQNEELVKKYMAGGDTFVHADVHGASVVLVKGKTER-------MDEVARFAASYSGA 495

Query: 635 WDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           W S   ++  +   P QVSKT   GE+++ GSF++RG++ +    PL  G GL+     +
Sbjct: 496 WRSGHFSADVYSALPSQVSKTPEAGEFVSRGSFIVRGERTYYRNIPLSTGIGLMLDPHAA 555

Query: 694 SLG 696
            +G
Sbjct: 556 VIG 558



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 9/149 (6%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M+  DV A    L+  + +    VY    KT         G+  +GE++ K LL +ESG 
Sbjct: 7   MSGIDVRAMTCELQEKLPLWIDKVYQFDTKTL--------GIRLNGENKAKYLLFIESGR 58

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H  A   +    P  F + LRKH+   ++  +RQ G +R+++F  G G     +I+EL
Sbjct: 59  RAHLVADLPEPPKNPPHFAMLLRKHLSGGKVLSIRQHGLERVLIFAIGKGTTVFNLIIEL 118

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           +  GN++L D   T++  L  HR  D+ V
Sbjct: 119 FDNGNVILADDTMTIIKPLWHHRFKDREV 147


>gi|390937875|ref|YP_006401613.1| putative RNA-binding protein [Desulfurococcus fermentans DSM 16532]
 gi|390190982|gb|AFL66038.1| putative RNA-binding protein, snRNP like protein [Desulfurococcus
           fermentans DSM 16532]
          Length = 659

 Score =  123 bits (308), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 181/370 (48%), Gaps = 58/370 (15%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +IY  + P L ++   +     +  + A+D ++++ E   A   ++A+ +    KL +I 
Sbjct: 250 EIYTSYEPRLFSEVYDKTVKPLDDINTAIDVYFTEYE---AYLDYQARMEEVTEKLREI- 305

Query: 390 MDQENRVHTLKQEVDRSVKMAELI-EYN--LEDVDAAILAVRVALANRMSWEDLARMVKE 446
              E R+   +QE        E+I EYN  +E++++ +  +    +N    E++    +E
Sbjct: 306 ---EARIK--RQE--------EIIAEYNNEIENIESILQTI---YSNYHVAEEILECARE 349

Query: 447 --ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHAN-A 503
             E+K    +A           C      N + E+  ++  + V+  E  L LS   + +
Sbjct: 350 TREKKGWEHIA---------EEC------NGVIEVRKDKGVIVVKLGEKTLELSIREDLS 394

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVANISHMRKVHWFEKF 560
           R+  EL++K+     KT +A     +  ++   + I    +EKT+   S      W+E+F
Sbjct: 395 RQVIELERKRGELVRKTESAKKVLEEMHQQLNTISISMNTEEKTIRKPS---PTFWYERF 451

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN---HRPEQPVP 617
           +W  +   +L I GRD  QNE++V++Y+ + DV++HAD+HG S+ V+K+   H  E  V 
Sbjct: 452 HWLFTRNGFLAIGGRDQSQNELVVRKYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV- 510

Query: 618 PLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
                 A     C+S+AW +       +WV   QVSKT P GEYL  G+FM+ G KN+L 
Sbjct: 511 -----DASYLAACYSKAWKAGFSYIEVYWVSGRQVSKTPPPGEYLPRGAFMVYGSKNYLQ 565

Query: 677 PHPLIMGFGL 686
             PL +G G+
Sbjct: 566 V-PLRLGIGV 574



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 73/156 (46%), Gaps = 15/156 (9%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           ++K  M+  D+ + V     +I G    N Y      +I KL    GV         ++ 
Sbjct: 5   LLKKAMDILDIYSWVNKYSSVITGCLIDNAYHYK-SYWILKLRCREGVY--------IVK 55

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGL--GMNA 117
           +E GVR+H +    ++K+   GFT  LR  IR  R+  ++Q  ++RIILF+  +   +  
Sbjct: 56  IEPGVRMHLSQSHPEEKDI-DGFTRFLRSRIRDSRITSIKQPWWERIILFETSIHDKILR 114

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           HYV  EL  +G  ++TD    ++   R     D+ +
Sbjct: 115 HYV--ELLPRGQWIITDQSDKIVYASRFMEYRDRSI 148


>gi|410730361|ref|XP_003671360.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
 gi|401780178|emb|CCD26117.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
          Length = 1037

 Score =  122 bits (307), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/226 (32%), Positives = 122/226 (53%), Gaps = 9/226 (3%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--I 548
           V +DL  SA+ANA  ++  KK    KQ++      KA K  E+K   Q+ ++   ++  +
Sbjct: 529 VTIDLGFSAYANASEYFNAKKTSAEKQKRVEKNIEKAMKNIEEKVNTQLKKKLKESHEVL 588

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  ++FEK++WFISSE YLV+ G++  + + I  +Y+   DV++  +    +   IK
Sbjct: 589 KKIRTPYFFEKYHWFISSEGYLVMMGKNDAETDQIYSKYIEDDDVFMSNNF--GTKVWIK 646

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFM 667
           N    + VPP TL QAG   +  S+AW  K+ +SAWW     V+K     +  L  G F+
Sbjct: 647 NPMKHE-VPPNTLMQAGILCMSSSEAWSKKIASSAWWCNAKNVTKFDKFDKSVLPPGVFV 705

Query: 668 IRGKK--NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           ++ +K  N LP   L+MG G L+++  S  G   + +   GE+E +
Sbjct: 706 LKDEKDQNTLPASQLVMGLGFLWKVKTSDNGDE-DVKEFEGEQEEL 750



 Score =  107 bits (266), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 129/509 (25%), Positives = 232/509 (45%), Gaps = 75/509 (14%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+   AAE+K    L G R +N+Y+ S     F L  +          K+ +
Sbjct: 1   MKQRISALDLQILAAELKT--SLEGYRLNNIYNASDSNRQFLLRFNKP------DSKLNV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R   + PSGF +KLRKH++++RL  +RQ+  DRI++ QF  G+   
Sbjct: 53  IVDCGLRIHLTEFTRPIPSAPSGFVMKLRKHLKSKRLTALRQVKNDRILVLQFADGL--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           +++LE ++ GN++L D    +++L R              H +   I   +     S  H
Sbjct: 111 FLVLEFFSAGNVILLDENRKIMSLQR------------IVHEHENIIGETYTMFDESLFH 158

Query: 179 AALTSSKEPDANEPDKVNEDGNN--VSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
            A       D N  +  N+D +   V N   E         S  L  + N  S+   + K
Sbjct: 159 TA------DDTNATNITNKDFSEGLVKNWLDEVKQKYAVAASTILETSKNDKSHQKKKIK 212

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEV----------NKLEDNAIQVLVL 286
             ++  +L   L   P LS  +     L  N+K+S++          N+++D  I++L  
Sbjct: 213 VMSIHKLL---LSKEPHLSSDL-----LSKNLKMSKIDPSTSALDFENRVDD-IIKLLNT 263

Query: 287 AVAKFEDWLQDVISGDIVPEGYIL-MQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQFR 344
             +++   L D    +    GYIL  +NK+    +P  +S    + IY+ F P    +  
Sbjct: 264 TESEYHQLLND----NEHRVGYILDHENKNF---NPKIDSNPDLEFIYETFHPF---EPY 313

Query: 345 SREFVKFET--------FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
             E  K  +        ++  LD+F+S IES +   + + +E  A  KL++  +D + ++
Sbjct: 314 VEEKDKASSHISEIPGYYNKTLDKFFSTIESSKYALRIQNQELQAKKKLDEAKLDNQKKL 373

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
             L      + +   LI  N + V+ A  A++  +  +M W  + +++K E+K    +A 
Sbjct: 374 QALIDVQSSNEEKGHLIVANADLVEEAKSAIQGLVDQQMDWNTIEKLIKSEQKKHVKIAE 433

Query: 457 LID-KLYLERNCMSLLLSNNLDEMDDEEK 484
           LI   L L+ N   + L   L   DD+E+
Sbjct: 434 LIVLPLNLKENKFKMKLP--LKTFDDDEQ 460


>gi|448346455|ref|ZP_21535340.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
           12890]
 gi|445632658|gb|ELY85869.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
           12890]
          Length = 715

 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/377 (25%), Positives = 166/377 (44%), Gaps = 52/377 (13%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIH 389
           +  P  L +    E   +++F +ALD ++ ++E    E+     +   F     K  +I 
Sbjct: 266 DVTPFPLEEHDDLEGEPYDSFLSALDAYFFRLELAEEEEPDPTDQRPDFESEIAKHERII 325

Query: 390 MDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
             Q+  +   +QE     + AEL+  EY L  VD  +  ++ A     SW+D+    +E 
Sbjct: 326 EQQQGAIEGFEQEAASLREQAELLYAEYGL--VDDILSTIQGARERERSWDDIRERFEEG 383

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
            + G   A  I  +      +++       E+DDE       ++++D       NA R Y
Sbjct: 384 AEQGIDAAAAIVDIDGSDGTVTV-------EIDDE-------RIDLDAQQGVEQNADRLY 429

Query: 508 ELKKKQESKQEKTITA--HSKAFKAAEKKTRLQILQEKTV-------------------- 545
              K+ E K++  + A   ++   A  K+ R +   +++                     
Sbjct: 430 TEAKRVEEKKDGALAAIEDTRQDLADAKRRRDEWEADESGGGDDDETDEDGDDLPRDWLS 489

Query: 546 -ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
            ++I       WF++F WF +S+ +LVI GR+A QNE +VK+Y+  GD  +H   HG   
Sbjct: 490 ESSIPIRENEPWFDRFRWFNTSDGFLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPV 549

Query: 605 TVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPT 657
           TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QVSKT  +
Sbjct: 550 TVLKATDPSEASSSDIDLPESSIAEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVSKTPES 609

Query: 658 GEYLTVGSFMIRGKKNF 674
           GEYL  G F IRG + +
Sbjct: 610 GEYLEKGGFAIRGDRTY 626



 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 70/162 (43%), Gaps = 7/162 (4%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|340345857|ref|ZP_08668989.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
 gi|339520998|gb|EGP94721.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
          Length = 638

 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 59/157 (37%), Positives = 94/157 (59%), Gaps = 4/157 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK   + + +RK +W+E++ WF +S+  L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 415 EKDSISFTEIRKKNWYERYRWFFTSDGILAIGGRDAPSNSAVVRKHLEKNDKIFHGDIFG 474

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++KN   + P P  +LN+    TVC S+AW   M   SA+WV P QV K+AP+G++
Sbjct: 475 SPFFILKN--ADNP-PTASLNEVAHATVCFSRAWREGMYGVSAFWVNPEQVKKSAPSGQF 531

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           L  GSF I G++NF+    L +  G++ + D+  L S
Sbjct: 532 LPKGSFTIEGQRNFVKISTLKLAVGIIPQGDDYVLTS 568



 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 98/218 (44%), Gaps = 21/218 (9%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       +S+  ++L  SGV L  T+   D+   P+    +
Sbjct: 24  VSNIYGVTKDSILFKLHHTE------KSDLFMMLSTSGVWL--TSVKIDQME-PNRLLKR 74

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +++ + Q+  +RI  F F  G +  YVI+ E + +GNILL ++E  +L L  
Sbjct: 75  LRSDLLRLKIKKIEQIASERIAYFTFA-GFDKEYVIVAEFFGEGNILLCNNEMKILALQH 133

Query: 145 S----HRDDDKGVAIMSRHRYPTEICRV----FERTTASKLHAA--LTSSKEPDANEPDK 194
           S    HR    G+          ++ +V    FE    S L AA  L  +        + 
Sbjct: 134 SIDVRHRKLGVGLVYAPPPLNGIDVIKVTENDFEELKTSDLAAAKWLGRTLGLPKKYVEG 193

Query: 195 VNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
           + E  N  S     NL  ++  K +D +KN   N   G
Sbjct: 194 IFEMSNVDSKCVGTNLTSEQIKKLYDTTKNIVTNVVTG 231


>gi|386874769|ref|ZP_10116995.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
           BD31]
 gi|386807392|gb|EIJ66785.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
           BD31]
          Length = 539

 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 88/147 (59%), Gaps = 4/147 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK +  +S +RK +W+E++ WF +S+ +L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 306 EKDLIVVSEIRKKNWYERYRWFFTSDGFLAIGGRDAASNSAVVRKHLVKKDKIFHGDIFG 365

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++K        P  ++N+    TVC S+AW   M   SA+WV P QV K+AP+GE+
Sbjct: 366 SPFFILKEA---DNAPDKSMNEVAHATVCFSRAWREGMYGVSAYWVNPEQVKKSAPSGEF 422

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           L  GSF I G++NF+    L +  G++
Sbjct: 423 LPKGSFTIEGQRNFIKSDTLRLAVGII 449


>gi|76156171|gb|AAX27403.2| SJCHGC07504 protein [Schistosoma japonicum]
          Length = 170

 Score =  122 bits (305), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 63/144 (43%), Positives = 94/144 (65%), Gaps = 7/144 (4%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K+   + DV   +  ++ +++G R  NVYD+  KTY+ KL ++         EK +LL+
Sbjct: 11  MKLLFTSYDVMVSISEIKNQILGHRVINVYDVDNKTYLLKLASTK------SDEKTILLL 64

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+H T Y   K   PSGF++KLRKHIR +++ DV Q+G DR++  Q G   +A+++
Sbjct: 65  ESGSRIHITDYDWPKNMMPSGFSMKLRKHIRNKKIVDVCQIGADRVVDIQIGYESSAYHL 124

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           ILELY +GN+LLTD  FT+L LLR
Sbjct: 125 ILELYDRGNMLLTDDTFTILHLLR 148


>gi|433590765|ref|YP_007280261.1| putative RNA-binding protein, snRNP like protein [Natrinema
           pellirubrum DSM 15624]
 gi|448331831|ref|ZP_21521081.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
           15624]
 gi|433305545|gb|AGB31357.1| putative RNA-binding protein, snRNP like protein [Natrinema
           pellirubrum DSM 15624]
 gi|445628400|gb|ELY81707.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
           15624]
          Length = 721

 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 162/375 (43%), Gaps = 60/375 (16%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRS 406
           +++F +ALD+++ ++E    E+     +   F     K  +I   Q+  +   +QE ++ 
Sbjct: 291 YDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAEQL 350

Query: 407 VKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYL 463
            + AEL+  EY L  VD  +  V+ A     +W+++    +E    G   A  +ID    
Sbjct: 351 RERAELLYAEYGL--VDEILSTVQGAREQDRAWDEIRERFEEGADRGIAAAEAVID---- 404

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPV----EKVEVDLALSAHANARRWYELKKKQESKQEK 519
                          +D  E T+ V    E++E+        NA R Y   K+ E K+E 
Sbjct: 405 ---------------VDGSEGTVTVDLDGERIELVADRGVEQNADRLYTEAKRVEDKKEG 449

Query: 520 TITAHSKAFKAAEKKTRLQILQEKTVA---------------------NISHMRKVHWFE 558
            + A     +  E   R +   E   A                     +I       WF+
Sbjct: 450 ALAAIENTREDLEDAKRRRDEWEAKDAASDDEDEADDEGPNRDWLADPSIPIRENEPWFD 509

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP--- 615
           +F WF +S++YLVI GR+A QNE IVK+Y+  GD  +H   HG   TV+K   P +    
Sbjct: 510 RFRWFHTSDDYLVIGGRNADQNEEIVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSS 569

Query: 616 ---VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
              +P  ++ +A  F V ++  W D +     + V   QVSKT  +GEYL  G F IRG 
Sbjct: 570 DIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGD 629

Query: 672 KNFLPPHPLIMGFGL 686
           + +    P+    G+
Sbjct: 630 RTYYRDTPVGAAVGI 644



 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +        +  ++ L++E 
Sbjct: 4   KRELTSVDLAALVGELGTYEGAKVDKAYLYGDDLVRLKMRDF-------DRGRLELILEV 56

Query: 63  G--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|297736764|emb|CBI25965.3| unnamed protein product [Vitis vinifera]
          Length = 1266

 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/173 (46%), Positives = 106/173 (61%), Gaps = 17/173 (9%)

Query: 713 DFEDSGHHKENSDIESEKDDTDEKPVAESLSVPN---------------SAHPAPSHTNA 757
           DFE++   K NSD ESEK++TDEK  AES S+ +               SAH   + +N 
Sbjct: 28  DFEENESLKGNSDSESEKEETDEKRTAESKSIMDPPTHQPILEGFSEISSAHNELTTSNV 87

Query: 758 SNVDSHEFPAEDKTISNGIDSK-IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHG 816
            +++  E P E++ + NG DS+ I DI+    + V PQLEDLID AL LGS + S  K+ 
Sbjct: 88  GSINLPEVPLEERNMLNGNDSEHIDDISGRHVSSVNPQLEDLIDWALELGSNTASGKKYA 147

Query: 817 IETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKE 869
           +ET+Q DL E+  H +R A VR+KPYISKAERRKLKKGQ +S  D   +  KE
Sbjct: 148 LETSQVDL-EDHNHEDRKAKVREKPYISKAERRKLKKGQKTSTSDAGGDHGKE 199


>gi|88601740|ref|YP_501918.1| hypothetical protein Mhun_0437 [Methanospirillum hungatei JF-1]
 gi|88187202|gb|ABD40199.1| protein of unknown function DUF814 [Methanospirillum hungatei JF-1]
          Length = 627

 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 161/338 (47%), Gaps = 45/338 (13%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F++F+AAL  FY       A    K +E     + ++I   QE  +   ++ + R+ ++A
Sbjct: 254 FDSFNAALAAFYPV-----APPVKKQEEKIRVSREDRIRHQQEEAIVKFEKNITRNEELA 308

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
            L+      V   I  +  A   R SW+++  ++K++                 +  + +
Sbjct: 309 ALLYEEYGFVSEIITTLSKAAETR-SWQEIEAILKKDTSGAG------------KKIIRI 355

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
             +    E+D          V+V +  +   NA R+Y+  KK + K      A  +  + 
Sbjct: 356 FPAEAAVELDLGRP------VKVFVHETIDQNAGRYYDQVKKFKKKLAGAKAAMEREVQQ 409

Query: 531 AEKKTRLQILQEKTVANISHMR-KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
           A  +TR           + + R K  WF++F WF +S+  LVI GRDA QNE ++++Y+ 
Sbjct: 410 A--RTR----------KVQYQRPKKRWFDRFRWFYTSDQVLVIGGRDAGQNEELIRKYLE 457

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYP 648
            GD +VHAD+HGAS  V+K    +       +++   F   +S AW +   ++  +   P
Sbjct: 458 GGDTFVHADVHGASVVVVKGKTKD-------MDEVARFAAAYSGAWRAGFASADVYAARP 510

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            QVSKTA +GEYL+ GSF++RG++ +    PL +  GL
Sbjct: 511 DQVSKTAESGEYLSRGSFVVRGERQWFHDVPLEVVIGL 548



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+  D+      + RL+ +    VY    +  IF+L        S    KV +L+E G R
Sbjct: 7   MSGLDLITVTDEITRLLPLWVHKVYLDENRLCIFRL-------NSKNQGKVNILIEPGRR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H  +   +    P  F + LRK++   R++ +RQ G  R ++          ++I+E++
Sbjct: 60  FHCVSTLPEMPQIPPAFAMFLRKYLAGGRVDGIRQQGLQRTVIIDIRKSEQLFHLIVEVF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
             GNI+L   + T++  L  HR  D+ V
Sbjct: 120 DDGNIILCGEDMTIIQPLTRHRFKDRDV 147


>gi|118577090|ref|YP_876833.1| RNA-binding protein [Cenarchaeum symbiosum A]
 gi|118195611|gb|ABK78529.1| RNA-binding protein [Cenarchaeum symbiosum A]
          Length = 631

 Score =  120 bits (302), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 106/200 (53%), Gaps = 5/200 (2%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           EK+ VD   S H+ A   ++  K+Q           +KA K  +   R    Q  +V + 
Sbjct: 355 EKISVDPRSSIHSAASSLFDEAKRQSGAVPAIEKLRAKAAKELDALRRDSEEQAASV-SF 413

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
           + +R+  W+E++ WF +++  L + GRD+  N  I++R++   D   HAD  G+   ++K
Sbjct: 414 TKVRRKSWYERYRWFFTTDGSLAVGGRDSSSNTSIIRRHLDANDRVFHADTFGSPFFILK 473

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           +    +P     L +A   TVC S+AW   M   SA+WV P QV K AP+G++L  GSF+
Sbjct: 474 DGADSRPA---GLEEAAHATVCFSRAWREAMYGLSAYWVLPEQVKKAAPSGQFLPKGSFV 530

Query: 668 IRGKKNFLPPHPLIMGFGLL 687
           I G++NF+    L +  GL+
Sbjct: 531 IEGRRNFVKIPTLRLAVGLV 550



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 62/125 (49%), Gaps = 9/125 (7%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           +R  G   SN+Y +SP++ +FKL +        E E ++L++ S   L T++  R ++  
Sbjct: 17  KRTGGYYVSNIYGISPESLLFKLHHP-------EKEDIMLML-STFGLWTSS-VRIEQVG 67

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           P+    +LRK +   RLE V Q G DRI   +F        +  E +  GN++L      
Sbjct: 68  PNRLLARLRKELLRSRLESVEQPGMDRIAYLRFEGPRGTRILAGEFFGGGNMILCGDGMM 127

Query: 139 VLTLL 143
           +L LL
Sbjct: 128 ILALL 132


>gi|156938202|ref|YP_001435998.1| hypothetical protein Igni_1415 [Ignicoccus hospitalis KIN4/I]
 gi|156567186|gb|ABU82591.1| protein of unknown function DUF814 [Ignicoccus hospitalis KIN4/I]
          Length = 644

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 90/148 (60%), Gaps = 3/148 (2%)

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           ++E+    I+  R+  W+EK++W I+S   L I G+DA QNE +V+RY+   D+++HA++
Sbjct: 400 VKEEIAKEIAKSRRREWYEKYHWLITSSGLLAIGGKDASQNEAVVRRYLEDDDIFMHAEV 459

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTG 658
            GA + V+K    E  V    L +A   T C+S+AW + +     ++V   QVSK+ P G
Sbjct: 460 QGAPAVVLKTEGKE--VTEKDLREAAFLTACYSKAWKEGRGSVDVFYVKGSQVSKSPPPG 517

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +Y+  G+F+I+GK+ ++   PL +  G+
Sbjct: 518 QYVAKGAFIIKGKREYVRDVPLRLALGV 545



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 64/138 (46%), Gaps = 8/138 (5%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  MN  DV A ++    LIG    NVY      ++ KL      T++       L+ E 
Sbjct: 4   KASMNYLDVVAWIRKNEDLIGSTVQNVYYKDGLMWM-KLKGKGSGTKA-------LIAEP 55

Query: 63  GVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
           G R+H T    +       F   LRK +++ +L  ++ +GYDR++   F  G   + +++
Sbjct: 56  GRRIHLTPSPPEAPERLHPFAGGLRKFLKSAKLTSIKTVGYDRVVEMNFSKGGEVYKLMI 115

Query: 123 ELYAQGNILLTDSEFTVL 140
           EL  +G I L D E  +L
Sbjct: 116 ELVPRGVIALLDPENKIL 133


>gi|448317278|ref|ZP_21506835.1| fibronectin-binding A domain-containing protein [Natronococcus
           jeotgali DSM 18795]
 gi|445604315|gb|ELY58265.1| fibronectin-binding A domain-containing protein [Natronococcus
           jeotgali DSM 18795]
          Length = 717

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/343 (26%), Positives = 151/343 (44%), Gaps = 46/343 (13%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRV 429
           Q+   +E+ A H+  +I   Q+  +   +Q+ +   + AEL+  EY L  VD  +  V+ 
Sbjct: 316 QRPDFEEEIAKHE--RIIEQQQGAIEGFEQQAEAQRENAELLYAEYGL--VDDILSTVQE 371

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE 489
           A A    W+++ +  +E ++ G   A  +  +      +++ L                E
Sbjct: 372 ARAQDRPWDEIEQRFEEGKERGIEAAEAVVGVDGTDGIVTVELDG--------------E 417

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT----- 544
           K+++D       NA R Y   K+ E K+E  + A     +      R +   E T     
Sbjct: 418 KIDLDAGQGVEQNADRIYTEAKRIEEKKEGALAAIEDTREDLADAKRRRDEWEATDETAD 477

Query: 545 --------------VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
                         +A+I       W+++F WF +S+ YLVI GR+A QNE +VK+Y+  
Sbjct: 478 GDEDDEHEETNWLELASIPIRENEPWYDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEP 537

Query: 591 GDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSA 643
           GD  +H   HG   TV+K   P +       +P  ++ +A  F V +S  W D +     
Sbjct: 538 GDTVLHTQAHGGPVTVLKATDPSEASSSDIELPDSSVEEAAQFAVSYSSVWKDGRYAGDV 597

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           + V   QV+KT  +GEYL  G F IRG + +    P+    G+
Sbjct: 598 YAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 640



 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLIEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFVGVEQFEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGNI +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNIAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|385805336|ref|YP_005841734.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Fervidicoccus fontis Kam940]
 gi|383795199|gb|AFH42282.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Fervidicoccus fontis Kam940]
          Length = 629

 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 90/149 (60%), Gaps = 7/149 (4%)

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM--SKGDVYVHAD 598
           +E+ V  I+  RK  W+EK+ W  +    L+I+GRDAQQNE IVK+Y+  +K  +Y HA+
Sbjct: 409 KEREVKAIA--RKRDWYEKYIWSFTRNRLLIIAGRDAQQNEAIVKKYLMKNKKSLYFHAE 466

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPT 657
           +HGA ST++      + +    +         +S+AW + + V   +WV+  QVSKT P 
Sbjct: 467 IHGAPSTILLAEN--EDIKEEDIYDTSVIAASYSKAWKASLKVVDVFWVHSDQVSKTPPA 524

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           GEYL  GSFMI G+KN++   PL +G GL
Sbjct: 525 GEYLEKGSFMIYGEKNYVRNVPLKLGIGL 553



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/185 (25%), Positives = 89/185 (48%), Gaps = 19/185 (10%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL-SPKTYIFKLMNSSGVTESGESEKVLLL 59
           +K  M   D+ A ++ L +  I ++ SN+Y +   K  + KL +              L+
Sbjct: 3   IKESMTVIDLIAFLRELEKEKINLKVSNIYHIPQTKRILIKLKDPYFK---------FLV 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            E+  +++ + Y+      PS F L LRK++  R +  ++Q+G+DRI+  +F    N + 
Sbjct: 54  AEASKKIYFSKYSLPTPEKPSIFALSLRKYLNERVITSIKQIGFDRILKLEFD---NDYA 110

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLH 178
           + +EL  +G I+LTD    ++      +  D+ +   S++  P     +FE R TA    
Sbjct: 111 LYIELLPRGEIILTDPTERIIHASSFKKMRDRKIERNSQYILPP----IFEKRPTAEMCI 166

Query: 179 AALTS 183
            AL+S
Sbjct: 167 EALSS 171


>gi|307595006|ref|YP_003901323.1| hypothetical protein Vdis_0882 [Vulcanisaeta distributa DSM 14429]
 gi|307550207|gb|ADN50272.1| protein of unknown function DUF814 [Vulcanisaeta distributa DSM
           14429]
          Length = 668

 Score =  118 bits (296), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 66/182 (36%), Positives = 108/182 (59%), Gaps = 6/182 (3%)

Query: 508 ELKKKQESKQEKT--ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           EL++K ++ +E    + A  +  +A  +K   + ++E ++  I   R+  WFE+F WFI+
Sbjct: 396 ELERKAKTAEESLSQLRARIEELRAESEKI-AESIREGSIRVIYGARE--WFERFRWFIT 452

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           S   LVI+GRDA QNE+IV+ Y+   D++VHAD+ G ++ VI+       V    + +A 
Sbjct: 453 SGGKLVIAGRDATQNEVIVRHYLRPWDIFVHADIPGGAAVVIRLASSGDNVSDDDIKEAA 512

Query: 626 CFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            + V +S+AW   + V  A++V   QV+K AP+GEYL  GSFMI G + ++    L +G 
Sbjct: 513 QYAVSYSRAWVMGLSVLDAFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWVRNAELGLGI 572

Query: 685 GL 686
           G+
Sbjct: 573 GV 574


>gi|288932692|ref|YP_003436752.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
           10642]
 gi|288894940|gb|ADC66477.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
           10642]
          Length = 646

 Score =  117 bits (294), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 95/348 (27%), Positives = 179/348 (51%), Gaps = 31/348 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLN---KI 388
           Y ++ P+ L ++   E   FE+F+ A+DEFY++  S   E + K K+     KL    KI
Sbjct: 236 YVDYQPIDLKKYEGYEKKYFESFNKAVDEFYTR--SALKEIEVKEKKSEVIEKLENRLKI 293

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
            ++ + R    ++E ++  ++ +LI      V+    A++ A+  +  ++++ +++ E++
Sbjct: 294 QLETKER---YERESEKLRRIGDLIYEKYPIVERIHSALKKAVELK-GFDEVKKILAEQK 349

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           KAG  +  ++D +  E+   +++LS     +DD +  L ++K       + H NA  +Y+
Sbjct: 350 KAGK-LKEILDIIPKEK---AVVLS-----IDDVKFKLFLDK-------NLHENAEYYYD 393

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
             KK + K    + A  K     E +   +I  +K ++    +R+  W+EK+ W+I+SE 
Sbjct: 394 QAKKLKEKVNGIVKAIEKT--REEIRRAEEIEAKKILSEFRVVRRREWYEKYRWYITSEG 451

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI GR+A+ NE IV ++    D++ H    G + T++K           ++ +A  F 
Sbjct: 452 FLVIGGRNAEMNEEIVSKHFESKDLFFHTQTPGGAVTILKRG---LEAGEKSIKEAAEFA 508

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             +S  W   M +   ++V   QV + A  GEYL  GSF I GK+N+L
Sbjct: 509 AIYSALWKHGMHSGEVYYVTYEQVKRAAKPGEYLPKGSFYIVGKRNYL 556



 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 10/136 (7%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           +M++ D+ A +  L+ + GM+   VY   P  +  KL             +V  L+E+G 
Sbjct: 3   QMSSIDIRAVLNELK-IEGMKVDKVYHYPPNEFRIKLRGRG---------RVDFLVEAGK 52

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T + ++    PS   + LRKH+   R+E V Q  +DRI++ +F  G     ++ EL
Sbjct: 53  RIHATEFPKESPKFPSSIAMLLRKHLENARVERVYQHDFDRIVVIEFSRGDEKKIMVAEL 112

Query: 125 YAQGNILLTDSEFTVL 140
           + +GN+LL D +F V+
Sbjct: 113 FGKGNLLLLDEDFKVI 128


>gi|21593912|gb|AAM65877.1| unknown [Arabidopsis thaliana]
          Length = 129

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 52/76 (68%), Positives = 63/76 (82%)

Query: 1002 MEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTA 1061
            MEE+DIHE+G+EEK +L DVDYLTGNPLP+DILLY +PVCGPY+A+QSYKYRVK IPG+ 
Sbjct: 1    MEEDDIHEVGDEEKEKLIDVDYLTGNPLPTDILLYAVPVCGPYNALQSYKYRVKAIPGSM 60

Query: 1062 KKGKGIQIFYSLLLLM 1077
            KKGK  +   +L   M
Sbjct: 61   KKGKAAKTAMNLFTHM 76


>gi|448300325|ref|ZP_21490327.1| fibronectin-binding A domain-containing protein [Natronorubrum
           tibetense GA33]
 gi|445586054|gb|ELY40340.1| fibronectin-binding A domain-containing protein [Natronorubrum
           tibetense GA33]
          Length = 726

 Score =  116 bits (290), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 166/371 (44%), Gaps = 52/371 (14%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRS 406
           ++T+ +ALD+++ ++E +   +     +   F     K  +I   Q+  +   +QE D  
Sbjct: 296 YDTYLSALDDYFFRLELEEEGEPDPTDQRPDFEEEIAKQERIIEQQQGAIEGFEQEADML 355

Query: 407 VKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            + AE +  EY L  VD  +  ++ A A    W+++    +   + G   A  +    ++
Sbjct: 356 REQAESLYAEYGL--VDDILSTIQEARAQDRPWDEIEERFEAGAEQGIEAAEAV----ID 409

Query: 465 RNCMSLLLSNNLD-EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
            +    +++ ++D E  D E T  VE+           NA R Y   K  E K+E  ++A
Sbjct: 410 VDGSEGVVTVDVDGEYIDLETTQGVEQ-----------NADRLYTEAKAVEDKKEGALSA 458

Query: 524 HSKAFKAAE--KKTRLQILQEK-------------------TVANISHMRKVHWFEKFNW 562
                K  +  K+ R Q   +                    ++ ++       W+++F W
Sbjct: 459 IENTRKDLQEAKRRRDQWEADDGEDEGDDADEEEREDRDWLSMPSVPVRENEPWYDRFRW 518

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------V 616
           F +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +
Sbjct: 519 FYTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIEL 578

Query: 617 PPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           P  ++ +A  F V ++  W D +     + V   QV+KT  +GEYL  G F IRG + + 
Sbjct: 579 PETSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYY 638

Query: 676 PPHPLIMGFGL 686
              P+ +  G+
Sbjct: 639 DDTPVGVAVGI 649



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RLELIIEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQFEFDRILEFT 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|383621605|ref|ZP_09948011.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
 gi|448702236|ref|ZP_21699890.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
 gi|445777606|gb|EMA28567.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
          Length = 718

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 169/376 (44%), Gaps = 62/376 (16%)

Query: 351 FETFDAALDEFYSKIESQRAE------QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +++F  ALD+++ ++E    E      Q+   +E+ A H+  +I   QE  +   +Q+ D
Sbjct: 288 YDSFLTALDDYFFRLELDEEEEPDPTEQRPDFEEEIAKHQ--RIIEQQEGAIEGFEQQAD 345

Query: 405 RSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + AE +  EY L  VD  +  +R A      W+++ +  +E ++ G           
Sbjct: 346 ELREQAESLYAEYGL--VDEVLSTIRQARKQDRPWDEIEQRFEEGKERG----------- 392

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQE 518
                  +  +  + ++D  E T+ VE    ++++ +      NA R Y   K+ E K+E
Sbjct: 393 -------IEAAETVVDLDGSEGTVTVEVDGERIDLVVDDGVEQNADRLYTEAKRVEEKKE 445

Query: 519 KTITAHSKAFKAAE--KKTRLQILQEK-------------------TVANISHMRKVHWF 557
             + A     +  E  K+ R Q   E                    ++ ++       W+
Sbjct: 446 GALAAIEDTREDLEDAKRRRDQWEAEDAAEDDEDDDDEDEEERNWLSMPSVPIRENEPWY 505

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-- 615
           ++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +   
Sbjct: 506 DRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDEVLHTQAHGGPVTVLKATDPSEASS 565

Query: 616 ----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
               +P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG
Sbjct: 566 HDIELPESSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRG 625

Query: 671 KKNFLPPHPLIMGFGL 686
            + +    P+ +  G+
Sbjct: 626 DRTYYRDTPVGVAVGI 641



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|428671809|gb|EKX72724.1| hypothetical protein BEWA_012830 [Babesia equi]
          Length = 1178

 Score =  115 bits (288), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 89/149 (59%), Gaps = 9/149 (6%)

Query: 1   MVKVRMNTADVAAEVKCLRRL-IGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M + R+N  DV   V  L+RL +     N+YD++ + ++ K         S   EKV +L
Sbjct: 1   MARERLNAIDVGVVVANLKRLALNYSLVNIYDITNRIFVLKF--------SKNEEKVYVL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+HTT + R   + PS F +KLRKH+R+R+L +V Q+  DR+I F F     AH+
Sbjct: 53  IEIGCRIHTTQFLRSSDSLPSNFNVKLRKHLRSRKLRNVAQMSQDRVIDFTFSSEEYAHH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRD 148
           +I++L+  GNI LTD+ + VLT+L   +D
Sbjct: 113 LIVQLFLPGNIYLTDANYKVLTVLSGEKD 141


>gi|336253827|ref|YP_004596934.1| Fibronectin-binding A domain-containing protein [Halopiger
           xanaduensis SH-6]
 gi|335337816|gb|AEH37055.1| Fibronectin-binding A domain protein [Halopiger xanaduensis SH-6]
          Length = 718

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 163/367 (44%), Gaps = 45/367 (12%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRS 406
           +++F  ALD+++ ++E +  E+    ++   F     K  +I   Q+  +   +QE ++ 
Sbjct: 289 YDSFLTALDDYFFRLELEDEEEPDPTEQRPDFEEEIAKHERIIEQQQGAIEGFEQEAEQL 348

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
            + AEL+      VD  +  +R A      W+++    +E ++ G   A  +  +     
Sbjct: 349 REKAELLYARYGLVDDILSTIRNAREQDRPWDEIEERFEEGKERGIEAAEAVVGIDGSEG 408

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
            +++ +                E+++++       NA R Y   K+ E K+E  + A   
Sbjct: 409 IVTVDIDG--------------ERIDLEARQGVEQNADRLYTEAKRVEEKKEGALAAIED 454

Query: 527 AFKAAE--KKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISS 566
             +  E  K+ R Q   E                   ++ ++       W+++F WF +S
Sbjct: 455 TREDLEEAKRRREQWEAEDAGEDDADDEDEGEDKDWLSMPSVPIRENEPWYDRFRWFHTS 514

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLT 620
           ++YLVI GR+A QNE IVK+Y+  GD  +H   HG   TV+K   P +       +P  +
Sbjct: 515 DDYLVIGGRNADQNEEIVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPDSS 574

Query: 621 LNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           + +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + +    P
Sbjct: 575 IEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTP 634

Query: 680 LIMGFGL 686
           + +  G+
Sbjct: 635 VGVAVGI 641



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 71/165 (43%), Gaps = 13/165 (7%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMN-SSGVTESGESEKVLLLME 61
           K  + + D+AA V+ L    G +    Y         K+ +   G TE        L+ E
Sbjct: 4   KRELTSVDLAALVEELGAYEGAKVDKAYLYGDDLVRLKMRDFDRGRTE--------LIFE 55

Query: 62  SG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F      
Sbjct: 56  VGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGT 115

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
             +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 116 TRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|448348947|ref|ZP_21537792.1| fibronectin-binding A domain-containing protein [Natrialba
           taiwanensis DSM 12281]
 gi|445641664|gb|ELY94739.1| fibronectin-binding A domain-containing protein [Natrialba
           taiwanensis DSM 12281]
          Length = 720

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 101/392 (25%), Positives = 171/392 (43%), Gaps = 39/392 (9%)

Query: 324 ESGSSTQIYDEFCPLLLNQFRSREF--VKFETFDAALDEFYSKIESQRAEQQHKAKEDAA 381
           + GS+ ++ D   P  L +    +     ++TF  ALD+++ ++E    E+     +   
Sbjct: 262 DEGSAARVVD-VTPFPLEEHEQDDLDGEPYDTFLEALDDYFFRLELDDEEEPDPTDQRPD 320

Query: 382 FH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRM 435
           F     K  +I   Q+  +   +QE +   + AE +  EY L  VD  +  ++ A     
Sbjct: 321 FEEEIAKHERIIEQQQGAIEGFEQEAENLRENAESLYAEYGL--VDEILSTIQEAREQDR 378

Query: 436 SWEDLARMVKEERKAGNPVA----------GL----IDKLYLERNCMSLLLSNNLDEMDD 481
            W+++     E  + G   A          GL    ID  Y+E      +   N D +  
Sbjct: 379 PWDEIEERFAEGAEQGIDAAEAVVDVDGSEGLVTVDIDGEYIELVAHDGV-EQNADRLYT 437

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
           E K +  +K   + AL+A  + R   E  K++  + E T    +      ++      L 
Sbjct: 438 EAKRVAEKK---EGALAAIEDTREDLEEAKRRRDEWEATDGEEADDEATEDEGEDHDWLA 494

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +   I       WF++F WF +S+ YLVI GRDA QNE +VK+Y+  GD  +H   HG
Sbjct: 495 DPS---IPIRENEPWFDRFRWFHTSDGYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHG 551

Query: 602 ASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKT 654
              TV+K   P +       +P  ++ +A  F V ++  W D +     + V   QV+KT
Sbjct: 552 GPVTVLKATDPSEASSADIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKT 611

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +GEYL  G F +RG + +    P+    G+
Sbjct: 612 PESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 643



 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGAYEGAKLDKAYLYGDNLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +         Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGASQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|332796292|ref|YP_004457792.1| hypothetical protein Ahos_0606 [Acidianus hospitalis W1]
 gi|332694027|gb|AEE93494.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 566

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 125/221 (56%), Gaps = 16/221 (7%)

Query: 479 MDDEEKTLPVE--KVEVDLALSAHANARRWYELKKK--QESKQEK-TITAHSKAFKAAEK 533
           + ++EK + +E  ++E+D  LS   NA  +++  K+  Q+SK+ K T+    +     E 
Sbjct: 281 IKNKEKKIKLEGKEIEIDPKLSVAKNASLYFDKAKEYVQKSKKAKETLEELKRKLNEIEI 340

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
           + + +    K       +RK  W+EK+ W  ++  +LVI+G+DA QNE +V++ +   D+
Sbjct: 341 EIKKEEEGRKL-----SIRKKEWYEKYRWSFTTNGFLVIAGKDADQNESLVRKLLEDNDI 395

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVS 652
           ++HAD+ GA++T+IKN +    +    +  A      +S+AW   +     +WVY  QVS
Sbjct: 396 FLHADIQGAAATIIKNPK---NITEQDIYDAAAIAASYSKAWKLGLAAVDVFWVYGSQVS 452

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           K+ P GEYL  GSFMI GKKN++    L +  G  F++++S
Sbjct: 453 KSPPAGEYLPKGSFMIYGKKNYIKSVKLNLAIG--FKINDS 491


>gi|116754828|ref|YP_843946.1| hypothetical protein Mthe_1534 [Methanosaeta thermophila PT]
 gi|116666279|gb|ABK15306.1| protein of unknown function DUF814 [Methanosaeta thermophila PT]
          Length = 641

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 166/379 (43%), Gaps = 49/379 (12%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           +  P  L  ++  E   FE F  ALDEF+         +    K  A   +L      Q 
Sbjct: 246 DVIPFPLEVYKGLEARSFERFSDALDEFF-------VAEPEMPKLSALERRLEL----QR 294

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             +  L+ +  +   M + I     ++D+ + A+  A    +S+ D+   ++   K+   
Sbjct: 295 AAIDELRAKETQLASMGDFIYQRYSEIDSILKAIAGARERGLSYTDIWERIQSSGKSAVK 354

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
                 ++ +E + ++L                     E++  L+   NA R+YE  K+ 
Sbjct: 355 SLDYSGEMIVEIDGVTL---------------------ELNAGLTVPQNAGRYYERAKEA 393

Query: 514 ESKQEKTITAHSKA---FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
             K      A  +     +  E++ R  +L+ +         K  WFE+F WF SS+++L
Sbjct: 394 AKKAAGAEEALRRTEDLLQRGEERRRSPVLKRR--------HKPRWFERFRWFYSSDDFL 445

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI GRDA  NE I  +Y+ K D+ +H D  GA  TVIK    E  VP  T+ +A  F V 
Sbjct: 446 VIGGRDADGNEEIYLKYLEKRDLALHTDYPGAPLTVIKTEGRE--VPERTVEEAAQFAVS 503

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           +S  W   + +   + V   QV+KT   GE+L  G+F++RG++ +L   PL +   +   
Sbjct: 504 YSNLWREGVASGDCYVVRGDQVTKTPEHGEFLRKGAFVVRGERRYLRDVPLGVALAI--- 560

Query: 690 LDESSLGSHLNERRVRGEE 708
            D S +G  ++  R +  E
Sbjct: 561 ADGSLIGGPVSAVRSKSSE 579



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 65/146 (44%), Gaps = 11/146 (7%)

Query: 6   MNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DVAA V  L+ R+ G      Y  S       +    G        ++ +++E+G 
Sbjct: 5   MSNVDVAAIVAELQTRIAGGFFGKAYQSSGDAIWLTIQAREG--------RLDIILEAGR 56

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H T   R    TP  F   LR  +   R+  V Q  +DR++          + +++EL
Sbjct: 57  RAHVTRKERVVGRTPPQFPAMLRSRLSGGRIVSVEQHDFDRVMEICVERSDGRYRLVVEL 116

Query: 125 YAQGNILLTDSEFTVLTLLR--SHRD 148
           + +GN+LL D E  ++  LR  S RD
Sbjct: 117 FPKGNMLLLDDEMRIILPLRPMSFRD 142


>gi|435848081|ref|YP_007310331.1| putative RNA-binding protein, snRNP like protein [Natronococcus
           occultus SP4]
 gi|433674349|gb|AGB38541.1| putative RNA-binding protein, snRNP like protein [Natronococcus
           occultus SP4]
          Length = 712

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 154/335 (45%), Gaps = 31/335 (9%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           Q+   +E+ A H+  +I   Q+  +   +Q+     + AEL+    E VD  +  ++ A 
Sbjct: 312 QRPDFEEEIAKHE--RIIEQQQGAIEGFEQQAQAQRENAELLYARYELVDDILSTIQEAR 369

Query: 432 ANRMSWEDLARMVKEERKAG----NPVAGL-----IDKLYLERNCMSLL----LSNNLDE 478
                W+++    +E ++ G      V G+     I  + L+   + L+    +  N D 
Sbjct: 370 TQDRPWDEIEERFEEGKERGIEAAEAVVGVDGTEGIVTVELDGEEIDLVADDGVEQNADR 429

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           +  E K +  +K   + AL+A  + R   E  K++  + E T        +  E+K  L+
Sbjct: 430 LYTEAKRIEEKK---EGALAAIEDTREDLEDAKRRRDEWEATDDHEDDDDEEDEEKNWLE 486

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           +      A++       W+++F WF +S+ YLVI GR A QNE +VK+Y+  GD  +H  
Sbjct: 487 M------ASVPIRENEPWYDRFRWFHTSDGYLVIGGRSADQNEELVKKYLEPGDTVLHTQ 540

Query: 599 LHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQV 651
            HG   TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QV
Sbjct: 541 AHGGPVTVLKATDPSEASSSDIELPDSSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQV 600

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +KT  +GEYL  G F IRG + +    P+    G+
Sbjct: 601 TKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 635



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLIEVGEIKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQFEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGNI +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNIAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|374724028|gb|EHR76108.1| putative RNA-binding protein [uncultured marine group II
           euryarchaeote]
          Length = 723

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/391 (24%), Positives = 177/391 (45%), Gaps = 54/391 (13%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAA---------FHK 384
           E  P +L         KF T   A+D +    ++    ++   K D A           +
Sbjct: 272 EATPTILPSHAGMAQAKFATLCEAVDAWKGAHDAGALARREAEKLDIAAPGRGHSTDVER 331

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           L +  + QE  +    +++++   +   I+ N   V++ ++ V  A+  +  W+++  M 
Sbjct: 332 LERRKVQQEKALSGFSKKIEKQQMIGHTIQNNWTHVESLLIQVTEAIEAK-GWKEVKSMA 390

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANAR 504
           K        +  ++     ER+ +S+L   N  E    + TL +++       S H NA+
Sbjct: 391 KS-------IPWIVSLNPAERSFLSVLPDEN-GEPKGPQATLSIDE-------SVHQNAQ 435

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT--VANISHMRKVHWFEKFNW 562
           R++   +KQ+ K +  + A        ++  + +  Q+ T  +  I   +++ WFE   W
Sbjct: 436 RFFTAARKQKDKTKGAVDALEDTLLQLQRAQKKEAKQQATGKLNKIKRSKRL-WFEHHRW 494

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST--------VIKNHRPEQ 614
            + +  +L++ G+DA+ N+ IVK+++S  D Y+HADLHGA S         V+  H+P  
Sbjct: 495 SMITGGHLLVGGKDAKGNDSIVKKHLSGQDRYLHADLHGAPSCSLRATQGFVVDQHKPAH 554

Query: 615 ---PVPPL--------------TLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAP 656
               VP                 L +A    +C S+AW       + + V P QVSKTA 
Sbjct: 555 IPADVPAFRIVDKLGDERITEEKLLEAATMALCWSRAWAGGGAHGTVYSVKPAQVSKTAQ 614

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           TGE++  GSF++RG++ +     + +G G++
Sbjct: 615 TGEFVGKGSFIVRGQRQWFKDLDVQIGIGIV 645



 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 56/92 (60%)

Query: 52  ESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
           E ++  L++  G R++T+   R    TP  F + LRKH++  R+  VRQLG+DR++ F F
Sbjct: 44  EQDQFDLVLVRGSRIYTSQRDRPMPMTPPPFAMVLRKHLKNARMTGVRQLGFDRVLGFDF 103

Query: 112 GLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
                ++++ +E++  GNI+LTD E  ++  L
Sbjct: 104 DTKHGSYHLYVEVFRDGNIILTDQEGVIIQPL 135


>gi|448353444|ref|ZP_21542220.1| fibronectin-binding A domain-containing protein [Natrialba
           hulunbeirensis JCM 10989]
 gi|445640304|gb|ELY93393.1| fibronectin-binding A domain-containing protein [Natrialba
           hulunbeirensis JCM 10989]
          Length = 736

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 164/364 (45%), Gaps = 34/364 (9%)

Query: 351 FETFDAALDEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           ++TF  ALD+++       + E     Q+   +E+ A H+  +I   Q+  +   +QE +
Sbjct: 302 YDTFLDALDDYFFHLELEDEEEPDPTSQRPDFEEEIAKHE--RIIEQQQGAIEGFEQEAE 359

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA--------- 455
              + AEL+  N   VD  +  ++ A A    WE +    +E  + G   A         
Sbjct: 360 NLRENAELLYANYGLVDDILSTIQEARAQDRPWEAIEARFEEGAEQGIEAAEAVIDVDGS 419

Query: 456 -GLI----DKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
            G++    D  Y+E      +   N D +  E K +  +K   + AL+A  + R   E  
Sbjct: 420 EGIVTVDVDGEYIELVAHDGV-EQNADRLYTEAKRVAEKK---EGALAAIEDTREDLEDA 475

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH-WFEKFNWFISSENY 569
           K++  + E++           ++       ++    +   +R+   WF++F WF +S+ Y
Sbjct: 476 KRRRDEWEESDGESGAGSGGGDEDEGEDEDRDWLAESSIPIRENEPWFDRFRWFHTSDGY 535

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 536 LVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPESSIEE 595

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           A  F V +S  W D +     + V   QV+KT  +GEYL  G F +RG + +    P+  
Sbjct: 596 AAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGA 655

Query: 683 GFGL 686
             G+
Sbjct: 656 AVGI 659



 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGTYEGAKVDKAYRYGDDLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|448359396|ref|ZP_21548054.1| fibronectin-binding A domain-containing protein [Natrialba
           chahannaoensis JCM 10990]
 gi|445643534|gb|ELY96581.1| fibronectin-binding A domain-containing protein [Natrialba
           chahannaoensis JCM 10990]
          Length = 727

 Score =  113 bits (283), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 158/376 (42%), Gaps = 61/376 (16%)

Query: 351 FETFDAALDEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           ++TF  ALD+++       + E     Q+    E+ A H+  +I   Q+  +   +QE +
Sbjct: 296 YDTFLNALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHE--RIIEQQQGAIEGFEQEAE 353

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYL 463
              + AEL+  N   VD  +  ++ A A    W+D+    +E  + G   A  +ID    
Sbjct: 354 NLRENAELLYANYGLVDDILSTIQEARAQDRPWDDIEARFEEGAEQGIEAAEAVID---- 409

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAH----ANARRWYELKKKQESKQEK 519
                          +D  E  + V+     + L AH     NA R Y   K+   K+E 
Sbjct: 410 ---------------VDGSEGIVTVDVNGEYIELVAHDGVEQNADRLYTEAKRVAEKKEG 454

Query: 520 TITAHSKAFKAAEKKTRLQILQEK----------------------TVANISHMRKVHWF 557
            + A     +  E   R +   E+                        ++I       WF
Sbjct: 455 ALVAIEDTREDLEDAKRRRDEWEEQDGEPGAGEEDEDDEDDDRDWLAESSIPIRENEPWF 514

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-- 615
           ++F WF +S+ YLVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +   
Sbjct: 515 DRFRWFHTSDGYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASS 574

Query: 616 ----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
               +P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F +RG
Sbjct: 575 SDIELPESSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRG 634

Query: 671 KKNFLPPHPLIMGFGL 686
            + +    P+    G+
Sbjct: 635 DRTYYRDTPVGAAVGI 650



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGTYEGAKVDKAYRYGDDLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|433638964|ref|YP_007284724.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
           XH-70]
 gi|433290768|gb|AGB16591.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
           XH-70]
          Length = 847

 Score =  113 bits (282), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 98/385 (25%), Positives = 164/385 (42%), Gaps = 50/385 (12%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE----DAAFHKLNKIHMDQ 392
           PL  +Q    E   F++F  ALDE++ ++E    E    A +    +A   K  +I   Q
Sbjct: 401 PLEEHQQAGLEPEAFDSFTEALDEYFYQLELAEEEPADSASQRPDFEAEIAKQQRIIEQQ 460

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
           E  +   ++E +   + AEL+  N   VD  +  VR A      W+++     EER A  
Sbjct: 461 EGAIEEFEREAEAERERAELLYANYGFVDEILTTVRDARTEGTPWDEI-----EERFAAG 515

Query: 453 PVAGL-IDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY---- 507
              G+   +  ++ +  +  ++  LD+          E++ +D       NA R Y    
Sbjct: 516 AEQGIDAAEAVVDVDGANGRVTIELDD----------ERIPLDADDGVEKNADRLYTEAK 565

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV-------------------ANI 548
            + +K+E  Q+       +     E+K   +   E                      ++I
Sbjct: 566 RIAEKKEGAQQAIENTREELADVRERKAAWEADDEGGDDIGGDDSDEDEPDIDWLARSSI 625

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  WF++F W  +S+ +LVI GR+A QNE +V +Y+  GD   H   HG   TV+K
Sbjct: 626 PIRENEPWFDRFRWVQTSDGFLVIGGRNADQNEELVSKYLEPGDRVFHTQAHGGPVTVLK 685

Query: 609 ------NHRPEQPVPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYL 661
                 + RP+   P  ++ QA  F V ++  W D +     + V   QV+KT  +GEYL
Sbjct: 686 ATDPSESSRPDMEFPETSIEQAAQFAVSYASVWKDGRYAGDVYSVDADQVTKTPESGEYL 745

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGL 686
             G F IRG + +    P+ +  G+
Sbjct: 746 EKGGFAIRGDRTYHRDTPVGVAVGI 770



 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 73/164 (44%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L  L G +    Y         K+ +        +  +V L +E 
Sbjct: 114 KRELSSVDLAAVVGELSDLEGAKVDKAYLYGDDLVRLKMRDF-------DRGRVELFIEV 166

Query: 63  G--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R+HT A  R  D    P  F   LR  +       V Q  +DRI+ F F       
Sbjct: 167 GETKRVHTVAQERVPDAPGRPPHFAKMLRNRLSGADFAGVSQYEFDRILEFVFEREDANT 226

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            VI+EL+ +GN+ +TD E+ V+  L + R   + VA  +R+ +P
Sbjct: 227 RVIVELFGEGNVAVTDGEYEVVDSLETIRLKSRTVAPGARYEFP 270


>gi|289580546|ref|YP_003479012.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
 gi|448284209|ref|ZP_21475471.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
 gi|289530099|gb|ADD04450.1| Fibronectin-binding A domain protein [Natrialba magadii ATCC 43099]
 gi|445571291|gb|ELY25845.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
          Length = 727

 Score =  112 bits (280), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 165/364 (45%), Gaps = 37/364 (10%)

Query: 351 FETFDAALDEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           ++TF  ALD+++       + E     Q+    E+ A H+  +I   Q+  +   +QE +
Sbjct: 296 YDTFLDALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHE--RIIEQQQGAIEGFEQEAE 353

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA--------- 455
              + AEL+  N   VD  +  ++ A A    W+++    ++  + G   A         
Sbjct: 354 NLRENAELLYANYGLVDDILSTIQEARAQDRPWDEIEARFEDGAEQGIEAAEAVIDVDGS 413

Query: 456 -GLI----DKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR-WYEL 509
            G++    D  Y+E      +   N D +  E K +  +K   + AL+A  + R    + 
Sbjct: 414 EGIVTVDVDGEYIELVAHDGV-EQNADRLYTEAKRVAEKK---EGALAAIEDTREDLKDA 469

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K++++  +E+     +      ++      L E   ++I       WF++F WF +S+ Y
Sbjct: 470 KRRRDEWEEQDGKPGAGDEDEDDEDDDRDWLAE---SSIPIRENEPWFDRFRWFHTSDGY 526

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 527 LVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPESSIEE 586

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + +    P+  
Sbjct: 587 AAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGA 646

Query: 683 GFGL 686
             G+
Sbjct: 647 AVGI 650



 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGTYEGAKVDKAYRYGDDLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAQERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RLIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|167043365|gb|ABZ08068.1| putative domain of unknown function (DUF814) [uncultured marine
           crenarchaeote HF4000_ANIW141O9]
          Length = 632

 Score =  112 bits (279), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 108/202 (53%), Gaps = 10/202 (4%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           EK+++DL  S    A   +   KKQ++     I +  K     E +    I + ++  ++
Sbjct: 361 EKIKIDLNSSLPTTASTLFNESKKQKA----AIGSIEKLLIKTENELEKVIEKGESAKSV 416

Query: 549 S--HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           S   +RK +WFE++ WF +++  L + GRD+  N  I+++++ K D   HA++ G+   +
Sbjct: 417 SFTQVRKKNWFERYRWFYTTDGVLAVGGRDSSSNSAIIRKHLDKNDKVFHAEISGSPFFL 476

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGS 665
           +K++    P    +L +    TVC S+ W      +SA+WV P QV K AP+G+ +  GS
Sbjct: 477 LKDNATSTPA---SLTEVAHATVCFSKVWKEAFYGSSAYWVNPDQVKKGAPSGQSMAKGS 533

Query: 666 FMIRGKKNFLPPHPLIMGFGLL 687
           FMI G++NF+    L M   ++
Sbjct: 534 FMIEGQRNFVKISTLKMCVAII 555



 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 72/147 (48%), Gaps = 28/147 (19%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKN 77
           +R+ G   SN+Y ++    +FK  +        E   +LL++ + G+ +      + + N
Sbjct: 17  KRIDGYYLSNIYGITKDGLLFKFHHP-------EKPDILLMLSTFGIWITNVKIEQIEPN 69

Query: 78  TPSGFTLKLRKHIRTR----RLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLT 133
                  KL KH+R+     +L++V+Q+G +RI+            +++EL++ GNI++ 
Sbjct: 70  -------KLLKHLRSNILRFKLKEVKQIGTERIVYLTLSYFEKEFVIVVELFSDGNIIIC 122

Query: 134 DSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           ++E  +L L  SH       +I  RHR
Sbjct: 123 NNEMKILAL--SH-------SINVRHR 140


>gi|408405775|ref|YP_006863758.1| hypothetical protein Ngar_c31850 [Candidatus Nitrososphaera
           gargensis Ga9.2]
 gi|408366371|gb|AFU60101.1| hypothetical protein with domain of unknown function DUF814 and
           fibronectin-binding A protein [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 661

 Score =  112 bits (279), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 85/136 (62%), Gaps = 5/136 (3%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP--- 612
           W+E++ WFI+++  L I GRDA  N  ++++++++ D+  HA++HG+   ++KN      
Sbjct: 436 WYERYRWFITTDGLLAIGGRDASSNSALIRKHLTEDDIVFHAEVHGSPFFIVKNAAAPAK 495

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           E  + P +L Q    TV  S+AW   + ++ A+WV P QV K APTG++L  GSF+I GK
Sbjct: 496 EGRIDP-SLLQVAKATVSFSRAWKDGLSSADAYWVMPEQVKKGAPTGQFLPKGSFVIEGK 554

Query: 672 KNFLPPHPLIMGFGLL 687
           +N+L    + +  G++
Sbjct: 555 RNYLKGVEIRLAIGIV 570


>gi|170291097|ref|YP_001737913.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
           cryptofilum OPF8]
 gi|170175177|gb|ACB08230.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
           cryptofilum OPF8]
          Length = 624

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 150/337 (44%), Gaps = 48/337 (14%)

Query: 347 EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           E V++  F        S +E  RA +   +     ++K   I  + E R+ +L++E++R 
Sbjct: 238 EIVEYSAFP------LSHLEYDRARRDLLSDAIEDYYKSKGISFEDE-RISSLRREIERQ 290

Query: 407 VKMAELIEYN---LEDVDAAILA----VRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           + + E  E     L  +   IL+    V  AL    S E+ A + + + K+G  +     
Sbjct: 291 ISLKEEYERTYAQLRRIGDTILSNIHEVEEALGRARSGEEHALVKRVDWKSGKVII---- 346

Query: 460 KLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
                                    +L  E++++D+  SA  NA  +Y+  KK   K  +
Sbjct: 347 -------------------------SLEGEEIQLDIRRSASENASEYYDKAKKAREKALR 381

Query: 520 TITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
              A S   +   K+    + + K   +    ++  W+EKF WF +S   LVI GRDAQ 
Sbjct: 382 IDKALSNIMERL-KQIESSLEERKLELSPKPRKRERWYEKFRWFYTSSGNLVICGRDAQT 440

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           N  IV +YM   D++ H D+ G +  V+K    E+ V   ++ QA       S+AW   +
Sbjct: 441 NSEIVSKYMDDKDLFFHVDMPGGAVVVLKV---EREVDQRSIEQAAVAAASFSRAWKEGL 497

Query: 640 -VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
                ++V   QVSK AP G YL  GSF I GK+N+L
Sbjct: 498 SYADVYYVKGEQVSKHAPPGMYLPKGSFYITGKRNYL 534



 Score = 62.4 bits (150), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 72/141 (51%), Gaps = 10/141 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M   +++  +  LRRL G     +Y++   +  F L+    V   G  E +++ +   + 
Sbjct: 6   MTGIEISHTINELRRLEGGFIKKIYNIDGNS--FSLLFHPEV--DGRRE-IVIDLRGFIF 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           L    +A  K  TPS F + LRKH+   R+E + QLG +RII F+F  GM    +I+EL+
Sbjct: 61  LTKLKWA--KPQTPSSFVMTLRKHLENARIESISQLGLERIISFEFPRGMR---LIVELF 115

Query: 126 AQGNILLTDSEFTVLTLLRSH 146
             GN++L   +  V +  R+ 
Sbjct: 116 GGGNLILLSGDEIVASQRRAE 136


>gi|448321837|ref|ZP_21511312.1| fibronectin-binding A domain-containing protein [Natronococcus
           amylolyticus DSM 10524]
 gi|445602889|gb|ELY56860.1| fibronectin-binding A domain-containing protein [Natronococcus
           amylolyticus DSM 10524]
          Length = 717

 Score =  110 bits (274), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 155/335 (46%), Gaps = 30/335 (8%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           Q+   +E+ A H+  +I   QE  +   +Q+     + AEL+      VD  +  V+ A 
Sbjct: 316 QRPDFEEEIAKHE--RIIEQQEGAIEGFEQQAQSQRENAELLYAEYGVVDDILSTVQEAR 373

Query: 432 ANRMSWEDLARMVKEERKAG----NPVAGL-----IDKLYLERNCMSLL----LSNNLDE 478
           A    W+++    +E ++ G      V G+     I  + L+   + LL    +  N D 
Sbjct: 374 AQDRPWDEIEERFEEGKERGIEAAEAVVGVDGTEGIVTVELDGEEIDLLARQGVEQNADR 433

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           +  E K +  +K   + AL+A  + R   +L+  +  + E   T  +      E +    
Sbjct: 434 LYTEAKRIAEKK---EGALAAIEDTRE--DLEDAKRRRDEWEATDETDDDDEDEAQEETN 488

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            L+   +A++       W+++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H  
Sbjct: 489 WLE---LASVPIRENEPWYDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDTVLHTQ 545

Query: 599 LHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQV 651
            HG   TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QV
Sbjct: 546 AHGGPVTVLKATDPSEASSSDIELPDSSIEEAAQFAVTYSSVWKDGRYAGDVYAVDSDQV 605

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +KT  +GEYL  G F IRG + +    P+ +  G+
Sbjct: 606 TKTPESGEYLEKGGFAIRGDRTYHRDTPVGVAVGI 640



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLIEVGEIKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFVGVEQFEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FDRDDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|424812620|ref|ZP_18237860.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
 gi|339756842|gb|EGQ40425.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
          Length = 628

 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 98/348 (28%), Positives = 144/348 (41%), Gaps = 43/348 (12%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P  L  +   E  +FETF  ALDE + +   Q+ E +   K       + +    QE +
Sbjct: 231 APFPLQTYSEHEEERFETFSRALDELFHRRRQQKLESKRMDKYRERREGIERQLHQQEQK 290

Query: 396 VHTLKQEVDRSVKMAELIEYNLE-------DVDAAILAVRVALANRMSWEDLARMVKEER 448
              L+Q   +  + AE I  N +        VD+ I       A ++   DL  +  +ER
Sbjct: 291 AEGLEQAARQRRQAAETIYENYQVFHDLKQKVDSVIHEEGWESAEQLEVSDLESVNHQER 350

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
                + G                         E K  P E +E        A A R Y+
Sbjct: 351 FYRVAIDGA------------------------EVKLSPDESLE--------AAASRMYD 378

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
             K++E K E T  A        E+    +   E+        R   WFEK+ WF + E 
Sbjct: 379 EAKEREQKAENTREALQNTRGKLEELEEDEFEVEEDSMERDESRSKRWFEKYRWFHTPEG 438

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
            LVI GR  Q NE +VK ++   D+Y+HAD  GA S  +K+    Q      + QA    
Sbjct: 439 RLVICGRGPQTNESLVKNHLEGDDLYLHADFDGAPSVALKDG---QDASEEEIRQAAKAA 495

Query: 629 VCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           V  S+AW S +     ++V P QV+K   +GEYL  G+F+IRG + +L
Sbjct: 496 VTFSKAWKSGIGADDVYYVEPSQVTKNPESGEYLEKGAFVIRGDRTYL 543



 Score = 45.4 bits (106), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 7/94 (7%)

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           + Y RD    P GF ++LRKH+    ++ +RQ G+DRI+  + G   +  +V  EL+ +G
Sbjct: 55  SEYKRDNPERPPGFCMELRKHLGG--VDRIRQRGFDRILEIRSG---DVRFVA-ELFGKG 108

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           N  L     T+   LR     D+   +     YP
Sbjct: 109 NAALVKDGKTI-GALRQEEWSDRRTVVGEEFGYP 141


>gi|325969240|ref|YP_004245432.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
 gi|323708443|gb|ADY01930.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
          Length = 668

 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 111/190 (58%), Gaps = 8/190 (4%)

Query: 508 ELKKKQESKQE--KTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           EL++K +S +E    + A  +  +A  +K  ++ ++E ++  I   R+  WFE+F WFI+
Sbjct: 396 ELERKAKSAEEVMSQLRARIEELRAEGEKV-IESIREGSIHVIYGARE--WFERFRWFIT 452

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           S   LVI+GRDA QNE+IV+ Y+   D++VHAD+ GA+  VI+   P        + +A 
Sbjct: 453 SGGKLVIAGRDAAQNEVIVRHYLRPWDIFVHADIPGAAVVVIRLSNPSDNASNSDIYEAA 512

Query: 626 CFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            +   +S+AW   + V   ++V   QV+K AP+GEYL  GSFMI G + ++    L +G 
Sbjct: 513 QYAAAYSRAWVMGLSVLDVFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWIRNVELRLGI 572

Query: 685 GLLFRLDESS 694
           GL  R+D  S
Sbjct: 573 GL--RIDNLS 580



 Score = 40.8 bits (94), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 61/127 (48%), Gaps = 8/127 (6%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLED 97
           ++ + NS  +    ESEK  ++  S  R   T+Y  +  +   G T  LR+ I   RL  
Sbjct: 31  VYTMSNSLLLRFRKESEKYFVIANSH-RFGLTSYVLE--HGAEGVT-PLRRLIEGMRLRS 86

Query: 98  VRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMS 157
           +  L +DRI+   F  G    Y+++EL    N +   ++  +  +LR++R  D+ + I  
Sbjct: 87  IELLNFDRIVKLVFSDG----YLVIELLEPWNAIYMSNDNVIRWVLRAYRSRDRVINIGL 142

Query: 158 RHRYPTE 164
            ++ P +
Sbjct: 143 EYKPPPQ 149


>gi|76156132|gb|AAX27365.2| SJCHGC07862 protein [Schistosoma japonicum]
          Length = 241

 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/238 (31%), Positives = 123/238 (51%), Gaps = 7/238 (2%)

Query: 237 QPTLKTVLGEALGYGPALSEHII-LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWL 295
           +P +   L   L YG  + EH + +    V   K     +LE +   ++ L V  F   L
Sbjct: 8   KPYVNKTLSLELPYGNVVIEHCMRIAQKEVKQAKTINDFQLESSETYLMKLYVKHFAVAL 67

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
           +D++ G    +    ++    GK H  TE G   Q Y+EF P +  Q+R +  + F++F+
Sbjct: 68  RDILLGPYSIDHQSSLKGYIFGKPHQSTEKG--LQSYEEFHPFMFEQYREKPHLAFDSFN 125

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+D F+SKIESQ+   Q    E  A  K+  I  DQE R+  LK E +  ++ A LIE 
Sbjct: 126 RAVDAFFSKIESQKTLGQISRNEQKANRKVENIKKDQERRIMLLKTEQELDMQKAYLIEA 185

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           N + VD  I+ +  AL+N++ W++L  +++E ++  +P++  I    +E NC  + LS
Sbjct: 186 NRQLVDNIIILINHALSNQIDWKELELIIEEAKQRNDPLSCHI----VELNCKRVRLS 239


>gi|307354208|ref|YP_003895259.1| Fibronectin-binding A domain-containing protein [Methanoplanus
           petrolearius DSM 11571]
 gi|307157441|gb|ADN36821.1| Fibronectin-binding A domain protein [Methanoplanus petrolearius
           DSM 11571]
          Length = 636

 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 162/350 (46%), Gaps = 46/350 (13%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK---IHMDQENRVHTLKQEVDRSV 407
           F +F+ AL  ++   +S        A +DA   KL K   I   Q+  +   ++++    
Sbjct: 252 FSSFNDALSAYFPLPQS--------AAKDAKKEKLPKSEIIRRRQQEAIVNFEKKIAELQ 303

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           +  + I  N +D+   I  +R A ++++SW+++   +K    +  P A  I ++Y   + 
Sbjct: 304 EKVDAIYENYQDISGIIDTLRDA-SSKLSWQEIEETLK---NSSLPAAKSIVRIYPSESA 359

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           + ++                 +KV++ +  +  ANA R+Y   KK + K+   + A  K 
Sbjct: 360 VDVMAGG--------------KKVKIFINENPEANANRYYGEIKKYKKKKAGALVAMEK- 404

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           F   EK       Q K   +    +K  W+ K+ WF++S+  LVI G+DA  NE I K+Y
Sbjct: 405 FMPKEK-------QAKKRQDYKPQKK-KWYHKYRWFVTSDGVLVIGGQDAGSNEDIGKKY 456

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWV 646
           +   D +VHAD+HG S  V+K              +   F   +S AW +       +  
Sbjct: 457 LEGRDYFVHADVHGGSVVVVKGETE-------NWEEVAEFAASYSNAWKAGHFNCDVYAA 509

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
            P QVSKTA +GE++  G+F+IRG++ +     L +  GL    + + +G
Sbjct: 510 KPEQVSKTAESGEFVKRGAFIIRGERRYFRNIGLKVAIGLQLEPELAVIG 559



 Score = 72.8 bits (177), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 42/158 (26%), Positives = 78/158 (49%), Gaps = 9/158 (5%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M++ D+   +  +R  + +    +Y  +  ++ F+L        +GE + K   L+E G 
Sbjct: 7   MSSIDIRTMLYEIRERLPLWIGKIYQYNTNSFGFRL--------NGEDKSKYNFLVECGR 58

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H T    D    PSG+++ LRK+I   R+ D++Q G  RI + + G     + +I EL
Sbjct: 59  RAHLTDNLPDAPQNPSGYSMFLRKYISGGRVLDIKQYGLQRIFIIKIGKTEKEYNLIFEL 118

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           + +GN +L D  F V+  L+     D+ +   + + +P
Sbjct: 119 FNEGNAVLCDENFIVINPLKRLHFRDREIVSGTEYIFP 156


>gi|448368844|ref|ZP_21555611.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
           DSM 13077]
 gi|445651387|gb|ELZ04295.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
           DSM 13077]
          Length = 722

 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 75/138 (54%), Gaps = 7/138 (5%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WF++F WF +S+ YLVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P + 
Sbjct: 508 WFDRFRWFHTSDGYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEA 567

Query: 616 ------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
                 +P  ++ +A  F V ++  W D +     + V   QV+KT  +GEYL  G F +
Sbjct: 568 SSSDIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAV 627

Query: 669 RGKKNFLPPHPLIMGFGL 686
           RG + +    P+    G+
Sbjct: 628 RGDRTYYRDTPVGAAVGI 645



 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         KL +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGAYEGAKLDKAYLYGDDLVRLKLRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT    R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVTPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|56753953|gb|AAW25169.1| SJCHGC08981 protein [Schistosoma japonicum]
          Length = 414

 Score =  107 bits (267), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 46/78 (58%), Positives = 57/78 (73%)

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             V  S AW S ++T AWWV+  QVSKTAP+GEYLT GSF+IRGKKN+LPP P   GFG+
Sbjct: 1   MAVVLSSAWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGI 60

Query: 687 LFRLDESSLGSHLNERRV 704
           +F+L E S+  H  ERR+
Sbjct: 61  MFKLHEDSVFKHKGERRI 78



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 98/187 (52%), Gaps = 20/187 (10%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQN-----ENASTHKEK 946
            + RGQK KLKK+K+KY +QDEEER++RM +L      Q +D  P       E   +  + 
Sbjct: 185  LKRGQKSKLKKIKQKYKEQDEEERSLRMRIL------QGDDAKPSQYHQILERDHSLNQV 238

Query: 947  KPAISPVDAPKVC-----YKCKKAGHLSKDCKEHPDDSSHGVEDN-PCVGLDETAEMDKV 1000
            K + S +D   VC        +   + + D  +H  +S  G E++  C  +D      K 
Sbjct: 239  KTSNSILDTQTVCDSDVIRNDQPDNNANLDIDDHFTESDDGSEESLRCSDVDNLKS--KD 296

Query: 1001 AMEEEDIHEIGEEEKGRL-NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPG 1059
              + +D  ++  E K  L + ++ LTG P   D+LLY IPVC PYS +  +K+RVK+ PG
Sbjct: 297  NDDGDDDEDLSSESKDDLISLLNSLTGQPNDDDLLLYAIPVCAPYSVLLKFKFRVKLNPG 356

Query: 1060 TAKKGKG 1066
              K+GK 
Sbjct: 357  NTKRGKA 363


>gi|352682802|ref|YP_004893326.1| putative RNA-binding protein [Thermoproteus tenax Kra 1]
 gi|350275601|emb|CCC82248.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Thermoproteus tenax Kra 1]
          Length = 624

 Score =  106 bits (265), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 113/459 (24%), Positives = 200/459 (43%), Gaps = 81/459 (17%)

Query: 233 ARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFE 292
           A A+   L+  L   LG GP ++E +                +   NA +    A+A  E
Sbjct: 157 ALAEGKDLRRALSRELGLGPEVAEEV--------------YQRSSGNADR----ALAVLE 198

Query: 293 DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE 352
           + +++V  G + P  Y+L              +G    +     P+      +    +F+
Sbjct: 199 ELIREVTLGQLRPTLYVL--------------NGVPVTV----TPIRFISINADATEEFD 240

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK---QEVDRSVKM 409
           TF  ALD+++ +IE ++A ++  A   +   KL +     E  +   +   +E+ R  + 
Sbjct: 241 TFWKALDKYFIEIELRKAVEKKTANITSRRQKLEQTIKSLEVEIEEYRRKGEELRRIAQT 300

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
              I+Y LED     L  R+  A  +  E + R++  +RK    V        LE + + 
Sbjct: 301 MMNIKYELED-----LMGRLNTATDVENESI-RIIDVDRKRREAV--------LETSGIK 346

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
            ++   LD        LPV K    +   A        E  +K E  +E      ++  +
Sbjct: 347 FVV--KLD--------LPVGKQISSMFEKAK-------EYLRKAEKAEETLRRLRAELER 389

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
             E++  L+   ++ V  ++      WFE++ W  +S    V+ GRDA QNE++VK+Y+ 
Sbjct: 390 LEEQRAELERSIKEGVVRVAER---SWFERYRWTATSRKTPVLGGRDASQNEILVKKYLR 446

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVP-PLTLNQAGCFTVCHSQAWDSKM-VTSAWWVY 647
              ++ HAD+ GAS  + +      P+   L L +   F   +S+AW + +     ++V+
Sbjct: 447 DNYLFFHADIPGASVVITR------PIEDQLELLEVAQFAASYSKAWKAGIHSIDVFYVF 500

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             QVSK  P+GEYL  GSFMI G +N++    L +  G+
Sbjct: 501 GSQVSKQPPSGEYLARGSFMIYGTRNYIRHVRLELAIGV 539



 Score = 43.9 bits (102), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 51/111 (45%), Gaps = 16/111 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  +   D+ A  + +R LIG R  N+Y  +P  Y+FK    S           L++ E
Sbjct: 1   MKTSLTIVDLYASAREMRNLIGRRVENIYK-TPSGYLFKFAGGS----------YLIIDE 49

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
           +   L      RD +   +     LR  +R  +L+DV    +D+I++ +FG
Sbjct: 50  TRASLTGVLGERDYRGAET-----LRGLLRDEKLDDVTVPRFDKILVLKFG 95


>gi|18313944|ref|NP_560611.1| hypothetical protein PAE3259 [Pyrobaculum aerophilum str. IM2]
 gi|18161516|gb|AAL64793.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
          Length = 614

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 140/598 (23%), Positives = 236/598 (39%), Gaps = 154/598 (25%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRS 145
           LR   R  RL +V    +DRI    FG G     +I+EL    N++    +  V+ LL S
Sbjct: 68  LRGLFRDDRLAEVVMPRFDRIAELVFGSGK----IIVELLEPFNMVAV-RDGKVVWLLHS 122

Query: 146 HRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNA 205
           +R  D+ ++  + + YP                A      + D +E  K  + G+     
Sbjct: 123 YRGKDRVISPGAMYAYPP---------------AVFVDVLKADVDELQKAIDPGD----- 162

Query: 206 SKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLV 265
                                             L+  L   LG GP L++ +I+  G  
Sbjct: 163 ----------------------------------LRRSLIRRLGTGPELADELIVRAGTS 188

Query: 266 PNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTES 325
           P                                    I  E   L++   LGK  P    
Sbjct: 189 PRA----------------------------------IAEEFKALVEKVRLGKIEPTVCV 214

Query: 326 GSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKL 385
                I     P+     +  E+ +F  F  ALD +++ +E + A  Q   +      +L
Sbjct: 215 KDGVPI--TVMPIKPLSLKCDEYKQFNAFWEALDFYFAPMELESAAIQTTQELAQRRKRL 272

Query: 386 NKIHMDQENRVHTLKQEVDRSVKMA-ELIEYNLE------DVDAAILAVRVALANRMSWE 438
                + EN++   ++E  +   +A +L+ Y LE       ++ +I  V V  A R+  E
Sbjct: 273 EASIRELENKIPEYREEAAKLKTLAHKLLMYKLEIEEALKGMETSIRVVNVD-ATRIKIE 331

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALS 498
            L    + E + G  +   I +L+ E        +  L+E   +   + +EK++ DL+  
Sbjct: 332 -LPEGEQVELRKGVSIGKQISQLFDE--------AKELEEKAQKAAQV-LEKLKKDLS-- 379

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
                    +L ++Q   +EK              K+ ++I  +K+           WFE
Sbjct: 380 ---------KLDEEQRRAEEKL-------------KSSVKIATKKS-----------WFE 406

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KF+W +++    VI GRDA QNE++VK+Y+ +  ++ HAD+ GAS+ V     P +   P
Sbjct: 407 KFHWTVTTGRKPVIGGRDASQNEVVVKKYLKEHYLFFHADIPGASAVVAP---PSE--DP 461

Query: 619 LTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           L L Q   F   +S+AW   +     ++V   QV+K  P+G+YL  GSFMI GK+ ++
Sbjct: 462 LELLQIAQFAAAYSKAWKIGIHAVDVYYVKGVQVTKQPPSGQYLARGSFMIYGKREYV 519


>gi|121698891|ref|XP_001267840.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119395982|gb|EAW06414.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 1111

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 52/144 (36%), Positives = 84/144 (58%), Gaps = 11/144 (7%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVSLRVSNIYDLSSRIFLFKLAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RR+  V Q+G DR+I F F  G+  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRVTSVEQIGTDRVIDFSFSDGL--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
            LE +A GNI++TD E+ +L L R
Sbjct: 111 FLEFFAGGNIIITDREYNILALFR 134



 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 32/48 (66%)

Query: 1021 VDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            +  L G P P D +L  IP+C P++A+  YKYRVK+ PG  KKGK ++
Sbjct: 973  IPALIGTPRPEDEILAAIPICAPWAALGRYKYRVKLQPGAVKKGKAVK 1020


>gi|387219995|gb|AFJ69706.1| hypothetical protein NGATSA_2054800, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 94

 Score =  104 bits (260), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 46/84 (54%), Positives = 67/84 (79%)

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           KA KAAE++    + +++    +S +RK +WFEKF+WFI+S+N+LV+SGRDAQQNE++VK
Sbjct: 3   KAVKAAERQAAASLSKQQRKRTLSVVRKPYWFEKFHWFITSDNHLVVSGRDAQQNELLVK 62

Query: 586 RYMSKGDVYVHADLHGASSTVIKN 609
           RY+  GD YVHADL GA+S V+++
Sbjct: 63  RYLRVGDAYVHADLPGAASCVVRH 86


>gi|359415829|ref|ZP_09208221.1| hypothetical protein HRED_04719, partial [Candidatus Haloredivivus
           sp. G17]
 gi|358033813|gb|EHK02326.1| hypothetical protein HRED_04719 [Candidatus Haloredivivus sp. G17]
          Length = 194

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 59/150 (39%), Positives = 82/150 (54%), Gaps = 3/150 (2%)

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV 545
           L  + +++DL     A A ++Y+  K+ ESK E    A  K     E      I  E+ +
Sbjct: 47  LEEDSIKIDLHQDLEATASQYYDKAKESESKMENAEKALEKTEDEIESLGEEDIELEEVM 106

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
            + S  R   WFEK+ WF SS+ YLV  GRDAQ NEM+VK++    D+Y+HAD  GA ST
Sbjct: 107 EDKSEKRSKKWFEKYRWFYSSDGYLVCLGRDAQTNEMLVKKHTDSEDLYLHADFDGAPST 166

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAW 635
           VIK+    Q  P  TL +A   +V  ++AW
Sbjct: 167 VIKDG---QEAPESTLEEAAKASVSFTKAW 193


>gi|154304166|ref|XP_001552488.1| hypothetical protein BC1G_08353 [Botryotinia fuckeliana B05.10]
          Length = 288

 Score =  103 bits (257), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 61/144 (42%), Positives = 81/144 (56%), Gaps = 11/144 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R SNVYDLS K ++ K         +    K  +L+
Sbjct: 1   MKQRFSSIDVKVIAHELSNALVTLRVSNVYDLSSKIFLIKF--------AKPDNKQQILI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T ++R     PS F  +LRK ++TRR+  V Q+G DRII FQF  G    Y 
Sbjct: 53  DSGFRCHLTDFSRATAAAPSVFVQRLRKFLKTRRVTQVSQVGTDRIIEFQFSDGQYRLY- 111

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
            LE YA GNI+LTD E  +LTLLR
Sbjct: 112 -LEFYAGGNIILTDKELNILTLLR 134


>gi|119872023|ref|YP_930030.1| hypothetical protein Pisl_0509 [Pyrobaculum islandicum DSM 4184]
 gi|119673431|gb|ABL87687.1| protein of unknown function DUF814 [Pyrobaculum islandicum DSM
           4184]
          Length = 613

 Score =  102 bits (255), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 80/136 (58%), Gaps = 6/136 (4%)

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           +EK  +++  + K  WFEKF W I++    +I GRDA QNE IV++Y+ +  ++ HAD+ 
Sbjct: 388 EEKVKSSVKIVVKRAWFEKFRWSITTGKRPIIGGRDASQNETIVRKYLREHYLFFHADIP 447

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGE 659
           GAS  V+    P +   PL L Q   F   +S+AW   +     ++V   QVSK AP G+
Sbjct: 448 GASVVVMP---PSE--DPLELLQTAQFAAAYSKAWKIGIHSIDVYYVRGEQVSKHAPAGQ 502

Query: 660 YLTVGSFMIRGKKNFL 675
           YL  GSFMI GK+ ++
Sbjct: 503 YLARGSFMIYGKREYI 518


>gi|327311796|ref|YP_004338693.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
 gi|326948275|gb|AEA13381.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
          Length = 623

 Score =  102 bits (255), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 150/337 (44%), Gaps = 58/337 (17%)

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +++ F  ALD +++ +E ++A +   A+  A   KL +        +   ++  +    +
Sbjct: 238 EYDAFWKALDRYFADVELRKAVELKTAELKAKKAKLEQSIAKLRGEIQEYRKRSEELYSL 297

Query: 410 AEL---IEYNLEDVDAAIL-------AVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           A+    ++Y LE+   AIL       ++R+   NR S E +              +GL  
Sbjct: 298 AKTMLSLKYELEEAMQAILRNEEIGASIRILDVNRTSKEAVLEH-----------SGLRF 346

Query: 460 KLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           KL L+R          ++E+ +E K       + + AL      R   EL + +  + E 
Sbjct: 347 KLRLDRPV-----GRQIEEVFEEAKDYARRAAKAEEALK-----RLEEELARVESERAEA 396

Query: 520 TITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
                 +  KAAE+                      WFEKF WF++      I GRDA Q
Sbjct: 397 ERAVAERVRKAAERA---------------------WFEKFRWFLALGRVPAIGGRDASQ 435

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           NE  V+RY+    ++ HAD+ GAS+ V K  + E  +  L L Q   F   +S+AW + +
Sbjct: 436 NEAAVRRYLKDDYLFFHADVPGASAVVAKPTQDEAAL--LELAQ---FAASYSRAWRAGI 490

Query: 640 -VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
                ++V   QVSK  P+GEYL  GSFMI G KN++
Sbjct: 491 HAVDVFYVPGRQVSKQPPSGEYLARGSFMIYGSKNYI 527


>gi|424812621|ref|ZP_18237861.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
 gi|339756843|gb|EGQ40426.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
          Length = 361

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/210 (32%), Positives = 105/210 (50%), Gaps = 8/210 (3%)

Query: 471 LLSNNLDEMDDEEK--TLPVEKVEVDLAL--SAHANARRWYELKKKQESKQEKTITAHSK 526
           L  NNL+ ++ +E+   + ++  EV L+   S  A A R Y+  K++E K E    A   
Sbjct: 74  LEVNNLESVNHQERFYRVAIDGAEVKLSPDESLEAAASRMYDEAKEREQKAENAREALQN 133

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
                E+    +   E+        R   WFEK+ WF + E  LVI GR  Q NE +V  
Sbjct: 134 TQGKLEELEEDEFEVEEESMERDESRSKRWFEKYRWFHTPEGRLVICGRGPQTNESLVNN 193

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWW 645
           ++ + D+Y+HAD  GA S  +K+    Q      + QA    V  S+AW S +     ++
Sbjct: 194 HLERDDLYLHADFDGAPSVALKDG---QNASKDEIRQAAKAAVTFSKAWKSGIGADDVYY 250

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           V P QV+K+  +GEYL  G+F IRG + +L
Sbjct: 251 VGPAQVTKSPESGEYLERGAFAIRGDRTYL 280


>gi|440491782|gb|ELQ74392.1| putative RNA-binding protein [Trachipleistophora hominis]
          Length = 886

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 118/240 (49%), Gaps = 40/240 (16%)

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRK--V 554
           LS   N   +Y   K ++ K+EK I  + ++  A        I+++K V     ++K  +
Sbjct: 576 LSIDKNMNYYYNQMKNKKIKREK-IRNNLESILA-------NIVEKKAVVKPQEIKKRVL 627

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--- 611
            WFEKFN+ I+S  +LV+ G++A QNE++ KR   K  ++ HAD+ G S+  I   R   
Sbjct: 628 FWFEKFNFTITSNGFLVLGGKNASQNEVLNKR---KFLLFFHADIKGGSAVTIDGTRINI 684

Query: 612 ----------PEQPVPPLT-------------LNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
                      E  +  +              +  A    + +S  W  ++V+ +++V  
Sbjct: 685 LGRCAKHESSSETSIKRIVASSDNAYGLKEEDITDASQMCMVYSNCWKDRIVSDSYYVNE 744

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
            QVSK+AP+GE+L+ G FM++GKKN++    L     LLF L E +L   +    V G++
Sbjct: 745 DQVSKSAPSGEFLSKGGFMVKGKKNYVHNVRLEYAIALLFAL-EKNLEQQIENMHVGGDK 803



 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 65/112 (58%), Gaps = 11/112 (9%)

Query: 29  VYDLSPK---TYIFKLMNSSGVTESGESEKVLLLMESGVRLH-TTAYARDKKNTPSGFTL 84
           V +L PK   TYI  + +S   T    + K + L+E+G+R+H T  Y  D+    S F  
Sbjct: 14  VNELHPKIESTYIQNIYSSGQRTFYLRTNKNIFLIEAGLRIHLTNTYPSDE---ISFFAK 70

Query: 85  KLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
           +LR ++R +++  VRQ+G+DR ++ Q G       V++E+++ GN+++ + E
Sbjct: 71  RLRTYLRRKKVGGVRQVGFDRAVVVQIG----EFLVVIEMFSAGNLIVLEKE 118


>gi|269865204|ref|XP_002651842.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063777|gb|EED42214.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 323

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 79/304 (25%), Positives = 135/304 (44%), Gaps = 46/304 (15%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 59  MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 109

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 110 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 169

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 170 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 207

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 208 -----KIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 262

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 263 NKYMEDRDLYFHCDVKGASSVVCKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 316

Query: 645 WVYP 648
           +V P
Sbjct: 317 YVSP 320


>gi|302761990|ref|XP_002964417.1| hypothetical protein SELMODRAFT_405642 [Selaginella moellendorffii]
 gi|300168146|gb|EFJ34750.1| hypothetical protein SELMODRAFT_405642 [Selaginella moellendorffii]
          Length = 161

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 51/127 (40%), Positives = 74/127 (58%), Gaps = 16/127 (12%)

Query: 956  PKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEK 1015
            P +CY CKK+GH++ +C +     S     N                 +E+I ++ EEE+
Sbjct: 4    PVICYNCKKSGHVASECPDSKQTESKIAAIN----------------AKENIVDLDEEER 47

Query: 1016 GRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
             +L ++D LTG PLP+DILLY + VC PYSA+QSYKY VKI PG  KKGKG+++     +
Sbjct: 48   EKLTELDALTGRPLPNDILLYAVLVCRPYSALQSYKYHVKITPGPLKKGKGVKMAMDAFI 107

Query: 1076 LMLSLTP 1082
             +  + P
Sbjct: 108  HLSDVLP 114


>gi|85091915|ref|XP_959135.1| hypothetical protein NCU09191 [Neurospora crassa OR74A]
 gi|28920536|gb|EAA29899.1| conserved hypothetical protein [Neurospora crassa OR74A]
 gi|29150083|emb|CAD79644.1| conserved hypothetical protein [Neurospora crassa]
          Length = 1097

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 56/145 (38%), Positives = 80/145 (55%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R +N+YDL+ K  + K        +        LL+
Sbjct: 1   MKQRFSSLDVRVVAHELSEALVSLRLANIYDLNSKILLLKFAKPDTRQQ--------LLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H T + R     PS F  +LRK+++TRR   V Q+G DRII FQF  G  A  +
Sbjct: 53  ESGFRCHLTDFVRTASPAPSQFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE +A GNI+LTD++  +L LLR+
Sbjct: 111 YLEFFASGNIILTDADLKILALLRN 135


>gi|387220185|gb|AFJ69801.1| hypothetical protein NGATSA_2069500, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 75

 Score =  100 bits (249), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 44/60 (73%), Positives = 48/60 (80%)

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           MVTSAWWV   QVSKTAP GE+L  GSFM+RGKKNFL P PL MG GLLF+LDE S+G H
Sbjct: 1   MVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFLAPQPLEMGLGLLFKLDEGSVGRH 60


>gi|429964304|gb|ELA46302.1| hypothetical protein VCUG_02190 [Vavraia culicis 'floridensis']
          Length = 943

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 65/224 (29%), Positives = 107/224 (47%), Gaps = 38/224 (16%)

Query: 497 LSAHANARRWYELKKKQESKQEKTIT-AHSKAFKAAEKKTRLQILQEKTVANISHMRKVH 555
           LS   N   +Y   K +++K+EK      S     +EKK  ++  + K        R++ 
Sbjct: 587 LSIDKNVNYYYNQMKSKKTKREKIRNNLESILANISEKKATVKQREYKK-------RELF 639

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR---- 611
           WFEKFN+ ++   +LV+ G++A QNE + KR   K  ++ HAD+ G S   +   +    
Sbjct: 640 WFEKFNFTVTQNGFLVLGGKNATQNETLNKR---KFKLFFHADVKGGSVVTVDGTKLNIL 696

Query: 612 ---------------------PEQ--PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
                                PE    +    +  A    + +S  W  ++V  +++V  
Sbjct: 697 RRNTGYAESSSVTSIKRLQTNPENVYGLKEEDITDASQMCMVNSNCWKDRIVCDSYYVNE 756

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            QVSK+AP+GE+LT G FM++GKKN++    L    GLLF L++
Sbjct: 757 EQVSKSAPSGEFLTKGGFMVKGKKNYVHNVRLEYAVGLLFALEK 800



 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 75/137 (54%), Gaps = 12/137 (8%)

Query: 29  VYDLSPK---TYIFKLMNSSGVTESGESEKVLLLMESGVRLHTT-AYARDKKNTPSGFTL 84
           V +L PK   TYI  + +S   T    + K + L+E+G+R+H T  Y     N  S F  
Sbjct: 14  VNELHPKIESTYIQNIYSSGQRTFYVRTNKNIFLIEAGLRIHLTDTYP---SNEISFFCK 70

Query: 85  KLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR 144
           +LR  +R +++  V+Q+G+DR+++ Q G       V++E++A GN+++ + E +V +   
Sbjct: 71  RLRTCLRRKKIGGVKQVGFDRVVVVQAG----EFLVVVEMFAAGNLIVLEKE-SVASERN 125

Query: 145 SHRDDDKGVAIMSRHRY 161
           S  +D+K    + R  Y
Sbjct: 126 SGEEDEKDRNGLERTEY 142


>gi|351707265|gb|EHB10184.1| Serologically defined colon cancer antigen 1 [Heterocephalus
           glaber]
          Length = 208

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 77/125 (61%), Gaps = 9/125 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELY 125
           I+ELY
Sbjct: 113 IIELY 117



 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 234 RAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
           R  +  LKT+    + YGPAL EH +++ G   N+K+ E  KLE   I+ ++  + K ED
Sbjct: 119 RCYRKILKTISSAFVAYGPALLEHCLIENGFSGNVKVDE--KLESKDIEKVLDCMQKAED 176

Query: 294 WLQDV--ISGDIVP 305
           +++      G + P
Sbjct: 177 YMKTTSNFHGKVTP 190


>gi|379003409|ref|YP_005259081.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
 gi|375158862|gb|AFA38474.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
          Length = 614

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 91/170 (53%), Gaps = 11/170 (6%)

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           EL++K   K E+ +    K   A E++ R    +E   A+   + K  WFEKF+W +++ 
Sbjct: 359 ELEEKAR-KAEQVLEKLRKELSALEEQQRRA--EEALKASAKVVAKRSWFEKFHWTVTTG 415

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-PPLTLNQAGC 626
              VI GRDA QNE +V+RY+     + HAD+ GAS+          P+  PL + Q   
Sbjct: 416 RRPVIGGRDASQNEAVVRRYLKDHYFFFHADIPGASAVA------APPMDDPLEILQVAQ 469

Query: 627 FTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           F   +S+AW   +     ++V   QVSK  P+G+YL  GSFM+ GK+ ++
Sbjct: 470 FAAAYSRAWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYV 519


>gi|145591891|ref|YP_001153893.1| hypothetical protein Pars_1690 [Pyrobaculum arsenaticum DSM 13514]
 gi|145283659|gb|ABP51241.1| protein of unknown function DUF814 [Pyrobaculum arsenaticum DSM
           13514]
          Length = 614

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 11/170 (6%)

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           EL++K   K E+ +    K   A E++ R    +E   A+   + K  WFEKF+W +++ 
Sbjct: 359 ELEEKAR-KAEQVLEKLRKELSALEEQQRRA--EEALKASAKVVAKRSWFEKFHWTVTTG 415

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-PPLTLNQAGC 626
              VI GRDA QNE +V++Y+     + HAD+ GAS+          P+  PL + Q   
Sbjct: 416 RRPVIGGRDASQNEAVVRKYLKDHYFFFHADIPGASAVA------APPMDDPLEILQVAQ 469

Query: 627 FTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           F   +S+AW   +     ++V   QVSK  P+G+YL  GSFM+ GK+ ++
Sbjct: 470 FAAAYSRAWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYV 519


>gi|374326819|ref|YP_005085019.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
 gi|356642088|gb|AET32767.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
          Length = 621

 Score = 96.7 bits (239), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 49/131 (37%), Positives = 73/131 (55%), Gaps = 6/131 (4%)

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
           A+   + K  WFEKF+W +++    VI GRDA QNE +V++Y+    ++ HAD+ GAS+ 
Sbjct: 401 ASARAVAKKSWFEKFHWTVTTGKRPVIGGRDASQNESVVRKYLKDHYLFFHADIPGASAV 460

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVG 664
                       PL L Q   F   +S+AW   +     ++V   QVSK  P+G+YL  G
Sbjct: 461 AAPPME-----DPLELLQVAQFAAAYSKAWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKG 515

Query: 665 SFMIRGKKNFL 675
           SFMI GK+ ++
Sbjct: 516 SFMIYGKREYV 526


>gi|78395025|gb|AAI07765.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 458

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 42/62 (67%), Positives = 50/62 (80%)

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           VSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E 
Sbjct: 2   VSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDED 61

Query: 711 MD 712
           M+
Sbjct: 62  ME 63


>gi|126460385|ref|YP_001056663.1| hypothetical protein Pcal_1780 [Pyrobaculum calidifontis JCM 11548]
 gi|126250106|gb|ABO09197.1| protein of unknown function DUF814 [Pyrobaculum calidifontis JCM
           11548]
          Length = 616

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 78/137 (56%), Gaps = 8/137 (5%)

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           +EK  +++  + +  WFEK++W +++    V+ GRDA QNE IV++Y+    ++ HAD+ 
Sbjct: 391 EEKVKSSVKAVVEREWFEKYHWTVTTGKRPVLGGRDASQNESIVRKYLKDHYLFFHADIP 450

Query: 601 GASSTVIKNHRPEQPVP-PLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTG 658
           GAS  +        P+  PL ++Q   F   +S+AW   +     ++    QVSK  P G
Sbjct: 451 GASVVI------APPIEDPLEVHQVAQFAAAYSRAWKIGIHAIDVYYARGEQVSKQPPAG 504

Query: 659 EYLTVGSFMIRGKKNFL 675
           +YL  GSFM+ GK+ ++
Sbjct: 505 QYLARGSFMVYGKREYV 521


>gi|41615287|ref|NP_963785.1| hypothetical protein NEQ506 [Nanoarchaeum equitans Kin4-M]
 gi|40069011|gb|AAR39346.1| NEQ506 [Nanoarchaeum equitans Kin4-M]
          Length = 255

 Score = 93.6 bits (231), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 73/131 (55%), Gaps = 11/131 (8%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP--- 612
           WF K+ +  +   +LVI G+DA QNE I+K Y   GD+  HAD+HGA   ++  + P   
Sbjct: 54  WFMKYRFTFTESGFLVIGGKDANQNERIMKVYRKDGDLVFHADIHGAPFALMLLNNPNAD 113

Query: 613 -------EQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVG 664
                  +  +    L QA   +  +S+AW   + +   ++V   Q+SK AP+GEYL  G
Sbjct: 114 SVEEVIEKYKITETDLMQAAGLSAVYSKAWQEGLASIDVFYVLGKQISKKAPSGEYLKHG 173

Query: 665 SFMIRGKKNFL 675
           SFM+ GKK+++
Sbjct: 174 SFMVYGKKHYI 184


>gi|440301762|gb|ELP94148.1| serologically defined colon cancer antigen 1, putative, partial
           [Entamoeba invadens IP1]
          Length = 144

 Score = 92.8 bits (229), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 47/121 (38%), Positives = 75/121 (61%), Gaps = 8/121 (6%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           RL+ M  + VYD++ + Y+ KL        S    K  +++ESGVR+H T Y RDK +TP
Sbjct: 27  RLLDMNVNTVYDINRRLYVIKL--------SKTDLKEFIVIESGVRVHLTQYNRDKSDTP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT +LRK++  +RL  V Q+G DR+I    G     + +I++LY+ GNI LTD+++ +
Sbjct: 79  NNFTSRLRKYLNKKRLLRVNQIGNDRVIEIVIGNATEKYNLIIDLYSNGNICLTDADYKI 138

Query: 140 L 140
           +
Sbjct: 139 V 139


>gi|171186042|ref|YP_001794961.1| hypothetical protein Tneu_1592 [Pyrobaculum neutrophilum V24Sta]
 gi|170935254|gb|ACB40515.1| protein of unknown function DUF814 [Pyrobaculum neutrophilum
           V24Sta]
          Length = 613

 Score = 90.9 bits (224), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 73/121 (60%), Gaps = 6/121 (4%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WFEKF+W I++    VI GRDA QNE +V++Y+    ++ HAD+ GAS+  +    P + 
Sbjct: 403 WFEKFHWTITTGRRPVIGGRDASQNETVVRKYLKDSYLFFHADIPGASAVAMP---PAE- 458

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
             PL L QA  F   +S+AW   +     ++V   QV+K AP G+YL  GSFMI GK+ +
Sbjct: 459 -DPLELLQAAQFAAAYSKAWKIGIHAVDVYYVRGEQVTKQAPAGQYLARGSFMIYGKREY 517

Query: 675 L 675
           +
Sbjct: 518 V 518


>gi|290559894|gb|EFD93216.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
           acidophilus ARMAN-5]
          Length = 587

 Score = 90.9 bits (224), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 83/161 (51%), Gaps = 11/161 (6%)

Query: 542 EKTVANISHMRKV------HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +K   N+  +R++       W+ KF +F +S N L I G+D  QNE +++++  KGD+  
Sbjct: 367 DKIKTNVIKVRRLKVITGNEWYSKFRFFSTSLNKLCIIGKDVNQNESLIQKHAEKGDIVG 426

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKT 654
           HAD+ G+   VIK    E     + L +       +S AW +       ++V P QV+KT
Sbjct: 427 HADVFGSPFGVIKTGNAE--TKEVELEEMATMIASYSSAWRAGATNLDVYFVNPEQVTKT 484

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
            P+GE L  G+F I GK+ ++    L  G  L F + E S+
Sbjct: 485 PPSGESLKKGAFYIEGKRKYIKNSSL--GIYLSFDIREDSV 523


>gi|269986196|gb|EEZ92508.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
           acidiphilum ARMAN-4]
          Length = 587

 Score = 90.1 bits (222), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 96/188 (51%), Gaps = 19/188 (10%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           +++ +D+  + + N    Y+  K+ ++   + ITA +K  +      R+++  E      
Sbjct: 336 QQLNIDITQNLNYNLALMYQKAKRLKNIDTEAITAKTKMIR------RIKVKNEN----- 384

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  W+ KF  FI+SE  LVI G+D  QNE +++++M K D+  HAD+ G+   +IK
Sbjct: 385 ------QWYSKFRHFITSEGNLVIIGKDVNQNESLIEKHMEKEDIVGHADVFGSPFGIIK 438

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFM 667
             +  + +    + +       +S AW         +++ P QV+KT P+GE L  G+F 
Sbjct: 439 -PKEGKSISKKEIEETAIMIASYSSAWRVGATNLDVYFIKPEQVTKTPPSGESLKKGAFY 497

Query: 668 IRGKKNFL 675
           I GK++++
Sbjct: 498 IEGKRDYI 505


>gi|269862884|ref|XP_002651013.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065270|gb|EED43045.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 191

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 6/104 (5%)

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           M   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   ++V 
Sbjct: 1   MEDRDLYFHCDVIGASSVVCKGSADR------IIEDATYFALVYSKAWDEQVIKDVFYVS 54

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
             QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 55  SDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 98


>gi|255514115|gb|EET90378.1| protein of unknown function DUF814 [Candidatus Micrarchaeum
           acidiphilum ARMAN-2]
          Length = 260

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 97/194 (50%), Gaps = 12/194 (6%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFKAAEKKTRLQILQEKTVAN 547
           V +D   SA  NA  +Y+  KK   K E   K +T   +   + E +   Q  + KT+  
Sbjct: 3   VSIDFTKSAQENANSYYQNAKKYHKKSEGAAKAMTQMEEKLNSIESEHVQQAAKTKTL-- 60

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
             H++K  W+EKF+WF +S   L I GRDAQQNE++  ++  + D++ HAD+ GAS  ++
Sbjct: 61  --HLQKKEWYEKFHWFFTSHGSLAIGGRDAQQNELLNSKHFDENDLFFHADIFGASVVIL 118

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           K            +         +S AW   +V+   + +   Q+SK+   G  L  GSF
Sbjct: 119 KGGAGADKEEKAEVAAF---AASYSSAWKKMLVSVDVYAMRRDQISKSTNKGS-LGQGSF 174

Query: 667 MIRGKKNFLPPHPL 680
           +++G++ +    PL
Sbjct: 175 LMKGEREWYRNTPL 188


>gi|366991987|ref|XP_003675759.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
 gi|342301624|emb|CCC69395.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
          Length = 1020

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 87/146 (59%), Gaps = 13/146 (8%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+   A E+K    L   R +N+Y++S  T  F L  +          K+ +
Sbjct: 1   MKQRISSLDLQILAGELK--NSLESYRLNNIYNVSDSTRQFLLRFNKP------DSKLNV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R     PSGF +KLRKH++ +RL  +RQ+  DRI++ QF  G+   
Sbjct: 53  IVDCGLRIHLTDFNRPIPPAPSGFVVKLRKHLKGKRLTALRQVQNDRILVLQFADGL--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR 144
           Y++LE ++ GN++L + + T+L+L R
Sbjct: 111 YLVLEFFSAGNVILLNEDRTILSLQR 136


>gi|302761202|ref|XP_002964023.1| hypothetical protein SELMODRAFT_81700 [Selaginella moellendorffii]
 gi|300167752|gb|EFJ34356.1| hypothetical protein SELMODRAFT_81700 [Selaginella moellendorffii]
          Length = 129

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/66 (59%), Positives = 53/66 (80%), Gaps = 1/66 (1%)

Query: 1004 EEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKK 1063
            EE+I ++GEEE+ +L ++D LTG   P+DILLY +PVCG YSA+Q+YKY VKI PG +KK
Sbjct: 1    EENIVDLGEEEREKLTELDALTGRSFPNDILLYAVPVCG-YSALQNYKYHVKITPGPSKK 59

Query: 1064 GKGIQI 1069
            GKG ++
Sbjct: 60   GKGAKM 65


>gi|74223770|dbj|BAE28715.1| unnamed protein product [Mus musculus]
          Length = 290

 Score = 84.3 bits (207), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 73/205 (35%), Positives = 106/205 (51%), Gaps = 24/205 (11%)

Query: 865  EREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS 924
            E++KER     ++      K    G  + RGQK K+KKMKEKY DQD+E+R + M LLAS
Sbjct: 56   EKDKERESAVHTEAYQNTSKNVAAGQPMKRGQKSKMKKMKEKYKDQDDEDRELIMKLLAS 115

Query: 925  AGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDS---SH 981
            AG    N  +   +      + +P   P   P+        G    D  + P      +H
Sbjct: 116  AGS---NKEEKGKKGKKGKPKDEPVKKPPQKPR-------GGQRVLDVVKEPPSLQVLAH 165

Query: 982  GVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVC 1041
             ++D   + +D+  + DK   EE D+ + G EE    N  D LTG P P D+L++ IP+C
Sbjct: 166  DLQD---LAVDDPHD-DK---EEHDLDQQGNEE----NLFDSLTGQPHPEDVLMFAIPIC 214

Query: 1042 GPYSAVQSYKYRVKIIPGTAKKGKG 1066
             PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 215  APYTIMTNYKYKVKLTPGVQKKGKA 239


>gi|31455252|gb|AAH53488.2| Sdccag1 protein [Mus musculus]
          Length = 208

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 66/175 (37%), Positives = 92/175 (52%), Gaps = 18/175 (10%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQK K+KKMKEKY DQD+E+R + M LLASAG    N  +   +      + +P   
Sbjct: 1    MKRGQKSKMKKMKEKYRDQDDEDRELIMKLLASAGS---NKEEKGKKGKKGKPKDEPVKK 57

Query: 952  PVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIG 1011
            P   P+        G    D  + P        D   + +D+  + DK   EE D+ + G
Sbjct: 58   PPQKPR-------GGQRVLDVVKEPPSLQVLAHDLQDLAVDDPHD-DK---EEHDLDQQG 106

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
             EE    N  D LTG P P D+L++ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 107  NEE----NLFDSLTGQPHPEDVLMFAIPICAPYTIMTNYKYKVKLTPGVQKKGKA 157


>gi|347828081|emb|CCD43778.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 430

 Score = 83.2 bits (204), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 40/77 (51%), Positives = 51/77 (66%), Gaps = 7/77 (9%)

Query: 642 SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNE 701
           SAWWV   QVSK+APTGE+L  GSF   GKKNFLPP  L++GFG+LF++ + S   H N+
Sbjct: 2   SAWWVTADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDESKARH-NK 60

Query: 702 RRVRGEEEGMDDFEDSG 718
            R++      DD   SG
Sbjct: 61  HRLQ------DDSPSSG 71



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 76/187 (40%), Gaps = 45/187 (24%)

Query: 907  YGDQDEEERNIRMALL-ASAGKVQKNDGDPQNENAST----HKEKKPAISPVDAPKVCYK 961
            Y DQDEE+R     ++ A+AG+ +                  KE+               
Sbjct: 215  YKDQDEEDRIAAQEIIGAAAGQEKAEAEAKAKAAREAELAFQKER--------------- 259

Query: 962  CKKAGH--LSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEE-DIHEIGEEEKGRL 1018
             ++A H    K+  EH                    EM K+ +E+  D HE  E E   +
Sbjct: 260  -RRAQHQRTQKETAEH-------------------EEMRKLMLEDGIDTHEDNEIE--TM 297

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLML 1078
              +D   G PLP D +L  IPVC P++A+  YKY+ KI PG  KKGK ++      +   
Sbjct: 298  TSLDSFVGLPLPGDEILEAIPVCAPWAAMGKYKYKAKIQPGAQKKGKAVREILGKWMAAS 357

Query: 1079 SLTPVFD 1085
            +   V D
Sbjct: 358  TAKGVLD 364


>gi|26328217|dbj|BAC27849.1| unnamed protein product [Mus musculus]
          Length = 346

 Score = 83.2 bits (204), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 72/202 (35%), Positives = 102/202 (50%), Gaps = 18/202 (8%)

Query: 865  EREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS 924
            E++KER     ++      K    G  + RGQK K+KKMKEKY DQD+E+R + M LLAS
Sbjct: 112  EKDKERESAVHTEAYQNTSKNVAAGQPMKRGQKSKMKKMKEKYKDQDDEDRELIMKLLAS 171

Query: 925  AGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVE 984
            AG    N  +   +      + +P   P   P+        G    D  + P        
Sbjct: 172  AGS---NKEEKGKKGKKGKPKDEPVKKPPQKPR-------GGQRVLDVVKEPPSLQVLAH 221

Query: 985  DNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPY 1044
            D   + +D+  + DK   EE D+ + G EE    N  D LTG P P D+L++ IP+C PY
Sbjct: 222  DLQDLAVDDPHD-DK---EEHDLDQQGNEE----NLFDSLTGQPHPEDVLMFAIPICAPY 273

Query: 1045 SAVQSYKYRVKIIPGTAKKGKG 1066
            + + +YKY+VK+ PG  KKGK 
Sbjct: 274  TIMTNYKYKVKLTPGVQKKGKA 295


>gi|302854249|ref|XP_002958634.1| hypothetical protein VOLCADRAFT_48102 [Volvox carteri f. nagariensis]
 gi|300256023|gb|EFJ40300.1| hypothetical protein VOLCADRAFT_48102 [Volvox carteri f. nagariensis]
          Length = 115

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/65 (53%), Positives = 48/65 (73%)

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
            + +E+  RL+ +D LTG P P D+LL+ +PVCGPY+A+QSYKY+VK+ PGT KKGK  + 
Sbjct: 1    LADEDAARLSVLDSLTGIPRPEDVLLFAVPVCGPYNAIQSYKYKVKVTPGTVKKGKAARQ 60

Query: 1070 FYSLL 1074
               LL
Sbjct: 61   ALELL 65


>gi|60422786|gb|AAH89999.1| Sdccag1 protein, partial [Rattus norvegicus]
          Length = 419

 Score = 80.5 bits (197), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 107/203 (52%), Gaps = 20/203 (9%)

Query: 865  EREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAS 924
            E++KE+     S+ +    K    G  + RGQK K+KKMKEKY DQD+E+R + M LLAS
Sbjct: 185  EKDKEKESAVHSEADQNTSKNVAAGQPMKRGQKSKMKKMKEKYKDQDDEDRELIMKLLAS 244

Query: 925  AGKVQKNDGDPQNENASTHKE-KKPAISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
            AG  ++  G    +  +  +  KK    P    +V    K+   L          S+  +
Sbjct: 245  AGSNKEEKGKKGKKGKTKDEPVKKNPQKPRGGQRVLDVVKETPSLQA--------STPDL 296

Query: 984  EDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGP 1043
            +D     +DE  + DK   EE D+ + G EE    N  D LTG P P D+L++ IP+C P
Sbjct: 297  QD---FAVDEPHD-DK---EEHDLDQQGNEE----NLFDSLTGQPHPEDVLMFAIPICAP 345

Query: 1044 YSAVQSYKYRVKIIPGTAKKGKG 1066
            Y+ + +YKY+VK+ PG  KKGK 
Sbjct: 346  YTIMTNYKYKVKLTPGVQKKGKA 368


>gi|12855522|dbj|BAB30366.1| unnamed protein product [Mus musculus]
          Length = 208

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 66/175 (37%), Positives = 92/175 (52%), Gaps = 18/175 (10%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQK K+KKMKEKY DQD+E+R + M LLASAG    N  +   +      + +P   
Sbjct: 1    MKRGQKSKMKKMKEKYKDQDDEDRELIMKLLASAGS---NKEEKGKKGKKGKPKDEPVKK 57

Query: 952  PVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIG 1011
            P   P+        G    D  + P        D   + +D+  + DK   EE D+ + G
Sbjct: 58   PPQKPR-------GGQRVLDVVKEPPSLQVLAHDLQDLAVDDPHD-DK---EEHDLDQQG 106

Query: 1012 EEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
             EE    N  D LTG P P D+L++ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 107  NEE----NLFDSLTGQPHPEDVLMFAIPICAPYTIMTNYKYKVKLTPGVQKKGKA 157


>gi|119586151|gb|EAW65747.1| serologically defined colon cancer antigen 1, isoform CRA_g [Homo
            sapiens]
          Length = 356

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 71/178 (39%), Positives = 97/178 (54%), Gaps = 23/178 (12%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQK K+KKMKEKY DQDEE+R + M LL SAG          N+     K KK    
Sbjct: 148  MKRGQKSKMKKMKEKYKDQDEEDRELIMKLLGSAGS---------NKEEKGKKGKKGKTK 198

Query: 952  PVDAPKVCYKCKKAGHLSKDCK-EHP--DDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
                 K   K +    +S + K E P  +  +H ++D     +D+  + DK   EE+D+ 
Sbjct: 199  DEPVKKQPQKPRGGQRVSDNIKKETPFLEVITHELQD---FAVDDPHD-DK---EEQDLD 251

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            + G EE    N  D LTG P P D+LL+ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 252  QQGNEE----NLFDSLTGQPHPEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKA 305


>gi|302509578|ref|XP_003016749.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
 gi|291180319|gb|EFE36104.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
          Length = 1073

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 65/110 (59%), Gaps = 10/110 (9%)

Query: 35  KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRR 94
           +T++FKL        +    K  L++ +G   H T  +R   + PS F  +LRK ++TRR
Sbjct: 12  RTFLFKL--------ALPDIKKQLIINAGFHCHLTESSRTTADAPSHFVSRLRKLLKTRR 63

Query: 95  LEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR 144
           +  VRQ+G DRII F+   G+   Y  LE +A GN++LTD+++ ++ LLR
Sbjct: 64  ITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAGNLILTDAKYGIVALLR 111



 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/179 (30%), Positives = 80/179 (44%), Gaps = 39/179 (21%)

Query: 894  RGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPV 953
            RG++GK KK+  KY DQDEE+R + + LL SA               ST   K    + +
Sbjct: 818  RGKRGKAKKLATKYKDQDEEDRKLALRLLGSAA------------GPSTPTTKPKTKADI 865

Query: 954  DAPKVCYK-CKKAGH---LSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHE 1009
            +A +   K  ++A H   L    ++    + + VED                        
Sbjct: 866  EAEREAQKERRRAQHERALQAVKRQQEAFTRNSVEDAS---------------------- 903

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
             GEE K   + +  L G P+  D +   IPVC P++A+  YKYR K+ PG  KKGK ++
Sbjct: 904  -GEEHKLDFSILPALVGTPVDGDEIEAAIPVCAPWAALGQYKYRAKLQPGKIKKGKAVK 961


>gi|3170174|gb|AAC18036.1| antigen NY-CO-1 [Homo sapiens]
          Length = 362

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 71/178 (39%), Positives = 97/178 (54%), Gaps = 23/178 (12%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQK K+KKMKEKY DQDEE+R + M LL SAG          N+     K KK    
Sbjct: 148  MKRGQKSKMKKMKEKYKDQDEEDRELIMKLLGSAGS---------NKEEKGKKGKKGKTK 198

Query: 952  PVDAPKVCYKCKKAGHLSKDCK-EHP--DDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
                 K   K +    +S + K E P  +  +H ++D     +D+  + DK   EE+D+ 
Sbjct: 199  DEPVKKQPQKPRGGQRVSDNIKKETPFLEVITHELQD---FAVDDPHD-DK---EEQDLD 251

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            + G EE    N  D LTG P P D+LL+ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 252  QQGNEE----NLFDSLTGQPHPEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKA 305


>gi|34189862|gb|AAH20794.2| SDCCAG1 protein [Homo sapiens]
          Length = 397

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 71/178 (39%), Positives = 97/178 (54%), Gaps = 23/178 (12%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQK K+KKMKEKY DQDEE+R + M LL SAG          N+     K KK    
Sbjct: 189  MKRGQKSKMKKMKEKYKDQDEEDRELIMKLLGSAGS---------NKEEKGKKGKKGKTK 239

Query: 952  PVDAPKVCYKCKKAGHLSKDCK-EHP--DDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
                 K   K +    +S + K E P  +  +H ++D     +D+  + DK   EE+D+ 
Sbjct: 240  DEPVKKQPQKPRGGQRVSDNIKKETPFLEVITHELQD---FAVDDPHD-DK---EEQDLD 292

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            + G EE    N  D LTG P P D+LL+ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 293  QQGNEE----NLFDSLTGQPHPEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKA 346


>gi|34364931|emb|CAE45886.1| hypothetical protein [Homo sapiens]
          Length = 276

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 69/179 (38%), Positives = 96/179 (53%), Gaps = 25/179 (13%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            + RGQK K+KKMKEKY DQDEE+R + M LL SAG    N  +   +      + +P   
Sbjct: 68   MKRGQKSKMKKMKEKYKDQDEEDRELIMKLLGSAGS---NKEEKGKKGKKGKTKDEPVKK 124

Query: 952  PVDAPKVCYKCKKAGHLSKDC--KEHP--DDSSHGVEDNPCVGLDETAEMDKVAMEEEDI 1007
                P+        G    D   KE P  +  +H ++D     +D+  + DK   EE+D+
Sbjct: 125  QPQKPR-------GGQRVSDNIKKETPFLEVITHELQD---FAVDDPHD-DK---EEQDL 170

Query: 1008 HEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
             + G EE    N  D LTG P P D+LL+ IP+C PY+ + +YKY+VK+ PG  KKGK 
Sbjct: 171  DQQGNEE----NLFDSLTGQPHPEDVLLFAIPICAPYTTMTNYKYKVKLTPGVQKKGKA 225


>gi|269862032|ref|XP_002650678.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065783|gb|EED43376.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 166

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 49/68 (72%)

Query: 624 AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           A  F + +S+AWD +++   ++V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G
Sbjct: 6   ATYFALVYSKAWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYG 65

Query: 684 FGLLFRLD 691
            G++FR++
Sbjct: 66  VGVVFRIN 73


>gi|374850433|dbj|BAL53422.1| hypothetical conserved protein [uncultured crenarchaeote]
          Length = 530

 Score = 73.2 bits (178), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 62/116 (53%), Gaps = 4/116 (3%)

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F  FI+S  +  + GRDA+ N M++KR++ + D+ +H ++ G+ + V+ N          
Sbjct: 332 FREFITSGGFRALLGRDARSNIMLLKRHLGENDLVLHTEIPGSPAAVLINGVKASET--- 388

Query: 620 TLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            + +      C+S+AW       S + V   QVS T P+G+YL  GSFM+ G K F
Sbjct: 389 DVEECAQMVGCYSRAWRENFSNVSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKF 444


>gi|322712137|gb|EFZ03710.1| DUF814 domain-containing protein [Metarhizium anisopliae ARSEF 23]
          Length = 959

 Score = 73.2 bits (178), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 83/180 (46%), Gaps = 31/180 (17%)

Query: 890  GKISRGQKGKLKKMKEKYGDQDEEERNIRMALL-ASAGKVQKNDGDPQNENASTHKEKKP 948
            G   RGQ+GK KK+  KY DQDEE+R     L+ A+ G         Q    +  K +  
Sbjct: 727  GPPKRGQRGKAKKVALKYKDQDEEDRAAAEVLIGATVG---------QKRQEAEAKARAD 777

Query: 949  AISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIH 1008
              + +DA +   + +      K+  EH                    E+ +V M +E I 
Sbjct: 778  RQAELDAARERRRAQHQ-RQQKEVAEH-------------------EEIRRVMM-DEGIE 816

Query: 1009 EIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
             +  +E  +   +D L G PLP D +L  IPVC P++A+  +KY+ K+ PG  KKGK  +
Sbjct: 817  VLDADEAEKATPLDALVGTPLPGDEILEAIPVCAPWNALGKFKYKAKLQPGAVKKGKATK 876


>gi|315427275|dbj|BAJ48887.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
 gi|343485854|dbj|BAJ51508.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 628

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 62/116 (53%), Gaps = 4/116 (3%)

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F  F++S  +  + GRDA+ N M++KR++ + D+ +H ++ G+ + V+ N          
Sbjct: 430 FREFVTSGGFRALLGRDARSNIMLLKRHLGENDLVLHTEIPGSPAAVLINGVKASET--- 486

Query: 620 TLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            + +      C+S+AW       S + V   QVS T P+G+YL  GSFM+ G K F
Sbjct: 487 DVQECAQMVGCYSRAWRENFSNVSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKF 542



 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 67/128 (52%), Gaps = 12/128 (9%)

Query: 6   MNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           +NT ++   V +C  R++     NVY    +  + K+   S    SGE     L + +G 
Sbjct: 4   LNTYEIGVLVAECRDRVLDSYVRNVYGFGSRAILLKVWKPS--IGSGE-----LWLTAGY 56

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
            +     + +K++TPS   L+LR+ +  +R+ D++Q+G +R++     LG++   +++E 
Sbjct: 57  SVFYIDQSVEKESTPSTHVLQLRRKVVGKRITDIKQVGGERLVT----LGLDGFELVVEC 112

Query: 125 YAQGNILL 132
              GNI+L
Sbjct: 113 MPPGNIVL 120


>gi|402470262|gb|EJW04606.1| hypothetical protein EDEG_01190 [Edhazardia aedis USNM 41457]
          Length = 393

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 46/72 (63%)

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
           L++ +     +C S+ W  K+  + ++V   QVSK A +GEYL  GSFMIRGKKN++  +
Sbjct: 131 LSIEETASMALCLSKFWKEKVTGNVYYVKSDQVSKKAQSGEYLKAGSFMIRGKKNYVDVY 190

Query: 679 PLIMGFGLLFRL 690
            L  G G++F++
Sbjct: 191 RLEYGIGIVFKI 202



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 34/40 (85%)

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
           LVI+GR AQ+N+++VK+++S  D++ HAD+ GA++ ++KN
Sbjct: 2   LVIAGRSAQENDLLVKKHLSNDDLFFHADVAGAATVILKN 41


>gi|402470263|gb|EJW04607.1| hypothetical protein EDEG_01191 [Edhazardia aedis USNM 41457]
          Length = 499

 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 21/148 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRL-IGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L+ +       NVY ++ KTY+FKL           S K  +L+
Sbjct: 1   MKQRFTFLDIRAVVNELQTIPTNTYIQNVYSINNKTYVFKL-----------SSKHFILV 49

Query: 61  ESGVRLHTTAYARDKKNTPSG----FTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
           E GVRLH  + + D  N  SG    F  K+R+ ++ ++L  ++Q+G+DRI++F+    ++
Sbjct: 50  EIGVRLHLISQS-DFDNLNSGELTFFCTKIRQLLKRQQLAQIKQVGFDRIVVFE----LS 104

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLR 144
              +  E +A GN+++ D ++ V  + R
Sbjct: 105 NVCIYFEFFAAGNLVICDKDYVVKLVYR 132


>gi|47230000|emb|CAG10414.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 393

 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 25/46 (54%), Positives = 36/46 (78%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQI 1069
            LTG P P D+LL+ +PVC PY+A+ SYK++VK+ PG+ KKGK  ++
Sbjct: 241  LTGQPHPEDVLLFAVPVCAPYTALSSYKHKVKVTPGSQKKGKAARV 286


>gi|224108806|ref|XP_002314974.1| predicted protein [Populus trichocarpa]
 gi|222864014|gb|EEF01145.1| predicted protein [Populus trichocarpa]
          Length = 104

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 34/57 (59%), Positives = 39/57 (68%), Gaps = 4/57 (7%)

Query: 1025 TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLLLMLSLT 1081
            TGNPLP+DILLY +PVC    AVQSYKY VK+IPGT KKGK  +   +L   M   T
Sbjct: 4    TGNPLPTDILLYAVPVC----AVQSYKYHVKVIPGTVKKGKAAKTATNLFSHMPEAT 56


>gi|313242815|emb|CBY39580.1| unnamed protein product [Oikopleura dioica]
          Length = 96

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 57/104 (54%), Gaps = 9/104 (8%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A +  +R  L+     N+YD+  KTY+ KL   +         K +LL 
Sbjct: 1   MKTRFTVLDIKAALAEIRDNLLHHYVLNIYDIDSKTYLLKLRKCAS--------KHVLLF 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
           ESG R+H T     K   PSGF++KLRKH++ +RL +  QLG+D
Sbjct: 53  ESGNRVHPTEMEWPKNTAPSGFSMKLRKHLKGKRLINATQLGFD 96


>gi|344304197|gb|EGW34446.1| hypothetical protein SPAPADRAFT_70556 [Spathaspora passalidarum NRRL
            Y-27907]
          Length = 865

 Score = 67.0 bits (162), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 85/178 (47%), Gaps = 36/178 (20%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAIS 951
            +SRG++ KLKK+  KY DQDEEER +RM  L +  ++++ +           + K+  + 
Sbjct: 656  LSRGKRSKLKKIAAKYADQDEEERRLRMDALGTLKQIEQKE----------QQTKREVLE 705

Query: 952  PVDAPKVCYKCKKAGHLSK--DCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHE 1009
             ++A K   + +      K  D KE+    S+ V+       DE+   + + +       
Sbjct: 706  KMEATKRMQEMQAVRERRKKQDEKEYQKYLSNEVDS------DESHVTNYLEI------- 752

Query: 1010 IGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGI 1067
                       +D     P   D ++ ++PV  P+ ++Q +KY+VKI PG+ KKGK I
Sbjct: 753  -----------LDSFAPKPSTKDEIISMVPVFAPWISLQKFKYKVKIQPGSGKKGKCI 799



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/172 (23%), Positives = 84/172 (48%), Gaps = 11/172 (6%)

Query: 280 AIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPL 338
            +Q +  A+   ED    ++       G+I+       K +  +++ SS + IYDEF P 
Sbjct: 106 GLQSVANALGACEDAYLSLVDSKNENTGFIV------AKRNKASDTNSSFEFIYDEFHPF 159

Query: 339 L---LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
                NQ    ++ +   ++  LD F+S +ES + E + +  +  A  +L+K   +++ +
Sbjct: 160 KPYKANQ-EDYQYTEVSGYNKTLDRFFSTLESSKFELKVEQLKQTAAKRLDKAKSERDKQ 218

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
           + +L ++ D + K  ELI+Y+ + VD     ++  L   M W ++  +++ E
Sbjct: 219 IQSLLEQQDLNAKKGELIQYHADLVDDCRAYIQSFLDQSMDWTNIETVLELE 270


>gi|70913606|ref|XP_731580.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56501553|emb|CAH83949.1| hypothetical protein PC300777.00.0 [Plasmodium chabaudi chabaudi]
          Length = 56

 Score = 66.6 bits (161), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 42/56 (75%)

Query: 73  RDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           R+K   PSGFT+KLRKH+R+R++ ++ QLG DR++  QFG   N +++I+ELY  G
Sbjct: 1   REKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVVDIQFGYDDNVYHLIVELYIAG 56


>gi|70918391|ref|XP_733179.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56504739|emb|CAH85243.1| hypothetical protein PC301461.00.0 [Plasmodium chabaudi chabaudi]
          Length = 169

 Score = 66.6 bits (161), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 73/133 (54%), Gaps = 7/133 (5%)

Query: 330 QIYDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIESQRAEQ-QHKAKEDAAF 382
           +++ EF P+LL    ++      E +KF  F+  +D ++SK+E  + ++ Q   K   A 
Sbjct: 16  RLFVEFIPILLKNHINKIDEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKNAL 75

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K++KI +D E R+  L++EV+   K   LI+ N E V  AI  +R A++   +WE +  
Sbjct: 76  TKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKIWD 135

Query: 443 MVKEERKAGNPVA 455
            VK  +K  +PVA
Sbjct: 136 HVKLFKKRNHPVA 148


>gi|240978880|ref|XP_002403059.1| Sdccag1 protein, putative [Ixodes scapularis]
 gi|215491283|gb|EEC00924.1| Sdccag1 protein, putative [Ixodes scapularis]
          Length = 130

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 38/52 (73%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            LTG P+P D LL+ +PVC PY A+Q++K++VK+ PGT ++GK  +   ++ +
Sbjct: 39   LTGCPVPEDGLLFAVPVCAPYGAMQNFKHKVKVTPGTGRRGKAAKTALTVFM 90


>gi|301617503|ref|XP_002938179.1| PREDICTED: serologically defined colon cancer antigen 1-like [Xenopus
            (Silurana) tropicalis]
          Length = 104

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/50 (52%), Positives = 37/50 (74%)

Query: 1019 NDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            N +D LTG P   D+LL+ +PVC PY+++ +YKY+VK+ PGT KKGK  +
Sbjct: 6    NLLDSLTGQPHGEDVLLFSVPVCAPYTSMTNYKYKVKLTPGTHKKGKAAK 55


>gi|308512689|gb|ADO32998.1| caliban [Biston betularia]
          Length = 186

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 37/51 (72%), Gaps = 4/51 (7%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK----GIQIF 1070
            LTG+P   D LL+ +PV  PYSA+ +YKY+VK+ PGT+K+GK     +Q+F
Sbjct: 94   LTGSPFAEDELLFAVPVVAPYSALHNYKYKVKLTPGTSKRGKAAKTAVQVF 144


>gi|21227915|ref|NP_633837.1| hypothetical protein MM_1813 [Methanosarcina mazei Go1]
 gi|20906335|gb|AAM31509.1| conserved protein [Methanosarcina mazei Go1]
          Length = 407

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 68/139 (48%), Gaps = 11/139 (7%)

Query: 6   MNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           M++ADVAA V  L    R +I  +   +Y  + +     L     V   G      L++E
Sbjct: 5   MSSADVAAVVAELSAGPRSIIDAKIGKIYQPASEEIRINLY----VFHQGRDN---LVIE 57

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T + R     P  F + LRK++   R+  V Q  +DRI+            +I
Sbjct: 58  AGKRLHMTKHIRPSPTLPQAFPMLLRKYLMGGRIVSVEQHDFDRIVKIGIERAGVRSTLI 117

Query: 122 LELYAQGNILLTDSEFTVL 140
           +EL+A+GN+L+ DSE  ++
Sbjct: 118 VELFARGNVLIVDSENKII 136



 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/135 (24%), Positives = 65/135 (48%), Gaps = 2/135 (1%)

Query: 316 LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK-IESQRAEQQH 374
           L   H   E     + +D   P  LN++   E   F++F+ ALDEF+ K    Q AE + 
Sbjct: 269 LRPQHIKQEINGKMETFD-VVPFDLNRYSEYEKEYFDSFNTALDEFFGKKALEQVAEVKE 327

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANR 434
             K++       +  M QE  +   ++E++++  +AE +  N + ++     +  A A  
Sbjct: 328 AEKKEKTLGVFERRLMQQEESLAKFEKEIEKNNALAETVYANYQIIEELFSVLNGARAKG 387

Query: 435 MSWEDLARMVKEERK 449
            SW+++  ++K+ +K
Sbjct: 388 YSWDEIRSILKQAKK 402


>gi|83033026|ref|XP_729297.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23486664|gb|EAA20862.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 161

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 28/61 (45%), Positives = 45/61 (73%), Gaps = 1/61 (1%)

Query: 1006 DIHEIGEEE-KGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
            +  EI E+E K +L++++ LT +P   D ++  IP+C PYSA+Q +KY+VK++PG AKKG
Sbjct: 53   NFEEINEDEMKMKLSELNKLTFSPKEEDDIICAIPMCAPYSAIQGHKYKVKLVPGNAKKG 112

Query: 1065 K 1065
            +
Sbjct: 113  Q 113


>gi|430813961|emb|CCJ28738.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 441

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/50 (52%), Positives = 35/50 (70%)

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
           EY TVG+FMI+GKKNFLPP  LI+G+G+L+ +DE S    L  +  +  E
Sbjct: 3   EYSTVGTFMIQGKKNFLPPSQLILGYGILWTIDEVSKARRLENKLSKNNE 52


>gi|302419579|ref|XP_003007620.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261353271|gb|EEY15699.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 224

 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 46/84 (54%)

Query: 1002 MEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTA 1061
            M EE +  +  +E  ++  +D L G PL  D ++  IPVC P++A+  +KY+VK  PG  
Sbjct: 71   MHEEGVELLEADEAEKVTALDGLVGTPLVGDEIVEAIPVCAPWNALGRFKYKVKFQPGPV 130

Query: 1062 KKGKGIQIFYSLLLLMLSLTPVFD 1085
            KKGK ++       L+ +   V D
Sbjct: 131  KKGKAVKEVLERWKLVATKKGVVD 154


>gi|435853658|ref|YP_007314977.1| putative RNA-binding protein, snRNP like protein [Halobacteroides
           halobius DSM 5150]
 gi|433670069|gb|AGB40884.1| putative RNA-binding protein, snRNP like protein [Halobacteroides
           halobius DSM 5150]
          Length = 584

 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 131/657 (19%), Positives = 250/657 (38%), Gaps = 155/657 (23%)

Query: 12  AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTA 70
           A + +   +LIG R   +Y   PK  +  +     + + GE+ K+L+       R+H T 
Sbjct: 10  AIKTELQNKLIGGRVDKIY--QPKENLLTIR----IRQPGENIKLLISANPQNPRIHITE 63

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI--ILFQF----GLGMNAHYVILEL 124
              D    P  F + LRKH+++ R++++ Q  ++RI  I+ Q+    G  ++   VI  +
Sbjct: 64  QDFDNPYQPPTFCMLLRKHLQSGRIKEINQPNFERILEIIIQYKNNQGELVDKKLVIELM 123

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
               NI+LT  +  +L  ++      +    +SR+R               +L      +
Sbjct: 124 GRHSNIILTKPDEQILDCIK------RVTKKISRYR---------------ELLPGKDYN 162

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
             P   + + +  D N       +NL   K                          + ++
Sbjct: 163 PPPQQGKKNPLTADFNQFKEVLSDNLNKDK------------------------MYRIIM 198

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
               G GP + + I+   G  P  +L +  ++++          + F D    +      
Sbjct: 199 NNYRGIGPLIGQEIVHRAGFNPQQELIKPKEIDN--------LWSAFNDIFNKI------ 244

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
                              E  + T + D+      N  +  E  K + FD   + F S 
Sbjct: 245 -----------------KNEKFNPTLVLDK-----ENNLKEYEAFKLKQFDLPQESFTS- 281

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
                        +   ++  N+I   + NR+         + KM  +I  N+E++    
Sbjct: 282 -----------VNQLLDYYFTNRIIQKKVNRL---------TNKMNNIIRDNIENIKKKY 321

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
             VR  L         A+   + +  G  +   I +L   +N ++L    N    +++E 
Sbjct: 322 SKVRGQLKG-------AKNADKHQLKGELITANIYQLEKGQNKVTLQNYYN----NNQEV 370

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA---EKKTRLQILQ 541
           T+     E+D  L+   NA+R++E K ++  K  K +   +K  KA     ++  + I Q
Sbjct: 371 TI-----ELDPELTPAENAQRYFE-KYEKAKKSVKYLRREAKKAKAEFEYLQQVEVNINQ 424

Query: 542 EKTVANISHMRKVHWFEKFN-----------------WFISSENYLVISGRDAQQNEMIV 584
            +T+A +  + K    E +                   F S+  Y ++ GR+ +QN+ + 
Sbjct: 425 SETLAELQEIEKELVQEGYIKEQKQNNNKQNDKLPPLKFASTAGYDILVGRNNRQNDGLT 484

Query: 585 KRYMSKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           K+  +  D +VH  DL G S T+I+NH  ++ +P  TL +A      +S+   S  V
Sbjct: 485 KKIANNQDTWVHVKDLPG-SHTIIRNHTGKK-IPEETLLEAAQIAAFYSKGRKSSNV 539


>gi|115443352|ref|XP_001218483.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114188352|gb|EAU30052.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 858

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 32/51 (62%)

Query: 1018 LNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQ 1068
            L  +  L G P P D +L  IPVC P+ ++  YKYRVK+ PG  KKGK ++
Sbjct: 717  LEWIPALVGTPHPDDEILAAIPVCAPWGSLGRYKYRVKLQPGAVKKGKAVK 767


>gi|269863395|ref|XP_002651206.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064951|gb|EED42851.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 150

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|412992730|emb|CCO18710.1| predicted protein [Bathycoccus prasinos]
          Length = 1191

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 35/52 (67%)

Query: 1024 LTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKGIQIFYSLLL 1075
            LT  P   D + + +PVC P+  + SYK+R+K+IPGT K+GK ++   ++LL
Sbjct: 1077 LTAQPFELDGVSFCLPVCAPFQVLASYKFRIKLIPGTQKRGKTVKDCANILL 1128


>gi|269863464|ref|XP_002651232.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064908|gb|EED42825.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 164

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269863970|ref|XP_002651409.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064579|gb|EED42648.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 185

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269865201|ref|XP_002651841.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063780|gb|EED42216.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 142

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 34/43 (79%)

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
            QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 7   RQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 49


>gi|410667776|ref|YP_006920147.1| fibronectin-binding A domain-containing protein [Thermacetogenium
           phaeum DSM 12270]
 gi|409105523|gb|AFV11648.1| fibronectin-binding A domain-containing protein [Thermacetogenium
           phaeum DSM 12270]
          Length = 587

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 96/422 (22%), Positives = 166/422 (39%), Gaps = 107/422 (25%)

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
           G G +++  ++   GL P ++L    + E +A+         F+  +  ++ G+  PE  
Sbjct: 204 GIGRSMAREVVYRAGLDPELRLEFCGEYELHAL------FQSFQKTVIPLLRGN-KPEPV 256

Query: 309 ILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYS-KIES 367
           I+ Q               +T +  ++ PL L  +R  + +  ET +  LD +Y+ K ES
Sbjct: 257 IIFQG--------------TTAV--DYAPLPLTHYRGLKSIPCETVNEMLDRYYAAKAES 300

Query: 368 QRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAV 427
            R +Q              K H++       ++Q +DR  K   L E   +D   A  A+
Sbjct: 301 NRLKQ-------------IKTHLET-----VIRQNMDRCSKKLTLQE---KDEAEAREAL 339

Query: 428 RVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP 487
           ++ L   M +  L  +    R+   P                     NL + D      P
Sbjct: 340 KLRLLGEMIFAHLHLIRPGSREVELP---------------------NLYQPDA-----P 373

Query: 488 VEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--------KTRLQI 539
             K+E+D +LSA  NA+R +    ++  K   TI A  K  K+ ++        KT L+ 
Sbjct: 374 SLKIELDPSLSAVQNAQRLF----RRYDKARDTIKALEKQIKSTKEEIQYLNSIKTALE- 428

Query: 540 LQEKTVANISHM-----------------RKVHWFEK----FNWFISSENYLVISGRDAQ 578
            Q + +A+   +                 R+    +K       F S + Y ++ G++ Q
Sbjct: 429 -QAECLADYQEIHEELEDAGYIRSDGKKSRRSKGTKKAPPQIMRFTSRDGYQILVGKNNQ 487

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           QN+ I  R     D ++H     A + VI   +P Q +PP TL +A       S+A  S 
Sbjct: 488 QNDYITMRLARDEDYWLHVK-DSAGAHVIVKSKPGQEIPPSTLEEAAGLAAHFSEARYSS 546

Query: 639 MV 640
            V
Sbjct: 547 KV 548


>gi|269863903|ref|XP_002651387.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064622|gb|EED42668.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 172

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269864916|ref|XP_002651741.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063963|gb|EED42314.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 184

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|428671810|gb|EKX72725.1| hypothetical protein BEWA_012840 [Babesia equi]
          Length = 842

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 74/174 (42%), Gaps = 53/174 (30%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAG-KVQKNDGDPQNENASTHKEKKPAI 950
            +S+  + KL KMK+KYG  DEE + +R  L  S   KV K              E++PA+
Sbjct: 678  MSKAARNKLAKMKKKYGSDDEETQELRRLLTGSTKLKVIK------------QAEEEPAV 725

Query: 951  SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEI 1010
             P  AP      +     S+D  +  DD                                
Sbjct: 726  QP-SAP------RPRTQPSQDTLKTIDD-------------------------------- 746

Query: 1011 GEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKG 1064
             +E +  +   + L  +P   DI+L  IP+C P+SA++ +K R+K++PG  KKG
Sbjct: 747  -KELERYMKQFNRLCKDPKEDDIILNAIPMCAPFSALREFKTRIKLVPGNTKKG 799


>gi|269865384|ref|XP_002651904.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063549|gb|EED42152.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 224

 Score = 57.4 bits (137), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|170576161|ref|XP_001893523.1| hypothetical protein [Brugia malayi]
 gi|158600426|gb|EDP37645.1| conserved hypothetical protein [Brugia malayi]
          Length = 109

 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 37/61 (60%), Gaps = 3/61 (4%)

Query: 1006 DIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGK 1065
            D+  +  EE   LN    LT  PL  D+LL+ + V  PY  +Q++KY+VK+ PGT K+GK
Sbjct: 3    DMAVMDAEETKMLNS---LTWRPLDEDVLLFALVVVAPYQTMQNFKYKVKLTPGTGKRGK 59

Query: 1066 G 1066
             
Sbjct: 60   A 60


>gi|269866242|ref|XP_002652204.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220062960|gb|EED41852.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 240

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269867274|ref|XP_002652541.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220062265|gb|EED41515.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 246

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269865392|ref|XP_002651907.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063544|gb|EED42149.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 275

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|381179596|ref|ZP_09888446.1| Fibronectin-binding A domain protein [Treponema saccharophilum DSM
           2985]
 gi|380768543|gb|EIC02532.1| Fibronectin-binding A domain protein [Treponema saccharophilum DSM
           2985]
          Length = 511

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 83/174 (47%), Gaps = 19/174 (10%)

Query: 474 NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES---KQEKTIT-------- 522
           +N  E DD E    V ++ +D +LSAH NA+ +YE  +K ES   + E+ I+        
Sbjct: 287 SNFIEADDWESGEKV-RIRIDPSLSAHENAQSYYEKYRKSESGIAELERDISIAEGELEK 345

Query: 523 --AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
             A      A +   +L+ +  KT       +K H   +F    S + + +I GRDA +N
Sbjct: 346 LDAQYAEMVAEKNPIKLEQVLRKTQRPKQLEKKTHPGLEF----SVDGWTIIVGRDADEN 401

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQA 634
           + +++  +   D+++H   +      IKN RP + VP   L  AG   V +S+A
Sbjct: 402 DELLRHNVKGQDMWLHVRDYSGGYVFIKN-RPGKTVPLEILLYAGNLAVFYSKA 454


>gi|385810177|ref|YP_005846573.1| RNA-binding protein [Ignavibacterium album JCM 16511]
 gi|383802225|gb|AFH49305.1| Putative RNA-binding protein [Ignavibacterium album JCM 16511]
          Length = 538

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/218 (21%), Positives = 98/218 (44%), Gaps = 32/218 (14%)

Query: 446 EERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
           E  + GN +   I+K++   N + L      D++ + +KT+   ++++D  L+   N  R
Sbjct: 294 EYNRLGNILLININKIHSGMNSIIL------DDIYESDKTI---EIKLDPKLTPKENVNR 344

Query: 506 WYELKKKQESKQEKTI-------------------TAHSKAFKAAEKKTRLQILQEKTVA 546
           ++E  K+ +++  K I                   T++S   K  E+  +   ++ KT  
Sbjct: 345 YFEKAKESKTQYHKAIELIEIVSREKDRLIEFKNRTSNSSTVKELEQIAKGLKIKMKTEK 404

Query: 547 NISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           NI         EKF  ++    Y V  G+D++ N+M+  ++  + D++ HA     S  V
Sbjct: 405 NIQESIS----EKFKQYLVDGKYKVYVGKDSKSNDMLTLKFAKQNDLWFHARAVPGSHVV 460

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           ++    ++P+P   L +       HS+A  + +V  ++
Sbjct: 461 LRIENTKEPIPKSVLKKVASLAAYHSKAKTAGLVPVSY 498


>gi|345892116|ref|ZP_08842940.1| hypothetical protein HMPREF1022_01600 [Desulfovibrio sp.
           6_1_46AFAA]
 gi|345047527|gb|EGW51391.1| hypothetical protein HMPREF1022_01600 [Desulfovibrio sp.
           6_1_46AFAA]
          Length = 534

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FIS + + ++ GRDA+ N +  ++  +  D+++HAD    S  +I+     QPVP  TL+
Sbjct: 412 FISEDGFALLRGRDAKGN-LAARKLAAPHDIWLHADNGPGSHVIIRRAHGGQPVPERTLD 470

Query: 623 QAGCFTVCHSQAWDSKMV 640
           QAG    C S   D+ + 
Sbjct: 471 QAGGLAACKSWQRDAAVA 488


>gi|303326372|ref|ZP_07356815.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302864288|gb|EFL87219.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 556

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FIS + + ++ GRDA+ N +  ++  +  D+++HAD    S  +I+     QPVP  TL+
Sbjct: 434 FISEDGFALLRGRDAKGN-LAARKLAAPHDIWLHADNGPGSHVIIRRAHGGQPVPERTLD 492

Query: 623 QAGCFTVCHSQAWDSKMV 640
           QAG    C S   D+ + 
Sbjct: 493 QAGGLAACKSWQRDAAVA 510


>gi|312143921|ref|YP_003995367.1| fibronectin-binding A domain-containing protein [Halanaerobium
           hydrogeniformans]
 gi|311904572|gb|ADQ15013.1| Fibronectin-binding A domain protein [Halanaerobium
           hydrogeniformans]
          Length = 582

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           F+SS  Y ++ GR+ +QN+ + K+  +KGD+++H      S  +IK    ++ +P  TLN
Sbjct: 462 FVSSNGYQILIGRNNKQNDKLTKKIANKGDIWLHTKTIAGSHVIIKRDTSKE-IPDTTLN 520

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       S+A +SK V
Sbjct: 521 EAASLAAYFSKARNSKNV 538


>gi|397906011|ref|ZP_10506838.1| Fibronectin/fibrinogen-binding protein [Caloramator australicus
           RC3]
 gi|397160925|emb|CCJ34173.1| Fibronectin/fibrinogen-binding protein [Caloramator australicus
           RC3]
          Length = 574

 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 58/102 (56%), Gaps = 12/102 (11%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNA-HY 119
           R+  T   ++   T   F + LRK+++  RLED++Q+ +DRI+  +F     LG ++ +Y
Sbjct: 57  RIQITNINKENPQTAPNFVMVLRKYLQNSRLEDIKQINFDRIVEIKFEGKDELGYSSYYY 116

Query: 120 VILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           +I+E+  +  NI+L D ++ ++  ++    D      M+R+R
Sbjct: 117 IIIEIMGKHSNIILLDEKYKIIDAIKHLGSD------MNRYR 152


>gi|312097061|ref|XP_003148860.1| hypothetical protein LOAG_13303 [Loa loa]
          Length = 106

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 26/54 (48%), Positives = 34/54 (62%), Gaps = 3/54 (5%)

Query: 1013 EEKGRLNDVDYLTGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            EE   LN    LT  PL  D+LLY + V  PY  +Q++KY+VK+ PGT K+GK 
Sbjct: 7    EETKMLNS---LTWRPLDGDVLLYALVVVAPYQTMQNFKYKVKLTPGTGKRGKA 57


>gi|212704765|ref|ZP_03312893.1| hypothetical protein DESPIG_02829 [Desulfovibrio piger ATCC 29098]
 gi|212671828|gb|EEB32311.1| hypothetical protein DESPIG_02829 [Desulfovibrio piger ATCC 29098]
          Length = 604

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 56/125 (44%), Gaps = 8/125 (6%)

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           EL   Q ++QE  +     A   A K  R       TVA +  +R          F+S +
Sbjct: 423 ELATVQAARQEALLGGIGHAAGEAGKPDR------STVA-LGALRGAALPRNVQLFVSDD 475

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            + ++ GRDA+ N +  ++  +  D+++H D    S  +I+     Q VP  TL+QAG  
Sbjct: 476 GFALLRGRDAKGN-IAARKLAAAHDIWLHTDGGPGSHVIIRRAHAGQEVPERTLDQAGAL 534

Query: 628 TVCHS 632
             C S
Sbjct: 535 AACKS 539


>gi|310779110|ref|YP_003967443.1| fibronectin-binding A domain-containing protein [Ilyobacter
           polytropus DSM 2926]
 gi|309748433|gb|ADO83095.1| Fibronectin-binding A domain protein [Ilyobacter polytropus DSM
           2926]
          Length = 539

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 54/98 (55%), Gaps = 10/98 (10%)

Query: 69  TAYARDKKN----TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYV 120
             Y +D K     TP  F+L LRKH+    + +V QLGYDRI++F+F     LG    Y+
Sbjct: 55  VCYLKDNKENAPETPMSFSLNLRKHLLNSIITEVSQLGYDRILVFKFRKLNELGQYKDYI 114

Query: 121 I-LELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIM 156
           +  E+  +  N++LTD +  +L L++    ++  + ++
Sbjct: 115 LYFEIMGKHSNLILTDKDGGILDLMKKFSLEENKLRVL 152


>gi|410131096|gb|AFV61763.1| gag protein [Equine infectious anemia virus]
          Length = 483

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 15/117 (12%)

Query: 907  YGDQDEEERNIRMALLASA------GKVQKN--DGDPQNENASTHKEKKPA--ISPVDAP 956
            Y  +D   +  +MALLA A      G ++     G P     + +   KP    S   AP
Sbjct: 340  YACRDVGSQRQKMALLAKALQTGLVGPMKAGVLKGGPLKAKQTCYNCGKPGHLSSQCRAP 399

Query: 957  KVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP-----CVGLDETAEMDKVAMEEEDIH 1008
            KVC+KCK+ GH SK CK++P +  +G +  P      V   ETA   K A   + ++
Sbjct: 400  KVCFKCKEPGHFSKQCKQNPKNGKNGAQGRPHKKTFPVHQQETANPAKTATPTQSLY 456


>gi|343473499|emb|CCD14625.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 211

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 11/86 (12%)

Query: 980  SHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRLNDVDYLTGNPLPSDILLYVIP 1039
            SH  + NP         +D  A+  E +    EEE  R  +  + T NP P D + Y + 
Sbjct: 88   SHPSKSNP---------VDPAAVNLEPLCSANEEEFER--EWVHFTANPRPDDCVQYAVV 136

Query: 1040 VCGPYSAVQSYKYRVKIIPGTAKKGK 1065
             C P SA++SYKY+ ++  G AKKG+
Sbjct: 137  TCAPMSALESYKYKTELFYGNAKKGQ 162


>gi|300811062|gb|ADK35798.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 23/121 (19%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHLSSQCKAPKVCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE-------DIHEIGEEEKG 1016
            GH SK C+  P +   G +  P       +   MDK  MEE+       D+ ++ +E K 
Sbjct: 411  GHFSKQCRNAPKNGRQGAQGRPQKQTFPVQKGSMDKTQMEEKQQGTLYPDLSQVKQEYKI 470

Query: 1017 R 1017
            R
Sbjct: 471  R 471


>gi|300811110|gb|ADK35839.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 23/121 (19%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHLSSQCKAPKVCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE-------DIHEIGEEEKG 1016
            GH SK C+  P +   G +  P       +   MDK  MEE+       D+ ++ +E K 
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMDKTQMEEKQQGTLYPDLSQMKQEYKI 470

Query: 1017 R 1017
            R
Sbjct: 471  R 471


>gi|300811082|gb|ADK35815.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 23/121 (19%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHLSSQCKAPKVCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE-------DIHEIGEEEKG 1016
            GH SK C+  P +   G +  P       +   MDK  MEE+       D+ ++ +E K 
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMDKTQMEEKQQGTLYPDLSQMKQEYKI 470

Query: 1017 R 1017
            R
Sbjct: 471  R 471


>gi|300811076|gb|ADK35810.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 23/121 (19%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHLSSQCKAPKVCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE-------DIHEIGEEEKG 1016
            GH SK C+  P +   G +  P       +   MDK  MEE+       D+ ++ +E K 
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMDKTQMEEKQQGTLYPDLSQMKQEYKI 470

Query: 1017 R 1017
            R
Sbjct: 471  R 471


>gi|300811103|gb|ADK35833.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 46/102 (45%), Gaps = 16/102 (15%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHLSSQCKAPKVCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE 1005
            GH SK C+  P +   G +  P       +   MDK  MEE+
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMDKTQMEEK 452


>gi|385799646|ref|YP_005836050.1| fibronectin-binding A domain-containing protein [Halanaerobium
           praevalens DSM 2228]
 gi|309389010|gb|ADO76890.1| Fibronectin-binding A domain protein [Halanaerobium praevalens DSM
           2228]
          Length = 583

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS  Y ++ GR+ +QN+ + K+  + GD+++H  +   S  +IK    E  VP  TL 
Sbjct: 462 FISSNGYQILVGRNNKQNDRLSKKIANNGDIWLHTKVIAGSHVIIK-RDTEVEVPEQTLT 520

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       SQA +S  V
Sbjct: 521 EAAAIAAYFSQARESTNV 538


>gi|110456080|gb|ABG74581.1| RNA-binding protein-like protein [Musa acuminata AAA Group]
          Length = 53

 Score = 51.2 bits (121), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/24 (83%), Positives = 24/24 (100%)

Query: 12 AAEVKCLRRLIGMRCSNVYDLSPK 35
          AAE+KCLR+LIGMRC+NVYD+SPK
Sbjct: 1  AAELKCLRKLIGMRCANVYDISPK 24


>gi|255513711|gb|EET89976.1| Predicted fibronectin-binding protein [Candidatus Micrarchaeum
           acidiphilum ARMAN-2]
          Length = 374

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 67/156 (42%), Gaps = 15/156 (9%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKV---L 57
           M   +++T ++A+  K LR L G      Y +    +  K         S + EKV   +
Sbjct: 1   MASRQVSTLEIASLSKELRFLEGFHIDKFYQVDESRFRIK--------ASSKGEKVNLGI 52

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L     R  T   A    + P+ F++ +R+ I    ++ V  L  DRII  +   G   
Sbjct: 53  WLCRYIGRTETITIA----DKPTNFSIAVRRRISGFVVDSVVMLNSDRIIEIKCSKGQET 108

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
             VI E++ +GNI+L D  +T+      H   D+ V
Sbjct: 109 KSVIFEMFGRGNIILCDGSYTIELAYAPHTFKDRAV 144


>gi|300811069|gb|ADK35804.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 23/121 (19%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHLSSQCKAPKVCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNPCVG--LDETAEMDKVAMEEE-------DIHEIGEEEKG 1016
            GH SK C+  P +   G +  P       +   MDK  MEE+       D+ ++ +E K 
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQPFPVQKGSMDKTQMEEKQQGTLYPDLSQMKQEYKI 470

Query: 1017 R 1017
            R
Sbjct: 471  R 471


>gi|39992427|gb|AAH64364.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 435

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 29/40 (72%)

Query: 673 NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           NFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 1   NFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDEDME 40


>gi|315272251|gb|ADU02701.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 16/102 (15%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APK+C++CK+ 
Sbjct: 353  KMALLAKALQTGLAGPRKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKICFRCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE 1005
            GH SK C+  P +   G +  P       +   MDK  MEE+
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMDKTQMEEK 452


>gi|429961216|gb|ELA40761.1| hypothetical protein VICG_02202, partial [Vittaforma corneae ATCC
           50505]
          Length = 147

 Score = 50.1 bits (118), Expect = 0.006,   Method: Composition-based stats.
 Identities = 41/144 (28%), Positives = 63/144 (43%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L  RL      N Y    +    K  N           K  LL+
Sbjct: 1   MKQRFTLLDLRATVNELNERLTNTFIQNFYSTQQRFIYIKFSN-----------KDTLLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E G R H T   ++  +  S F  KLR+  R  R+  + Q G+DRI +    + +    +
Sbjct: 50  EPGFRFHLT---QNADSEISHFCKKLREKCRHARVHRIYQFGFDRIAI----IDLQRVRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           ++E ++ GN+L+ D    +L LLR
Sbjct: 103 VIEFFSAGNMLVLDENDQILELLR 126


>gi|146400055|gb|ABQ28725.1| gag protein [Equine infectious anemia virus]
          Length = 488

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 24/37 (64%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           S   APKVC+KCK+AGH SK C+  P +   GV+  P
Sbjct: 394 SQCRAPKVCFKCKQAGHFSKQCRNAPKNGKQGVQGRP 430


>gi|315272265|gb|ADU02713.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 46/102 (45%), Gaps = 16/102 (15%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APK+C++CK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKICFRCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE 1005
            GH SK C+  P +   G +  P       +   MDK  MEE+
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMDKTQMEEK 452


>gi|429961917|gb|ELA41461.1| hypothetical protein VICG_01445 [Vittaforma corneae ATCC 50505]
          Length = 179

 Score = 49.7 bits (117), Expect = 0.010,   Method: Composition-based stats.
 Identities = 41/144 (28%), Positives = 63/144 (43%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L  RL      N Y    +    K  N           K  LL+
Sbjct: 1   MKQRFTLLDLRATVNELNERLTNTFIQNFYSTQQRFIYIKFSN-----------KDTLLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E G R H T   ++  +  S F  KLR+  R  R+  + Q G+DRI +    + +    +
Sbjct: 50  EPGFRFHLT---QNADSEISHFCKKLREKCRHARVHRIYQFGFDRIAI----IDLQRVRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           ++E ++ GN+L+ D    +L LLR
Sbjct: 103 VIEFFSAGNMLVLDENDQILELLR 126


>gi|315272174|gb|ADU02635.1| gag protein [Equine infectious anemia virus]
          Length = 485

 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 46/99 (46%), Gaps = 15/99 (15%)

Query: 902 KMKEK-YGDQDEEERNIRMALLASA----------GKVQKNDGDPQNENASTHKEKKPA- 949
           K++EK Y  +D      +MALLA A          G + K  G P     + +   KP  
Sbjct: 336 KLEEKMYACRDIGTVKQKMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGH 393

Query: 950 -ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
             S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 394 FSSQCKAPKVCFKCKQPGHFSKQCRNAPKNGKQGAQGRP 432


>gi|397691486|ref|YP_006528740.1| RNA-binding protein snRNP [Melioribacter roseus P3M]
 gi|395812978|gb|AFN75727.1| RNA-binding protein snRNP [Melioribacter roseus P3M]
          Length = 363

 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 41/182 (22%), Positives = 78/182 (42%), Gaps = 25/182 (13%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEK------- 543
           +++D  LS   N  R++E  K ++ + EK+I  ++      E K +  IL+E        
Sbjct: 158 IKLDPKLSPQKNIDRYFEKAKSEKIEYEKSIELYN------ELKNKYDILKELDEKLNKE 211

Query: 544 -TVANISHMRKVHWFEK-----------FNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
            T+  +  + K    +K           F  FI    Y V  G+D++ N+ +  R+  + 
Sbjct: 212 LTLEELQTIEKQLGIKKKMEMQDKSRPNFRHFIIDGKYNVYVGKDSKNNDELTLRFAKQN 271

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           D + HA     S  V++   P++ VP   L +A      +S+A  + +   ++    + V
Sbjct: 272 DYWFHARSVSGSHVVLRTDNPKEVVPKSVLKKAASIAAFYSKAKTAGLAPVSYTFKKYVV 331

Query: 652 SK 653
            K
Sbjct: 332 KK 333


>gi|402694375|gb|AFQ90121.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 471

 Score = 49.3 bits (116), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G PQ    + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 374 GGPQKAKQTCYNCGKPGHLSSQCRAPKVCFKCKEPGHFSKQCRNAPKNGKQGAQGRP 430


>gi|300811055|gb|ADK35792.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 49.3 bits (116), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 23/121 (19%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGSMKGGIFK--GGPLGAKQTCYNCGKPGHLSSQCKAPKVCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE-------DIHEIGEEEKG 1016
            GH SK C+  P +   G +  P       +   M K  MEE+       D+ ++ +E K 
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMGKTQMEEKLQGTLYPDLSQMKQEYKI 470

Query: 1017 R 1017
            R
Sbjct: 471  R 471


>gi|315272188|gb|ADU02647.1| gag protein [Equine infectious anemia virus]
          Length = 487

 Score = 49.3 bits (116), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 355 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 412

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 413 GHFSKQCRNAPKNGKQGAQGRP 434


>gi|407477620|ref|YP_006791497.1| hypothetical protein Eab7_1781 [Exiguobacterium antarcticum B7]
 gi|407061699|gb|AFS70889.1| Hypothetical protein Eab7_1781 [Exiguobacterium antarcticum B7]
          Length = 564

 Score = 49.3 bits (116), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 64/135 (47%), Gaps = 17/135 (12%)

Query: 15  VKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV---RLHTTAY 71
           V+ L+ L+G R + ++       IF +          E + V+LL  +     RLH T+ 
Sbjct: 12  VRELQPLVGARINKIHQPYALDLIFSV--------RAERKNVMLLASANAMYARLHLTSE 63

Query: 72  ARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA 126
                + P  F + LRKH+    +E + QLG DRIIL +      LG   A  + +EL  
Sbjct: 64  TTTNPSEPPMFCMMLRKHLEGGFIESIEQLGRDRIILMRVRSRNELGDEEAKKLYIELMG 123

Query: 127 Q-GNILLTDSEFTVL 140
           +  NILLTD +  +L
Sbjct: 124 RHSNILLTDGQDKIL 138


>gi|300811089|gb|ADK35821.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 54/121 (44%), Gaps = 23/121 (19%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APK+C+KCK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKLCFKCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE-------DIHEIGEEEKG 1016
            GH SK C+  P +   G +  P       +   M+K  MEE+       D+ ++ +E K 
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMNKTQMEEKLQGTLYPDLSQMKQEYKI 470

Query: 1017 R 1017
            R
Sbjct: 471  R 471


>gi|300811117|gb|ADK35845.1| gag protein [Equine infectious anemia virus]
 gi|300811124|gb|ADK35851.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGKQGAQGRP 432


>gi|417002378|ref|ZP_11941767.1| putative fibronectin-binding protein [Anaerococcus prevotii
           ACS-065-V-Col13]
 gi|325479519|gb|EGC82615.1| putative fibronectin-binding protein [Anaerococcus prevotii
           ACS-065-V-Col13]
          Length = 582

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 15/148 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  +  E+K   +L+G +   +   S    +F       +   G S K+LL   +   R+
Sbjct: 8   TRKITNELK--EKLLGGKIQKISQPSKNDIVF------NIYSMGNSYKLLLSANNNEARV 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYV 120
           + T    +  + P  F + LRKHI   ++ D+ Q G DR+I+F      + G   +   +
Sbjct: 60  NITNIKYENPDVPPNFCMVLRKHINQGKIVDINQKGLDRVIIFSISSIDEMGYDTSKKLI 119

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRD 148
           I  +    NI+L D +F ++  ++   D
Sbjct: 120 IEIMGKYSNIILVDDDFKIIDSIKRVND 147


>gi|315272230|gb|ADU02683.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGKQGAQGRP 432


>gi|315272244|gb|ADU02695.1| gag protein [Equine infectious anemia virus]
          Length = 485

 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGKQGAQGRP 432


>gi|300811131|gb|ADK35857.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGKQGAQGRP 432


>gi|300811138|gb|ADK35863.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 48.9 bits (115), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGKQGAQGRP 432


>gi|172057940|ref|YP_001814400.1| fibronectin-binding A domain-containing protein [Exiguobacterium
           sibiricum 255-15]
 gi|171990461|gb|ACB61383.1| Fibronectin-binding A domain protein [Exiguobacterium sibiricum
           255-15]
          Length = 564

 Score = 48.9 bits (115), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 67/139 (48%), Gaps = 17/139 (12%)

Query: 15  VKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV---RLHTTAY 71
           V+ L+ L+G R + ++       IF +          E + V+LL  +     RLH T+ 
Sbjct: 12  VQELQPLVGARINKIHQPYALDLIFSV--------RAERKNVMLLASANAMYARLHLTSE 63

Query: 72  ARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA 126
           +    + P  F + LRKH+    +E + QLG DR+IL +      LG   A  + +EL  
Sbjct: 64  STSNPSEPPMFCMMLRKHLEGGFIESIEQLGRDRVILMRVRSRNELGDEEAKKLYIELMG 123

Query: 127 Q-GNILLTDSEFTVLTLLR 144
           +  NILLTD +  +L  ++
Sbjct: 124 RHSNILLTDGQDKILDAIK 142


>gi|315272258|gb|ADU02707.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 48.9 bits (115), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 45/102 (44%), Gaps = 16/102 (15%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APK+C++CK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKICFRCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNP--CVGLDETAEMDKVAMEEE 1005
            GH SK C+  P +   G +  P       +   MDK   EEE
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRPQKQTFPVQKGSMDKTQKEEE 452


>gi|374711077|ref|ZP_09715511.1| fibronectin-binding A domain-containing protein, partial
           [Sporolactobacillus inulinus CASD]
          Length = 306

 Score = 48.5 bits (114), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 62/131 (47%), Gaps = 13/131 (9%)

Query: 13  AEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYA 72
           A V+ L+   G R + +Y  +P   IF L      +     + ++ +  +  R+H T  +
Sbjct: 10  AAVEELQDFTGGRIAKIYQPTPTDLIFHLR-----SRHARGKLLISINAAFARMHLTEQS 64

Query: 73  RDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYA 126
            D    P  F + LRKH+    ++ + QLG++RI+        +FG  +    +I+EL  
Sbjct: 65  ADNPQEPPMFCMLLRKHLEGSVIQRIEQLGFERIVHIDARSRNEFG-DLTEKQLIIELMG 123

Query: 127 Q-GNILLTDSE 136
           +  N++L D E
Sbjct: 124 RHSNVILIDKE 134


>gi|315272195|gb|ADU02653.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 48.5 bits (114), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGKQGAQGRP 432


>gi|13383730|gb|AAK21105.1|AF327877_1 gag protein [Equine infectious anemia virus]
          Length = 400

 Score = 48.5 bits (114), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 267 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKVCFKCKQP 324

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 325 GHFSKQCRNAPKNGKQGAQGRP 346


>gi|398310663|ref|ZP_10514137.1| hypothetical protein BmojR_14603 [Bacillus mojavensis RO-H-1]
          Length = 570

 Score = 48.5 bits (114), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +   G+++K+LL    S  R+H TA   +  + 
Sbjct: 18  RITGGRITKVHQPYKHDVIFH------IRADGKNQKLLLSAHPSYSRVHITAQTYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHY-VILELYAQ-GNILL 132
           P  F + LRKHI    +E + Q G DRI++F       +G   H  + +E+  +  NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETHRKLYVEIMGRHSNIIL 131

Query: 133 TDSE 136
           TD E
Sbjct: 132 TDGE 135


>gi|403234858|ref|ZP_10913444.1| Fibronectin-binding A domain-containing protein [Bacillus sp.
           10403023]
          Length = 569

 Score = 48.5 bits (114), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 68/136 (50%), Gaps = 15/136 (11%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRL 66
           T  +A E+K  + L   R S +Y       IF+      +  +G++ K+LL    S  R+
Sbjct: 8   THAIANELK--QTLESGRISKIYQPYKNELIFQ------IRSNGKNHKLLLSAHPSYARI 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVI 121
           H T    D  + P  F + LRKH+    +E +RQ+  DRII+F       LG ++   +I
Sbjct: 60  HLTNELYDNPHEPPMFCMLLRKHLEGSIIEAIRQVDKDRIIIFDIKGRNELGDVSYKQLI 119

Query: 122 LELYAQ-GNILLTDSE 136
           +E+  +  NI+L D+E
Sbjct: 120 IEIMGRHSNIILVDTE 135


>gi|315272216|gb|ADU02671.1| gag protein [Equine infectious anemia virus]
          Length = 400

 Score = 48.1 bits (113), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APK+C+KCK+ 
Sbjct: 267 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKICFKCKQP 324

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 325 GHFSKQCRNAPKNGKQGAQGRP 346


>gi|13383737|gb|AAK21111.1|AF327878_1 gag protein [Equine infectious anemia virus]
          Length = 400

 Score = 48.1 bits (113), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APK+C+KCK+ 
Sbjct: 267 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKICFKCKQP 324

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 325 GHFSKQCRNAPKNGRQGAQGRP 346


>gi|385264690|ref|ZP_10042777.1| hypothetical protein MY7_1447 [Bacillus sp. 5B6]
 gi|385149186|gb|EIF13123.1| hypothetical protein MY7_1447 [Bacillus sp. 5B6]
          Length = 568

 Score = 47.8 bits (112), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+HTT  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHTTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|317057453|ref|YP_004105920.1| fibronectin-binding A domain-containing protein [Ruminococcus albus
           7]
 gi|315449722|gb|ADU23286.1| Fibronectin-binding A domain protein [Ruminococcus albus 7]
          Length = 594

 Score = 47.8 bits (112), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 61/126 (48%), Gaps = 13/126 (10%)

Query: 16  KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARD 74
           K L  LIG R   ++  S    +  L    G+      +K+L+   +G  RLH TA   +
Sbjct: 13  KELMPLIGGRVDKIHQPSKGELLIALRTYDGI------KKLLINTVAGTARLHLTAAEIE 66

Query: 75  KKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA-QG 128
               P  F + +RKH+   +L D+RQ  ++R+I+  F     LG M    V +EL   + 
Sbjct: 67  NPKQPPMFCMLMRKHLSGAKLADIRQPEHERVIMLDFDATNELGDMVRLTVTVELMGRRA 126

Query: 129 NILLTD 134
           N+LLTD
Sbjct: 127 NLLLTD 132


>gi|300811096|gb|ADK35827.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 47.8 bits (112), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APK+C+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGARQTCYNCGKPGHFSSQCKAPKLCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGKQGAQGRP 432


>gi|255528127|ref|ZP_05394955.1| Fibronectin-binding A domain protein [Clostridium carboxidivorans
           P7]
 gi|255508168|gb|EET84580.1| Fibronectin-binding A domain protein [Clostridium carboxidivorans
           P7]
          Length = 541

 Score = 47.8 bits (112), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 57  LLLMESGV--RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF--- 111
           LL+  S V  ++H T  ++     P  F + LRKHI T RL ++RQL  DR+I   F   
Sbjct: 11  LLISASSVYPKIHLTQLSKTNPMQPPLFCMVLRKHINTGRLVNIRQLDTDRVIFLDFEST 70

Query: 112 -GLGMNAHYVI 121
             LG N+ Y +
Sbjct: 71  DELGFNSIYTL 81


>gi|315272272|gb|ADU02719.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 47.4 bits (111), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 20/113 (17%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APK+C++CK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKLCFRCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEKGRL 1018
            GH SK C+  P +   G +  P     +T  + K +M      + GE+++G L
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRP---QKQTFPVQKGSMNNT---QKGEKQQGTL 457


>gi|341868843|gb|AEK98539.1| gag protein [Equine infectious anemia virus]
          Length = 426

 Score = 47.4 bits (111), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA   K     G P     + +   KP    S   APKVC+KC++ GH SK CK+ P + 
Sbjct: 363 LAGPNKASVIKGGPLRAPQTCYNCGKPGHFSSQCRAPKVCFKCRQPGHFSKQCKDQPKNG 422

Query: 980 SHGV 983
             G+
Sbjct: 423 KQGL 426


>gi|315272223|gb|ADU02677.1| gag protein [Equine infectious anemia virus]
          Length = 400

 Score = 47.4 bits (111), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   KP    S   APK+C+KCK+ 
Sbjct: 267 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKLCFKCKQP 324

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 325 GHFSKQCRNAPKNGKQGAQGRP 346


>gi|317059002|ref|ZP_07923487.1| fibronectin-binding protein [Fusobacterium sp. 3_1_5R]
 gi|313684678|gb|EFS21513.1| fibronectin-binding protein [Fusobacterium sp. 3_1_5R]
          Length = 541

 Score = 47.4 bits (111), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 10/89 (11%)

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLT 133
           S F   LRKH+    L  V Q+G+DR ++F+F     LG    +++I EL  +  N+ L 
Sbjct: 79  SSFLNTLRKHLMNSFLYQVEQVGWDRTLIFRFSKLTELGDYKQYFLIFELMGRNSNLFLC 138

Query: 134 DSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           D ++ +L LL+    D+    + +R+ +P
Sbjct: 139 DQDYKILDLLKRFSLDE----VQTRNLFP 163


>gi|428279159|ref|YP_005560894.1| hypothetical protein BSNT_02575 [Bacillus subtilis subsp. natto
           BEST195]
 gi|291484116|dbj|BAI85191.1| hypothetical protein BSNT_02575 [Bacillus subtilis subsp. natto
           BEST195]
          Length = 570

 Score = 47.4 bits (111), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKVHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|146400059|gb|ABQ28727.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 47.4 bits (111), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA A K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 361 LAGAMKGGIMKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKEPGHFSKQCRNTPKNG 420

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 421 KQGAQGRP 428


>gi|414152026|gb|AFW99182.1| gag polyprotein [Equine infectious anemia virus]
          Length = 487

 Score = 47.4 bits (111), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P   + + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 374 GGPLKASQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNGKQGAQGRP 430


>gi|394993902|ref|ZP_10386641.1| YloA [Bacillus sp. 916]
 gi|393805226|gb|EJD66606.1| YloA [Bacillus sp. 916]
          Length = 568

 Score = 47.0 bits (110), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 65/133 (48%), Gaps = 15/133 (11%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG---MNAHYVILELYAQGNIL 131
           P  F + LRKHI    +E + Q G DRI++F+      +G   + A YV + +    NI+
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRALYVEI-MGRHSNII 130

Query: 132 LTDSEFTVLTLLR 144
           LTD E  ++  L+
Sbjct: 131 LTDGEGAIIDGLK 143


>gi|261872048|gb|ACY02858.1| gag polyprotein [Equine infectious anemia virus]
          Length = 426

 Score = 47.0 bits (110), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 2/53 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
           G PQ    + +   KP    S   APKVC+KCK+ GH SK C+  P +   G 
Sbjct: 374 GGPQKTKQTCYNCGKPGHLSSQCRAPKVCFKCKEPGHFSKQCRNAPKNGKQGA 426


>gi|315272181|gb|ADU02641.1| gag protein [Equine infectious anemia virus]
          Length = 485

 Score = 47.0 bits (110), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 22/37 (59%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 396 SQCKAPKVCFKCKQPGHFSKQCRNAPKNGKQGAQGRP 432


>gi|147678138|ref|YP_001212353.1| RNA-binding protein [Pelotomaculum thermopropionicum SI]
 gi|146274235|dbj|BAF59984.1| hypothetical RNA-binding protein [Pelotomaculum thermopropionicum
           SI]
          Length = 290

 Score = 47.0 bits (110), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 47/88 (53%), Gaps = 4/88 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTL 621
           F+S++ + +  GR+ +QN+ + ++     D+++HA D+ GA   +IK    E  VPP TL
Sbjct: 163 FVSTDGFQIFIGRNNKQNDYLTQKIARDNDIWLHARDIPGA-HVIIKTEGKE--VPPATL 219

Query: 622 NQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            +A       S+  +SK+V   +    H
Sbjct: 220 EEAAGLAAYFSKGRNSKIVPVDYTFKKH 247


>gi|315272202|gb|ADU02659.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 47.0 bits (110), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 50/110 (45%), Gaps = 22/110 (20%)

Query: 918  RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
            +MALLA A          G + K  G P     + +   KP    S   APK+C++CK+ 
Sbjct: 353  KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGKPGHFSSQCKAPKLCFRCKQP 410

Query: 966  GHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIGEEEK 1015
            GH SK C+  P +   G +  P        +     +++E +++  +EEK
Sbjct: 411  GHFSKQCRNAPKNGKQGAQGRP--------QKQTFPVQKESMNKTQKEEK 452


>gi|146400057|gb|ABQ28726.1| gag protein [Equine infectious anemia virus]
          Length = 488

 Score = 47.0 bits (110), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KC++ GH SK CK  P +   G +  P
Sbjct: 374 GGPLKAAQTCYNCGKPGHLSSQCRAPKVCFKCRQPGHFSKQCKNAPKNGKQGAQGRP 430


>gi|386360884|ref|YP_006059129.1| RNA-binding protein [Thermus thermophilus JL-18]
 gi|383509911|gb|AFH39343.1| putative RNA-binding protein, snRNP like protein [Thermus
           thermophilus JL-18]
          Length = 512

 Score = 46.6 bits (109), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLKTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKIGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|220903575|ref|YP_002478887.1| hypothetical protein Ddes_0294 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
 gi|219867874|gb|ACL48209.1| protein of unknown function DUF814 [Desulfovibrio desulfuricans
           subsp. desulfuricans str. ATCC 27774]
          Length = 577

 Score = 46.6 bits (109), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + ++ GRDA+ N + V++  +  D+++HA+    S  +I+     Q VP  TL+
Sbjct: 456 FISSDGFALLRGRDARGN-LAVRKLAAPHDIWLHAENGPGSHVIIRRAHGGQEVPARTLD 514

Query: 623 QAGCFTVCHS 632
           +AG      S
Sbjct: 515 EAGALAANKS 524


>gi|315272209|gb|ADU02665.1| gag protein [Equine infectious anemia virus]
          Length = 486

 Score = 46.6 bits (109), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 38/82 (46%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +MALLA A          G + K  G P     + +   +P    S   APK+C+KCK+ 
Sbjct: 353 KMALLAKALQTGLAGPMKGGIFK--GGPLGAKQTCYNCGRPGHFSSQCKAPKLCFKCKQP 410

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH SK C+  P +   G +  P
Sbjct: 411 GHFSKQCRNAPKNGRQGAQGRP 432


>gi|384430804|ref|YP_005640164.1| fibronectin-binding A domain-containing protein [Thermus
           thermophilus SG0.5JP17-16]
 gi|333966272|gb|AEG33037.1| Fibronectin-binding A domain protein [Thermus thermophilus
           SG0.5JP17-16]
          Length = 512

 Score = 46.6 bits (109), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLRTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKIGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|326790867|ref|YP_004308688.1| fibronectin-binding A domain-containing protein [Clostridium
           lentocellum DSM 5427]
 gi|326541631|gb|ADZ83490.1| Fibronectin-binding A domain protein [Clostridium lentocellum DSM
           5427]
          Length = 586

 Score = 46.6 bits (109), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 79/170 (46%), Gaps = 22/170 (12%)

Query: 9   ADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLH 67
           A++  E+K +  LIG R   +Y +  +  +F + N+      G   K+LL   S   R+H
Sbjct: 9   ANIVHELKDV--LIGGRIDKIYQIEKEDILFTIRNN------GNVYKLLLTANSNYPRVH 60

Query: 68  TTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLG-MNAHYVIL 122
            +  A++    P  F + LRKH+   RL D+ Q   +RI+ F       LG      +I+
Sbjct: 61  LSTLAKNPSQDPPMFCMLLRKHLGGGRLLDIVQPDLERIVEFHIEATNELGDKETKKLII 120

Query: 123 ELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER 171
           E+  +  NI+LT  +  +L  ++   +D   V    R   P    RV++R
Sbjct: 121 EIMGRHSNIILTKEDHLILDSIKHISNDKSSV----REILPN---RVYQR 163


>gi|433446087|ref|ZP_20410218.1| fibrinogen binding protein [Anoxybacillus flavithermus TNO-09.006]
 gi|432000832|gb|ELK21724.1| fibrinogen binding protein [Anoxybacillus flavithermus TNO-09.006]
          Length = 569

 Score = 46.6 bits (109), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 56/125 (44%), Gaps = 13/125 (10%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKN 77
           R L+G R S +Y   P   +  + +       G + K+LL    +  R+H T    D  +
Sbjct: 17  RTLVGGRISKIYQPFPHELVLHIRSY------GNNYKLLLSAHPTYARIHLTNEVYDHPS 70

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQGNIL 131
            P  F + LRKHI    +E + Q+ +DRII+       + G       +I  +    NI+
Sbjct: 71  EPPMFCMLLRKHIEGGVIEAITQVDFDRIIIIHVKARNELGDVCTKQLIIEMMGRHSNII 130

Query: 132 LTDSE 136
           L D++
Sbjct: 131 LVDAQ 135


>gi|16078628|ref|NP_389447.1| persistent RNA/DNA binding protein [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|402775809|ref|YP_006629753.1| persistent RNA/DNA binding protein [Bacillus subtilis QB928]
 gi|81637590|sp|O34693.1|YLOA_BACSU RecName: Full=Uncharacterized protein YloA
 gi|2462963|emb|CAA04416.1| putative fibronectin-binding protein [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|2633937|emb|CAB13438.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. subtilis str. 168]
 gi|402480991|gb|AFQ57500.1| Putative persistent RNA/DNA binding protein [Bacillus subtilis
           QB928]
          Length = 572

 Score = 46.6 bits (109), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 20  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 73

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 74  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 133

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 134 TDAAENVI 141


>gi|399888866|ref|ZP_10774743.1| RNA-binding protein [Clostridium arbusti SL206]
          Length = 576

 Score = 46.6 bits (109), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 98/235 (41%), Gaps = 33/235 (14%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHY- 119
           ++H T   +    TP  F + LRK++   R+ D+RQ+  DRII+F F     LG N+ Y 
Sbjct: 59  KIHITKNNKTNPLTPPMFCMVLRKYLLNGRIVDIRQVSTDRIIIFDFESVDDLGFNSIYS 118

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASK-L 177
           +++E+  +          + +TL+R  RD+     IM   ++ T EI R        K +
Sbjct: 119 LVVEIMGRH---------SNITLIR-QRDN----IIMDSIKHITPEINRFRSLYPGIKYV 164

Query: 178 HAALTSSKEP-DANEPDKVNEDGNNVSNASKENLGGQKGGKS--------FDLSKN---S 225
           +   +    P D N+ D  N   +N  +  ++       G S        F LSKN    
Sbjct: 165 YPPKSERLNPFDFNKSDFTNYLTSNAIDIDEKMFSKIFTGVSKPLSKEVFFRLSKNIKMD 224

Query: 226 NKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA 280
           N NSND           +      Y       II D   +    LS ++K+E N+
Sbjct: 225 NINSNDIYEYIANLFNDIKNYKFSYNAYSENGIIKDFSCIDLTNLSTMDKIEYNS 279


>gi|221309439|ref|ZP_03591286.1| hypothetical protein Bsubs1_08641 [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|221313764|ref|ZP_03595569.1| hypothetical protein BsubsN3_08577 [Bacillus subtilis subsp.
           subtilis str. NCIB 3610]
 gi|221318688|ref|ZP_03599982.1| hypothetical protein BsubsJ_08511 [Bacillus subtilis subsp.
           subtilis str. JH642]
 gi|221322959|ref|ZP_03604253.1| hypothetical protein BsubsS_08617 [Bacillus subtilis subsp.
           subtilis str. SMY]
 gi|418033289|ref|ZP_12671766.1| hypothetical protein BSSC8_27100 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|452914213|ref|ZP_21962840.1| fibronectin-binding A family protein [Bacillus subtilis MB73/2]
 gi|351469437|gb|EHA29613.1| hypothetical protein BSSC8_27100 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|407958971|dbj|BAM52211.1| persistent RNA/DNA binding protein [Bacillus subtilis BEST7613]
 gi|407964548|dbj|BAM57787.1| persistent RNA/DNA binding protein [Bacillus subtilis BEST7003]
 gi|452116633|gb|EME07028.1| fibronectin-binding A family protein [Bacillus subtilis MB73/2]
          Length = 570

 Score = 46.6 bits (109), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|384175306|ref|YP_005556691.1| fibronectin-binding protein [Bacillus subtilis subsp. subtilis str.
           RO-NN-1]
 gi|349594530|gb|AEP90717.1| fibronectin-binding protein [Bacillus subtilis subsp. subtilis str.
           RO-NN-1]
          Length = 570

 Score = 46.6 bits (109), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|166236167|gb|ABY85873.1| gag protein [Equine infectious anemia virus]
          Length = 487

 Score = 46.6 bits (109), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 363 LAGSMKGGVCKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 422

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 423 KQGAQGRP 430


>gi|430759013|ref|YP_007209734.1| Fibronectin-binding protein YloA [Bacillus subtilis subsp. subtilis
           str. BSP1]
 gi|430023533|gb|AGA24139.1| Fibronectin-binding protein YloA [Bacillus subtilis subsp. subtilis
           str. BSP1]
          Length = 572

 Score = 46.6 bits (109), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 20  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 73

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 74  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 133

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 134 TDAAENVI 141


>gi|315924518|ref|ZP_07920739.1| fibronectin-binding protein [Pseudoramibacter alactolyticus ATCC
           23263]
 gi|315622222|gb|EFV02182.1| fibronectin-binding protein [Pseudoramibacter alactolyticus ATCC
           23263]
          Length = 595

 Score = 46.6 bits (109), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 18/106 (16%)

Query: 51  GESEKVLLLMESG--VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           G++  VLL+  +    R+H T   +   NTP  F + LRKH+   R+E +RQ   DR+IL
Sbjct: 46  GKTNYVLLMSANANQPRVHLTNKKKKNPNTPPSFCMALRKHLINGRIEAIRQHESDRVIL 105

Query: 109 F------QFGLGMNAHYVILELYAQ-----GNILLTDSEFTVLTLL 143
                  +FG       VI  L A+      NI+LT +E   L ++
Sbjct: 106 LDIATKNEFGDP-----VIKSLIAEITGRHANIILTKTEADALVII 146


>gi|321315330|ref|YP_004207617.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           BSn5]
 gi|320021604|gb|ADV96590.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           BSn5]
          Length = 570

 Score = 46.6 bits (109), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|414152173|gb|AFW99273.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 201

 Score = 46.6 bits (109), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P   + + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 88  GGPLKASQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNGKQGAQGRP 144


>gi|414152170|gb|AFW99271.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152176|gb|AFW99275.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152179|gb|AFW99277.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 201

 Score = 46.6 bits (109), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P   + + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 88  GGPLKASQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNGKQGAQGRP 144


>gi|300854261|ref|YP_003779245.1| RNA-binding protein [Clostridium ljungdahlii DSM 13528]
 gi|300434376|gb|ADK14143.1| putative RNA binding protein [Clostridium ljungdahlii DSM 13528]
          Length = 578

 Score = 46.6 bits (109), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 6/80 (7%)

Query: 49  ESGESEKVLLLMESGV--RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI 106
           ++G     LLL  S V  ++H T  ++     P  F + LRKH+   +L D+RQL  DRI
Sbjct: 40  KNGRKNYKLLLSASPVYPKMHITVKSKQNPLQPPMFCMVLRKHLSPSKLVDIRQLDTDRI 99

Query: 107 ILFQF----GLGMNAHYVIL 122
           +   F     LG N+ Y ++
Sbjct: 100 VFLDFESSDELGFNSIYTLV 119


>gi|414152152|gb|AFW99259.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 201

 Score = 46.2 bits (108), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 33/68 (48%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K +   G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 77  LAGSMKGRICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 136

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 137 KQGAQGRP 144


>gi|414152012|gb|AFW99170.1| gag polyprotein [Equine infectious anemia virus]
          Length = 487

 Score = 46.2 bits (108), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 363 LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 422

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 423 KQGAQGRP 430


>gi|46198551|ref|YP_004218.1| fibronectin/fibrinogen-binding protein [Thermus thermophilus HB27]
 gi|46196173|gb|AAS80591.1| fibronectin/fibrinogen-binding protein [Thermus thermophilus HB27]
          Length = 516

 Score = 46.2 bits (108), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLKTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKVGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|55980577|ref|YP_143874.1| RNA-biniding protein [Thermus thermophilus HB8]
 gi|55771990|dbj|BAD70431.1| probable RNA-biniding protein [Thermus thermophilus HB8]
          Length = 516

 Score = 46.2 bits (108), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLRTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKVGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|414152005|gb|AFW99164.1| gag polyprotein [Equine infectious anemia virus]
 gi|414152019|gb|AFW99176.1| gag polyprotein [Equine infectious anemia virus]
          Length = 487

 Score = 46.2 bits (108), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 363 LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 422

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 423 KQGAQGRP 430


>gi|312898711|ref|ZP_07758100.1| fibronectin-binding protein A [Megasphaera micronuciformis F0359]
 gi|310620142|gb|EFQ03713.1| fibronectin-binding protein A [Megasphaera micronuciformis F0359]
          Length = 574

 Score = 46.2 bits (108), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 33/156 (21%), Positives = 68/156 (43%), Gaps = 14/156 (8%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L G + + +Y L+ +   F++ N   +        +++ ++   RL  +       + P+
Sbjct: 19  LTGGQITKIYQLNGRGLYFRVFNDKSLYH------LIITLDGSPRLFLSDNQPPTPDVPT 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-LGMNAHYVILELYAQ-----GNILLTD 134
           G  + LRK+    R+  + QL  DRII      L M+   V  +++ +      N++ T+
Sbjct: 73  GLAMFLRKYYENGRIASITQLHLDRIIDVNIDVLNMSGQLVTRKMHVELMGKYSNVIFTE 132

Query: 135 SEFTVLTLLRSHRDDDKGVAIMSRHRY--PTEICRV 168
               +  L+++H+D      I  +H Y  P    R+
Sbjct: 133 DGMILEALIKTHKDKQALRTIYPKHPYEFPPNFMRM 168


>gi|166236165|gb|ABY85872.1| gag protein [Equine infectious anemia virus]
          Length = 487

 Score = 46.2 bits (108), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 363 LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 422

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 423 KQGAQGRP 430


>gi|443632767|ref|ZP_21116946.1| hypothetical protein BSI_20210 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
 gi|443347590|gb|ELS61648.1| hypothetical protein BSI_20210 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
          Length = 570

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +  +G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KMMGGRITKVHQPYKHDVIFH------IRANGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E++ Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIENIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD    V+
Sbjct: 132 TDGAENVI 139


>gi|257066456|ref|YP_003152712.1| fibronectin-binding A domain-containing protein [Anaerococcus
           prevotii DSM 20548]
 gi|256798336|gb|ACV28991.1| Fibronectin-binding A domain protein [Anaerococcus prevotii DSM
           20548]
          Length = 582

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 30/132 (22%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRLHTTAYARDKKNT 78
           +L+G +   V   S    +F       V   G++ K+LL   +   R++ T    +  + 
Sbjct: 18  KLLGGKIQKVTQPSKNDIVF------NVYSMGKNYKLLLSANNNEARINITNKKYENPDV 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQGNILL 132
           P  F + LRKHI   ++ D+ Q G DR+++F      + G   +   ++  +    NI+L
Sbjct: 72  PPNFCMVLRKHINQGKIIDISQRGLDRVVIFSISSIDEMGFDTSKKLIVEIMGKYSNIIL 131

Query: 133 TDSEFTVLTLLR 144
            D  + ++  ++
Sbjct: 132 VDDNYKIIDAIK 143


>gi|451347065|ref|YP_007445696.1| hypothetical protein KSO_011620 [Bacillus amyloliquefaciens IT-45]
 gi|449850823|gb|AGF27815.1| hypothetical protein KSO_011620 [Bacillus amyloliquefaciens IT-45]
          Length = 568

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E +++  L+
Sbjct: 132 TDGEGSIIDGLK 143


>gi|253987314|gb|ACT52162.1| gag protein [Equine infectious anemia virus]
          Length = 489

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 14/82 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +M LLA A          G + K  G P     + +   KP    S   APKVC+KCK+ 
Sbjct: 351 KMMLLARALQSGLAGPMKGGIYK--GGPLKTPQTCYNCGKPGHLSSQCRAPKVCFKCKQP 408

Query: 966 GHLSKDCKEHPDDSSHGVEDNP 987
           GH+S+ CK  P +   G    P
Sbjct: 409 GHMSRQCKNAPKNGKQGAXGRP 430


>gi|159505443|gb|ABW97698.1| gag protein [Equine infectious anemia virus]
          Length = 487

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 22/37 (59%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 394 SQCRAPKVCFKCKQPGHFSKQCRNAPKNGKQGAQGRP 430


>gi|452974532|gb|EME74352.1| fibronectin-binding protein YloA [Bacillus sonorensis L12]
          Length = 571

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R+H T  A D  + P  F + LRKH+    +E + Q+G DR
Sbjct: 39  IRANGKNRKLLLSAHPSYARVHLTEEAYDNPSAPPMFCMLLRKHLEGGFVEQIEQIGLDR 98

Query: 106 IILF------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           +++F      + G  +    V+  +    NI+LTD E  V+
Sbjct: 99  VMVFHIRSRNEVGDTLIRKLVVEIMGRHSNIVLTDGEKDVI 139


>gi|189182786|gb|ACD81986.1| gag protein [Equine infectious anemia virus]
          Length = 487

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 22/37 (59%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 394 SQCRAPKVCFKCKQPGHFSKQCRNAPKNGKQGAQGRP 430


>gi|146400053|gb|ABQ28724.1| gag protein [Equine infectious anemia virus]
          Length = 488

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KC++ GH SK C+  P +   G +  P
Sbjct: 374 GGPLKAAQTCYNCGKPGHLSSQCRAPKVCFKCRQPGHFSKQCRNAPKNGKQGAQGRP 430


>gi|242280078|ref|YP_002992207.1| hypothetical protein Desal_2613 [Desulfovibrio salexigens DSM 2638]
 gi|242122972|gb|ACS80668.1| protein of unknown function DUF814 [Desulfovibrio salexigens DSM
           2638]
          Length = 503

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 35/70 (50%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ +L+I G++++ N  I+ +  S  D + H      S  V+K   P Q VP  TL 
Sbjct: 381 FISSDGFLMIRGKNSKANHEILSKVSSVFDYWFHVQGGPGSHVVLKRDHPSQEVPEQTLR 440

Query: 623 QAGCFTVCHS 632
           +A       S
Sbjct: 441 EAAVLAALKS 450


>gi|375362208|ref|YP_005130247.1| hypothetical protein BACAU_1518 [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
 gi|371568202|emb|CCF05052.1| hypothetical protein BACAU_1518 [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
          Length = 568

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E +++  L+
Sbjct: 132 TDGEGSIIDGLK 143


>gi|9929861|dbj|BAB12103.1| gag polyprotein [Equine infectious anemia virus]
          Length = 488

 Score = 46.2 bits (108), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 374 GGPLKAAQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRSVPKNGKQGAQGRP 430


>gi|9929868|dbj|BAB12109.1| gag polyprotein [Equine infectious anemia virus]
          Length = 488

 Score = 46.2 bits (108), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 374 GGPLKAAQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRSVPKNGKQGAQGRP 430


>gi|323778|gb|AAA43013.1| polyprotein, partial [Equine infectious anemia virus]
          Length = 122

 Score = 46.2 bits (108), Expect = 0.12,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KCK+ GH SK CK  P +   G +  P
Sbjct: 35  GGPLKAAQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCKSVPKNGKQGAQGRP 91


>gi|9626531|ref|NP_056901.1| gag protein [Equine infectious anemia virus]
 gi|62288102|sp|P69730.1|GAG_EIAV9 RecName: Full=Gag polyprotein; Contains: RecName: Full=Matrix
           protein p15; Short=MA; Contains: RecName: Full=Capsid
           protein p26; Short=CA; Contains: RecName: Full=p1;
           Contains: RecName: Full=Nucleocapsid protein p11;
           Short=NC; Contains: RecName: Full=p9
 gi|62288103|sp|P69731.1|GAG_EIAVC RecName: Full=Gag polyprotein; Contains: RecName: Full=Matrix
           protein p15; Short=MA; Contains: RecName: Full=Capsid
           protein p26; Short=CA; Contains: RecName: Full=p1;
           Contains: RecName: Full=Nucleocapsid protein p11;
           Short=NC; Contains: RecName: Full=p9
 gi|62288104|sp|P69732.1|GAG_EIAVY RecName: Full=Gag polyprotein; Contains: RecName: Full=Matrix
           protein p15; Short=MA; Contains: RecName: Full=Capsid
           protein p26; Short=CA; Contains: RecName: Full=p1;
           Contains: RecName: Full=Nucleocapsid protein p11;
           Short=NC; Contains: RecName: Full=p9
 gi|9944517|gb|AAG02701.1|AF247394_1 gag protein [Equine infectious anemia virus]
 gi|290628|gb|AAA43003.1| gag protein [Equine infectious anemia virus]
 gi|323837|gb|AAB59861.1| gag protein [Equine infectious anemia virus]
 gi|2801511|gb|AAC82599.1| gag protein [Equine infectious anemia virus]
 gi|2905987|gb|AAC03760.1| gag polyprotein [Equine infectious anemia virus]
 gi|3248894|gb|AAC24014.1| gag polyprotein [Equine infectious anemia virus]
 gi|3248901|gb|AAC24020.1| gag polyprotein [Equine infectious anemia virus]
 gi|89954445|gb|ABD83644.1| codon usage optimized EIAV-gag protein [synthetic construct]
          Length = 486

 Score = 46.2 bits (108), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 374 GGPLKAAQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRSVPKNGKQGAQGRP 430


>gi|2337794|emb|CAA74268.1| YloA protein [Bacillus subtilis subsp. subtilis str. 168]
          Length = 200

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 20  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 73

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 74  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 133

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 134 TDAAENVI 141


>gi|317132057|ref|YP_004091371.1| fibronectin-binding A domain-containing protein [Ethanoligenens
           harbinense YUAN-3]
 gi|315470036|gb|ADU26640.1| Fibronectin-binding A domain protein [Ethanoligenens harbinense
           YUAN-3]
          Length = 588

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 80/173 (46%), Gaps = 23/173 (13%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFKAAEK---------- 533
           PVE + +D+ L+   NA+++Y+   K  + +    + I A  +  +  E           
Sbjct: 371 PVE-IALDVRLTPAQNAQKYYKEYHKAAAAERFLTEQIAAGEEELRYLETVLDEIARAGG 429

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFN-----WFISSENYLVISGRDAQQNEMIVKRYM 588
           ++ L  ++++ V +    R+    EK        F+S + + ++ GR+ +QN+ +  +  
Sbjct: 430 ESELAEIRDELVGSGYLRRRGQKREKLRENAPRRFVSDDGFEILVGRNNKQNDRLTLKTA 489

Query: 589 SKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           +K D++ H  ++ GA   V+   R    VP  TL QA      HS+A DS  V
Sbjct: 490 AKTDMWFHTKNIPGAHVIVLAGGR---EVPERTLTQAAVLAATHSKAKDSAQV 539


>gi|323775|gb|AAA43011.1| gag [Equine infectious anemia virus]
          Length = 512

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 400 GGPLKAAQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRSVPKNGKQGAQGRP 456


>gi|384159452|ref|YP_005541525.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens
           TA208]
 gi|328553540|gb|AEB24032.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens
           TA208]
          Length = 568

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|387209294|gb|AFJ69115.1| hypothetical protein NGATSA_3044600, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 106

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 48/87 (55%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           Q  +A+E+A   +  ++  + E R+  L+    R +  A L+E + + VD  +L +R A+
Sbjct: 3   QAVRAQEEAVRSRPLRVQRENEARLKELEATEARLLDAARLVECHSDAVDKVLLVLRSAI 62

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLI 458
           A    W+ L   +++E+  GNP+A +I
Sbjct: 63  ATGADWQTLDEYIRKEQAGGNPLARMI 89


>gi|167769343|ref|ZP_02441396.1| hypothetical protein ANACOL_00669 [Anaerotruncus colihominis DSM
           17241]
 gi|167668311|gb|EDS12441.1| fibronectin-binding protein [Anaerotruncus colihominis DSM 17241]
          Length = 590

 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 80/189 (42%), Gaps = 29/189 (15%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL-LMESGVRLHTTAYARDKKNTP 79
           ++G R   ++  + +T +  +    G      + K+LL    S  R+H T  A+D   +P
Sbjct: 19  VVGGRVDKIHQPARETIVIAMRARVG------NRKLLLSASASNPRVHFTELAQDNPKSP 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN-AHYVILELYAQ-----GNILLT 133
             F + +RKH+   +L D+ Q G DRI+ F F         V+L L A+      NI+L 
Sbjct: 73  PMFCMLMRKHLTGAKLVDITQAGLDRILHFHFETTNELGDRVVLTLSAEIMGRHSNIILV 132

Query: 134 DSEFTVLTLLRSHRDDDKGV-----AIMSRH-----------RYPTEICRVFERTTASKL 177
             +  ++  ++   D+   V      +M  H             P+EI +    T    L
Sbjct: 133 GQDGRIIDAVKRVSDEMSRVRPVLPGMMYTHVPAGSRLDIYKAAPSEIVKRLHDTPEQPL 192

Query: 178 HAALTSSKE 186
           + AL S+ E
Sbjct: 193 YKALISALE 201



 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           F+S + + ++ GR+  QN+ +  +   K D+++H      S  VI      Q VP  TL 
Sbjct: 466 FVSDDGFTILCGRNNLQNDRLTLKDSRKNDIWLHTQKIPGSHVVIVTQ--GQEVPDRTLE 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAW------WVYP 648
           QA      HS+A +S  V   +      W +P
Sbjct: 524 QAAVIAAYHSKARESGKVAVDYTQVRNVWKHP 555


>gi|384164113|ref|YP_005545492.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens LL3]
 gi|384168499|ref|YP_005549877.1| uroporphyrin-III C-methyltransferase [Bacillus amyloliquefaciens
           XH7]
 gi|328911668|gb|AEB63264.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens LL3]
 gi|341827778|gb|AEK89029.1| putative uroporphyrin-III C-methyltransferase [Bacillus
           amyloliquefaciens XH7]
          Length = 571

 Score = 45.8 bits (107), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 21  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 74

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 75  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 134

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 135 TDGEGAIIDGLK 146


>gi|154685980|ref|YP_001421141.1| hypothetical protein RBAM_015470 [Bacillus amyloliquefaciens FZB42]
 gi|154351831|gb|ABS73910.1| YloA [Bacillus amyloliquefaciens FZB42]
          Length = 568

 Score = 45.8 bits (107), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|402694377|gb|AFQ90122.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 488

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 375 GGPLKAKQTCYNCGKPGHLSSQCRAPKVCFKCKEPGHFSKQCRNAPKNGRTGAQGKP 431


>gi|253326816|gb|ACT31322.1| gag polyprotein [Equine infectious anemia virus]
          Length = 486

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 22/37 (59%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 394 SQCRAPKVCFKCKEPGHFSKQCRNAPKNGRPGAQGKP 430


>gi|429505115|ref|YP_007186299.1| hypothetical protein B938_08035 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
 gi|429486705|gb|AFZ90629.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           AS43.3]
          Length = 568

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|421731766|ref|ZP_16170889.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           M27]
 gi|407073979|gb|EKE46969.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           M27]
          Length = 568

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|325679051|ref|ZP_08158645.1| putative fibronectin-binding protein [Ruminococcus albus 8]
 gi|324109175|gb|EGC03397.1| putative fibronectin-binding protein [Ruminococcus albus 8]
          Length = 594

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 63/127 (49%), Gaps = 13/127 (10%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           LIG R   ++  S    +  +    G+      +K+L+   +G  RLH T    +    P
Sbjct: 18  LIGGRVDKIHQPSKGELLIAVRTFDGI------KKLLINTVAGTARLHLTTAEIENPKQP 71

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA-QGNILLT 133
             F + +RKH+ + +L D+RQ  ++R+I+  F     LG +    V +EL   + N++LT
Sbjct: 72  PMFCMLMRKHLSSAKLVDIRQPAFERVIMLDFDASNELGDIVRLTVTVELMGRRANLMLT 131

Query: 134 DSEFTVL 140
           D++  ++
Sbjct: 132 DADGKII 138


>gi|160933821|ref|ZP_02081209.1| hypothetical protein CLOLEP_02682 [Clostridium leptum DSM 753]
 gi|156867698|gb|EDO61070.1| fibronectin-binding protein [Clostridium leptum DSM 753]
          Length = 585

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 43/185 (23%), Positives = 88/185 (47%), Gaps = 25/185 (13%)

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---------------- 518
           +L+   DE + +   +V++D AL+A  NA+++Y+  +K ++ Q+                
Sbjct: 363 DLENFYDENRLM---RVKLDPALNATQNAQKYYKEYRKAKTAQQVLGEQIAQAEQELLYV 419

Query: 519 -KTITAHSKAFKAAE-KKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
                  S+A   +E  + R ++ +E  +  +   RK         F+SSE + ++ GR+
Sbjct: 420 DSVFDCLSRAQSESELNEIRQELREEGYLKAVRDKRKPPAPLAPLEFVSSEGFRILVGRN 479

Query: 577 AQQNEMIVKRYMSKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW 635
            +QN+ +  +  +  D+++H  ++ G+ + ++   R  QP    TL +A      HS+A 
Sbjct: 480 NRQNDKLTLKQANNNDIWLHTKNIPGSHTIIVTGGR--QP-GDATLKEAAMLAAYHSRAK 536

Query: 636 DSKMV 640
           DS  V
Sbjct: 537 DSSQV 541



 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 58/132 (43%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNT 78
           R +G R   +Y  + +  +F L          E+ K+LL   +   R+H T YA +    
Sbjct: 18  RALGARVDKIYQPNKEELVFLLRTRQ------EAFKLLLSARANSPRIHFTQYAPENPKV 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF------GLGMNAHYVILELYAQGNILL 132
           P    + LRK +   +L +VRQ G +R++   F      G  +    VI  +    NI+L
Sbjct: 72  PPMLCMLLRKRLSGAKLVEVRQPGLERLLYLDFDAANELGDKVRLSLVIEIMGKYSNIIL 131

Query: 133 TDSEFTVLTLLR 144
            D +  ++  L+
Sbjct: 132 VDGQGKIVDALK 143


>gi|315272237|gb|ADU02689.1| gag protein [Equine infectious anemia virus]
          Length = 482

 Score = 45.4 bits (106), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 15/37 (40%), Positives = 22/37 (59%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           S   APK+C+KCK+ GH S+ C+  P +   G +  P
Sbjct: 392 SQCKAPKICFKCKQPGHFSRQCRNAPKNGKQGAQGRP 428


>gi|402694379|gb|AFQ90123.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 488

 Score = 45.4 bits (106), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           G P     + +   KP    S   APKVC+KCK+ GH SK C+  P +   G +  P
Sbjct: 375 GGPLKAKQTCYNCGKPGHLSSQCRAPKVCFKCKEPGHFSKQCRNAPKNGRTGAQGKP 431


>gi|52080167|ref|YP_078958.1| fibronectin binding protein [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319646053|ref|ZP_08000283.1| YloA protein [Bacillus sp. BT1B_CT2]
 gi|404489055|ref|YP_006713161.1| fibronectin-binding protein YloA [Bacillus licheniformis DSM 13 =
           ATCC 14580]
 gi|52003378|gb|AAU23320.1| putative fibronectin binding protein [Bacillus licheniformis DSM 13
           = ATCC 14580]
 gi|52348046|gb|AAU40680.1| putative fibronectin-binding protein YloA [Bacillus licheniformis
           DSM 13 = ATCC 14580]
 gi|317391803|gb|EFV72600.1| YloA protein [Bacillus sp. BT1B_CT2]
          Length = 570

 Score = 45.4 bits (106), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R+H T    D  +TP  F + LRKH+    ++ V Q+G DR
Sbjct: 39  IRANGKNRKLLLSAHPSYARVHLTNETYDNPSTPPMFCMLLRKHLEGGFIDQVEQIGMDR 98

Query: 106 IILF------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           +++F      + G  +    ++  +    NI+LTD E  V+
Sbjct: 99  MMVFHIRSRNEIGDTLTRKLMVEIMGRHSNIVLTDGEKDVI 139


>gi|423682109|ref|ZP_17656948.1| fibronectin binding protein [Bacillus licheniformis WX-02]
 gi|383438883|gb|EID46658.1| fibronectin binding protein [Bacillus licheniformis WX-02]
          Length = 570

 Score = 45.4 bits (106), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R+H T    D  +TP  F + LRKH+    ++ V Q+G DR
Sbjct: 39  IRANGKNRKLLLSAHPSYARVHLTNETYDNPSTPPMFCMLLRKHLEGGFIDQVEQIGMDR 98

Query: 106 IILF------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           +++F      + G  +    ++  +    NI+LTD E  V+
Sbjct: 99  MMVFHIRSRNEIGDTLTRKLMVEIMGRHSNIVLTDGEKDVI 139


>gi|449094256|ref|YP_007426747.1| hypothetical protein C663_1608 [Bacillus subtilis XF-1]
 gi|449028171|gb|AGE63410.1| hypothetical protein C663_1608 [Bacillus subtilis XF-1]
          Length = 570

 Score = 45.4 bits (106), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +   G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KIMGGRITKVHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|452855511|ref|YP_007497194.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens subsp. plantarum UCMB5036]
 gi|452079771|emb|CCP21528.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens subsp. plantarum UCMB5036]
          Length = 571

 Score = 45.4 bits (106), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 21  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 74

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 75  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 134

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 135 TDGEGAIIDGLK 146


>gi|414152164|gb|AFW99267.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 201

 Score = 45.4 bits (106), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 77  LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 136

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 137 RQGAQGRP 144


>gi|414152158|gb|AFW99263.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 201

 Score = 45.4 bits (106), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 77  LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 136

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 137 KQGAQGRP 144


>gi|414152134|gb|AFW99247.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152137|gb|AFW99249.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152140|gb|AFW99251.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152143|gb|AFW99253.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152146|gb|AFW99255.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152155|gb|AFW99261.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152161|gb|AFW99265.1| gag polyprotein, partial [Equine infectious anemia virus]
 gi|414152167|gb|AFW99269.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 201

 Score = 45.4 bits (106), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 77  LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 136

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 137 KQGAQGRP 144


>gi|419841188|ref|ZP_14364565.1| fibronectin-binding protein A [Fusobacterium necrophorum subsp.
           funduliforme ATCC 51357]
 gi|386905940|gb|EIJ70691.1| fibronectin-binding protein A [Fusobacterium necrophorum subsp.
           funduliforme ATCC 51357]
          Length = 533

 Score = 45.4 bits (106), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 67/136 (49%), Gaps = 16/136 (11%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP-----SGFTLKLRKHIRT 92
           I ++  ++  + S +  K LL++    +L    Y  ++K T      S F   LRKH+  
Sbjct: 25  IHRIFQNTDTSLSLQFGKQLLVLSCNPQL-PICYVTEEKETVLEESVSSFLNSLRKHLMN 83

Query: 93  RRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSH 146
             L  V Q+ +DR ++F+F     LG    +++I EL  +  N+ L D ++ +L LL+  
Sbjct: 84  SLLYQVEQVAWDRTLIFRFSKLTELGEYKQYFLIFELMGRNSNLFLCDRDYKILDLLKRF 143

Query: 147 RDDDKGVAIMSRHRYP 162
             D+    + +R+ +P
Sbjct: 144 SLDE----LPTRNLFP 155


>gi|308173527|ref|YP_003920232.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens DSM
           7]
 gi|307606391|emb|CBI42762.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens DSM 7]
          Length = 568

 Score = 45.4 bits (106), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + ++       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRIHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|340756150|ref|ZP_08692781.1| fibronectin-binding protein [Fusobacterium sp. D12]
 gi|421500707|ref|ZP_15947699.1| fibronectin-binding protein A, N-terminal domain protein
           [Fusobacterium necrophorum subsp. funduliforme Fnf 1007]
 gi|313686904|gb|EFS23739.1| fibronectin-binding protein [Fusobacterium sp. D12]
 gi|402267261|gb|EJU16657.1| fibronectin-binding protein A, N-terminal domain protein
           [Fusobacterium necrophorum subsp. funduliforme Fnf 1007]
          Length = 533

 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 67/136 (49%), Gaps = 16/136 (11%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP-----SGFTLKLRKHIRT 92
           I ++  ++  + S +  K LL++    +L    Y  ++K T      S F   LRKH+  
Sbjct: 25  IHRIFQNTDTSLSLQFGKQLLVLSCNPQL-PICYVTEEKETVLEESVSSFLNSLRKHLMN 83

Query: 93  RRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSH 146
             L  V Q+ +DR ++F+F     LG    +++I EL  +  N+ L D ++ +L LL+  
Sbjct: 84  SLLYQVEQVAWDRTLIFRFSKLTELGEYKQYFLIFELMGRNSNLFLCDRDYKILDLLKHF 143

Query: 147 RDDDKGVAIMSRHRYP 162
             D+    + +R+ +P
Sbjct: 144 SLDE----LPTRNLFP 155


>gi|440781920|ref|ZP_20960148.1| Fibronectin-binding protein [Clostridium pasteurianum DSM 525]
 gi|440220638|gb|ELP59845.1| Fibronectin-binding protein [Clostridium pasteurianum DSM 525]
          Length = 577

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 39/65 (60%), Gaps = 5/65 (7%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHY- 119
           ++H T  ++    TP  F + LRK++   ++ D+RQ+  DRII+F F     LG N+ Y 
Sbjct: 59  KIHITDNSKKNPLTPPMFCMVLRKYLLNSKIVDIRQIETDRIIIFDFQSVDDLGFNSIYS 118

Query: 120 VILEL 124
           +I+E+
Sbjct: 119 LIIEI 123


>gi|392394834|ref|YP_006431436.1| RNA-binding protein [Desulfitobacterium dehalogenans ATCC 51507]
 gi|390525912|gb|AFM01643.1| putative RNA-binding protein, snRNP like protein
           [Desulfitobacterium dehalogenans ATCC 51507]
          Length = 637

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 55/96 (57%), Gaps = 13/96 (13%)

Query: 51  GESEKVLL-LMESGVRLHTTAYARDKKNTPSG--FTLKLRKHIRTRRLEDVRQLGYDRII 107
           G+S ++LL +  +G RLH +   ++KKN PS   F + LRKHI   ++  + QLG +RI+
Sbjct: 43  GQSYRLLLNISATGARLHLSQ--KNKKNPPSPPMFCMILRKHIEGGKILALEQLGLERIV 100

Query: 108 LF------QFGLGMNAHYVILELYAQ-GNILLTDSE 136
           L       ++G  +   Y+ LE+  +  N++L D +
Sbjct: 101 LLTVQNYNEYG-DLATFYLYLEIMGKHSNLILVDPQ 135


>gi|381190336|ref|ZP_09897859.1| fibronectin/fibrinogen-binding protein [Thermus sp. RL]
 gi|380451929|gb|EIA39530.1| fibronectin/fibrinogen-binding protein [Thermus sp. RL]
          Length = 516

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   E+ +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAERALDLIPKTEARIRELEAEKERLRTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     +    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGPRIGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKA---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|373114330|ref|ZP_09528543.1| hypothetical protein HMPREF9466_02576 [Fusobacterium necrophorum
           subsp. funduliforme 1_1_36S]
 gi|371652324|gb|EHO17740.1| hypothetical protein HMPREF9466_02576 [Fusobacterium necrophorum
           subsp. funduliforme 1_1_36S]
          Length = 533

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 67/136 (49%), Gaps = 16/136 (11%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP-----SGFTLKLRKHIRT 92
           I ++  ++  + S +  K LL++    +L    Y  ++K T      S F   LRKH+  
Sbjct: 25  IHRIFQNTDTSLSLQFGKQLLVLSCNPQL-PICYVTEEKETVLEESVSSFLNSLRKHLMN 83

Query: 93  RRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSH 146
             L  V Q+ +DR ++F+F     LG    +++I EL  +  N+ L D ++ +L LL+  
Sbjct: 84  SLLYQVEQVAWDRTLIFRFSKLTELGEYKQYFLIFELMGRNSNLFLCDRDYKILDLLKRF 143

Query: 147 RDDDKGVAIMSRHRYP 162
             D+    + +R+ +P
Sbjct: 144 SLDE----LPTRNLFP 155


>gi|414152149|gb|AFW99257.1| gag polyprotein, partial [Equine infectious anemia virus]
          Length = 201

 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 77  LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 136

Query: 980 SHGVEDNP 987
             G +  P
Sbjct: 137 KQGAQGRP 144


>gi|317496576|ref|ZP_07954925.1| fibronectin-binding protein A [Gemella morbillorum M424]
 gi|316913379|gb|EFV34876.1| fibronectin-binding protein A [Gemella morbillorum M424]
          Length = 556

 Score = 45.1 bits (105), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 9/109 (8%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGM-NAHY 119
           R   T    +  NTPS F   LRK++    ++++ Q+  DRII+F+      LG    +Y
Sbjct: 57  RFQLTKNTYENPNTPSNFCTVLRKYLIGGIIQNIEQINNDRIIVFKIKNFDELGYEKYYY 116

Query: 120 VILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY---PTE 164
           +I EL  +  NI+LTD    ++  L++    D   + ++   Y   PTE
Sbjct: 117 LIAELMGKHSNIILTDDNKVIIESLKNSYSIDYKRSTIANMNYILPPTE 165


>gi|317121734|ref|YP_004101737.1| fibronectin-binding A domain-containing protein [Thermaerobacter
           marianensis DSM 12885]
 gi|315591714|gb|ADU51010.1| Fibronectin-binding A domain protein [Thermaerobacter marianensis
           DSM 12885]
          Length = 681

 Score = 45.1 bits (105), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 82/212 (38%), Gaps = 39/212 (18%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV- 64
           MN   +AA V+ L  L+  R   VY   P   + +L        +G    +L+  +  + 
Sbjct: 1   MNGLLLAAVVQELGNLLPARVERVYQPDPHVLVLRLY-------AGRELNLLISADPNLP 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLG-YDRIILFQFGL------GMNA 117
           RLH TA        P  F + LRKH+ + RL   RQ   +DR +   F            
Sbjct: 54  RLHLTARPPANPPAPPAFCMLLRKHLESLRLVGARQGPEFDRWLWLDFAAPGADEPARRL 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY-------------PTE 164
           H  +  L  + N++L D +  +L  LR       G +++    Y             P  
Sbjct: 114 HLAVELLDRRANVVLLDGQGRILDALRRVPGSPGGRSLLPGIPYEPPPPPSPLPQGDPAS 173

Query: 165 I-CRVFERTTASKLHAALTSSKEPDANEPDKV 195
           + CR  E         ALT +  PDA +PD V
Sbjct: 174 LGCRWLE---------ALTGAG-PDAEDPDAV 195


>gi|328957541|ref|YP_004374927.1| putative persistent RNA/DNA binding protein [Carnobacterium sp.
           17-4]
 gi|328673865|gb|AEB29911.1| putative persistent RNA/DNA binding protein [Carnobacterium sp.
           17-4]
          Length = 575

 Score = 44.7 bits (104), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 74/152 (48%), Gaps = 14/152 (9%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ K+LL    S  R+  T    +  ++P  F + +RKH+    LED++Q+G DR+I 
Sbjct: 48  NGKNHKLLLSAHPSYARIQLTEIPYENPSSPPNFCMIMRKHLEGAILEDIQQVGNDRVIH 107

Query: 109 FQF------GLGMNAHYVILELYAQGNILLT--DSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           F+F      G   N   ++  +    NILL   D++  + T+       +    IM    
Sbjct: 108 FRFKSRDEIGDVQNVILIVELMGRHSNILLIEQDTQRILDTIKHVPTSQNSFRFIMPGAT 167

Query: 161 YPT----EICRVFERTTASKLHAALTSSKEPD 188
           Y +    +    FE T++S+L   +T+ ++PD
Sbjct: 168 YQSPPHQDKLNPFE-TSSSELAELITAFEDPD 198


>gi|365128101|ref|ZP_09340417.1| hypothetical protein HMPREF1032_02181 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363623448|gb|EHL74567.1| hypothetical protein HMPREF1032_02181 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 587

 Score = 44.7 bits (104), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 86/189 (45%), Gaps = 30/189 (15%)

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY-ELKKKQESKQEKT- 520
           ++R   ++ L+N     D +E T+P+     D+ LS  ANA++++ E KKKQ + +  T 
Sbjct: 358 IQRGAKNVTLTNY---YDGKEVTIPL-----DVRLSPSANAQKYFKEYKKKQTAARMLTE 409

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMR---KVHWFEK-------------FNWFI 564
           + A S A        + ++   +  A ++ +R   K   + K             F  ++
Sbjct: 410 LIAESDAEAEYLATVQYEVETAEGEAALAEIRAELKSQGYLKYYKAKDKKQKPADFLRYV 469

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTLNQ 623
           SS+ + ++ GR+  QN+ +  +     DV+ H  +  G+ + V+      QPVP  T  +
Sbjct: 470 SSDGFPILVGRNNAQNDRLTLKTARGRDVWFHVKNAPGSHAVVLSGG---QPVPDTTKTE 526

Query: 624 AGCFTVCHS 632
           A      HS
Sbjct: 527 AAVLAAVHS 535


>gi|392531657|ref|ZP_10278794.1| putative persistent RNA/DNA binding protein [Carnobacterium
           maltaromaticum ATCC 35586]
          Length = 569

 Score = 44.7 bits (104), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ KVLL    S  R+  T    +  NTP  F + +RK +    LE++ Q+G DR+I 
Sbjct: 42  NGKNHKVLLSAHPSYARIQITEIPYENPNTPPNFCMMMRKQLEGAILENIEQIGNDRVIH 101

Query: 109 FQF 111
           F F
Sbjct: 102 FTF 104


>gi|163790397|ref|ZP_02184828.1| fibronectin/fibrinogen-binding protein, putative [Carnobacterium
           sp. AT7]
 gi|159874301|gb|EDP68374.1| fibronectin/fibrinogen-binding protein, putative [Carnobacterium
           sp. AT7]
          Length = 569

 Score = 44.3 bits (103), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 7/94 (7%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ K+LL    S  R+  T    +  ++P  F + +RKH+    LED++Q+G DR+I 
Sbjct: 42  NGKNHKLLLSAHPSYARIQLTEIPYENPSSPPNFCMIMRKHLEGAILEDIQQVGNDRVIH 101

Query: 109 FQF------GLGMNAHYVILELYAQGNILLTDSE 136
           F+F      G   N   ++  +    NILL + +
Sbjct: 102 FRFKSRDEIGDVQNVILIVELMGRHSNILLIEQD 135


>gi|414083819|ref|YP_006992527.1| fibronectin-binding A N-terminus family protein [Carnobacterium
           maltaromaticum LMA28]
 gi|412997403|emb|CCO11212.1| fibronectin-binding A N-terminus family protein [Carnobacterium
           maltaromaticum LMA28]
          Length = 440

 Score = 44.3 bits (103), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ KVLL    S  R+  T    +  NTP  F + +RK +    LE++ Q+G DR+I 
Sbjct: 42  NGKNHKVLLSAHPSYARIQITEIPYENPNTPPNFCMMMRKQLEGAILENIEQIGNDRVIH 101

Query: 109 FQF 111
           F F
Sbjct: 102 FTF 104


>gi|329767576|ref|ZP_08259097.1| hypothetical protein HMPREF0428_00794 [Gemella haemolysans M341]
 gi|328839203|gb|EGF88787.1| hypothetical protein HMPREF0428_00794 [Gemella haemolysans M341]
          Length = 555

 Score = 44.3 bits (103), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 43/169 (25%), Positives = 80/169 (47%), Gaps = 32/169 (18%)

Query: 25  RCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFT 83
           R + V +LS   ++F +         G++ K+ L    S  R+  T  + +  +TPS F 
Sbjct: 23  RINKVNNLSTDEFVFSI-------RKGKNLKLFLSANPSASRIQLTNNSYENPSTPSNFC 75

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGM-NAHYVILELYAQ-GNILLTDSEF 137
             LRK++    +++++Q+  DR+++F+      LG    +Y+I EL  +  NI+LT+ + 
Sbjct: 76  SVLRKYLTGGIIQEIKQVNNDRVLVFKIKNFDDLGYEKYYYLITELMGKHSNIILTNEDN 135

Query: 138 TVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKE 186
            +L  L              ++ Y  E    F+R+T S +   L  +KE
Sbjct: 136 IILESL--------------KNSYSLE----FKRSTISNMAYTLPPTKE 166


>gi|384265146|ref|YP_005420853.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           YAU B9601-Y2]
 gi|380498499|emb|CCG49537.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           YAU B9601-Y2]
          Length = 568

 Score = 44.3 bits (103), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F   LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCTLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|387898143|ref|YP_006328439.1| hypothetical protein MUS_1715 [Bacillus amyloliquefaciens Y2]
 gi|387172253|gb|AFJ61714.1| conserved hypothetical protein YloA [Bacillus amyloliquefaciens Y2]
          Length = 563

 Score = 44.3 bits (103), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 13  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 66

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F   LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 67  PPMFCTLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 126

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 127 TDGEGAIIDGLK 138


>gi|350265877|ref|YP_004877184.1| fibronectin-binding protein [Bacillus subtilis subsp. spizizenii
           TU-B-10]
 gi|349598764|gb|AEP86552.1| fibronectin-binding protein [Bacillus subtilis subsp. spizizenii
           TU-B-10]
          Length = 570

 Score = 44.3 bits (103), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           ++ G R + ++       IF       +  +G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KMTGGRITKIHQPYKHDVIFH------IRANGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD    V+
Sbjct: 132 TDGAENVI 139


>gi|323454649|gb|EGB10519.1| hypothetical protein AURANDRAFT_8451, partial [Aureococcus
            anophagefferens]
          Length = 94

 Score = 44.3 bits (103), Expect = 0.42,   Method: Composition-based stats.
 Identities = 19/42 (45%), Positives = 26/42 (61%)

Query: 1025 TGNPLPSDILLYVIPVCGPYSAVQSYKYRVKIIPGTAKKGKG 1066
            TG P   D L + +PVC P +A + Y + +K+ PGT KKGK 
Sbjct: 1    TGAPKDGDALAWALPVCAPTAAARHYAHALKLQPGTQKKGKA 42


>gi|319649630|ref|ZP_08003786.1| fibronectin/fibrinogen-binding protein [Bacillus sp. 2_A_57_CT2]
 gi|317398792|gb|EFV79474.1| fibronectin/fibrinogen-binding protein [Bacillus sp. 2_A_57_CT2]
          Length = 566

 Score = 44.3 bits (103), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 1/65 (1%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           V  +G + ++LL    S  R+  T  A +  + P  F + LRKH+    LEDV Q+G DR
Sbjct: 39  VRANGRNHRLLLSAHPSYARVQLTNEAHENPSEPPMFCMLLRKHLEGYILEDVHQIGLDR 98

Query: 106 IILFQ 110
           II+F+
Sbjct: 99  IIVFE 103


>gi|261872046|gb|ACY02857.1| gag polyprotein [Equine infectious anemia virus]
          Length = 427

 Score = 44.3 bits (103), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
           G P     + +   KP    S   APKVC+KCK+ GH SK C+  P +   G 
Sbjct: 375 GGPLKAKQTCYNCGKPGHLSSQCKAPKVCFKCKEPGHFSKQCRNAPKNGKQGA 427


>gi|212639624|ref|YP_002316144.1| Fibronectin/fibrinogen-binding protein [Anoxybacillus flavithermus
           WK1]
 gi|212561104|gb|ACJ34159.1| Fibronectin/fibrinogen-binding protein [Anoxybacillus flavithermus
           WK1]
          Length = 653

 Score = 43.9 bits (102), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 54/125 (43%), Gaps = 13/125 (10%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKN 77
           R L+G R S +Y   P +Y         V   G + K+LL    +  R+H T    D   
Sbjct: 100 RTLVGGRISKIYQ--PSSYEL----VCHVRSHGRNYKLLLCAHPTYARIHLTNETYDNPP 153

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQGNIL 131
            P  F + LRKH+    +E + Q+ +DRII+       + G       +I  +    NI+
Sbjct: 154 EPPMFCMLLRKHMEGGIIEAITQVDFDRIIIIHVKARNELGDVCTKQLIIEMMGRHSNII 213

Query: 132 LTDSE 136
           L D +
Sbjct: 214 LVDEQ 218


>gi|452992516|emb|CCQ96047.1| Fibronectin-binding protein A [Clostridium ultunense Esp]
          Length = 590

 Score = 43.9 bits (102), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 121/615 (19%), Positives = 236/615 (38%), Gaps = 121/615 (19%)

Query: 46  GVTESGESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
            +   G++ K+L+   S   R+H T   +   ++P  F + LRKH+    + ++ Q   D
Sbjct: 38  NIYNRGKNRKLLISASSNNPRIHLTNCGKSNPSSPPMFCMLLRKHLTGGIILNIEQFHMD 97

Query: 105 RIILF------QFGLGMNAHYVILELYAQGNILLTDS-EFTVLTLLRSHRDDDKGVAIMS 157
           RII        + G  +    ++  +    NI+L D   F V+  ++    D      MS
Sbjct: 98  RIIFIDISSLDELGQPIEKRLIVEIMGKYSNIILIDKISFRVIDSIKRVTPD------MS 151

Query: 158 RHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGK 217
           R R                    L   +    ++ +K+N    +++      L GQ  G 
Sbjct: 152 RIR------------------QVLPGVEYKYPHQNNKINPL--DLAEDQFFQLIGQDNGN 191

Query: 218 SFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLE 277
                              +P  +      +G GP +S+ I   + +  +  L+ +   E
Sbjct: 192 -------------------RPIYRFFYTNYIGLGPLISKEICFQSNIDMDRPLASITFEE 232

Query: 278 DNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCP 337
              I  + +A+ K       +   +  P   IL++N H G++            Y  F  
Sbjct: 233 KKKIFSIFMAIVK------RIRDNNFKP---ILIKNNH-GRN------------YKAFYA 270

Query: 338 LLLNQFRSREFVKFETFDAALDEFYSKIES-----QRAEQQHKAKEDAAFHKLNKIHMDQ 392
           L + QF + + +   +    LDE+Y K ++     Q+A+   K+ +      LNK+   +
Sbjct: 271 LDIEQFGNNKIL-LASISQVLDEYYIKNDTLDRVNQKAQSLRKSVQTKLERSLNKLAKQK 329

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVK--EERKA 450
           +  + +  +E  +    A+LI  NL  +D  +   +V L N  S E++ +++   +ER +
Sbjct: 330 QELLDSKNRE--KFKIYADLISANLYRIDKGL--SQVELENFYS-ENMEKIIVPLDERYS 384

Query: 451 GNPVAGLIDKLYLE-RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
               A    K Y + +N   LLL           + +P  + E+D   +   +     E+
Sbjct: 385 PAENAQKYYKRYSKLKNANQLLL-----------EQIPETEEEIDYLENVLNSIDHCTEV 433

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
            +  E K+E     + K                    +I   +K     K   +ISS+ +
Sbjct: 434 LELDEIKEELIKEGYLKG-------------------SIKKKQKKDMVSKPYQYISSDGF 474

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
            +  G++ +QN+ +  +   K D+++H      S  ++K     + V   TL +A     
Sbjct: 475 HIFVGKNNRQNDFLTLKTAHKEDLWLHVQKMPGSHVIVKTE--NRRVSEKTLEEAAILAA 532

Query: 630 CHSQAWDSKMVTSAW 644
            +S+A +S  V   +
Sbjct: 533 YYSKAKNSTNVAVDY 547


>gi|325846551|ref|ZP_08169466.1| putative fibronectin-binding protein [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
 gi|325481309|gb|EGC84350.1| putative fibronectin-binding protein [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
          Length = 582

 Score = 43.9 bits (102), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 67/140 (47%), Gaps = 15/140 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  V  E+K L  L+G +   +   S    I        +   G++ K+LL   +   R+
Sbjct: 8   TRAVTFEIKKL--LLGAKIQKISQPSKNDIIL------NIYSFGKTYKLLLSANNNEARV 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMN-AHYVI 121
           H T    +    P  F + LRKH+   ++  + Q   DR+I+F+      +G + ++ +I
Sbjct: 60  HITEKKYENPEVPPNFCMVLRKHLSQSKIIGIDQYKLDRVIVFKISSVDEMGFDVSNKLI 119

Query: 122 LELYAQ-GNILLTDSEFTVL 140
           +E+  +  NI+LTD ++ ++
Sbjct: 120 VEIMGKYSNIILTDDKYKII 139


>gi|218290470|ref|ZP_03494590.1| Fibronectin-binding A domain protein [Alicyclobacillus
           acidocaldarius LAA1]
 gi|218239491|gb|EED06686.1| Fibronectin-binding A domain protein [Alicyclobacillus
           acidocaldarius LAA1]
          Length = 594

 Score = 43.9 bits (102), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 43/178 (24%), Positives = 80/178 (44%), Gaps = 34/178 (19%)

Query: 490 KVEVDLALSAHANARRWYELKKKQ-------ESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           ++E+D AL A ANA+R + +  K+       E+++E T+    +  +  E    LQ L +
Sbjct: 369 RIELDPALDAIANAQRLFRMAAKRKRARQWIEAERENTL----RDLRYLEDV--LQALAD 422

Query: 543 KTVANISHMRKVHWFEKF-NW-------------------FISSENYLVISGRDAQQNEM 582
            ++ N+  +R+    + F  W                   F SS+ +++  GR+  QN+ 
Sbjct: 423 TSLENLEEVRRELEAQGFLAWAARRGTGGKRRSGETEPHAFRSSDGFVIRVGRNNVQNDR 482

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           +  R   K D+++H      S  VI+  + E+ +P  T+ +A       S+  DS  V
Sbjct: 483 LTFRKADKRDLWLHVKDAPGSHVVIERGQAEE-IPERTIEEAAVLAAYFSRMRDSANV 539


>gi|302854072|ref|XP_002958547.1| hypothetical protein VOLCADRAFT_108171 [Volvox carteri f.
           nagariensis]
 gi|300256122|gb|EFJ40396.1| hypothetical protein VOLCADRAFT_108171 [Volvox carteri f.
           nagariensis]
          Length = 233

 Score = 43.9 bits (102), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 3/70 (4%)

Query: 907 YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
           Y + +EE + ++ A+ A AGK  K    P +  A +       +    A  VC+KC K G
Sbjct: 134 YAETEEERKALKQAVKAVAGKKPKQ-AKPASVPAGS--GAGAGVKQAAAAGVCFKCNKPG 190

Query: 967 HLSKDCKEHP 976
           H +K+CKE+P
Sbjct: 191 HFAKECKENP 200


>gi|374849978|dbj|BAL52979.1| fibronectin-binding A domain protein [uncultured candidate division
           OP1 bacterium]
 gi|374856393|dbj|BAL59247.1| fibronectin-binding A domain protein [uncultured candidate division
           OP1 bacterium]
          Length = 576

 Score = 43.9 bits (102), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 45/98 (45%), Gaps = 8/98 (8%)

Query: 11  VAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTT 69
           V+A V  LR RL G R   +Y   P T   +L        +GE + +L+      R+H T
Sbjct: 8   VSALVAELRERLCGSRVQQIYHPRPSTITLELW-------AGEEQSLLIETAEQPRVHLT 60

Query: 70  AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
                   TPS F + LRK++R   +  V Q   +RII
Sbjct: 61  QQRFPHPKTPSAFCMLLRKYLRNGIIVGVSQPALERII 98


>gi|260935368|gb|ACX54356.1| gag polyprotein [Equine infectious anemia virus]
          Length = 427

 Score = 43.9 bits (102), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
           S   APKVC+KCK+ GH SK C+  P +   G+
Sbjct: 395 SQCRAPKVCFKCKEPGHFSKQCRNAPKNGKQGL 427


>gi|291544501|emb|CBL17610.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Ruminococcus champanellensis 18P13]
          Length = 591

 Score = 43.5 bits (101), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 8/101 (7%)

Query: 11  VAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTA 70
           +  E+ CL   +  R   VY  S ++ I         T+ G  + ++    S  R+H T 
Sbjct: 11  IQGELDCL---LEGRIDKVYQPSRESVILGFR-----TKQGARKLLISAAPSSARVHMTQ 62

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
            A D    P  F + LRKH+   RL  +RQ G +RI+   F
Sbjct: 63  VAVDNPAKPPMFCMLLRKHLTGGRLIAIRQDGLERILFLDF 103



 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 39/175 (22%), Positives = 78/175 (44%), Gaps = 26/175 (14%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK--------------AFKAAE 532
           P  ++ +D+ L+   NA+R+Y  K ++ S  EK +    +              A     
Sbjct: 372 PTVEIPLDVRLTPSQNAQRYYA-KYRKASTAEKVLVEQIRNGEEELRYIDSVFDALTRCT 430

Query: 533 KKTRLQILQEKTVANISHMRKVHWFEKFN------WFISSENYLVISGRDAQQNEMIVKR 586
            +T + +L+E+ +A   ++R      K         F SS+ + ++ GR+ +QN+ +  +
Sbjct: 431 SETDIAVLREE-LAGEGYLRAARRGTKPARSQPPLVFRSSDGFQILVGRNNRQNDQLTLK 489

Query: 587 YMSKGDVYVHAD-LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
             +K D+++H   + G+   V+   R    +P  T+ +A      HS+  DS  V
Sbjct: 490 QAAKQDLWLHTQGIPGSHVIVVSQGR---EIPESTIYEAALLAAHHSKGRDSAQV 541


>gi|300244841|gb|ADJ93853.1| gag polyprotein [Equine infectious anemia virus]
          Length = 426

 Score = 43.5 bits (101), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 30/64 (46%), Gaps = 2/64 (3%)

Query: 922 LASAGKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDS 979
           LA + K     G P     + +   KP    S   APKVC+KCK+ GH SK C+  P + 
Sbjct: 363 LAGSMKGGICKGGPLKAPQTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRNAPKNG 422

Query: 980 SHGV 983
             G 
Sbjct: 423 KQGA 426


>gi|386758287|ref|YP_006231503.1| hypothetical protein MY9_1710 [Bacillus sp. JS]
 gi|384931569|gb|AFI28247.1| hypothetical protein MY9_1710 [Bacillus sp. JS]
          Length = 570

 Score = 43.5 bits (101), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +   G+++K+LL    S  R+H T    +  + 
Sbjct: 18  KIMGGRITKVHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITTQTYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|261872050|gb|ACY02859.1| gag polyprotein [Equine infectious anemia virus]
          Length = 426

 Score = 43.5 bits (101), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 2/53 (3%)

Query: 933 GDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
           G P     + +  +KP    S    PKVC+KCK+ GH S+ C+ +P +   G 
Sbjct: 374 GGPIKAKQTCYNCRKPGHLSSQCRTPKVCFKCKEPGHFSRQCRNNPKNGKQGA 426


>gi|20807959|ref|NP_623130.1| RNA-binding protein snRNP [Thermoanaerobacter tengcongensis MB4]
 gi|20516530|gb|AAM24734.1| predicted RNA-binding protein homologous to eukaryotic snRNP
           [Thermoanaerobacter tengcongensis MB4]
          Length = 570

 Score = 43.5 bits (101), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 8/97 (8%)

Query: 13  AEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTA 70
           A VK L++ I G R   +Y    +  IF       +   G++ K+LL   +   R+H T 
Sbjct: 10  AIVKELKKEIEGGRIEKIYQPEKEDLIF------TIRSKGKNYKLLLSANANYPRIHLTK 63

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
             R+    P  F + LRKH++  R+ ++RQ+ +DRI+
Sbjct: 64  EDRENPLEPPMFCMLLRKHLQNGRIAEIRQVEFDRIV 100


>gi|256545176|ref|ZP_05472542.1| fibronectin-binding protein [Anaerococcus vaginalis ATCC 51170]
 gi|256399217|gb|EEU12828.1| fibronectin-binding protein [Anaerococcus vaginalis ATCC 51170]
          Length = 582

 Score = 43.5 bits (101), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 26/102 (25%), Positives = 54/102 (52%), Gaps = 7/102 (6%)

Query: 46  GVTESGESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
            +   G+S K+LL   +   R+H T    +   +P  F + LRK++   ++ ++ Q   D
Sbjct: 38  NIYSVGKSYKLLLSANNNEARVHITEKKYENPISPPNFCMVLRKYLNQSKIVEIEQYKMD 97

Query: 105 RIILFQFG----LGMN-AHYVILELYAQ-GNILLTDSEFTVL 140
           R+I+F       +G + ++ +I+E+  +  NI+LTD  + ++
Sbjct: 98  RVIIFHISSVDEMGFDISNKLIVEIMGKYSNIILTDENYKII 139


>gi|390357067|ref|XP_003728921.1| PREDICTED: uncharacterized protein LOC100894010 [Strongylocentrotus
           purpuratus]
          Length = 1702

 Score = 43.5 bits (101), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 86/203 (42%), Gaps = 11/203 (5%)

Query: 775 GIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSA--SISSTKHGIETTQFDLSEEDKHVE 832
           G+   +  ++  + A V    E L D + GL      + S   G E     +  ++   E
Sbjct: 122 GLKETVQPLSTELHARVQKSGETLADFSSGLIRLYDRMESAASGDERAALTMLRDNTLKE 181

Query: 833 RTAT-VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGK 891
           R  T VRDK  I +  RR L   +G   +D + E  +    D +      +R+ ++E  +
Sbjct: 182 RFVTGVRDK-QIQRELRRILFSAEGKPFIDMRKEVLQTFQDDDTVTSRPSIRECEVETAR 240

Query: 892 IS-RGQKGKLKKMKEKYGDQDEEERNIRMALLA-SAGKVQKNDGDPQNENASTHKEKKPA 949
            S   +   +K MK +  +  E  + +  A+   +    Q +     N N   H +++  
Sbjct: 241 ASVTAEDQTIKSMKSEITELKETLKEVVQAMRGMTNNPRQSSTSFCYNCNKKGHLKRE-- 298

Query: 950 ISPVDAPKVCYKCKKAGHLSKDC 972
               ++P +CY CK+ GH+ +DC
Sbjct: 299 ---CNSPTLCYGCKQTGHMRRDC 318


>gi|227499520|ref|ZP_03929627.1| fibrinogen-binding protein [Anaerococcus tetradius ATCC 35098]
 gi|227218399|gb|EEI83650.1| fibrinogen-binding protein [Anaerococcus tetradius ATCC 35098]
          Length = 582

 Score = 43.5 bits (101), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 32/144 (22%), Positives = 66/144 (45%), Gaps = 15/144 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  +  E+K   +L+G +   +   S    +F L +       G+S K+LL   +   R+
Sbjct: 8   TRKIVNELK--EKLLGAKIQKISQPSKNDIVFNLYSM------GKSYKLLLSANNNEARI 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYV 120
           + T    +  +    F + LRKHI   ++ +++Q G DR+++F      + G   +   +
Sbjct: 60  NITKRKFENPDIAPNFCMVLRKHINQGKIIEIKQKGLDRVVIFSIASIDEMGFDTSKKLI 119

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I  +    NI+L D  + ++  ++
Sbjct: 120 IEIMGKYSNIVLVDDNYKIIDAIK 143


>gi|296331140|ref|ZP_06873614.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. spizizenii ATCC 6633]
 gi|305674295|ref|YP_003865967.1| persistent RNA/DNA binding protein [Bacillus subtilis subsp.
           spizizenii str. W23]
 gi|296151784|gb|EFG92659.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. spizizenii ATCC 6633]
 gi|305412539|gb|ADM37658.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. spizizenii str. W23]
          Length = 570

 Score = 43.5 bits (101), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           ++ G R + ++       IF       +  +G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KMTGGRITKIHQPYKHDVIFH------IRVNGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD    V+
Sbjct: 132 TDGAENVI 139


>gi|269864365|ref|XP_002651547.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064321|gb|EED42509.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 322

 Score = 43.5 bits (101), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 51/224 (22%), Positives = 90/224 (40%), Gaps = 40/224 (17%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 129 MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 179

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 180 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 239

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 240 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 277

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSEN 568
                K  ++ +Q K      H+    R  +WFEKF++FIS  N
Sbjct: 278 -----KIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENN 316


>gi|341868845|gb|AEK98540.1| gag protein [Equine infectious anemia virus]
          Length = 426

 Score = 43.5 bits (101), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 35/78 (44%), Gaps = 14/78 (17%)

Query: 918 RMALLASA----------GKVQKNDGDPQNENASTHKEKKPA--ISPVDAPKVCYKCKKA 965
           +M LLA A          G V K  G P     + +   KP    S   APK+C+KCK+ 
Sbjct: 351 KMMLLARALQTGLAGPMKGGVLK--GGPLKAKQTCYNCGKPGHLSSQCRAPKLCFKCKEP 408

Query: 966 GHLSKDCKEHPDDSSHGV 983
           GH SK CK  P +   G 
Sbjct: 409 GHFSKQCKNAPKNGKQGA 426


>gi|410583545|ref|ZP_11320651.1| putative RNA-binding protein, snRNP like protein [Thermaerobacter
           subterraneus DSM 13965]
 gi|410506365|gb|EKP95874.1| putative RNA-binding protein, snRNP like protein [Thermaerobacter
           subterraneus DSM 13965]
          Length = 696

 Score = 43.5 bits (101), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 16/164 (9%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV- 64
           MN   +AA ++ L  L+  R   +Y   P   + +L        +G    +L+  +  + 
Sbjct: 1   MNGLVLAAVLQELSSLLPARVERIYQPEPHLLVLRLY-------AGREVHLLIGADPSLP 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQ-LGYDRIILFQF---GLGMNA--H 118
           RLH TA        P  F + LRKH+ + RL    Q   +DR +   F   G    A   
Sbjct: 54  RLHLTARPPANPPAPPAFCMLLRKHLESLRLVAAHQGPAFDRWVQLAFVAPGPDEPARRR 113

Query: 119 YVILELYA-QGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           Y+I+EL   + N++LTD E  +L  LR    D    +++   RY
Sbjct: 114 YLIVELLERRANVVLTDGEGRILDALR-RTPDSASRSLLPGSRY 156


>gi|398304107|ref|ZP_10507693.1| persistent RNA/DNA binding protein [Bacillus vallismortis DV1-F-3]
          Length = 570

 Score = 43.1 bits (100), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 7/95 (7%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G+++K+LL    S  R+H T  A +  + P  F + LRKHI    +E + Q G DR
Sbjct: 39  IRANGKNQKLLLSAHPSYSRVHITTQAYENPSEPPMFCMLLRKHIEGGFIEKIEQAGLDR 98

Query: 106 IILFQF-GLGMNAHYVILELYAQ-----GNILLTD 134
           I++F            + +LY +      NI+LTD
Sbjct: 99  IMIFHIKSRNEIGDETVRKLYVEIMGRHSNIILTD 133


>gi|302391733|ref|YP_003827553.1| fibronectin-binding A domain protein [Acetohalobium arabaticum DSM
           5501]
 gi|302203810|gb|ADL12488.1| Fibronectin-binding A domain protein [Acetohalobium arabaticum DSM
           5501]
          Length = 589

 Score = 43.1 bits (100), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 1/87 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           F SS+ + +  GR+  QN+ +VK   S  D+++HA     S  +IKNH  ++ VP  T+ 
Sbjct: 469 FKSSDGFDIRVGRNNHQNDKLVKYESSDQDLWLHAKDIPGSHVIIKNHTRDE-VPQNTIE 527

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPH 649
           +A      +S+  +S  V   + +  H
Sbjct: 528 EAAHLAAYYSKGKNSSNVPVDYALAKH 554



 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 67/156 (42%), Gaps = 19/156 (12%)

Query: 12  AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTA 70
           A +++    LIG R   +Y   PK  +  L       + GE+ K+LL       R+H T 
Sbjct: 10  AIKIELEEELIGGRLDKIY--QPKENLLTLR----FRQPGENIKLLLSASPQNPRIHITD 63

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV---ILELYAQ 127
              +    P  F + LRKH+   RL  + Q  ++RI+        N   +   IL +   
Sbjct: 64  SDHENPLRPPTFCMLLRKHLEHGRLRKIEQPDFERILKIYIDSKNNQGEIETKILLIEVM 123

Query: 128 G---NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           G   NI+L D++  +L  ++    D      MSRHR
Sbjct: 124 GRHSNIILIDNKNQILDSIKRVTSD------MSRHR 153


>gi|295706340|ref|YP_003599415.1| fibronectin-binding protein [Bacillus megaterium DSM 319]
 gi|294803999|gb|ADF41065.1| fibronectin-binding protein [Bacillus megaterium DSM 319]
          Length = 573

 Score = 43.1 bits (100), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 68/148 (45%), Gaps = 20/148 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           L+  R S +Y   P   I +      V   GE+ K+L+       R+H T    +  + P
Sbjct: 22  LVSGRISKIYQPFPNELILQ------VRAKGENRKLLISAHPNYSRVHFTNEPYENPSEP 75

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLT 133
             F + LRKH+    +E V QLG DRI++ +      +G +    +I+E+  +  N++L 
Sbjct: 76  PMFCMLLRKHLEGSIIEQVYQLGLDRILVMETKGRNEIGDVTYKQLIIEIMGRHSNVVLV 135

Query: 134 DSEFTVLTLLRSHRDDDKGVAI-MSRHR 160
           D E   +       D  K V + ++RHR
Sbjct: 136 DKEKQTII------DSIKHVPMALNRHR 157


>gi|373497493|ref|ZP_09588017.1| hypothetical protein HMPREF0402_01890 [Fusobacterium sp. 12_1B]
 gi|371963247|gb|EHO80817.1| hypothetical protein HMPREF0402_01890 [Fusobacterium sp. 12_1B]
          Length = 541

 Score = 43.1 bits (100), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 6/70 (8%)

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYVI-LELYAQ-GNILLTD 134
           G    +RKH+    L DV+QLG+DRI+ F+F     LG   +Y I  E+  +  N + TD
Sbjct: 71  GLAANMRKHLLNAMLTDVQQLGFDRILCFKFAKINELGEVKNYSIYFEIMGKYSNFIFTD 130

Query: 135 SEFTVLTLLR 144
            +  ++ LL+
Sbjct: 131 EDDRIIDLLK 140


>gi|300244839|gb|ADJ93852.1| gag polyprotein [Equine infectious anemia virus]
          Length = 426

 Score = 43.1 bits (100), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 951 SPVDAPKVCYKCKKAGHLSKDCKEHPDDSSHGV 983
           S   APKVC+KCK+ GH SK C+  P +   G 
Sbjct: 394 SQCRAPKVCFKCKQPGHFSKQCRNAPKNGKQGA 426


>gi|404366578|ref|ZP_10971960.1| hypothetical protein FUAG_01772 [Fusobacterium ulcerans ATCC 49185]
 gi|313689422|gb|EFS26257.1| hypothetical protein FUAG_01772 [Fusobacterium ulcerans ATCC 49185]
          Length = 541

 Score = 43.1 bits (100), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 6/70 (8%)

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYVI-LELYAQ-GNILLTD 134
           G    +RKH+    L DV+QLG+DRI+ F+F     LG   +Y I  E+  +  N + TD
Sbjct: 71  GLAANMRKHLLNAMLTDVQQLGFDRILCFKFAKINELGEIKNYSIYFEIMGKYSNFIFTD 130

Query: 135 SEFTVLTLLR 144
            +  ++ LL+
Sbjct: 131 EDDRIIDLLK 140


>gi|254479575|ref|ZP_05092888.1| Fibronectin-binding protein A domain protein [Carboxydibrachium
           pacificum DSM 12653]
 gi|214034487|gb|EEB75248.1| Fibronectin-binding protein A domain protein [Carboxydibrachium
           pacificum DSM 12653]
          Length = 469

 Score = 43.1 bits (100), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 8/97 (8%)

Query: 13  AEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTA 70
           A VK L++ I G R   +Y    +  IF       +   G++ K+LL   +   R+H T 
Sbjct: 12  AIVKELKKEIEGGRIEKIYQPEKEDLIF------TIRSKGKNYKLLLSANANYPRIHLTK 65

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
             R+    P  F + LRKH++  R+ ++RQ+ +DRI+
Sbjct: 66  EDRENPLEPPMFCMLLRKHLQNGRIAEIRQVEFDRIV 102


>gi|384045157|ref|YP_005493174.1| Fibronectin-binding A-like protein [Bacillus megaterium WSH-002]
 gi|345442848|gb|AEN87865.1| Fibronectin-binding A-like protein [Bacillus megaterium WSH-002]
          Length = 570

 Score = 43.1 bits (100), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 58/123 (47%), Gaps = 13/123 (10%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           L+  R S +Y   P   I +      V   GE+ K+L+       R+H T    +  + P
Sbjct: 19  LVSGRISKIYQPFPNELILQ------VRAKGENRKLLISAHPNYSRVHFTNEPYENPSEP 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLT 133
             F + LRKH+    +E V QLG DRI++ +      +G +    +I+E+  +  N++L 
Sbjct: 73  PMFCMLLRKHLEGSIIEQVYQLGLDRILVIETKGRNEIGDVTYKQLIIEIMGRHSNVVLV 132

Query: 134 DSE 136
           D E
Sbjct: 133 DKE 135


>gi|294500991|ref|YP_003564691.1| fibronectin-binding protein [Bacillus megaterium QM B1551]
 gi|294350928|gb|ADE71257.1| fibronectin-binding protein [Bacillus megaterium QM B1551]
          Length = 573

 Score = 43.1 bits (100), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           L+  R S +Y   P   I +      V   GE+ K+L+       R+H T    +  + P
Sbjct: 22  LVSGRISKIYQPFPNELILQ------VRAKGENRKLLISAHPNYSRVHFTNEPYENPSEP 75

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
             F + LRKH+    +E V QLG DRI++ +
Sbjct: 76  PMFCMLLRKHLEGSIIEQVYQLGLDRILVIE 106


>gi|312111736|ref|YP_003990052.1| Fibronectin-binding A domain-containing protein [Geobacillus sp.
           Y4.1MC1]
 gi|423720651|ref|ZP_17694833.1| fibronectin-binding A domain-containing protein [Geobacillus
           thermoglucosidans TNO-09.020]
 gi|311216837|gb|ADP75441.1| Fibronectin-binding A domain protein [Geobacillus sp. Y4.1MC1]
 gi|383366004|gb|EID43295.1| fibronectin-binding A domain-containing protein [Geobacillus
           thermoglucosidans TNO-09.020]
          Length = 571

 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 51  GESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G + K+LL       R+H T    D    P  F + LRKH+    +E +RQ+ +DRII+ 
Sbjct: 43  GRNYKLLLSAHPNYARVHLTNETYDNPAEPPMFCMLLRKHLEGSIIEAIRQVDFDRIIII 102

Query: 110 QFG----LG-MNAHYVILELYAQ-GNILLTDSE 136
           +      +G ++A  +I+E+  +  NI+L D E
Sbjct: 103 ETKGRDEIGDIHAKQLIIEIMGRHSNIILVDEE 135


>gi|336236110|ref|YP_004588726.1| fibronectin-binding A domain-containing protein [Geobacillus
           thermoglucosidasius C56-YS93]
 gi|335362965|gb|AEH48645.1| Fibronectin-binding A domain protein [Geobacillus
           thermoglucosidasius C56-YS93]
          Length = 571

 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 51  GESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G + K+LL       R+H T    D    P  F + LRKH+    +E +RQ+ +DRII+ 
Sbjct: 43  GRNYKLLLSAHPNYARVHLTNETYDNPAEPPMFCMLLRKHLEGSIIEAIRQVDFDRIIII 102

Query: 110 QFG----LG-MNAHYVILELYAQ-GNILLTDSE 136
           +      +G ++A  +I+E+  +  NI+L D E
Sbjct: 103 ETKGRDEIGDIHAKQLIIEIMGRHSNIILVDEE 135


>gi|126649682|ref|ZP_01721918.1| hypothetical protein BB14905_15830 [Bacillus sp. B14905]
 gi|126593401|gb|EAZ87346.1| hypothetical protein BB14905_15830 [Bacillus sp. B14905]
          Length = 591

 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 62/126 (49%), Gaps = 13/126 (10%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRLHTTAYARDKK 76
           L++L+  R + ++  + +  I        V  +G++ K+L  + S   R+H T  + +  
Sbjct: 42  LQQLVTGRITKIHQPNAQEVILH------VRANGKNHKLLFSIHSSYARVHLTEQSIENP 95

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNI 130
             P  F + LRKH+    +  V+QLG+DRII+ +          ++ +L+A+      N+
Sbjct: 96  AEPPMFCMLLRKHLEGGFISSVKQLGFDRIIIVEIESKNEIGDPIVRQLHAEIMGRHSNL 155

Query: 131 LLTDSE 136
           LL D E
Sbjct: 156 LLIDKE 161


>gi|212696157|ref|ZP_03304285.1| hypothetical protein ANHYDRO_00693 [Anaerococcus hydrogenalis DSM
           7454]
 gi|212676786|gb|EEB36393.1| hypothetical protein ANHYDRO_00693 [Anaerococcus hydrogenalis DSM
           7454]
          Length = 326

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 67/140 (47%), Gaps = 15/140 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  V  E+K L  L+G +   +   S    I        +   G++ K+LL   +   R+
Sbjct: 8   TRAVTFEIKKL--LLGAKIQKISQPSKNDIIL------NIYSFGKTYKLLLSANNNEARV 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMN-AHYVI 121
           H T    +    P  F + LRKH+   ++  + Q   DR+I+F+      +G + ++ +I
Sbjct: 60  HITEKKYENPEVPPNFCMVLRKHLSQSKIIGIDQYKLDRVIVFKISSVDEMGFDVSNKLI 119

Query: 122 LELYAQ-GNILLTDSEFTVL 140
           +E+  +  NI+LTD ++ ++
Sbjct: 120 VEIMGKYSNIILTDDKYKII 139


>gi|328948692|ref|YP_004366029.1| fibronectin-binding A domain-containing protein [Treponema
           succinifaciens DSM 2489]
 gi|328449016|gb|AEB14732.1| Fibronectin-binding A domain protein [Treponema succinifaciens DSM
           2489]
          Length = 482

 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 56  VLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
           V+       R++ T     K   P  F   L+  ++  R+   +QLG DRI+ F      
Sbjct: 47  VICTSPQSCRINKTNSKSPKNEKPLRFNEFLKSRVQGMRINSCKQLGLDRIVKFDVSTWK 106

Query: 116 NAHYVILELYAQ-GNILLTDSEFTVLTLL--RSHRDDDKGVAIMSRHRYPTE 164
           +  ++   L++   NI++TD    +L  L  R  +D+  G   + + + PTE
Sbjct: 107 DRLFIYARLWSNAANIIVTDENGKILDCLYRRPAKDEITGGVFVPQEKIPTE 158


>gi|258511297|ref|YP_003184731.1| fibronectin-binding A domain-containing protein [Alicyclobacillus
           acidocaldarius subsp. acidocaldarius DSM 446]
 gi|257478023|gb|ACV58342.1| Fibronectin-binding A domain protein [Alicyclobacillus
           acidocaldarius subsp. acidocaldarius DSM 446]
          Length = 594

 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 41/178 (23%), Positives = 80/178 (44%), Gaps = 34/178 (19%)

Query: 490 KVEVDLALSAHANARRWYELKKKQ-------ESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           ++E+D AL A ANA+R + +  K+       E+++E T+    +  +  E    LQ L +
Sbjct: 369 RIELDPALDAIANAQRLFRMAAKRKRARQWIEAERENTL----RDLRYLEDV--LQALAD 422

Query: 543 KTVANISHMRKVHWFEKF--------------------NWFISSENYLVISGRDAQQNEM 582
            ++ N+  +R+    + F                    + F SS+ +++  GR+  QN+ 
Sbjct: 423 TSLENLEEVRRELQAQGFLARADRRGTGGKRRAAESEPHAFRSSDGFVIRVGRNNVQNDR 482

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           +  R   K D+++H      S  VI+  + ++ +P  T+ +A       S+  DS  V
Sbjct: 483 LTFRRADKRDLWLHVKDAPGSHVVIERGQADE-IPERTIEEAAALAAYFSRMRDSANV 539


>gi|333978737|ref|YP_004516682.1| fibronectin-binding A domain-containing protein [Desulfotomaculum
           kuznetsovii DSM 6115]
 gi|333822218|gb|AEG14881.1| Fibronectin-binding A domain protein [Desulfotomaculum kuznetsovii
           DSM 6115]
          Length = 585

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 5/90 (5%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+  R   +Y  SP   I  L++  G      +  +L       R+H T   R+   +P 
Sbjct: 19  LLDGRIDRIYQPSP-LEIHLLIHRPGT----RARLLLSAHPENARVHLTGRVRENPPSPP 73

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
            F + LRKH+   R+  ++Q G DR+++FQ
Sbjct: 74  VFCMVLRKHLEGGRIRGIQQRGLDRVLVFQ 103


>gi|443895584|dbj|GAC72930.1| E3 ubiquitin ligase interacting with arginine methyltransferase
           [Pseudozyma antarctica T-34]
          Length = 130

 Score = 42.7 bits (99), Expect = 1.3,   Method: Composition-based stats.
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 956 PKVCYKCKKAGHLSKDCKEHPDDSSHG 982
           PK CYKC + GH+S+DC  +P  SS G
Sbjct: 47  PKTCYKCNETGHISRDCPSNPAPSSGG 73


>gi|268610540|ref|ZP_06144267.1| fibronectin-binding A-like protein [Ruminococcus flavefaciens FD-1]
          Length = 597

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRLHTTAYARDKKNTP 79
           LIG R   ++  S +  +  +   +G      S+K+ +   +G  R+H T  + D   TP
Sbjct: 31  LIGGRVEKIHQPSREEIVISIRTRNG------SKKLYISANAGSARVHLTEKSVDNPQTP 84

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRK + + +L D+RQ G +RI+   F
Sbjct: 85  PMFCMLLRKRLGSGKLIDIRQDGLERILFLDF 116


>gi|359417662|ref|ZP_09209759.1| RNA-binding protein, partial [Candidatus Haloredivivus sp. G17]
 gi|358031981|gb|EHK00788.1| RNA-binding protein [Candidatus Haloredivivus sp. G17]
          Length = 101

 Score = 42.4 bits (98), Expect = 1.4,   Method: Composition-based stats.
 Identities = 22/64 (34%), Positives = 38/64 (59%), Gaps = 6/64 (9%)

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           + Y RD    P GF ++LRKH+   +++ + Q G+DRI++ + G       +I EL+ +G
Sbjct: 35  SKYKRDNPMKPPGFCMELRKHL--GKVDRIEQKGFDRILVIESG----DTKLICELFGRG 88

Query: 129 NILL 132
           N +L
Sbjct: 89  NYIL 92


>gi|256828054|ref|YP_003156782.1| hypothetical protein Dbac_0239 [Desulfomicrobium baculatum DSM
           4028]
 gi|256577230|gb|ACU88366.1| protein of unknown function DUF814 [Desulfomicrobium baculatum DSM
           4028]
          Length = 498

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 34/65 (52%)

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           K   + SS+ +L++ GR AQ N  ++ +  S  D ++HA     +  ++K   P Q VP 
Sbjct: 369 KVQAYRSSDGFLIVRGRSAQANHQLLTQAASPFDYWLHAQDGPGAHVIVKRDFPAQEVPE 428

Query: 619 LTLNQ 623
            T+ Q
Sbjct: 429 RTIQQ 433


>gi|302872268|ref|YP_003840904.1| fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor obsidiansis OB47]
 gi|302575127|gb|ADL42918.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           obsidiansis OB47]
          Length = 585

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTLRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
           +A       S+A  S  V   +    Y  +  KT P
Sbjct: 524 EAALLASYFSKAKHSTKVPVDYTFVKYVKKPPKTKP 559


>gi|225019375|ref|ZP_03708567.1| hypothetical protein CLOSTMETH_03328 [Clostridium methylpentosum
           DSM 5476]
 gi|224948006|gb|EEG29215.1| hypothetical protein CLOSTMETH_03328 [Clostridium methylpentosum
           DSM 5476]
          Length = 582

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 51  GESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G S ++LL    S  R+H T++  +   TP  F + LRKH+ + +L  VRQL  DR++  
Sbjct: 42  GGSGRLLLSASASNARIHFTSFPPENPKTPPMFCMLLRKHLGSGKLIAVRQLELDRVLCL 101

Query: 110 QF 111
            F
Sbjct: 102 DF 103


>gi|395213235|ref|ZP_10400120.1| hypothetical protein O71_05359 [Pontibacter sp. BAB1700]
 gi|394456814|gb|EJF11060.1| hypothetical protein O71_05359 [Pontibacter sp. BAB1700]
          Length = 523

 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 47/100 (47%), Gaps = 8/100 (8%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
              +E + ++ G+ AQ N+++ +R+  K D+++HA     S  VIK H+  + VP   L 
Sbjct: 406 LFETEGFKILVGKSAQNNDLLTQRHTYKEDIWLHAKDVSGSHVVIK-HQAGKTVPATVLE 464

Query: 623 QAGCFTVCHSQAWDSKMV----TSAWWVYPHQVSKTAPTG 658
           +A      +S+     +     T   WV   +  K AP G
Sbjct: 465 KAAQLAAYYSKRKSDTLCPVLYTPKKWV---RKPKGAPAG 501


>gi|329768836|ref|ZP_08260265.1| hypothetical protein HMPREF0433_00029 [Gemella sanguinis M325]
 gi|328838229|gb|EGF87842.1| hypothetical protein HMPREF0433_00029 [Gemella sanguinis M325]
          Length = 555

 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 6/90 (6%)

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNA 117
           S  R+  T  + +   TPS F   LRK++    +E++ Q+  DRII F+      LG   
Sbjct: 54  SASRIQLTNNSYENPQTPSNFCSVLRKYLMGGIIEEINQINNDRIIKFKIKNFDELGYEK 113

Query: 118 HY-VILELYAQ-GNILLTDSEFTVLTLLRS 145
           +Y +I EL  +  NI+LT+S+  ++  L++
Sbjct: 114 YYFLITELMGKHSNIILTNSDNIIIESLKN 143


>gi|205373309|ref|ZP_03226113.1| fibronectin-binding protein / fibrinogen-binding protein [Bacillus
           coahuilensis m4-4]
          Length = 545

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 13/129 (10%)

Query: 15  VKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYAR 73
           V  L+ LIG R + V+   P    FKL     +  +G+++K+LL    S  R+  T  + 
Sbjct: 12  VNELQPLIGGRINKVH--QP----FKLEILLNIRANGKNQKLLLSSHPSYARVQLTEQSY 65

Query: 74  DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ- 127
           D   TP  F + LRKH+    +E++ Q   +R+I+ +      +G ++   +I+E+  + 
Sbjct: 66  DNPTTPPMFCMLLRKHLEGYIIENIYQKDLERMIIMEVKGRNEIGDISYKQLIIEIMGRH 125

Query: 128 GNILLTDSE 136
            NI+L D E
Sbjct: 126 SNIILVDKE 134


>gi|312793063|ref|YP_004025986.1| Fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor kristjanssonii 177R1B]
 gi|312180203|gb|ADQ40373.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           kristjanssonii 177R1B]
          Length = 585

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTLRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
           +A       S+A  S  V   +    Y  +  KT P
Sbjct: 524 EAALLASYFSKAKHSTKVPVDYTFVKYVKKPPKTKP 559


>gi|311068085|ref|YP_003973008.1| persistent RNA/DNA binding protein [Bacillus atrophaeus 1942]
 gi|419823934|ref|ZP_14347467.1| putative persistent RNA/DNA binding protein [Bacillus atrophaeus
           C89]
 gi|310868602|gb|ADP32077.1| putative persistent RNA/DNA binding protein [Bacillus atrophaeus
           1942]
 gi|388471971|gb|EIM08761.1| putative persistent RNA/DNA binding protein [Bacillus atrophaeus
           C89]
          Length = 570

 Score = 42.0 bits (97), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 56/122 (45%), Gaps = 13/122 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + ++       IF       +  +G+++K+LL    S  R+H T    +  + 
Sbjct: 18  RITGGRITKIHQPFKHDVIFH------IRANGKNQKLLLSAHPSYSRVHLTNQTYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIESIEQSGMDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TD 134
           TD
Sbjct: 132 TD 133


>gi|295696031|ref|YP_003589269.1| fibronectin-binding A domain-containing protein [Kyrpidia tusciae
           DSM 2912]
 gi|295411633|gb|ADG06125.1| Fibronectin-binding A domain protein [Kyrpidia tusciae DSM 2912]
          Length = 599

 Score = 42.0 bits (97), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 12/121 (9%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISSE   +  G++ +QN+ +  +   K D ++HA     S  VI++    + VPP TL 
Sbjct: 478 FISSEGIDIFVGKNNRQNDELTTKTAHKQDTWLHAQNIPGSHVVIRS----REVPPKTLE 533

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI--RGKKNFLPPHPL 680
           +A      +S+A  +  V   + +  H V K  PTG       F++    K  F+PP P 
Sbjct: 534 EAARLAAYYSKARHAGTVAVDYTLVKH-VWK--PTG---ARPGFVLYDHQKTVFVPPDPA 587

Query: 681 I 681
           +
Sbjct: 588 L 588


>gi|300813244|ref|ZP_07093609.1| putative fibronectin-binding protein [Peptoniphilus sp. oral taxon
           836 str. F0141]
 gi|300512651|gb|EFK39786.1| putative fibronectin-binding protein [Peptoniphilus sp. oral taxon
           836 str. F0141]
          Length = 584

 Score = 42.0 bits (97), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 8/96 (8%)

Query: 57  LLLMESG--VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-- 112
           LLL  SG   R+H T    D  + P  F + LRKH+    L  + Q   DRII F F   
Sbjct: 48  LLLSASGNYPRVHLTENIIDNPSNPPAFCMLLRKHLEGSILNQITQYKMDRIIKFDFSSK 107

Query: 113 --LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLR 144
             LG +    +ILE+  +  NI+L + +  +L  L+
Sbjct: 108 DELGLLEDKSLILEIMGKYSNIILVNKDSKILDSLK 143


>gi|168031469|ref|XP_001768243.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680421|gb|EDQ66857.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 779

 Score = 42.0 bits (97), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 83/383 (21%), Positives = 161/383 (42%), Gaps = 66/383 (17%)

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
           G GP L+  +I  +GL P+M  + + + E  ++ V+ L      DWL+ V+       G 
Sbjct: 376 GVGPGLAVELISRSGLSPSMDPAAMTEDEWFSLHVVWL------DWLR-VLEESTFKPGL 428

Query: 309 ILMQNKH--LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI- 365
           +     +  LG D P     S+ Q  ++    +L               A LD++Y+++ 
Sbjct: 429 VRSTGSYSVLGGDGPYIL--STDQDSEDAATGIL---------------AMLDDYYTRVY 471

Query: 366 ---ESQRAEQQHKAKEDAAFHKL-NKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
              + Q+  QQ  AK  AA  K  +K+++ ++    ++  E  +  KMA+L+  NL   +
Sbjct: 472 ETEKFQQLRQQLVAKVSAATKKAQSKVNLFEDQIKASM--EYSKISKMADLLMANLHVCE 529

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI----DKLYLERNCMSLLLSNNLD 477
              L++ +        E+   +  + R+     A  +     KL      ++ LL+   D
Sbjct: 530 PGALSITLP---DFETEEPTTIALDPRQTALVTAQKLYKRSQKLKKSEKAVAPLLAEARD 586

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           E+            +V+++L       R  +L+  +E + E    A+ K   A       
Sbjct: 587 ELTYLS--------QVEVSLQQLDRYTRSTDLRSLEEVRDELVEGAYLKPIIAGTPPPSS 638

Query: 538 QILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  ++ +  +   ++MR+         F S   Y V+ GR+ +QN+++  R  ++ D++ 
Sbjct: 639 KRKKKSSPLDNFAANMRR---------FTSPSGYEVLVGRNNRQNDVLANRVATEYDLWF 689

Query: 596 HADLHGASSTVIKNHRPEQPVPP 618
           HA     S TV++       VPP
Sbjct: 690 HARNIPGSHTVLR-------VPP 705


>gi|386714204|ref|YP_006180527.1| fibronectin/fibrinogen-binding protein [Halobacillus halophilus DSM
           2266]
 gi|384073760|emb|CCG45253.1| fibronectin/fibrinogen-binding protein, putative [Halobacillus
           halophilus DSM 2266]
          Length = 578

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 31/52 (59%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ 614
           F+SS+  L+  GR+ +QNE +  R  +K D+++HA     S  VI+N  P +
Sbjct: 453 FLSSDGTLIYVGRNNKQNEYLTNRMANKSDIWLHAKDIPGSHVVIRNEDPSE 504


>gi|406981505|gb|EKE02969.1| hypothetical protein ACD_20C00301G0015 [uncultured bacterium]
          Length = 587

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 39/177 (22%), Positives = 77/177 (43%), Gaps = 23/177 (12%)

Query: 491 VEVDLALSAHANARRWYEL--KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           +++D   S +ANA+R+Y+L  K K  S+  K I    +      +     I Q  ++A++
Sbjct: 373 IQLDPVKSPNANAQRYYKLYNKAKTASRISKDIVRQVQEELDYLESIETFINQSDSLADL 432

Query: 549 SHMR--------------KVHWFEKF-------NWFISSENYLVISGRDAQQNEMIVKRY 587
             ++              ++   EK        + + S++ Y +  G++ +QNE ++ + 
Sbjct: 433 KQIKDELISQNLLKTTGKQIKSPEKLKKEGISLSEYTSTDGYKIYVGKNNRQNEYLISKI 492

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            S  D+++H      S  +IK +     VP  T+ +A       SQA +S  V   +
Sbjct: 493 ASPNDIWLHTQNIPGSHVLIKINDENVEVPASTIEEAASIAAYFSQAKNSANVAVIY 549


>gi|336400749|ref|ZP_08581522.1| hypothetical protein HMPREF0404_00813 [Fusobacterium sp. 21_1A]
 gi|423136512|ref|ZP_17124155.1| hypothetical protein HMPREF9942_00293 [Fusobacterium nucleatum
           subsp. animalis F0419]
 gi|336161774|gb|EGN64765.1| hypothetical protein HMPREF0404_00813 [Fusobacterium sp. 21_1A]
 gi|371961666|gb|EHO79290.1| hypothetical protein HMPREF9942_00293 [Fusobacterium nucleatum
           subsp. animalis F0419]
          Length = 541

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  NI+ TD E  +
Sbjct: 77  LRKHLMNAMLTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNIIFTDEENKI 136

Query: 140 LTLLR 144
           L  L+
Sbjct: 137 LDTLK 141


>gi|169827056|ref|YP_001697214.1| hypothetical protein Bsph_1482 [Lysinibacillus sphaericus C3-41]
 gi|168991544|gb|ACA39084.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41]
          Length = 587

 Score = 41.6 bits (96), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 7/94 (7%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRLHTTAYARDKK 76
           L++L+  R + ++  + +  +        V  +G++ K+L  + S   R+H T    +  
Sbjct: 38  LQQLVTGRITKIHQPNAQEVVLH------VRANGKNHKLLFSIHSSYARVHLTEQTIENP 91

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
             P  F + LRKH+    +  V+QLG+DRII+ +
Sbjct: 92  AEPPMFCMLLRKHLEGGFISSVKQLGFDRIIIVE 125


>gi|260890517|ref|ZP_05901780.1| hypothetical protein GCWU000323_01695 [Leptotrichia hofstadii
           F0254]
 gi|260859759|gb|EEX74259.1| hypothetical protein GCWU000323_01695 [Leptotrichia hofstadii
           F0254]
          Length = 322

 Score = 41.6 bits (96), Expect = 2.3,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 53/104 (50%), Gaps = 11/104 (10%)

Query: 68  TTAYARDKKNT----PSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNA 117
           T  Y +D+K+      S F L L+KH++   L ++RQ G+DRI+ F      QFG  +  
Sbjct: 54  TIFYLKDEKDPNTDFQSKFLLSLKKHLQNSILINIRQEGFDRIVYFDFEKLNQFG-DVEK 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           + +I+E+  + + +   S+  +L+ L     D     IM+  RY
Sbjct: 113 YTLIIEIMGKASNIFLTSKDKILSALYFTSIDVGNRVIMTGARY 156


>gi|158320452|ref|YP_001512959.1| fibronectin-binding A domain-containing protein [Alkaliphilus
           oremlandii OhILAs]
 gi|158140651|gb|ABW18963.1| Fibronectin-binding A domain protein [Alkaliphilus oremlandii
           OhILAs]
          Length = 593

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 51/97 (52%), Gaps = 7/97 (7%)

Query: 47  VTESGESEKVLLLMESGV-RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           V  +G++ K+LL  +S   ++H T   ++  ++P  F + LRKH+   R+ D+ Q  ++R
Sbjct: 39  VRSNGKNHKILLSADSNYPKIHFTTSNKENPSSPPNFCMVLRKHLMGGRIVDIVQPQFER 98

Query: 106 II------LFQFGLGMNAHYVILELYAQGNILLTDSE 136
           I+      L +  +  +   +I  +    NI+L DSE
Sbjct: 99  IVKIIIESLDELNILKSKELMIEIMGKHSNIILVDSE 135


>gi|282881987|ref|ZP_06290628.1| fibronectin-binding protein [Peptoniphilus lacrimalis 315-B]
 gi|281298017|gb|EFA90472.1| fibronectin-binding protein [Peptoniphilus lacrimalis 315-B]
          Length = 584

 Score = 41.6 bits (96), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 8/96 (8%)

Query: 57  LLLMESG--VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-- 112
           LLL  SG   R+H T    D  + P  F + LRKH+    L  + Q   DRII F F   
Sbjct: 48  LLLSASGNYPRVHLTENLIDNPSNPPAFCMLLRKHLEGSILNKITQYKMDRIIKFDFSSK 107

Query: 113 --LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLR 144
             LG +    +ILE+  +  NI+L + +  +L  L+
Sbjct: 108 DELGLLEDKSLILEIMGKYSNIILVNKDSKILDSLK 143


>gi|312621969|ref|YP_004023582.1| Fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor kronotskyensis 2002]
 gi|312202436|gb|ADQ45763.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           kronotskyensis 2002]
          Length = 585

 Score = 41.6 bits (96), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTIRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLV 523

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       S+A  S  V
Sbjct: 524 EAALLASYFSKAKHSTKV 541


>gi|312127148|ref|YP_003992022.1| Fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor hydrothermalis 108]
 gi|311777167|gb|ADQ06653.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           hydrothermalis 108]
          Length = 585

 Score = 41.6 bits (96), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTIRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       S+A  S  V
Sbjct: 524 EAALLASYFSKAKHSTKV 541


>gi|227485000|ref|ZP_03915316.1| fibrinogen-binding protein [Anaerococcus lactolyticus ATCC 51172]
 gi|227236997|gb|EEI87012.1| fibrinogen-binding protein [Anaerococcus lactolyticus ATCC 51172]
          Length = 580

 Score = 41.6 bits (96), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/82 (24%), Positives = 41/82 (50%), Gaps = 6/82 (7%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII------LFQFGLGMNAH 118
           R++ T    +    P  F + LRKHI   ++ D++Q G DR++      + + G   +  
Sbjct: 58  RINFTEKKYENPEKPDNFCMVLRKHINQGKIIDIKQYGLDRVVELSIVSIDEMGFDTSKK 117

Query: 119 YVILELYAQGNILLTDSEFTVL 140
            +I  +    N++LTD+ + ++
Sbjct: 118 LIIEIMGKHSNVILTDTNYKII 139


>gi|222529807|ref|YP_002573689.1| fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor bescii DSM 6725]
 gi|222456654|gb|ACM60916.1| Fibronectin-binding A domain protein [Caldicellulosiruptor bescii
           DSM 6725]
          Length = 585

 Score = 41.6 bits (96), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTIRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLV 523

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       S+A  S  V
Sbjct: 524 EAALLASYFSKAKHSTKV 541


>gi|237744728|ref|ZP_04575209.1| fibronectin-binding protein [Fusobacterium sp. 7_1]
 gi|229431957|gb|EEO42169.1| fibronectin-binding protein [Fusobacterium sp. 7_1]
          Length = 541

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 32/174 (18%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  N++ TD E  +
Sbjct: 77  LRKHLMNAMLTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKI 136

Query: 140 L-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFER------TTASKLHAALTSSKEPDAN 190
           L TL + H  +  D+ + +   +  P      FE+       T S+ +  L  +K P  N
Sbjct: 137 LDTLKKFHISENFDRTLFLGETYTRPK-----FEKKLLPIDITESEFNRIL-ENKIPLTN 190

Query: 191 EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           E + V +  NN+           K  K F    NS+  +    + K+  L TVL
Sbjct: 191 EIEGVGKFLNNI-----------KSFKDFKNILNSDVKAKIYFKDKKIKLATVL 233


>gi|348027019|ref|YP_004766824.1| fibronectin-binding A [Megasphaera elsdenii DSM 20460]
 gi|341823073|emb|CCC73997.1| fibronectin-binding A [Megasphaera elsdenii DSM 20460]
          Length = 567

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 30/156 (19%), Positives = 70/156 (44%), Gaps = 14/156 (8%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L G + S +Y L  ++  F++ N +G+        +++ ++   RL+         + P+
Sbjct: 19  LKGGQISKIYQLDARSLYFRIFNDAGI------HHLVITLDDSPRLYIAETMPPTPDVPT 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-LGMNAHYVILELYAQ-----GNILLTD 134
           G  + LRK+    R+  + QL  DR+I      L M+   V  +++ +      N++ T+
Sbjct: 73  GLCMFLRKYYENGRIAAIAQLHLDRLIDIDIDVLDMSGRLVTRKIHVELMGKYSNVIFTE 132

Query: 135 SEFTVLTLLRSHRDDD--KGVAIMSRHRYPTEICRV 168
               +  L+++ ++    + +A    + +P    R+
Sbjct: 133 DGTIIEALIKTGKNKQALRTIAPHEPYAFPPNFMRM 168


>gi|89098716|ref|ZP_01171598.1| fibronectin/fibrinogen-binding protein, putative [Bacillus sp. NRRL
           B-14911]
 gi|89086678|gb|EAR65797.1| fibronectin/fibrinogen-binding protein, putative [Bacillus sp. NRRL
           B-14911]
          Length = 570

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R   T  A +  + P  F + LRKH+    LED+RQ+G DR
Sbjct: 39  IRANGKNHKLLLSAHPSYARAQLTHEAYENPSEPPMFCMLLRKHLEGYILEDIRQVGLDR 98

Query: 106 IILF 109
           I++ 
Sbjct: 99  ILIL 102


>gi|260494593|ref|ZP_05814723.1| fibronectin-binding protein A [Fusobacterium sp. 3_1_33]
 gi|260197755|gb|EEW95272.1| fibronectin-binding protein A [Fusobacterium sp. 3_1_33]
          Length = 357

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 32/174 (18%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  N++ TD E  +
Sbjct: 77  LRKHLMNAMLTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKI 136

Query: 140 L-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFER------TTASKLHAALTSSKEPDAN 190
           L TL + H  +  D+ + +   +  P      FE+       T S+ +  L  +K P  N
Sbjct: 137 LDTLKKFHISENFDRTLFLGETYTRPK-----FEKKLLPIDITESEFNRIL-ENKIPLTN 190

Query: 191 EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           E + V +  NN+           K  K F    NS+  +    + K+  L TVL
Sbjct: 191 EIEGVGKFLNNI-----------KSFKDFKNILNSDVKAKIYFKDKKIKLATVL 233


>gi|323489530|ref|ZP_08094757.1| hypothetical protein GPDM_09295 [Planococcus donghaensis MPA1U2]
 gi|323396661|gb|EGA89480.1| hypothetical protein GPDM_09295 [Planococcus donghaensis MPA1U2]
          Length = 554

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 7/97 (7%)

Query: 51  GESEKVLL-LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G++ K+L+ +  S  R+H TA A    + P  F + LRKHI    + ++ Q G DR+I+ 
Sbjct: 42  GKNHKLLISIHPSYSRIHLTATANVNPSEPPMFCMLLRKHIEGGVITEISQYGMDRLIML 101

Query: 110 ------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
                 + G  +     +  +    N++L D+E T++
Sbjct: 102 KIKAKNEIGDDIERELHVEMMGRHSNVILIDAERTMI 138


>gi|302857459|ref|XP_002959875.1| hypothetical protein VOLCADRAFT_108784 [Volvox carteri f.
           nagariensis]
 gi|300254053|gb|EFJ39061.1| hypothetical protein VOLCADRAFT_108784 [Volvox carteri f.
           nagariensis]
          Length = 182

 Score = 41.2 bits (95), Expect = 3.3,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 3/68 (4%)

Query: 907 YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
           Y + +EE + ++ A+ A AGK  K    P +  A +       +    A  VC+KC K G
Sbjct: 117 YAETEEERKALKQAVKAVAGKKPKQ-AKPASVPAGSGA--GAGVKQATAAGVCFKCNKPG 173

Query: 967 HLSKDCKE 974
           H +K+CKE
Sbjct: 174 HFAKECKE 181


>gi|167751125|ref|ZP_02423252.1| hypothetical protein EUBSIR_02110 [Eubacterium siraeum DSM 15702]
 gi|167655840|gb|EDR99969.1| fibronectin-binding protein A domain protein [Eubacterium siraeum
           DSM 15702]
          Length = 587

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKNTP 79
           L+G R   +Y  S +  I        +  +G+  K+L+   S   R+  T  A +  + P
Sbjct: 20  LVGGRIDKIYQPSREEIII------SIRSAGKHNKILISSNSMSARVCMTERAAENPSAP 73

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRKH+   +L D+ Q G +RII F F
Sbjct: 74  PMFCMLLRKHLSGGKLLDITQDGLERIINFDF 105


>gi|389815947|ref|ZP_10207184.1| hypothetical protein A1A1_03947 [Planococcus antarcticus DSM 14505]
 gi|388465441|gb|EIM07758.1| hypothetical protein A1A1_03947 [Planococcus antarcticus DSM 14505]
          Length = 554

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 7/97 (7%)

Query: 51  GESEKVLL-LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G++ K+L+ +  S  R+H TA A    + P  F + LRKHI    + ++ Q G DR+I+ 
Sbjct: 42  GKNHKLLISIHPSYSRIHLTAAANVNPSEPPMFCMLLRKHIEGGVITEISQYGMDRLIML 101

Query: 110 ------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
                 + G  +     +  +    N++L D+E T++
Sbjct: 102 KIKAKNEIGDDIERELHVEMMGRHSNVILIDAERTMI 138


>gi|291531786|emb|CBK97371.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Eubacterium siraeum 70/3]
          Length = 587

 Score = 41.2 bits (95), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKNTP 79
           L+G R   +Y  S +  I        +  +G+  K+L+   S   R+  T  A +  + P
Sbjct: 20  LVGGRIDKIYQPSREEIII------SIRSAGKHNKILISSNSMSARVCMTERAAENPSAP 73

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRKH+   +L D+ Q G +RII F F
Sbjct: 74  PMFCMLLRKHLSGGKLLDITQDGLERIINFDF 105


>gi|291556680|emb|CBL33797.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Eubacterium siraeum V10Sc8a]
          Length = 587

 Score = 41.2 bits (95), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKNTP 79
           L+G R   +Y  S +  I        +  +G+  K+L+   S   R+  T  A +  + P
Sbjct: 20  LVGGRIDKIYQPSREEIII------SIRSAGKHNKILISSNSMSARVCMTERAAENPSAP 73

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRKH+   +L D+ Q G +RII F F
Sbjct: 74  PMFCMLLRKHLSGGKLLDITQDGLERIINFDF 105


>gi|336418046|ref|ZP_08598325.1| hypothetical protein HMPREF0401_00343 [Fusobacterium sp. 11_3_2]
 gi|336160505|gb|EGN63550.1| hypothetical protein HMPREF0401_00343 [Fusobacterium sp. 11_3_2]
          Length = 541

 Score = 41.2 bits (95), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 32/174 (18%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  N++ TD E  +
Sbjct: 77  LRKHLMNAILTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKI 136

Query: 140 L-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFER------TTASKLHAALTSSKEPDAN 190
           L TL + H  +  D+ + +   +  P      FE+       T S+ +  L  +K P  N
Sbjct: 137 LDTLKKFHISENFDRTLFLGETYTRPK-----FEKKLLPIDITESEFNRIL-ENKIPLTN 190

Query: 191 EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           E + V +  NN+           K  K F    NS+  +    + K+  L TVL
Sbjct: 191 EIEGVGKFLNNI-----------KSFKDFKNILNSDVKAKIYFKDKKIKLATVL 233


>gi|449539657|gb|EMD30706.1| hypothetical protein CERSUDRAFT_101067, partial [Ceriporiopsis
           subvermispora B]
          Length = 619

 Score = 40.8 bits (94), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 8/86 (9%)

Query: 889 GGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKP 948
           GG+ SR Q+    ++K + G     +R       +S+G+VQ     P+N N   +K  + 
Sbjct: 130 GGQSSRNQQSHQHRLKRERG-----QRPFGNKGSSSSGQVQIR---PKNGNQGDNKLSEQ 181

Query: 949 AISPVDAPKVCYKCKKAGHLSKDCKE 974
             + + A   CYKCK+ GH +++C +
Sbjct: 182 EKARLAAEDRCYKCKEKGHFARNCPQ 207


>gi|421145721|ref|ZP_15605566.1| fibronectin-binding protein-like protein A [Fusobacterium nucleatum
           subsp. fusiforme ATCC 51190]
 gi|395487876|gb|EJG08786.1| fibronectin-binding protein-like protein A [Fusobacterium nucleatum
           subsp. fusiforme ATCC 51190]
          Length = 541

 Score = 40.8 bits (94), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L D+ QLG+DRI++F F     LG +  + +  E   +  N++ TD E  V
Sbjct: 77  LRKHLMNAMLTDIEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKV 136

Query: 140 LTLLR 144
           L  L+
Sbjct: 137 LDTLK 141


>gi|397615164|gb|EJK63262.1| hypothetical protein THAOC_16092, partial [Thalassiosira oceanica]
          Length = 429

 Score = 40.8 bits (94), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 31/53 (58%), Gaps = 6/53 (11%)

Query: 959  CYKCKKAGHLSKDCKEHPDDSSHGVEDNPCVGLDETAEMDKVAMEEEDIHEIG 1011
            CYKC + GH +KDCK+ P+++ H +     + + E  E D    + E I+E+G
Sbjct: 277  CYKCGERGHYAKDCKQRPNETQHEL---ARLAIGELGEYDS---DNESINELG 323


>gi|344996726|ref|YP_004799069.1| fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor lactoaceticus 6A]
 gi|343964945|gb|AEM74092.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           lactoaceticus 6A]
          Length = 585

 Score = 40.8 bits (94), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  ++ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTLKFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
           +A       S+A  S  V   +    Y  +  KT P
Sbjct: 524 EAALLASYFSKAKHSTKVPVDYTFVKYVKKPPKTKP 559


>gi|241889368|ref|ZP_04776669.1| fibronectin-binding A domain-containing protein [Gemella
           haemolysans ATCC 10379]
 gi|241863911|gb|EER68292.1| fibronectin-binding A domain-containing protein [Gemella
           haemolysans ATCC 10379]
          Length = 552

 Score = 40.8 bits (94), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 33/132 (25%), Positives = 67/132 (50%), Gaps = 14/132 (10%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           ++  R + + +LS   ++F +         G++ K+ L   S   R+  T  + +  +TP
Sbjct: 19  ILNGRINKINNLSTDEFVFSV-------RKGKNLKLFLSANSSASRIQLTNNSFENPSTP 71

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHYVIL-ELYAQ-GNILLT 133
           S F   LRK++    + ++ Q+  DRI++F+      LG   +Y ++ EL  +  NI+LT
Sbjct: 72  SNFCSVLRKYLTGGIILEINQVNNDRIVIFKIKNFDDLGYEKYYYLISELMGKHSNIILT 131

Query: 134 DSEFTVLTLLRS 145
           + +  +L  L++
Sbjct: 132 NEDNIILESLKN 143


>gi|302834840|ref|XP_002948982.1| hypothetical protein VOLCADRAFT_104154 [Volvox carteri f.
           nagariensis]
 gi|302846674|ref|XP_002954873.1| hypothetical protein VOLCADRAFT_106546 [Volvox carteri f.
           nagariensis]
 gi|302857318|ref|XP_002959842.1| hypothetical protein VOLCADRAFT_108765 [Volvox carteri f.
           nagariensis]
 gi|300254164|gb|EFJ39094.1| hypothetical protein VOLCADRAFT_108765 [Volvox carteri f.
           nagariensis]
 gi|300259848|gb|EFJ44072.1| hypothetical protein VOLCADRAFT_106546 [Volvox carteri f.
           nagariensis]
 gi|300265727|gb|EFJ49917.1| hypothetical protein VOLCADRAFT_104154 [Volvox carteri f.
           nagariensis]
          Length = 253

 Score = 40.4 bits (93), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 3/68 (4%)

Query: 907 YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
           Y + +EE + ++ A+ A AGK  K    P +  A +       +    A  VC+KC K G
Sbjct: 188 YAETEEERKALKQAVKAVAGKKPKQ-AKPASVPAGSGA--GAGVKQAAAAGVCFKCNKPG 244

Query: 967 HLSKDCKE 974
           H +K+CKE
Sbjct: 245 HFAKECKE 252


>gi|302837744|ref|XP_002950431.1| hypothetical protein VOLCADRAFT_104687 [Volvox carteri f.
           nagariensis]
 gi|302856295|ref|XP_002959556.1| hypothetical protein VOLCADRAFT_108658 [Volvox carteri f.
           nagariensis]
 gi|300254900|gb|EFJ39379.1| hypothetical protein VOLCADRAFT_108658 [Volvox carteri f.
           nagariensis]
 gi|300264436|gb|EFJ48632.1| hypothetical protein VOLCADRAFT_104687 [Volvox carteri f.
           nagariensis]
          Length = 199

 Score = 40.4 bits (93), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 3/68 (4%)

Query: 907 YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
           Y + +EE + ++ A+ A AGK  K    P +  A +       +    A  VC+KC K G
Sbjct: 134 YAETEEERKALKQAVKAVAGKKPKQ-AKPASVPAGSGA--GAGVKQAAAAGVCFKCNKPG 190

Query: 967 HLSKDCKE 974
           H +K+CKE
Sbjct: 191 HFAKECKE 198


>gi|297617030|ref|YP_003702189.1| Fibronectin-binding A domain-containing protein [Syntrophothermus
           lipocalidus DSM 12680]
 gi|297144867|gb|ADI01624.1| Fibronectin-binding A domain protein [Syntrophothermus lipocalidus
           DSM 12680]
          Length = 602

 Score = 40.4 bits (93), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 71/324 (21%), Positives = 129/324 (39%), Gaps = 55/324 (16%)

Query: 334 EFCPLLL----NQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI- 388
           EF P  L    +Q    E + F + + A+D ++                   +HKL+++ 
Sbjct: 264 EFSPFSLLPMASQEAGEEVLTFASVNQAVDYYF-------------------YHKLSQLR 304

Query: 389 -HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
            +  + N + TLK  ++++ + A L E +L   +              +W +L      +
Sbjct: 305 AYSYKTNLLRTLKAHLEKAYRKALLQEGDLVQAEKTF--------PYRTWGELLTAYGHQ 356

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
            + G     LID    E   + LL               P+E  +    L A   A   +
Sbjct: 357 IEKGQTEVELIDFYTGESVTVGLL-----------PHLTPIENAQRYFKLYAKGKAAALH 405

Query: 508 ELKKKQESKQE-KTITAHSKAFKAAEKKTRLQILQEKT----VANISHMRKVHWFE---K 559
             K+ +E++QE   + +   A + AE    ++ + E+       N    RK    E   +
Sbjct: 406 AEKRLRETRQEIAYLESVQFALEQAETMDEIEEIAEELDREGYINKDKKRKARVKEERLQ 465

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA-DLHGASSTV--IKNHRPEQPV 616
              F+SS+ Y ++ GR+  QNE +  +     D+++HA D+ G+   V   KN +    V
Sbjct: 466 PRMFLSSDGYKILVGRNNLQNEQLTLKASGHNDLWLHAKDVPGSHVIVRLSKNIQSIHEV 525

Query: 617 PPLTLNQAGCFTVCHSQAWDSKMV 640
           P  TL +A       S++ +S  V
Sbjct: 526 PDHTLEEAALLAAYFSKSRESDKV 549


>gi|302835832|ref|XP_002949477.1| hypothetical protein VOLCADRAFT_104285 [Volvox carteri f.
           nagariensis]
 gi|300265304|gb|EFJ49496.1| hypothetical protein VOLCADRAFT_104285 [Volvox carteri f.
           nagariensis]
          Length = 217

 Score = 40.4 bits (93), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 3/68 (4%)

Query: 907 YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
           Y + +EE + ++ A+ A AGK  K    P +  A +       +    A  VC+KC K G
Sbjct: 152 YAETEEERKALKQAVKAVAGKKPKQ-AKPASVPAGSGA--GAGVKQAAAAGVCFKCNKPG 208

Query: 967 HLSKDCKE 974
           H +K+CKE
Sbjct: 209 HFAKECKE 216


>gi|261199101|ref|XP_002625952.1| zinc knuckle domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239595104|gb|EEQ77685.1| zinc knuckle domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 226

 Score = 40.0 bits (92), Expect = 6.9,   Method: Composition-based stats.
 Identities = 16/33 (48%), Positives = 23/33 (69%), Gaps = 3/33 (9%)

Query: 955 APKVCYKCKKAGHLSKDCKEHPDDSSHGVEDNP 987
           A KVCYKC +AGH+S+DC   P++++  V   P
Sbjct: 172 AGKVCYKCSQAGHISRDC---PNNATEVVASTP 201


>gi|302852817|ref|XP_002957927.1| hypothetical protein VOLCADRAFT_107870 [Volvox carteri f.
           nagariensis]
 gi|300256804|gb|EFJ41063.1| hypothetical protein VOLCADRAFT_107870 [Volvox carteri f.
           nagariensis]
          Length = 252

 Score = 40.0 bits (92), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 3/68 (4%)

Query: 907 YGDQDEEERNIRMALLASAGKVQKNDGDPQNENASTHKEKKPAISPVDAPKVCYKCKKAG 966
           Y + +EE + ++ A+ A AGK  K    P +  A +       +    A  VC+KC K G
Sbjct: 187 YAETEEERKALKQAVKAVAGKKPKQ-AKPASVPAGSGA--GAGVKQAAAAGVCFKCNKPG 243

Query: 967 HLSKDCKE 974
           H +K+CKE
Sbjct: 244 HFAKECKE 251


>gi|358467879|ref|ZP_09177544.1| fibronectin-binding protein A [Fusobacterium sp. oral taxon 370
           str. F0437]
 gi|357066553|gb|EHI76702.1| fibronectin-binding protein A [Fusobacterium sp. oral taxon 370
           str. F0437]
          Length = 538

 Score = 40.0 bits (92), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 9/135 (6%)

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNI 130
           +  S     LRKH+    L D+ QLG+DRI+ F F     LG +  + +  E   +  N+
Sbjct: 65  DISSSLISNLRKHLMNAMLTDIEQLGFDRILAFHFSKINELGEIKKYKIYFECLGKLSNV 124

Query: 131 LLTDSEFTVL-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEP 187
           + TD E  +L TL + H  +  D+ + +   +  P    ++     +     +L +S   
Sbjct: 125 IFTDEEDKILDTLKKFHISENIDRTLFLGETYSRPKYDKKILPTELSKDKFDSLLASGNV 184

Query: 188 DANEPDKVNEDGNNV 202
            +NE + V +  NN+
Sbjct: 185 FSNEVEGVGKYLNNI 199


>gi|293400918|ref|ZP_06645063.1| putative fibronectin-binding protein [Erysipelotrichaceae bacterium
           5_2_54FAA]
 gi|291305944|gb|EFE47188.1| putative fibronectin-binding protein [Erysipelotrichaceae bacterium
           5_2_54FAA]
          Length = 556

 Score = 40.0 bits (92), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 162/403 (40%), Gaps = 83/403 (20%)

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDI--VPEGYILMQNKHLGKDHPPTESGSS-TQIYD 333
           ED  I   +  +  FE+  + ++ G +  +PE +    NK     H P ++  S ++ + 
Sbjct: 136 EDGRIVDALKRIPPFENSKRTILPGAVFTLPEPH---SNKQDPYHHGPFDAEESFSKQFH 192

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
            F PLL  + + R   K E FD  L     KI           K+   FH +   H+   
Sbjct: 193 GFSPLLSKEVQYR-MHKGEAFDDIL----KKIHDSNTLYISDVKDQVYFHCIPLTHLTDT 247

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE-RKAGN 452
            R + L   +D       ++ Y  E+       VR+    +   +DL R VK E  K  +
Sbjct: 248 YRQYPLMHGMD-------ILFYEKEE------KVRI----KQQSQDLYRSVKRELHKNTS 290

Query: 453 PVAGLIDKLYLERNCMSL-----LLSNNLDEMDDEEK-TLPVEK------VEVDLALSAH 500
            +  L   L    +C        LL   + E++ +   TLP  +      + +D+     
Sbjct: 291 KLPKLKQSLAESMDCDKYREYGDLLFAYMHEIEKQPIITLPSFETGEEIAIPIDMRFDIK 350

Query: 501 ANARRWYELKKKQESKQEKTITAHSKA--------FKAAEKK-----------TRLQILQ 541
            NA RWY+  K  +SK+ ++I     A        F+A E +            R ++++
Sbjct: 351 GNANRWYQ--KYHKSKRAQSILKEQIALCEKEIAYFEAMETQLSQAGVQDAIEIREELVK 408

Query: 542 EKTV-ANISHMRK-----VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  + A  S +RK     +  +E F +    ++Y +  G++  QN+ +  +   K D ++
Sbjct: 409 QGYLRAQKSRIRKKKKQELPHYETFLF----DDYRIYVGKNNLQNDYVTWKLARKKDTWL 464

Query: 596 HA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
           HA DLHGA   +      EQP      N+A   T     AW S
Sbjct: 465 HAKDLHGAHVILT----LEQP------NEAALRTAAMLAAWYS 497


>gi|291460975|ref|ZP_06026217.2| fibronectin-binding protein A [Fusobacterium periodonticum ATCC
           33693]
 gi|291379669|gb|EFE87187.1| fibronectin-binding protein A [Fusobacterium periodonticum ATCC
           33693]
          Length = 538

 Score = 40.0 bits (92), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 9/135 (6%)

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNI 130
           +  S     LRKH+    L D+ QLG+DRI+ F F     LG +  + +  E   +  N+
Sbjct: 65  DISSSLISNLRKHLMNAMLTDIEQLGFDRILAFHFSKINELGEIKKYKIYFECLGKLSNV 124

Query: 131 LLTDSEFTVL-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEP 187
           + TD E  +L TL + H  +  D+ + +   +  P    ++     +     +L +S   
Sbjct: 125 IFTDEEDKILDTLKKFHISENIDRTLFLGETYSRPKYNKKILPTELSKDKFDSLLASGNV 184

Query: 188 DANEPDKVNEDGNNV 202
            +NE + V +  NN+
Sbjct: 185 LSNEVEGVGKYLNNI 199


>gi|373451686|ref|ZP_09543605.1| hypothetical protein HMPREF0984_00647 [Eubacterium sp. 3_1_31]
 gi|371967907|gb|EHO85374.1| hypothetical protein HMPREF0984_00647 [Eubacterium sp. 3_1_31]
          Length = 553

 Score = 40.0 bits (92), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 162/403 (40%), Gaps = 83/403 (20%)

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDI--VPEGYILMQNKHLGKDHPPTESGSS-TQIYD 333
           ED  I   +  +  FE+  + ++ G +  +PE +    NK     H P ++  S ++ + 
Sbjct: 133 EDGRIVDALKRIPPFENSKRTILPGAVFTLPEPH---SNKQDPYHHGPFDAEESFSKQFH 189

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
            F PLL  + + R   K E FD  L     KI           K+   FH +   H+   
Sbjct: 190 GFSPLLSKEVQYR-MHKGEAFDDIL----KKIHDSNTLYISDVKDQVYFHCIPLTHLTDT 244

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE-RKAGN 452
            R + L   +D       ++ Y  E+       VR+    +   +DL R VK E  K  +
Sbjct: 245 YRQYPLMHGMD-------ILFYEKEE------KVRI----KQQSQDLYRSVKRELHKNTS 287

Query: 453 PVAGLIDKLYLERNCMSL-----LLSNNLDEMDDEEK-TLPVEK------VEVDLALSAH 500
            +  L   L    +C        LL   + E++ +   TLP  +      + +D+     
Sbjct: 288 KLPKLKQSLAESMDCDKYREYGDLLFAYMHEIEKQPIITLPSFETGEEIAIPIDMRFDIK 347

Query: 501 ANARRWYELKKKQESKQEKTITAHSKA--------FKAAEKK-----------TRLQILQ 541
            NA RWY+  K  +SK+ ++I     A        F+A E +            R ++++
Sbjct: 348 GNANRWYQ--KYHKSKRAQSILKEQIALCEKEIAYFEAMETQLSQAGVQDAIEIREELVK 405

Query: 542 EKTV-ANISHMRK-----VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  + A  S +RK     +  +E F +    ++Y +  G++  QN+ +  +   K D ++
Sbjct: 406 QGYLRAQKSRIRKKKKQELPHYETFLF----DDYRIYVGKNNLQNDYVTWKLARKKDTWL 461

Query: 596 HA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
           HA DLHGA   +      EQP      N+A   T     AW S
Sbjct: 462 HAKDLHGAHVILT----LEQP------NEAALRTAAMLAAWYS 494


>gi|290968771|ref|ZP_06560308.1| fibronectin-binding A, N-terminal domain protein [Megasphaera
           genomosp. type_1 str. 28L]
 gi|335049115|ref|ZP_08542125.1| fibronectin-binding protein A [Megasphaera sp. UPII 199-6]
 gi|290781067|gb|EFD93658.1| fibronectin-binding A, N-terminal domain protein [Megasphaera
           genomosp. type_1 str. 28L]
 gi|333764227|gb|EGL41627.1| fibronectin-binding protein A [Megasphaera sp. UPII 199-6]
          Length = 573

 Score = 40.0 bits (92), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 20/89 (22%), Positives = 44/89 (49%), Gaps = 6/89 (6%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           + L G + + +Y    +T  F++ +++G+        V++ ++   R++         +T
Sbjct: 17  KELTGGQITKIYQPRARTLYFRIFSATGL------HHVIITLDESPRIYIAEKMPPMPDT 70

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
           PS   + LRK+    R+  +RQL  DR++
Sbjct: 71  PSALCMFLRKYYENGRISSLRQLHLDRLL 99


>gi|410460737|ref|ZP_11314410.1| Fibronectin-binding A domain-containing protein [Bacillus
           azotoformans LMG 9581]
 gi|409926667|gb|EKN63823.1| Fibronectin-binding A domain-containing protein [Bacillus
           azotoformans LMG 9581]
          Length = 570

 Score = 39.7 bits (91), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 70/145 (48%), Gaps = 22/145 (15%)

Query: 25  RCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFT 83
           R S +Y    + Y + L+ +  +  +G+++++L+    S  RLH T    D    P  F 
Sbjct: 23  RISRIY----QPYKYDLIFT--IRANGKNQQLLISANPSYARLHITKETYDNPKEPPMFC 76

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQ-GNILLTDSE 136
           + LRKH+    +E + Q G +RII F      + G   +   +I+E+  +  NILL D E
Sbjct: 77  MLLRKHLEGSFIEKIEQDGLERIIKFYVRTKNEIG-DESIKILIVEVMGRHSNILLVDQE 135

Query: 137 FTVLTLLRSHRDDDKGVA-IMSRHR 160
             ++       D  K V+  ++RHR
Sbjct: 136 KNIIM------DSIKHVSPAVNRHR 154


>gi|392963797|ref|ZP_10329218.1| protein of unknown function DUF814 [Fibrisoma limi BUZ 3]
 gi|387846692|emb|CCH51262.1| protein of unknown function DUF814 [Fibrisoma limi BUZ 3]
          Length = 547

 Score = 39.7 bits (91), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 41/181 (22%), Positives = 82/181 (45%), Gaps = 26/181 (14%)

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE-------------KTITAH 524
           E+ D  +  P+  +++   LS   NA  +Y   K ++ ++E             + I A+
Sbjct: 340 ELYDFYRDQPI-TIKLKTDLSPQKNAENYYRKAKNEKIEEEHLNQQIANREAEIERINAY 398

Query: 525 SKAFKAAE--KKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
             A +  +  K+ R  I Q   ++  +    V  F++    +  EN+ ++ GR+A+ N++
Sbjct: 399 QTALETIQTLKELRKYIKQHNLLSESAIEGPVQLFKE----VIFENFRILIGRNAKNNDL 454

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           + ++Y  K D+++HA     S  VIK ++  +  P   + +A         AW SK  T 
Sbjct: 455 LTQKYAHKEDLWLHARDVSGSHVVIK-YQAGKTFPKSVIERAAELA-----AWYSKRRTD 508

Query: 643 A 643
           +
Sbjct: 509 S 509


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.131    0.375 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,222,378,621
Number of Sequences: 23463169
Number of extensions: 760282706
Number of successful extensions: 2209614
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 847
Number of HSP's successfully gapped in prelim test: 2285
Number of HSP's that attempted gapping in prelim test: 2190798
Number of HSP's gapped (non-prelim): 15277
length of query: 1102
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 948
effective length of database: 8,745,867,341
effective search space: 8291082239268
effective search space used: 8291082239268
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 83 (36.6 bits)