BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy17547
         (186 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
          Length = 695

 Score =  191 bits (486), Expect = 9e-47,   Method: Composition-based stats.
 Identities = 82/104 (78%), Positives = 88/104 (84%), Gaps = 3/104 (2%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           LE NGIRPRKPCNCTKS CLKLYCDCFANGEFCY CNC +CFNN+E+ED R  AI+ CLE
Sbjct: 474 LESNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLE 533

Query: 146 RNPNAFRPKIGKCLVGEGE---RRHTKGCNCKRSGCLKNYCECY 186
           RNPNAFRPKIGK     G+   R+HTKGCNCKRSGCLKNYCECY
Sbjct: 534 RNPNAFRPKIGKAKDVAGDSSIRKHTKGCNCKRSGCLKNYCECY 577



 Score =  150 bits (380), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 64/84 (76%), Positives = 70/84 (83%), Gaps = 3/84 (3%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
           YCDCFANGEFCY CNC +CFNN+E+ED R  AI+ CLERNPNAFRPKIGK     G+   
Sbjct: 496 YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSSI 555

Query: 58  RRHTKGCNCKRSGCLKNYCECYEA 81
           R+HTKGCNCKRSGCLKNYCECYEA
Sbjct: 556 RKHTKGCNCKRSGCLKNYCECYEA 579


>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
          Length = 752

 Score =  189 bits (479), Expect = 6e-46,   Method: Composition-based stats.
 Identities = 82/104 (78%), Positives = 88/104 (84%), Gaps = 3/104 (2%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           LE NGIRPRKPCNCTKS CLKLYCDCFANGEFCY CNC +CFNN+E+ED R  AI+ CLE
Sbjct: 531 LESNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLE 590

Query: 146 RNPNAFRPKIGKCLVGEGE---RRHTKGCNCKRSGCLKNYCECY 186
           RNPNAFRPKIGK     G+   R+HTKGCNCKRSGCLKNYCECY
Sbjct: 591 RNPNAFRPKIGKAKDVAGDSSIRKHTKGCNCKRSGCLKNYCECY 634



 Score =  148 bits (374), Expect = 8e-34,   Method: Composition-based stats.
 Identities = 64/84 (76%), Positives = 70/84 (83%), Gaps = 3/84 (3%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
           YCDCFANGEFCY CNC +CFNN+E+ED R  AI+ CLERNPNAFRPKIGK     G+   
Sbjct: 553 YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSSI 612

Query: 58  RRHTKGCNCKRSGCLKNYCECYEA 81
           R+HTKGCNCKRSGCLKNYCECYEA
Sbjct: 613 RKHTKGCNCKRSGCLKNYCECYEA 636


>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
          Length = 750

 Score =  179 bits (455), Expect = 3e-43,   Method: Composition-based stats.
 Identities = 75/95 (78%), Positives = 85/95 (89%), Gaps = 1/95 (1%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           + +KPCNCT+S CLKLYCDCFANGEFCYQCNCNSC+NN+EHE+DR  AIR  L+RNPNAF
Sbjct: 535 KTKKPCNCTRSQCLKLYCDCFANGEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAF 594

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           RPKI K  VG+ ER+HTKGCNCKRSGCLKNYCEC+
Sbjct: 595 RPKI-KSFVGQAERQHTKGCNCKRSGCLKNYCECF 628



 Score =  156 bits (394), Expect = 4e-36,   Method: Composition-based stats.
 Identities = 66/81 (81%), Positives = 73/81 (90%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFCYQCNCNSC+NN+EHE+DR  AIR  L+RNPNAFRPKI K  VG+ ER+H
Sbjct: 551 YCDCFANGEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAFRPKI-KSFVGQAERQH 609

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           TKGCNCKRSGCLKNYCEC+EA
Sbjct: 610 TKGCNCKRSGCLKNYCECFEA 630


>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
          Length = 916

 Score =  169 bits (429), Expect = 4e-40,   Method: Composition-based stats.
 Identities = 79/95 (83%), Positives = 85/95 (89%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+CFNN+EHE++R  AI  CLERNPNAF
Sbjct: 661 RPRKPCNCTKSQCLKLYCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGACLERNPNAF 720

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           RPKIGK   G+ ERRHTKGCNCKRSGCLKNYCECY
Sbjct: 721 RPKIGKGKEGDHERRHTKGCNCKRSGCLKNYCECY 755



 Score =  137 bits (345), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 66/81 (81%), Positives = 72/81 (88%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC+ CNCN+CFNN+EHE++R  AI  CLERNPNAFRPKIGK   G+ ERRH
Sbjct: 677 YCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGACLERNPNAFRPKIGKGKEGDHERRH 736

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           TKGCNCKRSGCLKNYCECYEA
Sbjct: 737 TKGCNCKRSGCLKNYCECYEA 757


>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
          Length = 822

 Score =  169 bits (427), Expect = 6e-40,   Method: Composition-based stats.
 Identities = 80/103 (77%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           T+EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 593 TVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 652

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 653 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 695



 Score =  124 bits (310), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 616 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 675

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 676 HNKGCNCKRSGCLKNYCECYEA 697


>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
          Length = 834

 Score =  169 bits (427), Expect = 6e-40,   Method: Composition-based stats.
 Identities = 80/102 (78%), Positives = 88/102 (86%), Gaps = 1/102 (0%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           +EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLE
Sbjct: 606 VEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLE 665

Query: 146 RNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           RNPNAFRPKIGK    GE  RRH KGCNCKRSGCLKNYCECY
Sbjct: 666 RNPNAFRPKIGKGRETGEDIRRHNKGCNCKRSGCLKNYCECY 707



 Score =  125 bits (314), Expect = 7e-27,   Method: Composition-based stats.
 Identities = 62/82 (75%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    GE  RR
Sbjct: 628 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIRR 687

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 688 HNKGCNCKRSGCLKNYCECYEA 709


>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
          Length = 804

 Score =  168 bits (425), Expect = 1e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 598 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 657

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 658 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 700



 Score =  124 bits (311), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 621 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 680

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 681 HNKGCNCKRSGCLKNYCECYEA 702


>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
          Length = 802

 Score =  167 bits (424), Expect = 1e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 597 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 656

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 657 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 699



 Score =  124 bits (310), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 679

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 680 HNKGCNCKRSGCLKNYCECYEA 701


>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
          Length = 859

 Score =  167 bits (424), Expect = 1e-39,   Method: Composition-based stats.
 Identities = 80/102 (78%), Positives = 88/102 (86%), Gaps = 1/102 (0%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           +EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLE
Sbjct: 631 VEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLE 690

Query: 146 RNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           RNPNAFRPKIGK    GE  RRH KGCNCKRSGCLKNYCECY
Sbjct: 691 RNPNAFRPKIGKGRETGEDIRRHNKGCNCKRSGCLKNYCECY 732



 Score =  124 bits (312), Expect = 1e-26,   Method: Composition-based stats.
 Identities = 62/82 (75%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    GE  RR
Sbjct: 653 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIRR 712

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 713 HNKGCNCKRSGCLKNYCECYEA 734


>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
          Length = 835

 Score =  167 bits (424), Expect = 1e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 606 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 665

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 666 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 708



 Score =  124 bits (310), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 629 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 688

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 689 HNKGCNCKRSGCLKNYCECYEA 710


>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
          Length = 835

 Score =  167 bits (424), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 606 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 665

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 666 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 708



 Score =  124 bits (310), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 629 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 688

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 689 HNKGCNCKRSGCLKNYCECYEA 710


>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
          Length = 829

 Score =  167 bits (423), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 600 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 659

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 660 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 702



 Score =  124 bits (310), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 623 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 682

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 683 HNKGCNCKRSGCLKNYCECYEA 704


>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
          Length = 829

 Score =  167 bits (423), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 600 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 659

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 660 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 702



 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 623 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 682

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 683 HNKGCNCKRSGCLKNYCECYEA 704


>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
          Length = 827

 Score =  167 bits (422), Expect = 2e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 598 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 657

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 658 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 700



 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 621 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 680

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 681 HNKGCNCKRSGCLKNYCECYEA 702


>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
          Length = 826

 Score =  167 bits (422), Expect = 3e-39,   Method: Composition-based stats.
 Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           ++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CL
Sbjct: 597 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 656

Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           ERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 657 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 699



 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 679

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 680 HNKGCNCKRSGCLKNYCECYEA 701


>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
          Length = 825

 Score =  166 bits (421), Expect = 3e-39,   Method: Composition-based stats.
 Identities = 80/108 (74%), Positives = 90/108 (83%), Gaps = 1/108 (0%)

Query: 80  EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHA 139
           + G   +EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  A
Sbjct: 591 QRGYGNIEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRA 650

Query: 140 IRQCLERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           I+ CLERNPNAFRPKIGK    G+  RRH KGCNCKRSGCLKNYCECY
Sbjct: 651 IKSCLERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 698



 Score =  122 bits (307), Expect = 5e-26,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    G+  RR
Sbjct: 619 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 678

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 679 HNKGCNCKRSGCLKNYCECYEA 700


>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
           occidentalis]
          Length = 609

 Score =  166 bits (420), Expect = 4e-39,   Method: Composition-based stats.
 Identities = 72/93 (77%), Positives = 80/93 (86%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +KPCNCTKS CLKLYCDCFANGEFC  CNC  C+NN+EHE++R  A++ CLERNPNAF P
Sbjct: 392 KKPCNCTKSQCLKLYCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHP 451

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIGK   G+ ERRHTKGCNCKRSGCLKNYCECY
Sbjct: 452 KIGKYKPGDKERRHTKGCNCKRSGCLKNYCECY 484



 Score =  142 bits (359), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 62/81 (76%), Positives = 69/81 (85%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC  C+NN+EHE++R  A++ CLERNPNAF PKIGK   G+ ERRH
Sbjct: 406 YCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHPKIGKYKPGDKERRH 465

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           TKGCNCKRSGCLKNYCECYEA
Sbjct: 466 TKGCNCKRSGCLKNYCECYEA 486


>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
 gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
          Length = 725

 Score =  164 bits (415), Expect = 2e-38,   Method: Composition-based stats.
 Identities = 68/100 (68%), Positives = 78/100 (78%), Gaps = 1/100 (1%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           P+    ++PCNCTKS CLKLYCDCFANGE+CY CNC  CFN  +H+++R  AIR  LERN
Sbjct: 449 PDEAYKKRPCNCTKSQCLKLYCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERN 508

Query: 148 PNAFRPKIGKC-LVGEGERRHTKGCNCKRSGCLKNYCECY 186
           PNAF+PKIG      +G R HTKGCNCKRSGCLKNYCECY
Sbjct: 509 PNAFKPKIGSIGSTDDGTRLHTKGCNCKRSGCLKNYCECY 548



 Score =  138 bits (347), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 57/82 (69%), Positives = 64/82 (78%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-LVGEGERR 59
           YCDCFANGE+CY CNC  CFN  +H+++R  AIR  LERNPNAF+PKIG      +G R 
Sbjct: 469 YCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERNPNAFKPKIGSIGSTDDGTRL 528

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           HTKGCNCKRSGCLKNYCECYE 
Sbjct: 529 HTKGCNCKRSGCLKNYCECYEG 550


>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
 gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
          Length = 818

 Score =  163 bits (413), Expect = 2e-38,   Method: Composition-based stats.
 Identities = 78/101 (77%), Positives = 86/101 (85%), Gaps = 1/101 (0%)

Query: 87  EPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLER 146
           + NGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLER
Sbjct: 590 DANGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLER 649

Query: 147 NPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           NPNAFRPKIGK    GE  RRH KGCNCKRSGCLKNYCECY
Sbjct: 650 NPNAFRPKIGKGRETGEDIRRHNKGCNCKRSGCLKNYCECY 690



 Score =  125 bits (314), Expect = 7e-27,   Method: Composition-based stats.
 Identities = 62/82 (75%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC+ CNCN+C NN+ +E++R  AI+ CLERNPNAFRPKIGK    GE  RR
Sbjct: 611 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIRR 670

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 671 HNKGCNCKRSGCLKNYCECYEA 692


>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
          Length = 804

 Score =  163 bits (412), Expect = 4e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF
Sbjct: 575 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 634

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 635 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 669



 Score =  129 bits (325), Expect = 4e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 591 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 650

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 651 SKGCNCKRSGCLKNYCECYEA 671



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
           E   R  K CNC +S CLK YC+C+  G                                
Sbjct: 571 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 630

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 631 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 689

Query: 133 EDDR 136
             +R
Sbjct: 690 SPER 693


>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
          Length = 827

 Score =  162 bits (411), Expect = 4e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 598 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 657

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 658 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 692



 Score =  129 bits (324), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 614 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 673

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 674 SKGCNCKRSGCLKNYCECYEA 694



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 594 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 653

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 654 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 712

Query: 133 EDDR 136
             +R
Sbjct: 713 SPER 716


>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
          Length = 709

 Score =  162 bits (411), Expect = 4e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  129 bits (324), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
          Length = 748

 Score =  162 bits (410), Expect = 6e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  129 bits (323), Expect = 7e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  162 bits (410), Expect = 6e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  129 bits (323), Expect = 7e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
          Length = 749

 Score =  162 bits (410), Expect = 6e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  129 bits (323), Expect = 7e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  162 bits (410), Expect = 6e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  129 bits (323), Expect = 7e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 749

 Score =  162 bits (409), Expect = 7e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  129 bits (323), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)

Query: 45  RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
           RP     +  E   R  K CNC +S CLK YC+C+  G              LE      
Sbjct: 506 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 565

Query: 88  ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
                     P   +P+              K CNC +S CLK YC+C+     C   C 
Sbjct: 566 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 625

Query: 123 CNSCFNNIEHEDDR 136
           C  C  N E   +R
Sbjct: 626 CVGC-KNFEESPER 638


>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
           gorilla]
          Length = 749

 Score =  162 bits (409), Expect = 7e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
          Length = 749

 Score =  162 bits (409), Expect = 7e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
          Length = 749

 Score =  162 bits (409), Expect = 7e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)

Query: 45  RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
           RP     +  E   R  K CNC +S CLK YC+C+  G              LE      
Sbjct: 506 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 565

Query: 88  ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
                     P   +P+              K CNC +S CLK YC+C+     C   C 
Sbjct: 566 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 625

Query: 123 CNSCFNNIEHEDDR 136
           C  C  N E   +R
Sbjct: 626 CVGC-KNFEESPER 638


>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
 gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
 gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
 gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
           domain-containing protein 1
 gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
 gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
 gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
 gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
 gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
 gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
 gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
          Length = 749

 Score =  162 bits (409), Expect = 7e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
 gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
 gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
          Length = 749

 Score =  162 bits (409), Expect = 7e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
 gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
 gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
 gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
 gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
          Length = 749

 Score =  162 bits (409), Expect = 7e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
          Length = 749

 Score =  162 bits (409), Expect = 8e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
          Length = 749

 Score =  162 bits (409), Expect = 8e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
 gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
          Length = 749

 Score =  162 bits (409), Expect = 8e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
           familiaris]
          Length = 749

 Score =  162 bits (409), Expect = 8e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
 gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
          Length = 749

 Score =  162 bits (409), Expect = 8e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
          Length = 749

 Score =  162 bits (409), Expect = 8e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
           carolinensis]
          Length = 749

 Score =  162 bits (409), Expect = 8e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 8e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 53.5 bits (127), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
           E   R  K CNC +S CLK YC+C+  G                                
Sbjct: 516 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
          Length = 749

 Score =  161 bits (408), Expect = 9e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
          Length = 748

 Score =  161 bits (408), Expect = 9e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 519 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 578

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 579 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 613



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 535 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 594

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 595 SKGCNCKRSGCLKNYCECYEA 615



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 575 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 633

Query: 133 EDDR 136
             +R
Sbjct: 634 SPER 637


>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
 gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
 gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
          Length = 748

 Score =  161 bits (408), Expect = 9e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 519 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 578

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 579 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 613



 Score =  128 bits (322), Expect = 9e-28,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 535 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 594

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 595 SKGCNCKRSGCLKNYCECYEA 615



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 575 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 633

Query: 133 EDDR 136
             +R
Sbjct: 634 SPER 637


>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
          Length = 749

 Score =  161 bits (408), Expect = 9e-38,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (322), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
          Length = 749

 Score =  161 bits (408), Expect = 1e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614



 Score =  128 bits (321), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634

Query: 133 EDDR 136
             +R
Sbjct: 635 SPER 638


>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
          Length = 751

 Score =  161 bits (408), Expect = 1e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF
Sbjct: 522 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 581

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 582 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 616



 Score =  128 bits (321), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 538 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 597

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 598 SKGCNCKRSGCLKNYCECYEA 618



 Score = 53.1 bits (126), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
           E   R  K CNC +S CLK YC+C+  G                                
Sbjct: 518 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 577

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 578 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 636

Query: 133 EDDR 136
             +R
Sbjct: 637 SPER 640


>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
 gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
          Length = 757

 Score =  161 bits (408), Expect = 1e-37,   Method: Composition-based stats.
 Identities = 69/103 (66%), Positives = 81/103 (78%), Gaps = 1/103 (0%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           T+ P+    ++PCNCTKS CLKLYCDCFANGEFCY CNC  C+NN+++E++R  AIR  L
Sbjct: 480 TIFPDEAYKKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATL 539

Query: 145 ERNPNAFRPKIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
           ERNP+AF+PKIG     E   R HTKGCNCKRSGCLKNYCECY
Sbjct: 540 ERNPSAFKPKIGAVSADEDALRLHTKGCNCKRSGCLKNYCECY 582



 Score =  135 bits (341), Expect = 5e-30,   Method: Composition-based stats.
 Identities = 58/82 (70%), Positives = 66/82 (80%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RR 59
           YCDCFANGEFCY CNC  C+NN+++E++R  AIR  LERNP+AF+PKIG     E   R 
Sbjct: 503 YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFKPKIGAVSADEDALRL 562

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           HTKGCNCKRSGCLKNYCECYEA
Sbjct: 563 HTKGCNCKRSGCLKNYCECYEA 584


>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
          Length = 793

 Score =  161 bits (407), Expect = 1e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 564 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 623

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 624 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 658



 Score =  128 bits (321), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 580 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 639

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 640 SKGCNCKRSGCLKNYCECYEA 660



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 560 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 619

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 620 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 678

Query: 133 EDDR 136
             +R
Sbjct: 679 SPER 682


>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
          Length = 752

 Score =  161 bits (407), Expect = 1e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF
Sbjct: 523 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 582

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 583 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 617



 Score =  128 bits (321), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 539 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 598

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 599 SKGCNCKRSGCLKNYCECYEA 619



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
           E   R  K CNC +S CLK YC+C+  G                                
Sbjct: 519 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 578

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 579 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 637

Query: 133 EDDR 136
             +R
Sbjct: 638 SPER 641


>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
          Length = 756

 Score =  161 bits (407), Expect = 1e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF
Sbjct: 527 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 586

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 587 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 621



 Score =  128 bits (321), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 543 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 602

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 603 SKGCNCKRSGCLKNYCECYEA 623



 Score = 53.1 bits (126), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
           E   R  K CNC +S CLK YC+C+  G                                
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 582

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 583 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 641

Query: 133 EDDR 136
             +R
Sbjct: 642 SPER 645


>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
 gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
 gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
          Length = 741

 Score =  160 bits (405), Expect = 2e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 512 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 571

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 572 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 606



 Score =  127 bits (319), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 528 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 587

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 588 SKGCNCKRSGCLKNYCECYEA 608



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 38/137 (27%), Positives = 49/137 (35%), Gaps = 43/137 (31%)

Query: 42  NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE--- 87
           N  R  +   +  E   R  K CNC +S CLK YC+C+  G              LE   
Sbjct: 495 NQPRGPLNGIISSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHEN 554

Query: 88  -------------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ 120
                        P   +P+              K CNC +S CLK YC+C+     C  
Sbjct: 555 ERQKAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSS 614

Query: 121 -CNCNSCFNNIEHEDDR 136
            C C  C  N E   +R
Sbjct: 615 ICKCIGC-KNFEESPER 630


>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
          Length = 757

 Score =  160 bits (405), Expect = 2e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF
Sbjct: 528 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 587

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 588 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 622



 Score =  127 bits (319), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 544 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 603

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 604 SKGCNCKRSGCLKNYCECYEA 624



 Score = 52.4 bits (124), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
           E   R  K CNC +S CLK YC+C+  G                                
Sbjct: 524 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 583

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 584 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 642

Query: 133 EDDR 136
             +R
Sbjct: 643 SPER 646


>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
          Length = 763

 Score =  160 bits (405), Expect = 2e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 534 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 593

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 594 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 628



 Score =  127 bits (319), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 550 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 609

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 610 SKGCNCKRSGCLKNYCECYEA 630



 Score = 53.5 bits (127), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 530 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 589

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 590 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 648

Query: 133 EDDR 136
             +R
Sbjct: 649 SPER 652


>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
          Length = 813

 Score =  160 bits (404), Expect = 3e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 584 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 643

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 644 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 678



 Score =  127 bits (318), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 600 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 659

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 660 SKGCNCKRSGCLKNYCECYEA 680



 Score = 53.5 bits (127), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 580 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 639

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 640 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 698

Query: 133 EDDR 136
             +R
Sbjct: 699 SPER 702


>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 660

 Score =  159 bits (402), Expect = 4e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (317), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 56.2 bits (134), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)

Query: 45  RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
           RP     +  E   R  K CNC +S CLK YC+C+  G              LE      
Sbjct: 417 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 476

Query: 88  ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
                     P   +P+              K CNC +S CLK YC+C+     C   C 
Sbjct: 477 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 536

Query: 123 CNSCFNNIEHEDDR 136
           C  C  N E   +R
Sbjct: 537 CVGC-KNFEESPER 549


>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Saccoglossus kowalevskii]
          Length = 946

 Score =  159 bits (402), Expect = 5e-37,   Method: Composition-based stats.
 Identities = 77/105 (73%), Positives = 85/105 (80%)

Query: 82  GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           G+  +E  G RPRKPCNCTKS CLKLYCDCFANGEFC  CNC +CFNN+EHE +R  AI+
Sbjct: 490 GSVQVEAPGTRPRKPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHESERAKAIK 549

Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            CLERNP AF PKIGK   G+ +RRH KGCNCKRSGCLKNYCECY
Sbjct: 550 SCLERNPLAFHPKIGKGKEGQADRRHNKGCNCKRSGCLKNYCECY 594



 Score =  122 bits (305), Expect = 8e-26,   Method: Composition-based stats.
 Identities = 61/81 (75%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +CFNN+EHE +R  AI+ CLERNP AF PKIGK   G+ +RRH
Sbjct: 516 YCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKSCLERNPLAFHPKIGKGKEGQADRRH 575

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 576 NKGCNCKRSGCLKNYCECYEA 596



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 36/120 (30%), Positives = 43/120 (35%), Gaps = 43/120 (35%)

Query: 59  RHTKGCNCKRSGCLKNYCECYEAGART-------------------------LEPNGI-- 91
           R  K CNC +S CLK YC+C+  G                            LE N +  
Sbjct: 500 RPRKPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKSCLERNPLAF 559

Query: 92  --------------RPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDR 136
                         R  K CNC +S CLK YC+C+     C   C C  C  N E   DR
Sbjct: 560 HPKIGKGKEGQADRRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEESPDR 618


>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
 gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
           gorilla]
 gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
          Length = 660

 Score =  159 bits (401), Expect = 6e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (316), Expect = 4e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545

Query: 133 EDDR 136
             +R
Sbjct: 546 SPER 549


>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
          Length = 660

 Score =  159 bits (401), Expect = 6e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (316), Expect = 4e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545

Query: 133 EDDR 136
             +R
Sbjct: 546 SPER 549


>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
          Length = 660

 Score =  159 bits (401), Expect = 6e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (316), Expect = 4e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 55.8 bits (133), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)

Query: 45  RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
           RP     +  E   R  K CNC +S CLK YC+C+  G              LE      
Sbjct: 417 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 476

Query: 88  ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
                     P   +P+              K CNC +S CLK YC+C+     C   C 
Sbjct: 477 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 536

Query: 123 CNSCFNNIEHEDDR 136
           C  C  N E   +R
Sbjct: 537 CVGC-KNFEESPER 549


>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
          Length = 660

 Score =  159 bits (401), Expect = 6e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (316), Expect = 4e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545

Query: 133 EDDR 136
             +R
Sbjct: 546 SPER 549


>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
          Length = 660

 Score =  159 bits (401), Expect = 7e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (316), Expect = 5e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 545

Query: 133 EDDR 136
             +R
Sbjct: 546 SPER 549


>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
           familiaris]
          Length = 660

 Score =  159 bits (401), Expect = 7e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (316), Expect = 5e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 545

Query: 133 EDDR 136
             +R
Sbjct: 546 SPER 549


>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
          Length = 756

 Score =  159 bits (401), Expect = 7e-37,   Method: Composition-based stats.
 Identities = 75/95 (78%), Positives = 85/95 (89%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNCN+CFNN+EHE++R  AI+ CL+RNP AF
Sbjct: 527 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAF 586

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 587 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 621



 Score =  125 bits (314), Expect = 8e-27,   Method: Composition-based stats.
 Identities = 62/81 (76%), Positives = 71/81 (87%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+CFNN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 543 YCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 602

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 603 SKGCNCKRSGCLKNYCECYEA 623



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRN 582

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 583 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 641

Query: 133 EDDR 136
             +R
Sbjct: 642 SPER 645


>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
          Length = 660

 Score =  159 bits (401), Expect = 7e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525



 Score =  126 bits (316), Expect = 5e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545

Query: 133 EDDR 136
             +R
Sbjct: 546 SPER 549


>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
          Length = 659

 Score =  159 bits (401), Expect = 7e-37,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 430 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 489

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 490 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 524



 Score =  126 bits (316), Expect = 5e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 446 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 505

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 506 SKGCNCKRSGCLKNYCECYEA 526



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 426 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 485

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 486 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 544

Query: 133 EDDR 136
             +R
Sbjct: 545 SPER 548


>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
          Length = 667

 Score =  157 bits (398), Expect = 1e-36,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF
Sbjct: 438 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 497

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 498 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 532



 Score =  125 bits (314), Expect = 8e-27,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN++HE+DR  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 454 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 513

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 514 SKGCNCKRSGCLKNYCECYEA 534


>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
 gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
          Length = 921

 Score =  156 bits (394), Expect = 4e-36,   Method: Composition-based stats.
 Identities = 65/97 (67%), Positives = 74/97 (76%), Gaps = 1/97 (1%)

Query: 90  GIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
           G   RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+
Sbjct: 703 GASRRKHCNCSKSQCLKLYCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPS 762

Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           AF+PKI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 763 AFKPKITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 798



 Score =  134 bits (336), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 721 YCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 779

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 780 NKGCNCKRSGCLKNYCECYEA 800


>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
          Length = 821

 Score =  156 bits (394), Expect = 4e-36,   Method: Composition-based stats.
 Identities = 75/95 (78%), Positives = 83/95 (87%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS CLKLYCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF
Sbjct: 592 RPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDRNPEAF 651

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 652 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 686



 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 62/81 (76%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 608 YCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 667

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 668 SKGCNCKRSGCLKNYCECYEA 688



 Score = 55.5 bits (132), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 36/125 (28%), Positives = 45/125 (36%), Gaps = 43/125 (34%)

Query: 54  GEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE--------------- 87
            E   R  K CNC +S CLK YC+C+  G              LE               
Sbjct: 587 AETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDR 646

Query: 88  -PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
            P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E
Sbjct: 647 NPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFE 705

Query: 132 HEDDR 136
              +R
Sbjct: 706 ESPER 710


>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
          Length = 807

 Score =  155 bits (393), Expect = 5e-36,   Method: Composition-based stats.
 Identities = 75/95 (78%), Positives = 83/95 (87%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS CLKLYCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF
Sbjct: 578 RPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAF 637

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 638 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 672



 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 62/81 (76%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 594 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 653

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 654 SKGCNCKRSGCLKNYCECYEA 674



 Score = 54.7 bits (130), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 574 ETAPRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRN 633

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 634 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 692

Query: 133 EDDR 136
             +R
Sbjct: 693 SPER 696


>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
          Length = 775

 Score =  155 bits (393), Expect = 6e-36,   Method: Composition-based stats.
 Identities = 75/95 (78%), Positives = 83/95 (87%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS CLKLYCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF
Sbjct: 546 RPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAF 605

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 606 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 640



 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 62/81 (76%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 562 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 621

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 622 SKGCNCKRSGCLKNYCECYEA 642



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 36/125 (28%), Positives = 45/125 (36%), Gaps = 43/125 (34%)

Query: 54  GEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE--------------- 87
            E   R  K CNC +S CLK YC+C+  G              LE               
Sbjct: 541 AETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDR 600

Query: 88  -PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
            P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E
Sbjct: 601 NPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFE 659

Query: 132 HEDDR 136
              +R
Sbjct: 660 ESPER 664


>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
          Length = 621

 Score =  155 bits (392), Expect = 7e-36,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 392 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 451

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 452 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 486



 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 408 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 467

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 468 SKGCNCKRSGCLKNYCECYEA 488



 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 388 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 447

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 448 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 506

Query: 133 EDDR 136
             +R
Sbjct: 507 SPER 510


>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
 gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
          Length = 978

 Score =  154 bits (390), Expect = 1e-35,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 765 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 824

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 825 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 856



 Score =  133 bits (334), Expect = 3e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 779 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 837

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 838 NKGCNCKRSGCLKNYCECYEA 858


>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
 gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
          Length = 950

 Score =  154 bits (390), Expect = 1e-35,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 797

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829



 Score =  133 bits (334), Expect = 3e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831


>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
          Length = 705

 Score =  154 bits (390), Expect = 1e-35,   Method: Composition-based stats.
 Identities = 74/105 (70%), Positives = 85/105 (80%)

Query: 82  GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           G  T + +G RPRKPCNCTKS CLKLYCDCFANGEFC  CNCN+C NN++HED+R  A++
Sbjct: 470 GMLTGDGSGSRPRKPCNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVK 529

Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            CL+RNP+AF PKIGK    +  RRH KGCNCKRSGCLKNYCECY
Sbjct: 530 ACLDRNPHAFHPKIGKGHGSQTNRRHNKGCNCKRSGCLKNYCECY 574



 Score =  120 bits (301), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 58/81 (71%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+C NN++HED+R  A++ CL+RNP+AF PKIGK    +  RRH
Sbjct: 496 YCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKACLDRNPHAFHPKIGKGHGSQTNRRH 555

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 556 NKGCNCKRSGCLKNYCECYEA 576



 Score = 52.4 bits (124), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 38/128 (29%), Positives = 48/128 (37%), Gaps = 44/128 (34%)

Query: 52  LVGEGE-RRHTKGCNCKRSGCLKNYCECYEAGA-------------------RT------ 85
           L G+G   R  K CNC +S CLK YC+C+  G                    RT      
Sbjct: 472 LTGDGSGSRPRKPCNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKAC 531

Query: 86  --LEPNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
               P+   P+              K CNC +S CLK YC+C+     C   C C  C  
Sbjct: 532 LDRNPHAFHPKIGKGHGSQTNRRHNKGCNCKRSGCLKNYCECYEAKILCSNFCKCVGC-K 590

Query: 129 NIEHEDDR 136
           N E   +R
Sbjct: 591 NFEESPER 598


>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
 gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
          Length = 980

 Score =  154 bits (390), Expect = 1e-35,   Method: Composition-based stats.
 Identities = 67/101 (66%), Positives = 75/101 (74%), Gaps = 1/101 (0%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           L+  G   RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E  R  AIR CLE
Sbjct: 758 LDALGGSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLE 817

Query: 146 RNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           RNP+AF+PKI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 818 RNPSAFKPKITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 857



 Score =  133 bits (334), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 56/81 (69%), Positives = 62/81 (76%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E  R  AIR CLERNP+AF+PKI     G+  R H
Sbjct: 780 YCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLERNPSAFKPKITAPNSGD-MRLH 838

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 839 NKGCNCKRSGCLKNYCECYEA 859


>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 560

 Score =  154 bits (389), Expect = 1e-35,   Method: Composition-based stats.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 331 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 390

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 391 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 425



 Score =  122 bits (307), Expect = 5e-26,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 347 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 406

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 407 SKGCNCKRSGCLKNYCECYEA 427



 Score = 50.4 bits (119), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
           E   R  K CNC +S CLK YC+C+  G              LE                
Sbjct: 327 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 386

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC +S CLK YC+C+     C   C C  C  N E 
Sbjct: 387 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 445

Query: 133 EDDR 136
             +R
Sbjct: 446 SPER 449


>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 995

 Score =  154 bits (389), Expect = 2e-35,   Method: Composition-based stats.
 Identities = 76/103 (73%), Positives = 86/103 (83%)

Query: 84  RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQC 143
           R LE NGIR +KPCNCT+S CLKLYC+CFANGEFC+QCNCN+C+NNIEHE+ R  +I+ C
Sbjct: 771 RVLEANGIRMKKPCNCTRSQCLKLYCECFANGEFCFQCNCNNCYNNIEHEEARQKSIKSC 830

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           LERNP AFRPKI      E ERRH KGC+CKRSGCLKNYCECY
Sbjct: 831 LERNPCAFRPKIQIGETEEDERRHNKGCHCKRSGCLKNYCECY 873



 Score =  119 bits (297), Expect = 8e-25,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 68/81 (83%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFANGEFC+QCNCN+C+NNIEHE+ R  +I+ CLERNP AFRPKI      E ERRH
Sbjct: 795 YCECFANGEFCFQCNCNNCYNNIEHEEARQKSIKSCLERNPCAFRPKIQIGETEEDERRH 854

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGC+CKRSGCLKNYCECYEA
Sbjct: 855 NKGCHCKRSGCLKNYCECYEA 875


>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
 gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
          Length = 944

 Score =  154 bits (388), Expect = 2e-35,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 732 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNMDYEVERERAIRSCLDRNPSAFQP 791

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 792 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 823



 Score =  132 bits (333), Expect = 5e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 746 YCDCFANGEFCQDCTCKDCFNNMDYEVERERAIRSCLDRNPSAFQPKITAPNSGD-MRLH 804

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 805 NKGCNCKRSGCLKNYCECYEA 825


>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
 gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
          Length = 952

 Score =  153 bits (387), Expect = 3e-35,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 740 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 799

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 800 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 831



 Score =  132 bits (332), Expect = 7e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 754 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 812

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 813 NKGCNCKRSGCLKNYCECYEA 833


>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
           protein of 120 kDa
 gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
          Length = 950

 Score =  153 bits (387), Expect = 3e-35,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 797

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829



 Score =  132 bits (332), Expect = 7e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831


>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
 gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
          Length = 949

 Score =  153 bits (386), Expect = 4e-35,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 737 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 796

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 797 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 828



 Score =  132 bits (331), Expect = 9e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 751 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 809

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 810 NKGCNCKRSGCLKNYCECYEA 830


>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
          Length = 950

 Score =  152 bits (384), Expect = 6e-35,   Method: Composition-based stats.
 Identities = 63/93 (67%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC++S CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSRSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 797

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829



 Score =  132 bits (331), Expect = 7e-29,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831


>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
 gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
 gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
          Length = 771

 Score =  152 bits (383), Expect = 7e-35,   Method: Composition-based stats.
 Identities = 75/105 (71%), Positives = 86/105 (81%), Gaps = 1/105 (0%)

Query: 82  GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           G  T E N  RPRKPCNCT+S CLKLYCDCFANGEFC  CNC +CFNN++HE +R  AI+
Sbjct: 534 GLSTSEANS-RPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKAIK 592

Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            CL+RNP AF+PKIGK   GE +RRH+KGCNCK+SGCLKNYCECY
Sbjct: 593 ACLDRNPVAFKPKIGKGKEGESDRRHSKGCNCKKSGCLKNYCECY 637



 Score =  119 bits (298), Expect = 6e-25,   Method: Composition-based stats.
 Identities = 59/81 (72%), Positives = 69/81 (85%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +CFNN++HE +R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 559 YCDCFANGEFCNNCNCVNCFNNLDHESERLKAIKACLDRNPVAFKPKIGKGKEGESDRRH 618

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCK+SGCLKNYCECYEA
Sbjct: 619 SKGCNCKKSGCLKNYCECYEA 639



 Score = 57.0 bits (136), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 36/124 (29%), Positives = 43/124 (34%), Gaps = 43/124 (34%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
           E   R  K CNC RS CLK YC+C+  G                                
Sbjct: 539 EANSRPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKAIKACLDRN 598

Query: 88  PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
           P   +P+              K CNC KS CLK YC+C+     C   C C  C  N E 
Sbjct: 599 PVAFKPKIGKGKEGESDRRHSKGCNCKKSGCLKNYCECYEAKIMCSSICKCMGC-KNFEE 657

Query: 133 EDDR 136
             +R
Sbjct: 658 SPER 661


>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
 gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
          Length = 879

 Score =  152 bits (383), Expect = 8e-35,   Method: Composition-based stats.
 Identities = 65/93 (69%), Positives = 72/93 (77%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E  R  AIR CLERNP+AF+P
Sbjct: 664 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFKP 723

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 724 KITAPNSGD-IRLHNKGCNCKRSGCLKNYCECY 755



 Score =  130 bits (328), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 56/81 (69%), Positives = 62/81 (76%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E  R  AIR CLERNP+AF+PKI     G+  R H
Sbjct: 678 YCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFKPKITAPNSGD-IRLH 736

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 737 NKGCNCKRSGCLKNYCECYEA 757


>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
 gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
          Length = 950

 Score =  150 bits (379), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 63/93 (67%), Positives = 72/93 (77%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  C NN+++E +R  AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKP 797

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829



 Score =  129 bits (324), Expect = 6e-28,   Method: Composition-based stats.
 Identities = 54/81 (66%), Positives = 62/81 (76%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  C NN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831


>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
          Length = 209

 Score =  149 bits (375), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 76/104 (73%), Positives = 85/104 (81%), Gaps = 5/104 (4%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
            LE NG++PRKPCNCTKS CLKLYCDCFANGEFC+ CNC  C NN+EHE+ R  AIR CL
Sbjct: 13  VLESNGLKPRKPCNCTKSQCLKLYCDCFANGEFCHGCNCVCCANNLEHEELRLRAIRSCL 72

Query: 145 ERNPNAFRPKIGKCLVGEG--ERRHTKGCNCKRSGCLKNYCECY 186
           +RNP+AF+PKIG   VG G   RRH KGC+CKRSGCLKNYCECY
Sbjct: 73  DRNPHAFKPKIG---VGWGPEPRRHNKGCHCKRSGCLKNYCECY 113



 Score =  115 bits (288), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 60/83 (72%), Positives = 67/83 (80%), Gaps = 5/83 (6%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG--ER 58
           YCDCFANGEFC+ CNC  C NN+EHE+ R  AIR CL+RNP+AF+PKIG   VG G   R
Sbjct: 36  YCDCFANGEFCHGCNCVCCANNLEHEELRLRAIRSCLDRNPHAFKPKIG---VGWGPEPR 92

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
           RH KGC+CKRSGCLKNYCECYEA
Sbjct: 93  RHNKGCHCKRSGCLKNYCECYEA 115


>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
          Length = 605

 Score =  148 bits (374), Expect = 8e-34,   Method: Composition-based stats.
 Identities = 67/93 (72%), Positives = 74/93 (79%), Gaps = 2/93 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RKPCNCTKSMCLKLYC+CFANG FC  CNC +C NN+E + DR  AI+ CLERNP AFRP
Sbjct: 377 RKPCNCTKSMCLKLYCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAFRP 436

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG+    +  R H KGCNCKRSGCLKNYCECY
Sbjct: 437 KIGRGR--DANRTHQKGCNCKRSGCLKNYCECY 467



 Score =  121 bits (304), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 62/81 (76%), Gaps = 2/81 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFANG FC  CNC +C NN+E + DR  AI+ CLERNP AFRPKIG+    +  R H
Sbjct: 391 YCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAFRPKIGRGR--DANRTH 448

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 449 QKGCNCKRSGCLKNYCECYEA 469


>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 528

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (299), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
 gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
 gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
 gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
 gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
 gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
 gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
 gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
 gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
 gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
 gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
 gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
          Length = 528

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (298), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
          Length = 528

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (298), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 521

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 292 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 351

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 352 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 386



 Score =  119 bits (299), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 308 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 367

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 368 SKGCNCKRSGCLKNYCECYEA 388


>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
           gorilla]
          Length = 528

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (298), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
          Length = 527

 Score =  147 bits (372), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 298 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 357

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 358 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 392



 Score =  119 bits (298), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 314 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 373

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 374 SKGCNCKRSGCLKNYCECYEA 394


>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
          Length = 528

 Score =  147 bits (372), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (298), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
           familiaris]
 gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
          Length = 528

 Score =  147 bits (372), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (298), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
          Length = 528

 Score =  147 bits (372), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (298), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
          Length = 515

 Score =  147 bits (372), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 286 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 345

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 346 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 380



 Score =  119 bits (298), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 302 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 361

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 362 SKGCNCKRSGCLKNYCECYEA 382


>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
          Length = 528

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  119 bits (298), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
           magnipapillata]
          Length = 530

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 70/94 (74%), Positives = 78/94 (82%), Gaps = 1/94 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +KPCNCTKS CLKLYC+CFANGEFC  CNC  CFNNI HE +R  AI+ CL+RNP AF P
Sbjct: 318 KKPCNCTKSQCLKLYCECFANGEFCNNCNCRICFNNIAHEVERSKAIKSCLDRNPYAFHP 377

Query: 154 KIGKCLV-GEGERRHTKGCNCKRSGCLKNYCECY 186
           KIGK  + G+ ERRHTKGCNC+RSGCLKNYCECY
Sbjct: 378 KIGKGKIKGDTERRHTKGCNCRRSGCLKNYCECY 411



 Score =  126 bits (317), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 60/82 (73%), Positives = 67/82 (81%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV-GEGERR 59
           YC+CFANGEFC  CNC  CFNNI HE +R  AI+ CL+RNP AF PKIGK  + G+ ERR
Sbjct: 332 YCECFANGEFCNNCNCRICFNNIAHEVERSKAIKSCLDRNPYAFHPKIGKGKIKGDTERR 391

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           HTKGCNC+RSGCLKNYCECYEA
Sbjct: 392 HTKGCNCRRSGCLKNYCECYEA 413


>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
          Length = 494

 Score =  147 bits (370), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 265 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 324

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 325 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 359



 Score =  118 bits (296), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 281 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 340

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 341 SKGCNCKRSGCLKNYCECYEA 361


>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
          Length = 495

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 266 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 325

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 326 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 360



 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 282 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 341

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 342 SKGCNCKRSGCLKNYCECYEA 362


>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
          Length = 614

 Score =  146 bits (368), Expect = 4e-33,   Method: Composition-based stats.
 Identities = 75/107 (70%), Positives = 85/107 (79%), Gaps = 1/107 (0%)

Query: 81  AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
           AG+   +    RPRK CNCTKS CLKLYCDCFANGEFC  CNC +CFNN+EHE++R  +I
Sbjct: 380 AGSAAEQNAAARPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSI 439

Query: 141 RQCLERNPNAFRPKIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
           + CLERNP AF PKIGK   G+G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 440 KTCLERNPQAFHPKIGKSKAGQGDIRRHNKGCNCKRSGCLKNYCECY 486



 Score =  117 bits (292), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RR 59
           YCDCFANGEFC  CNC +CFNN+EHE++R  +I+ CLERNP AF PKIGK   G+G+ RR
Sbjct: 407 YCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAFHPKIGKSKAGQGDIRR 466

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCKRSGCLKNYCECYEA
Sbjct: 467 HNKGCNCKRSGCLKNYCECYEA 488



 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 35/121 (28%), Positives = 46/121 (38%), Gaps = 44/121 (36%)

Query: 59  RHTKGCNCKRSGCLKNYCECYEAG---------------------ARTLE------PNGI 91
           R  K CNC +S CLK YC+C+  G                     +R+++      P   
Sbjct: 391 RPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAF 450

Query: 92  RPR---------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDD 135
            P+               K CNC +S CLK YC+C+     C   C C  C  N E   D
Sbjct: 451 HPKIGKSKAGQGDIRRHNKGCNCKRSGCLKNYCECYEAKILCTSLCKCVGC-KNFEESPD 509

Query: 136 R 136
           R
Sbjct: 510 R 510


>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
 gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
 gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
          Length = 419

 Score =  145 bits (367), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 190 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 249

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 250 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 284



 Score =  117 bits (293), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 206 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 265

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 266 SKGCNCKRSGCLKNYCECYEA 286


>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
          Length = 438

 Score =  145 bits (367), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 209 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 268

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 269 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 303



 Score =  117 bits (293), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 225 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 284

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 285 SKGCNCKRSGCLKNYCECYEA 305


>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
          Length = 419

 Score =  145 bits (366), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 190 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 249

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 250 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 284



 Score =  117 bits (293), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 206 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 265

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 266 SKGCNCKRSGCLKNYCECYEA 286


>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
          Length = 348

 Score =  144 bits (362), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/95 (76%), Positives = 84/95 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 119 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 178

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 179 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 213



 Score =  115 bits (289), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 135 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 194

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 195 SKGCNCKRSGCLKNYCECYEA 215


>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
          Length = 528

 Score =  142 bits (358), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 72/95 (75%), Positives = 83/95 (87%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDC ANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393



 Score =  113 bits (283), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/81 (72%), Positives = 69/81 (85%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDC ANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 315 YCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395


>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
          Length = 844

 Score =  142 bits (357), Expect = 7e-32,   Method: Composition-based stats.
 Identities = 79/109 (72%), Positives = 86/109 (78%), Gaps = 2/109 (1%)

Query: 80  EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHA 139
           E    T E  G+RPRK CNCTKS CLKLYCDCFANGEFC +CNCN+C NN+E+E+ R  A
Sbjct: 614 ETDQVTTELGGLRPRKACNCTKSQCLKLYCDCFANGEFCNRCNCNNCHNNLENEELRQKA 673

Query: 140 IRQCLERNPNAFRPKIGKCLVGEGE--RRHTKGCNCKRSGCLKNYCECY 186
           IR CL+RNPNAFRPKIGK   G  E  RRH KGCNCKRSGCLKNYCECY
Sbjct: 674 IRGCLDRNPNAFRPKIGKSKAGGPEIIRRHNKGCNCKRSGCLKNYCECY 722



 Score =  108 bits (271), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 73/152 (48%), Positives = 87/152 (57%), Gaps = 40/152 (26%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--R 58
           YCDCFANGEFC +CNCN+C NN+E+E+ R  AIR CL+RNPNAFRPKIGK   G  E  R
Sbjct: 642 YCDCFANGEFCNRCNCNNCHNNLENEELRQKAIRGCLDRNPNAFRPKIGKSKAGGPEIIR 701

Query: 59  RHTKGCNCKRSGCLKNYCECYEA-----------GARTLE-----------PNGIRP--- 93
           RH KGCNCKRSGCLKNYCECYEA           G R +E           P  ++P   
Sbjct: 702 RHNKGCNCKRSGCLKNYCECYEAKIACSSICKCVGCRNVEETLERGRRRDAPRALQPLAG 761

Query: 94  -------------RKPCNCTKSMCLKLYCDCF 112
                        ++PC+   S  ++  C C 
Sbjct: 762 PPTYRPHVALAHAKQPCSFMTSEVIEAVCQCL 793


>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
          Length = 407

 Score =  142 bits (357), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 80/147 (54%), Positives = 90/147 (61%), Gaps = 17/147 (11%)

Query: 45  RPKIGKCLVGEGERRHTKGCNCKRSGC----LKNYCECYEAGARTLEPNGIRPRKPCNCT 100
           RP   K  VG  E+R  K  N  R       +K   + Y  G R          KPCNCT
Sbjct: 106 RPPKSKSEVGGIEQR--KAFNVMREQVNQVRMKTKKKVYAPGQR----------KPCNCT 153

Query: 101 KSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL- 159
           KS CLKLYCDCFANGEFC  CNC  C NNI+++  R  AIRQ LERNPNAF+PKIG    
Sbjct: 154 KSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARG 213

Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
             + ER H KGC+CK+SGCLKNYCECY
Sbjct: 214 TADIERLHQKGCHCKKSGCLKNYCECY 240



 Score =  117 bits (293), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 56/82 (68%), Positives = 63/82 (76%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
           YCDCFANGEFC  CNC  C NNI+++  R  AIRQ LERNPNAF+PKIG      + ER 
Sbjct: 161 YCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGTADIERL 220

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGC+CK+SGCLKNYCECYEA
Sbjct: 221 HQKGCHCKKSGCLKNYCECYEA 242


>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
 gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
          Length = 458

 Score =  141 bits (355), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 79/105 (75%), Positives = 85/105 (80%)

Query: 82  GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           G    E  G RPRKPCNCTKS CLKLYCDCFANGEFC  CNCN+CFNN+EHE +R  +I+
Sbjct: 223 GTLNTEQVGTRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHEAERAKSIK 282

Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            CLERNP AF PKIGK   GEG+RRH KGCNCKRSGCLKNYCECY
Sbjct: 283 ACLERNPYAFHPKIGKGKEGEGDRRHNKGCNCKRSGCLKNYCECY 327



 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/81 (77%), Positives = 69/81 (85%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+CFNN+EHE +R  +I+ CLERNP AF PKIGK   GEG+RRH
Sbjct: 249 YCDCFANGEFCNNCNCNNCFNNLEHEAERAKSIKACLERNPYAFHPKIGKGKEGEGDRRH 308

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 309 NKGCNCKRSGCLKNYCECYEA 329


>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
          Length = 360

 Score =  141 bits (355), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 75/95 (78%), Positives = 85/95 (89%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNCN+CFNN+EHE++R  AI+ CL+RNP AF
Sbjct: 131 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAF 190

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 191 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 225



 Score =  113 bits (282), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 62/81 (76%), Positives = 71/81 (87%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+CFNN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 147 YCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 206

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 207 SKGCNCKRSGCLKNYCECYEA 227


>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
          Length = 437

 Score =  140 bits (353), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/148 (53%), Positives = 90/148 (60%), Gaps = 13/148 (8%)

Query: 41  PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCT 100
           P A RP  G+ L GE  +   +         +K   + Y  G R          KPCNCT
Sbjct: 132 PGASRPAGGE-LTGEQRKAFIEMKEQVNQVRMKTKKKVYAPGQR----------KPCNCT 180

Query: 101 KSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV 160
           KS CLKLYCDCFANGEFC  CNC  C NNI+++  R  AIRQ LERNPNAF+PKIG    
Sbjct: 181 KSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARG 240

Query: 161 GEG--ERRHTKGCNCKRSGCLKNYCECY 186
           G    ER H KGC+CK+SGCLKNYCECY
Sbjct: 241 GTADIERLHQKGCHCKKSGCLKNYCECY 268



 Score =  117 bits (292), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 57/83 (68%), Positives = 63/83 (75%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG--ER 58
           YCDCFANGEFC  CNC  C NNI+++  R  AIRQ LERNPNAF+PKIG    G    ER
Sbjct: 188 YCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTADIER 247

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+SGCLKNYCECYEA
Sbjct: 248 LHQKGCHCKKSGCLKNYCECYEA 270


>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
 gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
          Length = 429

 Score =  140 bits (353), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 69/95 (72%), Positives = 75/95 (78%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RKPCNCTKS CLKLYCDCFANGEFC  CNC  C NNIE++  R  AIRQ LERNPNAF+P
Sbjct: 168 RKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKP 227

Query: 154 KIGKCLVG--EGERRHTKGCNCKRSGCLKNYCECY 186
           KIG    G  + ER H KGC+CK+SGCLKNYCECY
Sbjct: 228 KIGIARGGITDIERLHQKGCHCKKSGCLKNYCECY 262



 Score =  118 bits (295), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 58/83 (69%), Positives = 64/83 (77%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG--EGER 58
           YCDCFANGEFC  CNC  C NNIE++  R  AIRQ LERNPNAF+PKIG    G  + ER
Sbjct: 182 YCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITDIER 241

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+SGCLKNYCECYEA
Sbjct: 242 LHQKGCHCKKSGCLKNYCECYEA 264


>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
 gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
           protein 54
 gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
          Length = 435

 Score =  140 bits (353), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 68/95 (71%), Positives = 75/95 (78%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RKPCNCTKS CLKLYCDCFANGEFC  CNC  C NNIE++  R  AIRQ LERNPNAF+P
Sbjct: 174 RKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKP 233

Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG     + + ER H KGC+CK+SGCLKNYCECY
Sbjct: 234 KIGIARGGITDIERLHQKGCHCKKSGCLKNYCECY 268



 Score =  118 bits (295), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 57/83 (68%), Positives = 64/83 (77%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
           YCDCFANGEFC  CNC  C NNIE++  R  AIRQ LERNPNAF+PKIG     + + ER
Sbjct: 188 YCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITDIER 247

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+SGCLKNYCECYEA
Sbjct: 248 LHQKGCHCKKSGCLKNYCECYEA 270


>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
 gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
          Length = 587

 Score =  140 bits (353), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+P
Sbjct: 365 RKHCNCSKSQCLKLYCDCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKP 424

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI     G+  R H KGCNCKRSGCLKNYCECY
Sbjct: 425 KITAPASGD-MRLHNKGCNCKRSGCLKNYCECY 456



 Score =  120 bits (302), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF+PKI     G+  R H
Sbjct: 379 YCDCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKPKITAPASGD-MRLH 437

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 438 NKGCNCKRSGCLKNYCECYEA 458


>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
          Length = 408

 Score =  139 bits (350), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 67/96 (69%), Positives = 77/96 (80%), Gaps = 2/96 (2%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           +PRKPCNC  S CLKLYCDCFANGEFC   CNC +C N+ +HE DR  A++ CLERNPNA
Sbjct: 176 KPRKPCNCKNSQCLKLYCDCFANGEFCRDSCNCQNCKNSFQHEGDRSRAVKACLERNPNA 235

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           F+PK+G   VG+ ERRH KGCNCK+S CLKNYCECY
Sbjct: 236 FKPKVGHGRVGD-ERRHIKGCNCKKSSCLKNYCECY 270



 Score =  118 bits (295), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 57/82 (69%), Positives = 66/82 (80%), Gaps = 2/82 (2%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFANGEFC   CNC +C N+ +HE DR  A++ CLERNPNAF+PK+G   VG+ ERR
Sbjct: 192 YCDCFANGEFCRDSCNCQNCKNSFQHEGDRSRAVKACLERNPNAFKPKVGHGRVGD-ERR 250

Query: 60  HTKGCNCKRSGCLKNYCECYEA 81
           H KGCNCK+S CLKNYCECYEA
Sbjct: 251 HIKGCNCKKSSCLKNYCECYEA 272


>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
          Length = 348

 Score =  139 bits (350), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 71/95 (74%), Positives = 83/95 (87%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKP NCTKS+CL+LYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 119 RPRKPRNCTKSLCLRLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 178

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 179 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 213



 Score =  115 bits (289), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 60/81 (74%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 135 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 194

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 195 SKGCNCKRSGCLKNYCECYEA 215


>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
 gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
          Length = 433

 Score =  139 bits (349), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 68/95 (71%), Positives = 74/95 (77%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RKPCNCTKS CLKLYCDCFANGEFC  CNC  C NNI+++  R  AIRQ LERNPNAF+P
Sbjct: 172 RKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKP 231

Query: 154 KIGKCLVGEG--ERRHTKGCNCKRSGCLKNYCECY 186
           KIG    G    ER H KGC+CK+SGCLKNYCECY
Sbjct: 232 KIGIARGGTADIERLHQKGCHCKKSGCLKNYCECY 266



 Score =  116 bits (291), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 57/83 (68%), Positives = 63/83 (75%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG--ER 58
           YCDCFANGEFC  CNC  C NNI+++  R  AIRQ LERNPNAF+PKIG    G    ER
Sbjct: 186 YCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTADIER 245

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+SGCLKNYCECYEA
Sbjct: 246 LHQKGCHCKKSGCLKNYCECYEA 268


>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
 gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
          Length = 226

 Score =  138 bits (348), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 75/95 (78%), Positives = 80/95 (84%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS CLKLYCDCFA GEFC  CNC +C NNIEHE +R  AI+ CLERNP+AF
Sbjct: 2   RPRKPCNCTKSQCLKLYCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAF 61

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            PKIGK  VGE ERRH KGC+CKRSGCLKNYCECY
Sbjct: 62  HPKIGKGKVGESERRHNKGCHCKRSGCLKNYCECY 96



 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/81 (76%), Positives = 67/81 (82%)

Query: 1  YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
          YCDCFA GEFC  CNC +C NNIEHE +R  AI+ CLERNP+AF PKIGK  VGE ERRH
Sbjct: 18 YCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAFHPKIGKGKVGESERRH 77

Query: 61 TKGCNCKRSGCLKNYCECYEA 81
           KGC+CKRSGCLKNYCECYEA
Sbjct: 78 NKGCHCKRSGCLKNYCECYEA 98


>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
          Length = 448

 Score =  138 bits (347), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 67/100 (67%), Positives = 78/100 (78%), Gaps = 2/100 (2%)

Query: 89  NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNP 148
           +GI  +KPCNCTKSMCLKLYCDCFANGEFC  CNC  C NN+++E +R  AI+  LERNP
Sbjct: 216 SGIGIKKPCNCTKSMCLKLYCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNP 275

Query: 149 NAFRPKIGKCLVG--EGERRHTKGCNCKRSGCLKNYCECY 186
           NAF+PKIG    G  + ER H KGC+CK+S CLKNYCECY
Sbjct: 276 NAFKPKIGVASRGRVDSERLHQKGCHCKKSNCLKNYCECY 315



 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 54/83 (65%), Positives = 63/83 (75%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG--EGER 58
           YCDCFANGEFC  CNC  C NN+++E +R  AI+  LERNPNAF+PKIG    G  + ER
Sbjct: 235 YCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNPNAFKPKIGVASRGRVDSER 294

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+S CLKNYCECYEA
Sbjct: 295 LHQKGCHCKKSNCLKNYCECYEA 317


>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
          Length = 460

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 74/95 (77%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           + ++PCNCTKS CLKLYC+CFANGEFC  C C +CFNN EH  +R  A++ CL+RNP AF
Sbjct: 262 KSKRPCNCTKSQCLKLYCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAF 321

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           RPKIG    G  +  HTKGCNCKRSGCLKNYCECY
Sbjct: 322 RPKIGSRKQGNVKGCHTKGCNCKRSGCLKNYCECY 356



 Score =  114 bits (284), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 62/81 (76%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFANGEFC  C C +CFNN EH  +R  A++ CL+RNP AFRPKIG    G  +  H
Sbjct: 278 YCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAFRPKIGSRKQGNVKGCH 337

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           TKGCNCKRSGCLKNYCECYEA
Sbjct: 338 TKGCNCKRSGCLKNYCECYEA 358


>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
          Length = 452

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +KPC+CT+SMCLKLYCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+P
Sbjct: 221 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 280

Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG       E ER H KGC+CK+S CLKNYCECY
Sbjct: 281 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 315



 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
           YCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+PKIG       E ER
Sbjct: 235 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 294

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+S CLKNYCECYEA
Sbjct: 295 LHQKGCHCKKSNCLKNYCECYEA 317


>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
          Length = 495

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +KPC+CT+SMCLKLYCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+P
Sbjct: 264 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 323

Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG       E ER H KGC+CK+S CLKNYCECY
Sbjct: 324 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 358



 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
           YCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+PKIG       E ER
Sbjct: 278 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 337

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+S CLKNYCECYEA
Sbjct: 338 LHQKGCHCKKSNCLKNYCECYEA 360


>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
 gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
          Length = 469

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +KPC+CT+SMCLKLYCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+P
Sbjct: 238 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 297

Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG       E ER H KGC+CK+S CLKNYCECY
Sbjct: 298 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 332



 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
           YCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+PKIG       E ER
Sbjct: 252 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 311

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+S CLKNYCECYEA
Sbjct: 312 LHQKGCHCKKSNCLKNYCECYEA 334


>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
           garnettii]
          Length = 544

 Score =  130 bits (326), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 60/98 (61%), Positives = 73/98 (74%)

Query: 89  NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNP 148
           + I P KPC C  S+CLKL CDCFANGEF   CNC S +N++EHE++R  AI+ CL+RNP
Sbjct: 339 SSIWPGKPCECVISLCLKLDCDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNP 398

Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            AF+PK GK    + +  H+KGCNCK SGCLKNYCECY
Sbjct: 399 EAFKPKTGKGKEEDSDXHHSKGCNCKXSGCLKNYCECY 436



 Score =  113 bits (283), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 51/80 (63%), Positives = 62/80 (77%)

Query: 2   CDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHT 61
           CDCFANGEF   CNC S +N++EHE++R  AI+ CL+RNP AF+PK GK    + +  H+
Sbjct: 359 CDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNPEAFKPKTGKGKEEDSDXHHS 418

Query: 62  KGCNCKRSGCLKNYCECYEA 81
           KGCNCK SGCLKNYCECYEA
Sbjct: 419 KGCNCKXSGCLKNYCECYEA 438


>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
          Length = 391

 Score =  130 bits (326), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +KPC+CT+SMCLKLYCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+P
Sbjct: 234 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 293

Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG       E ER H KGC+CK+S CLKNYCECY
Sbjct: 294 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 328



 Score =  109 bits (272), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
           YCDCFANGEFC  C+C  C N IE+E +R  AIR  LERNPNAF+PKIG       E ER
Sbjct: 248 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 307

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
            H KGC+CK+S CLKNYCECYEA
Sbjct: 308 LHQKGCHCKKSNCLKNYCECYEA 330


>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 450

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/151 (51%), Positives = 91/151 (60%), Gaps = 22/151 (14%)

Query: 36  CLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRK 95
            L  NPN  +P  G   +  G      G     SG L N                   RK
Sbjct: 105 ILTLNPNIVQPMTGPPGLTPGGVNQLGGS----SGPLTNLTNS---------------RK 145

Query: 96  PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           PCNCTKS CLKLYC+CFA G+ C  CNC++C NN+ +E++R  AI+  LERNP AF PKI
Sbjct: 146 PCNCTKSHCLKLYCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNPTAFHPKI 205

Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           G+   GEGER+HTKGCNCKRSGCLKNYCECY
Sbjct: 206 GR---GEGERKHTKGCNCKRSGCLKNYCECY 233



 Score =  102 bits (255), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 66/81 (81%), Gaps = 3/81 (3%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA G+ C  CNC++C NN+ +E++R  AI+  LERNP AF PKIG+   GEGER+H
Sbjct: 158 YCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNPTAFHPKIGR---GEGERKH 214

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           TKGCNCKRSGCLKNYCECYEA
Sbjct: 215 TKGCNCKRSGCLKNYCECYEA 235


>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
          Length = 764

 Score =  124 bits (312), Expect = 1e-26,   Method: Composition-based stats.
 Identities = 60/81 (74%), Positives = 69/81 (85%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC+ CNCN+C NN++HE++R  AI+ CL+RNP AF PKIGK   GE +RRH
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRRH 588

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 589 NKGCNCKRSGCLKNYCECYEA 609



 Score =  120 bits (302), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 58/79 (73%), Positives = 67/79 (84%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
           YCDCFANGEFC+ CNCN+C NN++HE++R  AI+ CL+RNP AF PKIGK   GE +RRH
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRRH 588

Query: 168 TKGCNCKRSGCLKNYCECY 186
            KGCNCKRSGCLKNYCECY
Sbjct: 589 NKGCNCKRSGCLKNYCECY 607


>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
          Length = 363

 Score =  124 bits (310), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/93 (72%), Positives = 77/93 (82%), Gaps = 3/93 (3%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RKPCNCTKS CLKLYC+CFA G+ C  CNCN+C NN+ +E++R  AI+  LERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 203

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG+   GEGER+HTKGCNCKRSGCLKNYCECY
Sbjct: 204 KIGR---GEGERKHTKGCNCKRSGCLKNYCECY 233



 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 66/81 (81%), Gaps = 3/81 (3%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA G+ C  CNCN+C NN+ +E++R  AI+  LERNP AF PKIG+   GEGER+H
Sbjct: 158 YCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHPKIGR---GEGERKH 214

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           TKGCNCKRSGCLKNYCECYEA
Sbjct: 215 TKGCNCKRSGCLKNYCECYEA 235


>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
          Length = 364

 Score =  124 bits (310), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/93 (72%), Positives = 77/93 (82%), Gaps = 3/93 (3%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RKPCNCTKS CLKLYC+CFA G+ C  CNCN+C NN+ +E++R  AI+  LERNP AF P
Sbjct: 145 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 204

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KIG+   GEGER+HTKGCNCKRSGCLKNYCECY
Sbjct: 205 KIGR---GEGERKHTKGCNCKRSGCLKNYCECY 234



 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 66/81 (81%), Gaps = 3/81 (3%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA G+ C  CNCN+C NN+ +E++R  AI+  LERNP AF PKIG+   GEGER+H
Sbjct: 159 YCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHPKIGR---GEGERKH 215

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           TKGCNCKRSGCLKNYCECYEA
Sbjct: 216 TKGCNCKRSGCLKNYCECYEA 236


>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
          Length = 702

 Score =  124 bits (310), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 58/95 (61%), Positives = 67/95 (70%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C+NN EHE  R  A+   LERNPNAFRP
Sbjct: 153 KKHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRP 212

Query: 154 KI--GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI  G      G  RH KGCNCK+SGCLK YCEC+
Sbjct: 213 KIAPGADNRRAGGARHNKGCNCKKSGCLKKYCECF 247



 Score =  109 bits (272), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 51/84 (60%), Positives = 60/84 (71%), Gaps = 2/84 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--GKCLVGEGER 58
           YC+CFA+G +C  CNC +C+NN EHE  R  A+   LERNPNAFRPKI  G      G  
Sbjct: 167 YCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRPKIAPGADNRRAGGA 226

Query: 59  RHTKGCNCKRSGCLKNYCECYEAG 82
           RH KGCNCK+SGCLK YCEC++AG
Sbjct: 227 RHNKGCNCKKSGCLKKYCECFQAG 250



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 39/111 (35%), Positives = 44/111 (39%), Gaps = 44/111 (39%)

Query: 62  KGCNCKRSGCLKNYCECYEAG-------------------------ARTLE--PNGIRPR 94
           K CNCK S CLK YCEC+ +G                            LE  PN  RP+
Sbjct: 154 KHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRPK 213

Query: 95  ----------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
                           K CNC KS CLK YC+CF  G  C + C C  C N
Sbjct: 214 IAPGADNRRAGGARHNKGCNCKKSGCLKKYCECFQAGIVCSEICKCLDCKN 264


>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 798

 Score =  123 bits (309), Expect = 3e-26,   Method: Composition-based stats.
 Identities = 62/81 (76%), Positives = 70/81 (86%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF+PKIGK   GE +RRH
Sbjct: 590 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 649

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 650 SKGCNCKRSGCLKNYCECYEA 670



 Score =  120 bits (302), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 62/84 (73%), Positives = 70/84 (83%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           M L  YCDCFANGEFC  CNCN+CFNN+EHE +R  AI+ CL+RNP AF+PKIGK   GE
Sbjct: 585 MPLNGYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGE 644

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 645 SDRRHSKGCNCKRSGCLKNYCECY 668


>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
          Length = 559

 Score =  118 bits (295), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 56/106 (52%), Positives = 69/106 (65%), Gaps = 11/106 (10%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           + +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAF
Sbjct: 89  KKQKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAF 148

Query: 152 RPKIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           RPKI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 149 RPKIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECF 194



 Score =  101 bits (252), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 105 YCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 164

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 165 DAGEVLILGKHNKGCHCKKSGCLKKYCECFQA 196


>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
          Length = 559

 Score =  117 bits (293), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 56/106 (52%), Positives = 69/106 (65%), Gaps = 11/106 (10%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           + +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAF
Sbjct: 87  KKQKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAF 146

Query: 152 RPKIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           RPKI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 147 RPKIASSPHGTRDSREEAGEVLILGKHNKGCHCKKSGCLKKYCECF 192



 Score =  100 bits (250), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 103 YCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 162

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 163 EAGEVLILGKHNKGCHCKKSGCLKKYCECFQA 194


>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
           tropicalis]
          Length = 183

 Score =  115 bits (289), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 63/97 (64%), Positives = 78/97 (80%)

Query: 90  GIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
           G + +KPCNCT+S CLKLYCDCFANG+ C  CNC++C+NN EHE +R  A++ CL RNP+
Sbjct: 7   GSKTKKPCNCTRSQCLKLYCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPD 66

Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           AF+PKI K  +G+ + RH KGCNCKRSGC KNYCECY
Sbjct: 67  AFQPKIEKGKIGDVKPRHNKGCNCKRSGCFKNYCECY 103



 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/126 (47%), Positives = 77/126 (61%), Gaps = 18/126 (14%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANG+ C  CNC++C+NN EHE +R  A++ CL RNP+AF+PKI K  +G+ + RH
Sbjct: 25  YCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPDAFQPKIEKGKIGDVKPRH 84

Query: 61  TKGCNCKRSGCLKNYCECYEA--------------GARTLEPNGIRPRKPCNCTKSMCLK 106
            KGCNCKRSGC KNYCECYEA                ++L+   I P     CT    ++
Sbjct: 85  NKGCNCKRSGCFKNYCECYEAKIACSSVCKCVGCKNVKSLDSGSIIP----TCTSREVIE 140

Query: 107 LYCDCF 112
             C C 
Sbjct: 141 ATCACL 146


>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
          Length = 321

 Score =  115 bits (288), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 57/95 (60%), Positives = 70/95 (73%), Gaps = 2/95 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           RK CNC  S CLKLYC+CFA+G++C Q CNC  C NN+E+   R  A+   LERNPNAFR
Sbjct: 20  RKTCNCRNSRCLKLYCECFASGQYCLQNCNCVECKNNVENSVQRQEAVEATLERNPNAFR 79

Query: 153 PKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
           PK+   L  GEG+ +H KGC+CK+SGCLK YCEC+
Sbjct: 80  PKVLPTLNGGEGDEKHNKGCHCKKSGCLKKYCECF 114



 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/84 (57%), Positives = 62/84 (73%), Gaps = 2/84 (2%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGER 58
           YC+CFA+G++C Q CNC  C NN+E+   R  A+   LERNPNAFRPK+   L  GEG+ 
Sbjct: 34  YCECFASGQYCLQNCNCVECKNNVENSVQRQEAVEATLERNPNAFRPKVLPTLNGGEGDE 93

Query: 59  RHTKGCNCKRSGCLKNYCECYEAG 82
           +H KGC+CK+SGCLK YCEC++A 
Sbjct: 94  KHNKGCHCKKSGCLKKYCECFQAS 117


>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
 gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
          Length = 1133

 Score =  115 bits (288), Expect = 8e-24,   Method: Composition-based stats.
 Identities = 49/93 (52%), Positives = 56/93 (60%), Gaps = 11/93 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
           YC+CFA+G +C  CNC  CFNN EHE  R  A+   LERNPNAFRPKI            
Sbjct: 144 YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQAPAAAAA 203

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
                    RH KGCNCK+S CLK YCEC++AG
Sbjct: 204 VVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAG 236



 Score =  112 bits (279), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 48/96 (50%), Positives = 56/96 (58%), Gaps = 11/96 (11%)

Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------ 155
           +M  + YC+CFA+G +C  CNC  CFNN EHE  R  A+   LERNPNAFRPKI      
Sbjct: 138 NMTAQSYCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQA 197

Query: 156 -----GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                          RH KGCNCK+S CLK YCEC+
Sbjct: 198 PAAAAAVVNNAAAPGRHLKGCNCKKSSCLKKYCECF 233


>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 586

 Score =  115 bits (287), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 43/68 (63%), Positives = 55/68 (80%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           T+ P+    ++PCNCTKS CLKLYCDCFANGEFCY CNC  C+NN+++E++R  AIR  L
Sbjct: 519 TIFPDEAYKKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATL 578

Query: 145 ERNPNAFR 152
           ERNP+AF+
Sbjct: 579 ERNPSAFK 586



 Score = 84.7 bits (208), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 30/45 (66%), Positives = 38/45 (84%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 45
           YCDCFANGEFCY CNC  C+NN+++E++R  AIR  LERNP+AF+
Sbjct: 542 YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 586


>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
 gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
          Length = 766

 Score =  115 bits (287), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 54/98 (55%), Positives = 63/98 (64%), Gaps = 8/98 (8%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY--QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           K CNC  S CLKLYC+CFA G  C   +C C +C N +EH  +R  AI   L+RNPNAFR
Sbjct: 16  KGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVLDRNPNAFR 75

Query: 153 PKIGKCLVGEGER----RHTKGCNCKRSGCLKNYCECY 186
           PKI    V    R    RH +GC+CKRSGCLK YCEC+
Sbjct: 76  PKIAA--VASPSRDVALRHNRGCHCKRSGCLKKYCECF 111



 Score = 98.6 bits (244), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 46/88 (52%), Positives = 56/88 (63%), Gaps = 8/88 (9%)

Query: 1   YCDCFANGEFCY--QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 58
           YC+CFA G  C   +C C +C N +EH  +R  AI   L+RNPNAFRPKI    V    R
Sbjct: 29  YCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVLDRNPNAFRPKIAA--VASPSR 86

Query: 59  ----RHTKGCNCKRSGCLKNYCECYEAG 82
               RH +GC+CKRSGCLK YCEC++A 
Sbjct: 87  DVALRHNRGCHCKRSGCLKKYCECFQAA 114


>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
          Length = 1772

 Score =  113 bits (282), Expect = 4e-23,   Method: Composition-based stats.
 Identities = 49/93 (52%), Positives = 57/93 (61%), Gaps = 6/93 (6%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           KPCNC  S CLKLYCDCFA G FC + C C  C N+  H  DR  AI+  LE+NP AFR 
Sbjct: 331 KPCNCRNSRCLKLYCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAIKATLEKNPKAFRA 390

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           K     V      H  GC+CK++ CLK YCEC+
Sbjct: 391 K-----VDAEATTHQNGCHCKKTKCLKKYCECF 418



 Score = 95.5 bits (236), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 42/83 (50%), Positives = 50/83 (60%), Gaps = 6/83 (7%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA G FC + C C  C N+  H  DR  AI+  LE+NP AFR K     V      
Sbjct: 344 YCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAIKATLEKNPKAFRAK-----VDAEATT 398

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
           H  GC+CK++ CLK YCEC+EAG
Sbjct: 399 HQNGCHCKKTKCLKKYCECFEAG 421


>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
 gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
          Length = 316

 Score =  112 bits (281), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 62/81 (76%), Positives = 69/81 (85%), Gaps = 1/81 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFANGEFC  CNC +C NNIEHE++R  AI+ CLERNP+AFRPKIGK   G  ERRH
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEG-NERRH 196

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 197 SKGCNCKRSGCLKNYCECYEA 217



 Score =  109 bits (272), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 60/79 (75%), Positives = 67/79 (84%), Gaps = 1/79 (1%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
           YC+CFANGEFC  CNC +C NNIEHE++R  AI+ CLERNP+AFRPKIGK   G  ERRH
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEG-NERRH 196

Query: 168 TKGCNCKRSGCLKNYCECY 186
           +KGCNCKRSGCLKNYCECY
Sbjct: 197 SKGCNCKRSGCLKNYCECY 215


>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 638

 Score =  112 bits (279), Expect = 8e-23,   Method: Composition-based stats.
 Identities = 50/100 (50%), Positives = 67/100 (67%), Gaps = 2/100 (2%)

Query: 87  EPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLER 146
           + + +   + C+C++S CLKLYC+CFA G FC  C+C  C+N  +HE+ R  AI + + R
Sbjct: 155 QQDSVLTSRRCSCSRSFCLKLYCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNR 214

Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            P+AF+ KI   L  E    H +GCNCKRSGCLKNYCECY
Sbjct: 215 KPDAFQSKIAHSL--ENATVHARGCNCKRSGCLKNYCECY 252



 Score = 99.4 bits (246), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 46/84 (54%), Positives = 56/84 (66%), Gaps = 2/84 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA G FC  C+C  C+N  +HE+ R  AI + + R P+AF+ KI   L  E    H
Sbjct: 176 YCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNRKPDAFQSKIAHSL--ENATVH 233

Query: 61  TKGCNCKRSGCLKNYCECYEAGAR 84
            +GCNCKRSGCLKNYCECYEA  R
Sbjct: 234 ARGCNCKRSGCLKNYCECYEARVR 257


>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
 gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
 gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
 gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
          Length = 629

 Score =  112 bits (279), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 54/115 (46%), Positives = 69/115 (60%), Gaps = 20/115 (17%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           + ++ CNC  S CLKLYC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNPNAF
Sbjct: 125 KKQRQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAF 184

Query: 152 RPKIGKCLVGEGERR--------------------HTKGCNCKRSGCLKNYCECY 186
           RPKI     G  + R                    H KGC+CK+SGCLK YCEC+
Sbjct: 185 RPKIASSPQGARDSRVSRSSILWQEEAGEGLILIKHHKGCHCKKSGCLKKYCECF 239



 Score = 96.3 bits (238), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 60/101 (59%), Gaps = 20/101 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 141 YCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPQGARDSRV 200

Query: 60  -------------------HTKGCNCKRSGCLKNYCECYEA 81
                              H KGC+CK+SGCLK YCEC++A
Sbjct: 201 SRSSILWQEEAGEGLILIKHHKGCHCKKSGCLKKYCECFQA 241


>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 151

 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 64/118 (54%), Positives = 73/118 (61%), Gaps = 13/118 (11%)

Query: 82  GARTLEP-NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
            AR   P +    RK CNC  S CLKLYC+CFA+G++C  CNC  C NN EH  +R  AI
Sbjct: 17  AARASTPRDSASQRKRCNCKNSKCLKLYCECFASGKYCDACNCAGCKNNDEHAHERQSAI 76

Query: 141 RQCLERNPNAFRPKI-----GKCLVGEG-------ERRHTKGCNCKRSGCLKNYCECY 186
            Q LERNPNAFRPKI        L G G       E RH +GC+CKRSGCLK YCEC+
Sbjct: 77  EQTLERNPNAFRPKIINQTTNGTLDGAGAGDGATTEARHNRGCHCKRSGCLKKYCECF 134



 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 12/99 (12%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----GKCLVGE 55
           YC+CFA+G++C  CNC  C NN EH  +R  AI Q LERNPNAFRPKI        L G 
Sbjct: 44  YCECFASGKYCDACNCAGCKNNDEHAHERQSAIEQTLERNPNAFRPKIINQTTNGTLDGA 103

Query: 56  G-------ERRHTKGCNCKRSGCLKNYCECYEAGARTLE 87
           G       E RH +GC+CKRSGCLK YCEC++A    +E
Sbjct: 104 GAGDGATTEARHNRGCHCKRSGCLKKYCECFQAAIYCVE 142


>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
          Length = 371

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354



 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G 
Sbjct: 273 IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 328

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH+KGCNCKRSGCLKNYCECY
Sbjct: 329 AKLRHSKGCNCKRSGCLKNYCECY 352


>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
 gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
          Length = 682

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/107 (54%), Positives = 69/107 (64%), Gaps = 13/107 (12%)

Query: 93  PRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           P+KP  CNC  SMCLKLYC+CFA G  C  CNC +C NN+E+E +R  A+   L+RNPNA
Sbjct: 105 PKKPKQCNCKNSMCLKLYCECFAAGIHCNGCNCLNCRNNVENEKERKEAVETTLQRNPNA 164

Query: 151 FRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
           FRPKI     G  +            +H KGCNCKRSGCLK YCEC+
Sbjct: 165 FRPKIASSPHGSRDTTEDAQEVQMLGKHNKGCNCKRSGCLKKYCECF 211



 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 57/91 (62%), Gaps = 11/91 (12%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA G  C  CNC +C NN+E+E +R  A+   L+RNPNAFRPKI     G  +   
Sbjct: 122 YCECFAAGIHCNGCNCLNCRNNVENEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTE 181

Query: 59  ---------RHTKGCNCKRSGCLKNYCECYE 80
                    +H KGCNCKRSGCLK YCEC++
Sbjct: 182 DAQEVQMLGKHNKGCNCKRSGCLKKYCECFQ 212


>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
          Length = 536

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRP
Sbjct: 83  KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 142

Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
           KI     G  +            RH KGC+CK+SGCLK YCEC+
Sbjct: 143 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 186



 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
           YC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRPKI     G  +   
Sbjct: 97  YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 156

Query: 58  --------RRHTKGCNCKRSGCLKNYCECYEA 81
                    RH KGC+CK+SGCLK YCEC++A
Sbjct: 157 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 188


>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
          Length = 532

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRP
Sbjct: 83  KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 142

Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
           KI     G  +            RH KGC+CK+SGCLK YCEC+
Sbjct: 143 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 186



 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
           YC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRPKI     G  +   
Sbjct: 97  YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 156

Query: 58  --------RRHTKGCNCKRSGCLKNYCECYEA 81
                    RH KGC+CK+SGCLK YCEC++A
Sbjct: 157 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 188


>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
 gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific
          Length = 475

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354



 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G 
Sbjct: 273 IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 328

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH+KGCNCKRSGCLKNYCECY
Sbjct: 329 AKLRHSKGCNCKRSGCLKNYCECY 352


>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
           norvegicus]
          Length = 467

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 270 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 325

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 326 SKGCNCKRSGCLKNYCECYEA 346



 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G 
Sbjct: 265 IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 320

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH+KGCNCKRSGCLKNYCECY
Sbjct: 321 AKLRHSKGCNCKRSGCLKNYCECY 344


>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
          Length = 602

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 57/103 (55%), Positives = 66/103 (64%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC  S CLKLYC+CFA+G +C  CNC +C NN+EHE  R  A+   LERNPNAFRPK
Sbjct: 128 KLCNCKHSRCLKLYCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPK 187

Query: 155 IGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           I     G  E R           H KGC+CK+SGCLK YCEC+
Sbjct: 188 IANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECF 230



 Score = 99.0 bits (245), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 49/92 (53%), Positives = 59/92 (64%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C NN+EHE  R  A+   LERNPNAFRPKI     G  E R 
Sbjct: 141 YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKIANSPHGTRESRD 200

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 201 EIGELVMLGKHNKGCHCKKSGCLKKYCECFQA 232


>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
 gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
 gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
 gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
          Length = 475

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354



 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           P+G   + P   T    L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RN
Sbjct: 262 PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 313

Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           P AF+PK+GK  +G  + RH+KGCNCKRSGCLKNYCECY
Sbjct: 314 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 352


>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
 gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
 gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
 gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 571

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRP
Sbjct: 118 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 177

Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
           KI     G  +            RH KGC+CK+SGCLK YCEC+
Sbjct: 178 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 221



 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
           YC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRPKI     G  +   
Sbjct: 132 YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 191

Query: 58  --------RRHTKGCNCKRSGCLKNYCECYEA 81
                    RH KGC+CK+SGCLK YCEC++A
Sbjct: 192 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 223


>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 578

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRP
Sbjct: 118 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 177

Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
           KI     G  +            RH KGC+CK+SGCLK YCEC+
Sbjct: 178 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 221



 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
           YC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRPKI     G  +   
Sbjct: 132 YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 191

Query: 58  --------RRHTKGCNCKRSGCLKNYCECYEA 81
                    RH KGC+CK+SGCLK YCEC++A
Sbjct: 192 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 223


>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
          Length = 543

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRP
Sbjct: 83  KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 142

Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
           KI     G  +            RH KGC+CK+SGCLK YCEC+
Sbjct: 143 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 186



 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
           YC+CFA+G +C  CNC +CFNN+E+E  R  A+   LERNPNAFRPKI     G  +   
Sbjct: 97  YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 156

Query: 58  --------RRHTKGCNCKRSGCLKNYCECYEA 81
                    RH KGC+CK+SGCLK YCEC++A
Sbjct: 157 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 188


>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
          Length = 380

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  C+CN    N+ HE +R  AI+ C++RNP AF+PK+GK  +G  + RH
Sbjct: 183 YCDCFASGDFCTSCSCN----NLRHELERFKAIKACIDRNPEAFQPKMGKGRLGVSKLRH 238

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 239 SKGCNCKRSGCLKNYCECYEA 259



 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  C+CN    N+ HE +R  AI+ C++RNP AF+PK+GK  +G 
Sbjct: 178 IALAGYCDCFASGDFCTSCSCN----NLRHELERFKAIKACIDRNPEAFQPKMGKGRLGV 233

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH+KGCNCKRSGCLKNYCECY
Sbjct: 234 SKLRHSKGCNCKRSGCLKNYCECY 257


>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
          Length = 475

 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354



 Score =  107 bits (268), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           P+G   + P   T    L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RN
Sbjct: 262 PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 313

Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           P AF+PK+GK  +G  + RH+KGCNCKRSGCLKNYCECY
Sbjct: 314 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 352


>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
          Length = 584

 Score =  109 bits (273), Expect = 4e-22,   Method: Composition-based stats.
 Identities = 48/100 (48%), Positives = 65/100 (65%), Gaps = 1/100 (1%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLER 146
           P G  P + CNC KS CLKLYC+C+A G +C  QC C +C NN+E+E  R  A+   L++
Sbjct: 400 PVGRPPLRSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDK 459

Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           NP+AFRPKI +      +  H +GC C +S CL+ YCEC+
Sbjct: 460 NPDAFRPKIKQETTTNRKANHLRGCKCTKSKCLQRYCECF 499



 Score = 90.5 bits (223), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 38/83 (45%), Positives = 55/83 (66%), Gaps = 1/83 (1%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+C+A G +C  QC C +C NN+E+E  R  A+   L++NP+AFRPKI +      +  
Sbjct: 420 YCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDKNPDAFRPKIKQETTTNRKAN 479

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
           H +GC C +S CL+ YCEC++AG
Sbjct: 480 HLRGCKCTKSKCLQRYCECFQAG 502


>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
          Length = 360

 Score =  109 bits (272), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 163 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 218

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 219 SKGCNCKRSGCLKNYCECYEA 239



 Score =  107 bits (266), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           P+G   + P   T    L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RN
Sbjct: 147 PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 198

Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           P AF+PK+GK  +G  + RH+KGCNCKRSGCLKNYCECY
Sbjct: 199 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 237


>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
 gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
 gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
 gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
 gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
 gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
          Length = 295

 Score =  108 bits (271), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 98  YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 153

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 154 SKGCNCKRSGCLKNYCECYEA 174



 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           P+G   + P   T    L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RN
Sbjct: 82  PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 133

Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           P AF+PK+GK  +G  + RH+KGCNCKRSGCLKNYCECY
Sbjct: 134 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 172


>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
 gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
          Length = 191

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G  + RH
Sbjct: 98  YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 153

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNCKRSGCLKNYCECYEA
Sbjct: 154 SKGCNCKRSGCLKNYCECYEA 174



 Score =  105 bits (263), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCF++G+FC  C+CN    N+ HE +R  AI+ CL+RNP AF+PK+GK  +G 
Sbjct: 93  IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 148

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH+KGCNCKRSGCLKNYCECY
Sbjct: 149 AKLRHSKGCNCKRSGCLKNYCECY 172


>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
          Length = 503

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 51/81 (62%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  C+ ++C NN+ HE +R  AI+ CL+RNP AF+PKIG   +G+ + RH
Sbjct: 302 YCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLGDVKPRH 361

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 362 NKGCNCRRSGCLKNYCECYEA 382



 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/84 (59%), Positives = 65/84 (77%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  C+ ++C NN+ HE +R  AI+ CL+RNP AF+PKIG   +G+
Sbjct: 297 LTLAGYCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLGD 356

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNC+RSGCLKNYCECY
Sbjct: 357 VKPRHNKGCNCRRSGCLKNYCECY 380


>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
 gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
          Length = 601

 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRP
Sbjct: 120 QKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 179

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 180 KIASSPHGTRDSREENGEGLILGKHNKGCHCKKSGCLKKYCECF 223



 Score = 95.9 bits (237), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 134 YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 193

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 194 ENGEGLILGKHNKGCHCKKSGCLKKYCECFQA 225


>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
 gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
          Length = 519

 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRP
Sbjct: 112 QKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 171

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 172 KIASSPHGTRDSREETGEGLVFVKHNKGCHCKKSGCLKKYCECF 215



 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 126 YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 185

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 186 ETGEGLVFVKHNKGCHCKKSGCLKKYCECFQA 217


>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
           [Cucumis sativus]
          Length = 556

 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 56/103 (54%), Positives = 65/103 (63%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC  S CLKLYC+CFA+G +C  CNC +C NN+EHE  R  A+   LERNPNAFRPK
Sbjct: 128 KLCNCKHSRCLKLYCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPK 187

Query: 155 IGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
                 G  E R           H KGC+CK+SGCLK YCEC+
Sbjct: 188 XANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECF 230



 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 48/92 (52%), Positives = 58/92 (63%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C NN+EHE  R  A+   LERNPNAFRPK      G  E R 
Sbjct: 141 YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKXANSPHGTRESRD 200

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 201 EIGELVMLGKHNKGCHCKKSGCLKKYCECFQA 232


>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 568

 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/104 (55%), Positives = 69/104 (66%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C NN+E+E +R  AI   LERNPNAFRP
Sbjct: 116 KKQCNCKHSRCLKLYCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRP 175

Query: 154 KIGKCL---------VGEGER--RHTKGCNCKRSGCLKNYCECY 186
           KI             VGE     RH KGC+CK+SGCLK YCEC+
Sbjct: 176 KIAASPHGGRDNREEVGEVVLLGRHNKGCHCKKSGCLKKYCECF 219



 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 50/92 (54%), Positives = 61/92 (66%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-------- 52
           YC+CFA+G +C  CNC +C NN+E+E +R  AI   LERNPNAFRPKI            
Sbjct: 130 YCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRPKIAASPHGGRDNRE 189

Query: 53  -VGEGER--RHTKGCNCKRSGCLKNYCECYEA 81
            VGE     RH KGC+CK+SGCLK YCEC++A
Sbjct: 190 EVGEVVLLGRHNKGCHCKKSGCLKKYCECFQA 221


>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
          Length = 561

 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRP
Sbjct: 92  QKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 151

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 152 KIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECF 195



 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 106 YCECFASGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 165

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 166 DAGEVLILGKHNKGCHCKKSGCLKKYCECFQA 197


>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 615

 Score =  107 bits (267), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 68/105 (64%), Gaps = 19/105 (18%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G +C  CNC +CFNN+E+E  R  AI   LERNP+AFRPKIG
Sbjct: 163 CNCRHSKCLKLYCECFASGIYCDGCNCTNCFNNVENEVARREAIEATLERNPDAFRPKIG 222

Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                            L+G    +H KGC+CK+SGCLK YCEC+
Sbjct: 223 SSPHTNRNNMEVAGDLPLIG----KHNKGCHCKKSGCLKKYCECF 263



 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 49/96 (51%), Positives = 62/96 (64%), Gaps = 19/96 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G +C  CNC +CFNN+E+E  R  AI   LERNP+AFRPKIG           
Sbjct: 174 YCECFASGIYCDGCNCTNCFNNVENEVARREAIEATLERNPDAFRPKIGSSPHTNRNNME 233

Query: 52  ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                 L+G    +H KGC+CK+SGCLK YCEC++A
Sbjct: 234 VAGDLPLIG----KHNKGCHCKKSGCLKKYCECFQA 265


>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
 gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 619

 Score =  107 bits (266), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 69/105 (65%), Gaps = 19/105 (18%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG
Sbjct: 162 CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 221

Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                            LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 222 SSPHANRNNMEAAGDLPLVG----KHNKGCHCKKSGCLKKYCECF 262



 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 19/96 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG           
Sbjct: 173 YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHANRNNME 232

Query: 52  ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                 LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 233 AAGDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 264


>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
          Length = 576

 Score =  107 bits (266), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 69/105 (65%), Gaps = 19/105 (18%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG
Sbjct: 114 CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 173

Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                            LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 174 SSPHANRNNMEAAGDLPLVG----KHNKGCHCKKSGCLKKYCECF 214



 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 19/96 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG           
Sbjct: 125 YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHANRNNME 184

Query: 52  ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                 LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 185 AAGDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 216


>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
           CCMP2712]
          Length = 138

 Score =  107 bits (266), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 61/114 (53%), Positives = 74/114 (64%), Gaps = 15/114 (13%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLER 146
           P     RKPCNC  S CLKLYC+CFA+G +C  QCNC+SC NN+  E DR  A+   LER
Sbjct: 8   PGTPAKRKPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLER 67

Query: 147 NPNAFRPKIGKC----LV---GEGER-------RHTKGCNCKRSGCLKNYCECY 186
           NP+AFRPKI +     +V    +G R       RH KGC+CK+SGCLK YCEC+
Sbjct: 68  NPHAFRPKITQTGQQGIVDSPAKGARWLMFVKARHMKGCHCKKSGCLKKYCECF 121



 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 50/98 (51%), Positives = 64/98 (65%), Gaps = 15/98 (15%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC----LV-- 53
           YC+CFA+G +C  QCNC+SC NN+  E DR  A+   LERNP+AFRPKI +     +V  
Sbjct: 28  YCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERNPHAFRPKITQTGQQGIVDS 87

Query: 54  -GEGER-------RHTKGCNCKRSGCLKNYCECYEAGA 83
             +G R       RH KGC+CK+SGCLK YCEC++A  
Sbjct: 88  PAKGARWLMFVKARHMKGCHCKKSGCLKKYCECFQAAV 125


>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
          Length = 529

 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 56/105 (53%), Positives = 69/105 (65%), Gaps = 19/105 (18%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG
Sbjct: 68  CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 127

Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                            LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 128 SSPHANRNNMEAAGDLPLVG----KHNKGCHCKKSGCLKKYCECF 168



 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 19/96 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG           
Sbjct: 79  YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHANRNNME 138

Query: 52  ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                 LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 139 AAGDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 170


>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
 gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
          Length = 545

 Score =  106 bits (265), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           ++ CNC  S CLKLYC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRP
Sbjct: 133 QRQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 192

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 193 KIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSGCLKKYCECF 236



 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C+NN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 147 YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 206

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 207 ETGDGLVFVKHNKGCHCKKSGCLKKYCECFQA 238


>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 601

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNP AFRP
Sbjct: 127 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRP 186

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  ++R           H KGC+CK+SGCLK YCEC+
Sbjct: 187 KIASSPHGVRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECF 230



 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNP AFRPKI     G  ++R 
Sbjct: 141 YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGVRDKRE 200

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 201 DIGEVVLLGKHNKGCHCKKSGCLKKYCECFQA 232


>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
 gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
 gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
 gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
 gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
 gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 603

 Score =  106 bits (264), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNP AFRP
Sbjct: 131 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRP 190

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  ++R           H KGC+CK+SGCLK YCEC+
Sbjct: 191 KIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECF 234



 Score = 93.6 bits (231), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNP AFRPKI     G  ++R 
Sbjct: 145 YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGGRDKRE 204

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 205 DIGEVVLLGKHNKGCHCKKSGCLKKYCECFQA 236


>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1852

 Score =  106 bits (264), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 47/103 (45%), Positives = 58/103 (56%), Gaps = 4/103 (3%)

Query: 88   PNGIRP----RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQC 143
            PNGI P    R  CNC KS CLKLYC+CF+    C  CNC  C N       R  A+  C
Sbjct: 1498 PNGITPMSTPRNRCNCKKSQCLKLYCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDC 1557

Query: 144  LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            L +N +AF+P+I         + H+ GC CK+S CLK YCEC+
Sbjct: 1558 LNKNAHAFKPRIAAAPGAADLQGHSMGCKCKKSECLKKYCECF 1600



 Score = 82.4 bits (202), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 46/82 (56%)

Query: 1    YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
            YC+CF+    C  CNC  C N       R  A+  CL +N +AF+P+I         + H
Sbjct: 1522 YCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAPGAADLQGH 1581

Query: 61   TKGCNCKRSGCLKNYCECYEAG 82
            + GC CK+S CLK YCEC++AG
Sbjct: 1582 SMGCKCKKSECLKKYCECFQAG 1603



 Score = 58.2 bits (139), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 35/107 (32%), Positives = 41/107 (38%), Gaps = 42/107 (39%)

Query: 64   CNCKRSGCLKNYCECYEA----------GARTL-----------------EPNGIRPRKP 96
            CNCK+S CLK YCEC+ A            R L                   +  +PR  
Sbjct: 1511 CNCKKSQCLKLYCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIA 1570

Query: 97   --------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
                          C C KS CLK YC+CF  G  C  +C C SC N
Sbjct: 1571 AAPGAADLQGHSMGCKCKKSECLKKYCECFQAGVLCGLKCKCESCKN 1617


>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
          Length = 592

 Score =  105 bits (262), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 67/104 (64%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRP
Sbjct: 106 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRP 165

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 166 KIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECF 209



 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 59/92 (64%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 120 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSRE 179

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 180 ESGEALVLGKHNKGCHCKKSGCLKKYCECFQA 211


>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
          Length = 524

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 55/104 (52%), Positives = 67/104 (64%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRP
Sbjct: 38  QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRP 97

Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
           KI     G  + R           H KGC+CK+SGCLK YCEC+
Sbjct: 98  KIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECF 141



 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 47/92 (51%), Positives = 59/92 (64%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPKI     G  + R 
Sbjct: 52  YCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSRE 111

Query: 60  ----------HTKGCNCKRSGCLKNYCECYEA 81
                     H KGC+CK+SGCLK YCEC++A
Sbjct: 112 ESGEALVLGKHNKGCHCKKSGCLKKYCECFQA 143


>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
 gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
          Length = 550

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 57/104 (54%), Positives = 66/104 (63%), Gaps = 18/104 (17%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G  C  CNC +CFNN E+E  R  AI   LERNP+AFRPKIG
Sbjct: 114 CNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 173

Query: 157 KC--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                           LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 174 SSPHTNRNNEVSSDLPLVG----KHNKGCHCKKSGCLKKYCECF 213



 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 50/95 (52%), Positives = 60/95 (63%), Gaps = 18/95 (18%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G  C  CNC +CFNN E+E  R  AI   LERNP+AFRPKIG           
Sbjct: 125 YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPHTNRNNEV 184

Query: 52  -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 185 SSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 215


>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
          Length = 728

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/81 (70%), Positives = 68/81 (83%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANG+FC  CNCN+C+NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 527 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDIKPRH 586

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 587 NKGCNCKRSGCLKNYCECYEA 607



 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/102 (59%), Positives = 74/102 (72%), Gaps = 4/102 (3%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           T  P+G   + P   T    L  YCDCFANG+FC  CNCN+C+NN+ HE +R  AI+ CL
Sbjct: 508 TTFPSGSNLQGPAKIT----LAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACL 563

Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +RNP AF+PKIGK  +G+ + RH KGCNCKRSGCLKNYCECY
Sbjct: 564 DRNPEAFQPKIGKGKLGDIKPRHNKGCNCKRSGCLKNYCECY 605


>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 412

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/104 (54%), Positives = 66/104 (63%), Gaps = 18/104 (17%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G  C  CNC +CFNN E+E  R  AI   LERNP+AFRPKIG
Sbjct: 157 CNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 216

Query: 157 KC--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                           LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 217 SSPHTNRNNEVSSDLPLVG----KHNKGCHCKKSGCLKKYCECF 256



 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 50/95 (52%), Positives = 60/95 (63%), Gaps = 18/95 (18%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G  C  CNC +CFNN E+E  R  AI   LERNP+AFRPKIG           
Sbjct: 168 YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPHTNRNNEV 227

Query: 52  -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 228 SSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 258


>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
          Length = 511

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/81 (70%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANG+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 369

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 370 NKGCNCKRSGCLKNYCECYEA 390



 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/84 (66%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFANG+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 305 ITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 364

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 365 VKPRHNKGCNCKRSGCLKNYCECY 388


>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
 gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
          Length = 511

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/81 (70%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANG+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 369

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 370 NKGCNCKRSGCLKNYCECYEA 390



 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/84 (66%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFANG+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 305 ITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 364

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 365 VKPRHNKGCNCKRSGCLKNYCECY 388


>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
          Length = 512

 Score =  103 bits (258), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C N++ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 311 YCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 370

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 371 NKGCNCKRSGCLKNYCECYEA 391



 Score =  101 bits (251), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C N++ HE +R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 306 ITLAGYCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 365

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 366 VKPRHNKGCNCKRSGCLKNYCECY 389


>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
 gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
          Length = 220

 Score =  103 bits (258), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNPNAFRP
Sbjct: 82  QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRP 141

Query: 154 KIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
           KI     G  +            +H KGC+CK+SGCLK YCEC+
Sbjct: 142 KIASSPHGTRDNKEETGEVKVLVKHNKGCHCKKSGCLKKYCECF 185



 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA+G +C  CNC +CFNN+++E  R  A+   LERNPNAFRPKI     G  +   
Sbjct: 96  YCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 155

Query: 59  ---------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCEC++A
Sbjct: 156 ETGEVKVLVKHNKGCHCKKSGCLKKYCECFQA 187


>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 1042

 Score =  103 bits (257), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 52/94 (55%), Positives = 62/94 (65%), Gaps = 8/94 (8%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           +  CNC KS CLKLYCDCFA G  C   CNC  C N  E  ++R  A+ Q  ERNPNAF+
Sbjct: 651 KTTCNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANT-EGNEERKQAMDQITERNPNAFK 709

Query: 153 PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           PKI      + +  H+KGC+CK+SGCLK YCECY
Sbjct: 710 PKI------QEKGFHSKGCHCKKSGCLKKYCECY 737



 Score = 87.8 bits (216), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 44/83 (53%), Positives = 55/83 (66%), Gaps = 8/83 (9%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA G  C   CNC  C N  E  ++R  A+ Q  ERNPNAF+PKI      + +  
Sbjct: 665 YCDCFAYGLGCSPDCNCADCANT-EGNEERKQAMDQITERNPNAFKPKI------QEKGF 717

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
           H+KGC+CK+SGCLK YCECY++G
Sbjct: 718 HSKGCHCKKSGCLKKYCECYQSG 740



 Score = 53.9 bits (128), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 34/101 (33%), Positives = 43/101 (42%), Gaps = 36/101 (35%)

Query: 64  CNCKRSGCLKNYCECYEAG------------ART---------------LEPNGIRPR-- 94
           CNCK+S CLK YC+C+  G            A T                 PN  +P+  
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERKQAMDQITERNPNAFKPKIQ 713

Query: 95  ------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
                 K C+C KS CLK YC+C+ +G  C   C C  C N
Sbjct: 714 EKGFHSKGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKN 754


>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
 gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
          Length = 1987

 Score =  103 bits (257), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 51/105 (48%), Positives = 60/105 (57%), Gaps = 6/105 (5%)

Query: 82  GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           G RT E +     K C C KS CLKLYCDCFA G++C  C+C SC N  EH D       
Sbjct: 779 GRRTSENSS----KSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRRE 834

Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
               R+P AF  KI   L   G  +H +GCNC++S CLK YCECY
Sbjct: 835 DIAARDPQAFTRKIQ--LAPNGNGKHKRGCNCRKSHCLKKYCECY 877



 Score = 88.2 bits (217), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 39/84 (46%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA G++C  C+C SC N  EH D           R+P AF  KI   L   G  +H
Sbjct: 801 YCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAARDPQAFTRKIQ--LAPNGNGKH 858

Query: 61  TKGCNCKRSGCLKNYCECYEAGAR 84
            +GCNC++S CLK YCECY+ G +
Sbjct: 859 KRGCNCRKSHCLKKYCECYQGGVK 882



 Score = 58.9 bits (141), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 40/108 (37%)

Query: 61  TKGCNCKRSGCLKNYCECYEAG-----------------------------AR------- 84
           +K C CK+S CLK YC+C+ AG                             AR       
Sbjct: 787 SKSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAARDPQAFTR 846

Query: 85  --TLEPNGI-RPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
              L PNG  + ++ CNC KS CLK YC+C+  G  C  QC C  C N
Sbjct: 847 KIQLAPNGNGKHKRGCNCRKSHCLKKYCECYQGGVKCGIQCTCMECEN 894


>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  103 bits (257), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 55/137 (40%), Positives = 66/137 (48%), Gaps = 43/137 (31%)

Query: 93  PRK-PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           P+K PCNC KS CLKLYC+CFA+G +C + CNC  C N    ED R  AI   LE+NPNA
Sbjct: 559 PKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNA 618

Query: 151 FRPKIGK-----CLVGEGERR------------------------------------HTK 169
           F+PKIG       +   G RR                                    H  
Sbjct: 619 FKPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKH 678

Query: 170 GCNCKRSGCLKNYCECY 186
           GC+CK+S C K YCEC+
Sbjct: 679 GCHCKKSACQKKYCECF 695



 Score = 83.6 bits (205), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 45/124 (36%), Positives = 56/124 (45%), Gaps = 42/124 (33%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
           YC+CFA+G +C + CNC  C N    ED R  AI   LE+NPNAF+PKIG       +  
Sbjct: 575 YCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTS 634

Query: 55  EGERR------------------------------------HTKGCNCKRSGCLKNYCEC 78
            G RR                                    H  GC+CK+S C K YCEC
Sbjct: 635 GGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKHGCHCKKSACQKKYCEC 694

Query: 79  YEAG 82
           ++AG
Sbjct: 695 FQAG 698


>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  103 bits (257), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 55/137 (40%), Positives = 66/137 (48%), Gaps = 43/137 (31%)

Query: 93  PRK-PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           P+K PCNC KS CLKLYC+CFA+G +C + CNC  C N    ED R  AI   LE+NPNA
Sbjct: 559 PKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNA 618

Query: 151 FRPKIGK-----CLVGEGERR------------------------------------HTK 169
           F+PKIG       +   G RR                                    H  
Sbjct: 619 FKPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKH 678

Query: 170 GCNCKRSGCLKNYCECY 186
           GC+CK+S C K YCEC+
Sbjct: 679 GCHCKKSACQKKYCECF 695



 Score = 83.6 bits (205), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 45/124 (36%), Positives = 56/124 (45%), Gaps = 42/124 (33%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
           YC+CFA+G +C + CNC  C N    ED R  AI   LE+NPNAF+PKIG       +  
Sbjct: 575 YCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTS 634

Query: 55  EGERR------------------------------------HTKGCNCKRSGCLKNYCEC 78
            G RR                                    H  GC+CK+S C K YCEC
Sbjct: 635 GGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKHGCHCKKSACQKKYCEC 694

Query: 79  YEAG 82
           ++AG
Sbjct: 695 FQAG 698


>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
          Length = 548

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 57/81 (70%), Positives = 68/81 (83%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANG+FC  CNCN+C+NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 347 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDIKPRH 406

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 407 NKGCNCKRSGCLKNYCECYEA 427



 Score =  102 bits (253), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 61/102 (59%), Positives = 74/102 (72%), Gaps = 4/102 (3%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           T  P+G   + P   T    L  YCDCFANG+FC  CNCN+C+NN+ HE +R  AI+ CL
Sbjct: 328 TTFPSGSNLQGPAKIT----LAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACL 383

Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +RNP AF+PKIGK  +G+ + RH KGCNCKRSGCLKNYCECY
Sbjct: 384 DRNPEAFQPKIGKGKLGDIKPRHNKGCNCKRSGCLKNYCECY 425


>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
          Length = 509

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 308 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 367

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 368 NKGCNCKRSGCLKNYCECYEA 388



 Score =  100 bits (249), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 303 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 362

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 363 VKPRHNKGCNCKRSGCLKNYCECY 386


>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
          Length = 511

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 310 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 369

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 370 NKGCNCKRSGCLKNYCECYEA 390



 Score =  100 bits (249), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 305 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 364

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 365 VKPRHNKGCNCKRSGCLKNYCECY 388


>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
          Length = 356

 Score =  102 bits (253), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 54/104 (51%), Positives = 67/104 (64%), Gaps = 18/104 (17%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G +C  CNC +CFNN ++E  R  A+   L+RNP+AFRPKIG
Sbjct: 152 CNCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIG 211

Query: 157 KC--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                           LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 212 SSPHMHRNNEVPSDLPLVG----KHNKGCHCKKSGCLKKYCECF 251



 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/95 (49%), Positives = 61/95 (64%), Gaps = 18/95 (18%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G +C  CNC +CFNN ++E  R  A+   L+RNP+AFRPKIG           
Sbjct: 163 YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRNNEV 222

Query: 52  -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 223 PSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 253


>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
          Length = 656

 Score =  102 bits (253), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 56/113 (49%), Positives = 70/113 (61%), Gaps = 15/113 (13%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           L+    + +K CNC  S CLKLYC+CFA G +C  CNC +C NN+++E  R  A+   LE
Sbjct: 193 LKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLE 252

Query: 146 RNPNAFRPKIGKCLVGEGERR------------HTKGCNCKRSGCLKNYCECY 186
           RNPNAFRPKI    +   ERR            H KGC+CK+SGCLK YCEC+
Sbjct: 253 RNPNAFRPKIASSPL---ERRDSKECEIQVIGKHNKGCHCKKSGCLKKYCECF 302



 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 15/93 (16%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
           YC+CFA G +C  CNC +C NN+++E  R  A+   LERNPNAFRPKI    +   ERR 
Sbjct: 215 YCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPL---ERRD 271

Query: 60  -----------HTKGCNCKRSGCLKNYCECYEA 81
                      H KGC+CK+SGCLK YCEC++A
Sbjct: 272 SKECEIQVIGKHNKGCHCKKSGCLKKYCECFQA 304


>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
          Length = 383

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 182 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 241

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 242 NKGCNCKRSGCLKNYCECYEA 262



 Score = 99.4 bits (246), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 177 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 236

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 237 VKPRHNKGCNCKRSGCLKNYCECY 260


>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
          Length = 515

 Score =  101 bits (252), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 66/81 (81%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+ + RH
Sbjct: 314 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 373

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 374 NKGCNCKRSGCLKNYCECYEA 394



 Score = 99.0 bits (245), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 66/84 (78%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+
Sbjct: 309 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 368

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 369 VKPRHNKGCNCKRSGCLKNYCECY 392


>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
          Length = 712

 Score =  101 bits (252), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 55/131 (41%), Positives = 67/131 (51%), Gaps = 39/131 (29%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYC+CFAN +FC   C CN C N+ EH+D+R HA  Q L RNP AFRP
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508

Query: 154 KI--------------------------------------GKCLVGEGERRHTKGCNCKR 175
           K+                                      G   V + ++RH +GCNCK+
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKK 568

Query: 176 SGCLKNYCECY 186
           S C K YCEC+
Sbjct: 569 SNCQKKYCECF 579



 Score = 84.3 bits (207), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 46/121 (38%), Positives = 59/121 (48%), Gaps = 39/121 (32%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC+CFAN +FC   C CN C N+ EH+D+R HA  Q L RNP AFRPK+           
Sbjct: 462 YCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRPKVETQQDETSLFS 521

Query: 49  ---------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                                      G   V + ++RH +GCNCK+S C K YCEC++ 
Sbjct: 522 KDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKKSNCQKKYCECFQQ 581

Query: 82  G 82
           G
Sbjct: 582 G 582


>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
 gi|224030123|gb|ACN34137.1| unknown [Zea mays]
 gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
          Length = 607

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 53/103 (51%), Positives = 66/103 (64%), Gaps = 18/103 (17%)

Query: 98  NCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK 157
           NC  S CLKLYC+CFA+G +C  CNC +CFNN ++E  R  A+   L+RNP+AFRPKIG 
Sbjct: 153 NCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGS 212

Query: 158 C--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
                          LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 213 SPHMHRNNEVPSDLPLVG----KHNKGCHCKKSGCLKKYCECF 251



 Score = 92.0 bits (227), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 47/95 (49%), Positives = 61/95 (64%), Gaps = 18/95 (18%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA+G +C  CNC +CFNN ++E  R  A+   L+RNP+AFRPKIG           
Sbjct: 163 YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRNNEV 222

Query: 52  -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 223 PSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 253


>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
 gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
          Length = 510

 Score =  101 bits (251), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 66/81 (81%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE  R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 309 YCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 368

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 369 NKGCNCRRSGCLKNYCECYEA 389



 Score = 98.6 bits (244), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 66/84 (78%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE  R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 304 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLGD 363

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNC+RSGCLKNYCECY
Sbjct: 364 VKPRHNKGCNCRRSGCLKNYCECY 387


>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 912

 Score =  100 bits (250), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC  S CLKLYC+CFA+G +C  CNC +C NN EHE  R  A+   LERNPNAFRPK
Sbjct: 469 KQCNCKNSRCLKLYCECFASGTYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPK 528

Query: 155 IGKC--LVGEGER----------RHTKGCNCKRSGCLKNYCECY 186
           I     ++ +G+           +H KGC+CK+SGCLK YCECY
Sbjct: 529 IANSPRILHDGKEDVVDRNGSAAKHNKGCHCKKSGCLKKYCECY 572



 Score = 84.3 bits (207), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 48/93 (51%), Positives = 61/93 (65%), Gaps = 12/93 (12%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGEGER 58
           YC+CFA+G +C  CNC +C NN EHE  R  A+   LERNPNAFRPKI     ++ +G+ 
Sbjct: 482 YCECFASGTYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPKIANSPRILHDGKE 541

Query: 59  ----------RHTKGCNCKRSGCLKNYCECYEA 81
                     +H KGC+CK+SGCLK YCECY+A
Sbjct: 542 DVVDRNGSAAKHNKGCHCKKSGCLKKYCECYQA 574


>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1223

 Score =  100 bits (250), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 51/104 (49%), Positives = 59/104 (56%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYC+CFA  EFC   C C +CFN  E+E    +  +Q   RNP AF P
Sbjct: 737 KRCNCKKSKCLKLYCECFAAREFCVGSCACRNCFNKPEYEATVLNTRQQIESRNPLAFAP 796

Query: 154 KI-----------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI            + L      RH +GCNCK+S CLK YCECY
Sbjct: 797 KIVQAPESSPIPGDEALDTPASARHKRGCNCKKSLCLKKYCECY 840



 Score = 84.7 bits (208), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 43/94 (45%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC+CFA  EFC   C C +CFN  E+E    +  +Q   RNP AF PKI           
Sbjct: 750 YCECFAAREFCVGSCACRNCFNKPEYEATVLNTRQQIESRNPLAFAPKIVQAPESSPIPG 809

Query: 49  GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
            + L      RH +GCNCK+S CLK YCECY+AG
Sbjct: 810 DEALDTPASARHKRGCNCKKSLCLKKYCECYQAG 843



 Score = 36.6 bits (83), Expect = 4.9,   Method: Composition-based stats.
 Identities = 15/26 (57%), Positives = 17/26 (65%)

Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
           GE      K CNCK+S CLK YCEC+
Sbjct: 729 GEKPGDGCKRCNCKKSKCLKLYCECF 754


>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 238

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 46/66 (69%), Positives = 53/66 (80%)

Query: 91  IRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           I  R+PCNCTKS CLKLYCDCFANGEFC  C+C +C NN+ +E DR  AI+ CLERNP A
Sbjct: 52  INNRRPCNCTKSQCLKLYCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMA 111

Query: 151 FRPKIG 156
           F+PKIG
Sbjct: 112 FQPKIG 117



 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 33/49 (67%), Positives = 39/49 (79%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
           YCDCFANGEFC  C+C +C NN+ +E DR  AI+ CLERNP AF+PKIG
Sbjct: 69  YCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG 117


>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score =  100 bits (250), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/95 (58%), Positives = 63/95 (66%), Gaps = 5/95 (5%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYCDCFA G  C   CNC SC NN +H  +R + I+Q +ERNP AFRPK+
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKV 263

Query: 156 GKCLVGEGE----RRHTKGCNCKRSGCLKNYCECY 186
                 E E     RH KGCNCK+S CLK YCECY
Sbjct: 264 DSRSNSEDEIDHKPRHFKGCNCKKSNCLKKYCECY 298



 Score = 89.0 bits (219), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 5/89 (5%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-- 57
           YCDCFA G  C   CNC SC NN +H  +R + I+Q +ERNP AFRPK+      E E  
Sbjct: 215 YCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKVDSRSNSEDEID 274

Query: 58  --RRHTKGCNCKRSGCLKNYCECYEAGAR 84
              RH KGCNCK+S CLK YCECY+ G +
Sbjct: 275 HKPRHFKGCNCKKSNCLKKYCECYQMGVK 303


>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Oryctolagus cuniculus]
          Length = 450

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 64/81 (79%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN  HE +R  AI+ CL+RNP AF+PKIGK   G  + RH
Sbjct: 250 YCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGSAKPRH 309

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 310 NKGCNCKRSGCLKNYCECYEA 330



 Score = 98.6 bits (244), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 64/84 (76%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN  HE +R  AI+ CL+RNP AF+PKIGK   G 
Sbjct: 245 IALSGYCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGS 304

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 305 AKPRHNKGCNCKRSGCLKNYCECY 328


>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
          Length = 516

 Score =  100 bits (249), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 66/81 (81%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE  R  AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 315 YCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 374

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 375 NKGCNCRRSGCLKNYCECYEA 395



 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 66/84 (78%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE  R  AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 310 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIGKGKLGD 369

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNC+RSGCLKNYCECY
Sbjct: 370 VKPRHNKGCNCRRSGCLKNYCECY 393


>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
          Length = 515

 Score =  100 bits (249), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/110 (48%), Positives = 69/110 (62%), Gaps = 9/110 (8%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           L+    + +K CNC  S CLKLYC+CFA G +C  CNC +C NN+++E  R  A+   LE
Sbjct: 122 LKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLE 181

Query: 146 RNPNAFRPKIGKCLVGEGER---------RHTKGCNCKRSGCLKNYCECY 186
           RNPNAFRPKI      + +          +H KGC+CK+SGCLK YCEC+
Sbjct: 182 RNPNAFRPKIASSPQEQRDSKECEIKVIGKHNKGCHCKKSGCLKKYCECF 231



 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 58/90 (64%), Gaps = 9/90 (10%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA G +C  CNC +C NN+++E  R  A+   LERNPNAFRPKI      + +   
Sbjct: 144 YCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPQEQRDSKE 203

Query: 59  -------RHTKGCNCKRSGCLKNYCECYEA 81
                  +H KGC+CK+SGCLK YCEC++A
Sbjct: 204 CEIKVIGKHNKGCHCKKSGCLKKYCECFQA 233


>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
          Length = 391

 Score =  100 bits (249), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK   G+ + RH
Sbjct: 186 YCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGKGRSGDVKPRH 245

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 246 NKGCNCKRSGCLKNYCECYEA 266



 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK   G+
Sbjct: 181 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGKGRSGD 240

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 241 VKPRHNKGCNCKRSGCLKNYCECY 264


>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
 gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 459

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/118 (46%), Positives = 73/118 (61%), Gaps = 13/118 (11%)

Query: 80  EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHA 139
           E  ++   PN  + +K CNC  S CLKLYC+CFA+G +C  CNC +C N +E+E  R  A
Sbjct: 82  EVESKENTPN--KQQKHCNCKNSKCLKLYCECFASGSYCNGCNCVNCHNKLENESSRQVA 139

Query: 140 IRQCLERNPNAFRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
           I   LERNP+AF+PKI     G  +            +H+KGC+C++SGCLK YCECY
Sbjct: 140 ISGILERNPDAFKPKIAGSPHGMKDLQENVQQVLLIGKHSKGCHCRKSGCLKKYCECY 197



 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 59/92 (64%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA+G +C  CNC +C N +E+E  R  AI   LERNP+AF+PKI     G  +   
Sbjct: 108 YCECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQE 167

Query: 59  ---------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H+KGC+C++SGCLK YCECY+A
Sbjct: 168 NVQQVLLIGKHSKGCHCRKSGCLKKYCECYQA 199


>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
          Length = 508

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 66/81 (81%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+ + RH
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 366

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387



 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 66/84 (78%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+
Sbjct: 302 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 361

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNC+RSGCLKNYCECY
Sbjct: 362 VKPRHNKGCNCRRSGCLKNYCECY 385


>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
          Length = 377

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 68/81 (83%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANG+FC  CNCN+C+NN +HE +R +AI+ CL+RNP AF PKIGK  +G+ + RH
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLGDIKPRH 235

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCEC+EA
Sbjct: 236 NKGCNCKRSGCLKNYCECFEA 256



 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 54/79 (68%), Positives = 66/79 (83%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
           YCDCFANG+FC  CNCN+C+NN +HE +R +AI+ CL+RNP AF PKIGK  +G+ + RH
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLGDIKPRH 235

Query: 168 TKGCNCKRSGCLKNYCECY 186
            KGCNCKRSGCLKNYCEC+
Sbjct: 236 NKGCNCKRSGCLKNYCECF 254


>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/95 (58%), Positives = 63/95 (66%), Gaps = 5/95 (5%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYCDCFA G  C   CNC SC NN +H  +R + I+Q +ERNP AFRPK+
Sbjct: 195 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKV 254

Query: 156 GKCLVGEGE----RRHTKGCNCKRSGCLKNYCECY 186
                 E E     RH KGCNCK+S CLK YCECY
Sbjct: 255 ESRSNSEDEIDQKPRHFKGCNCKKSNCLKKYCECY 289



 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 5/89 (5%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-- 57
           YCDCFA G  C   CNC SC NN +H  +R + I+Q +ERNP AFRPK+      E E  
Sbjct: 206 YCDCFAAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKVESRSNSEDEID 265

Query: 58  --RRHTKGCNCKRSGCLKNYCECYEAGAR 84
              RH KGCNCK+S CLK YCECY+ G +
Sbjct: 266 QKPRHFKGCNCKKSNCLKKYCECYQMGVK 294


>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
           fascicularis]
          Length = 490

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 66/81 (81%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+ + RH
Sbjct: 289 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 348

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 349 NKGCNCRRSGCLKNYCECYEA 369



 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 66/84 (78%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+
Sbjct: 284 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 343

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNC+RSGCLKNYCECY
Sbjct: 344 VKPRHNKGCNCRRSGCLKNYCECY 367


>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
          Length = 676

 Score =  100 bits (248), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 11/96 (11%)

Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
           S CLKLYC+CFA+G +C  C+C++C NN+ +E  R  A+   L+RNPNAFRPKI     G
Sbjct: 199 SRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHG 258

Query: 162 E---GER--------RHTKGCNCKRSGCLKNYCECY 186
               GE         +H KGC+CK+SGCLK YCEC+
Sbjct: 259 SHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECF 294



 Score = 93.6 bits (231), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 45/92 (48%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---GE 57
           YC+CFA+G +C  C+C++C NN+ +E  R  A+   L+RNPNAFRPKI     G    GE
Sbjct: 205 YCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGE 264

Query: 58  R--------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCEC++A
Sbjct: 265 EAGKVPLVGKHNKGCHCKKSGCLKKYCECFQA 296


>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
 gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
          Length = 897

 Score =  100 bits (248), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 54/105 (51%), Positives = 59/105 (56%), Gaps = 13/105 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYCDCFA G FC   C+C  CFN  E+ +      +Q   RNP AF P
Sbjct: 509 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 568

Query: 154 KIGKC------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI K             L      RHT+GCNCKRS CLK YCECY
Sbjct: 569 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECY 613



 Score = 81.6 bits (200), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 44/94 (46%), Positives = 51/94 (54%), Gaps = 13/94 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
           YCDCFA G FC   C+C  CFN  E+ +      +Q   RNP AF PKI K         
Sbjct: 522 YCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNM 581

Query: 52  ----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
               L      RHT+GCNCKRS CLK YCECY++
Sbjct: 582 EDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQS 615


>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
 gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
          Length = 629

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 54/107 (50%), Positives = 64/107 (59%), Gaps = 19/107 (17%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC  S CLKLYC+CFA G  C  CNC +C NN+E+E  R  A+   LERNP AF+PK
Sbjct: 147 KHCNCKNSRCLKLYCECFAAGVHCTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPK 206

Query: 155 IGKC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
           I                  LVG    +H KGC+CK+SGCLK YCEC+
Sbjct: 207 IASSPHGSRDAKEDAMEVQLVG----KHNKGCHCKKSGCLKKYCECF 249



 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/96 (47%), Positives = 57/96 (59%), Gaps = 19/96 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
           YC+CFA G  C  CNC +C NN+E+E  R  A+   LERNP AF+PKI            
Sbjct: 160 YCECFAAGVHCTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKE 219

Query: 52  ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                 LVG    +H KGC+CK+SGCLK YCEC++A
Sbjct: 220 DAMEVQLVG----KHNKGCHCKKSGCLKKYCECFQA 251


>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
           distachyon]
          Length = 540

 Score = 99.8 bits (247), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 57/111 (51%), Positives = 65/111 (58%), Gaps = 13/111 (11%)

Query: 89  NGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           NG   RK C C  S CLKLYCDCFA+G +C   CNC +C N++ HE  R  AI   +ERN
Sbjct: 143 NGTPRRKNCKCKNSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAINAVMERN 202

Query: 148 PNAFRPKIGKCL----------VGEGER--RHTKGCNCKRSGCLKNYCECY 186
           P AF PKIG               EG R  +H KGC CKRS CLK YCEC+
Sbjct: 203 PVAFMPKIGNIPRHAAQNREYRAAEGPRVGKHMKGCQCKRSECLKKYCECF 253



 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/94 (48%), Positives = 56/94 (59%), Gaps = 13/94 (13%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL------- 52
           YCDCFA+G +C   CNC +C N++ HE  R  AI   +ERNP AF PKIG          
Sbjct: 162 YCDCFASGRYCNDDCNCKNCCNDVSHETARQDAINAVMERNPVAFMPKIGNIPRHAAQNR 221

Query: 53  ---VGEGER--RHTKGCNCKRSGCLKNYCECYEA 81
                EG R  +H KGC CKRS CLK YCEC+++
Sbjct: 222 EYRAAEGPRVGKHMKGCQCKRSECLKKYCECFQS 255


>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 542

 Score = 99.8 bits (247), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 50/97 (51%), Positives = 62/97 (63%), Gaps = 2/97 (2%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           + RK C C  SMCLKLYC+CFA G+ C  C+C +C N+ +HE +   A    L RNP AF
Sbjct: 228 KERKSCKCKNSMCLKLYCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAF 287

Query: 152 RPKIGKCLVGEGER--RHTKGCNCKRSGCLKNYCECY 186
            PK+      +G    +H KGCNCKRSGC KNYCEC+
Sbjct: 288 EPKMTAVPKEDGALAIKHQKGCNCKRSGCQKNYCECF 324



 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 53/85 (62%), Gaps = 2/85 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA G+ C  C+C +C N+ +HE +   A    L RNP AF PK+      +G    
Sbjct: 244 YCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAFEPKMTAVPKEDGALAI 303

Query: 59  RHTKGCNCKRSGCLKNYCECYEAGA 83
           +H KGCNCKRSGC KNYCEC+ AG 
Sbjct: 304 KHQKGCNCKRSGCQKNYCECFHAGV 328


>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
          Length = 263

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 53/81 (65%), Positives = 67/81 (82%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN++HE +R  AI+ CL RNP AF+PKIG   +G+ + RH
Sbjct: 53  YCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLGDVKPRH 112

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGC+C+RSGCLKNYCECYEA
Sbjct: 113 SKGCSCRRSGCLKNYCECYEA 133



 Score = 96.7 bits (239), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 67/84 (79%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN++HE +R  AI+ CL RNP AF+PKIG   +G+
Sbjct: 48  ITLAGYCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLGD 107

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH+KGC+C+RSGCLKNYCECY
Sbjct: 108 VKPRHSKGCSCRRSGCLKNYCECY 131


>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
 gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
          Length = 497

 Score = 99.4 bits (246), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 57/116 (49%), Positives = 76/116 (65%), Gaps = 12/116 (10%)

Query: 83  ARTLEPNGI-RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           ART   NG  +  + C+C +S CLKLYC+CFA+G +C +C+C +C NN+E+ED R  A  
Sbjct: 80  ARTKIINGTPKSHRQCHCKQSKCLKLYCECFASGSYCDECSCANCHNNVENEDVRREAAE 139

Query: 142 QCLERNPNAFRPKI--GKCLV---GEGER------RHTKGCNCKRSGCLKNYCECY 186
             LERNPNAF+PKI    C     G+  +      +H KGC+CKR+GCLK YCEC+
Sbjct: 140 CILERNPNAFKPKITGSPCTPPDDGDAAKDVLVMAKHIKGCHCKRTGCLKKYCECF 195



 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 62/92 (67%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--GKCLV---GE 55
           YC+CFA+G +C +C+C +C NN+E+ED R  A    LERNPNAF+PKI    C     G+
Sbjct: 106 YCECFASGSYCDECSCANCHNNVENEDVRREAAECILERNPNAFKPKITGSPCTPPDDGD 165

Query: 56  GER------RHTKGCNCKRSGCLKNYCECYEA 81
             +      +H KGC+CKR+GCLK YCEC++A
Sbjct: 166 AAKDVLVMAKHIKGCHCKRTGCLKKYCECFQA 197


>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
           both male and female reproductiv [Ectocarpus
           siliculosus]
          Length = 735

 Score = 99.0 bits (245), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 47/90 (52%), Positives = 56/90 (62%), Gaps = 3/90 (3%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC KS CLKLYC+CF   ++C  CNC  C N    ED R  AI+  +ERNP AF  K  
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKFE 499

Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +     G+R H  GCNCK+S CLK YCEC+
Sbjct: 500 R---RAGKRSHNAGCNCKKSACLKKYCECF 526



 Score = 84.7 bits (208), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 40/82 (48%), Positives = 50/82 (60%), Gaps = 3/82 (3%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CF   ++C  CNC  C N    ED R  AI+  +ERNP AF  K  +     G+R H
Sbjct: 451 YCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKFER---RAGKRSH 507

Query: 61  TKGCNCKRSGCLKNYCECYEAG 82
             GCNCK+S CLK YCEC++AG
Sbjct: 508 NAGCNCKKSACLKKYCECFQAG 529



 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 38/120 (31%), Positives = 45/120 (37%), Gaps = 39/120 (32%)

Query: 48  IGKCLVGEGERRHTKGCNCKRSGCLKNYCECYE-------------------AGARTLEP 88
           +G    G G    +  CNCK+S CLK YCEC++                      R L  
Sbjct: 424 VGMAGCGGGRDPLSVTCNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAI 483

Query: 89  NGIRPRKP-------------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
            G   R P                   CNC KS CLK YC+CF  G  C   C C +C N
Sbjct: 484 QGTIERNPQAFVSKFERRAGKRSHNAGCNCKKSACLKKYCECFQAGVACGTNCKCVNCKN 543


>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
           [Cucumis sativus]
          Length = 788

 Score = 99.0 bits (245), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 56/117 (47%), Positives = 65/117 (55%), Gaps = 14/117 (11%)

Query: 84  RTLEPNGIRPR--KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAI 140
           R LEP G      K CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      
Sbjct: 480 RRLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATR 539

Query: 141 RQCLERNPNAFRPKI---GKCLVGEGER--------RHTKGCNCKRSGCLKNYCECY 186
           +Q   RNP AF PK+      L   G+         RH +GCNCK+S CLK YCECY
Sbjct: 540 KQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECY 596



 Score = 80.1 bits (196), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 42/94 (44%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+      L   G
Sbjct: 506 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPG 565

Query: 57  ER--------RHTKGCNCKRSGCLKNYCECYEAG 82
           +         RH +GCNCK+S CLK YCECY+ G
Sbjct: 566 DESNKTPASARHKRGCNCKKSSCLKKYCECYQGG 599


>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
          Length = 999

 Score = 99.0 bits (245), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 53/139 (38%), Positives = 66/139 (47%), Gaps = 45/139 (32%)

Query: 93  PRK-PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           P+K PCNC KS CLKLYC+CFA+G +C + CNC  C N    E+ R  AI   LE+NPNA
Sbjct: 562 PKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNA 621

Query: 151 FRPKIG-----------KCLVGE--------------------------------GERRH 167
           F+PKIG           +  VG                                   + H
Sbjct: 622 FKPKIGATPGMTPGSARRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMH 681

Query: 168 TKGCNCKRSGCLKNYCECY 186
             GC+CK+S C K YCEC+
Sbjct: 682 KHGCHCKKSACQKKYCECF 700



 Score = 78.6 bits (192), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 43/126 (34%), Positives = 56/126 (44%), Gaps = 44/126 (34%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG---------- 49
           YC+CFA+G +C + CNC  C N    E+ R  AI   LE+NPNAF+PKIG          
Sbjct: 578 YCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNAFKPKIGATPGMTPGSA 637

Query: 50  -KCLVGE--------------------------------GERRHTKGCNCKRSGCLKNYC 76
            +  VG                                   + H  GC+CK+S C K YC
Sbjct: 638 RRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMHKHGCHCKKSACQKKYC 697

Query: 77  ECYEAG 82
           EC++AG
Sbjct: 698 ECFQAG 703



 Score = 37.0 bits (84), Expect = 3.8,   Method: Composition-based stats.
 Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 149 NAFRPKIGKCLVGEGERRHTKG-CNCKRSGCLKNYCECY 186
           +A +  +G   VG   +   K  CNCK+S CLK YCEC+
Sbjct: 544 SASKAPLGPAPVGMATQVPKKAPCNCKKSKCLKLYCECF 582


>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
          Length = 788

 Score = 99.0 bits (245), Expect = 9e-19,   Method: Composition-based stats.
 Identities = 56/117 (47%), Positives = 65/117 (55%), Gaps = 14/117 (11%)

Query: 84  RTLEPNGIRPR--KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAI 140
           R LEP G      K CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      
Sbjct: 480 RRLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATR 539

Query: 141 RQCLERNPNAFRPKI---GKCLVGEGER--------RHTKGCNCKRSGCLKNYCECY 186
           +Q   RNP AF PK+      L   G+         RH +GCNCK+S CLK YCECY
Sbjct: 540 KQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECY 596



 Score = 80.1 bits (196), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 42/94 (44%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+      L   G
Sbjct: 506 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPG 565

Query: 57  ER--------RHTKGCNCKRSGCLKNYCECYEAG 82
           +         RH +GCNCK+S CLK YCECY+ G
Sbjct: 566 DESNKTPASARHKRGCNCKKSSCLKKYCECYQGG 599


>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
          Length = 517

 Score = 99.0 bits (245), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 54/81 (66%), Positives = 66/81 (81%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+ + RH
Sbjct: 320 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 379

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGC+C+RSGCLKNYCECYEA
Sbjct: 380 NKGCSCRRSGCLKNYCECYEA 400



 Score = 96.7 bits (239), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 53/84 (63%), Positives = 66/84 (78%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ HE +R  AI+ CL RNP AF+PKIGK  +G+
Sbjct: 315 IALARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 374

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGC+C+RSGCLKNYCECY
Sbjct: 375 VKPRHNKGCSCRRSGCLKNYCECY 398


>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
          Length = 821

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Composition-based stats.
 Identities = 54/105 (51%), Positives = 59/105 (56%), Gaps = 13/105 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYCDCFA G FC   C+C  CFN  E+ +      +Q   RNP AF P
Sbjct: 433 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 492

Query: 154 KIGKC------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI K             L      RHT+GCNCKRS CLK YCECY
Sbjct: 493 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECY 537



 Score = 80.5 bits (197), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 44/94 (46%), Positives = 51/94 (54%), Gaps = 13/94 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
           YCDCFA G FC   C+C  CFN  E+ +      +Q   RNP AF PKI K         
Sbjct: 446 YCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNM 505

Query: 52  ----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
               L      RHT+GCNCKRS CLK YCECY++
Sbjct: 506 EDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQS 539


>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
 gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
          Length = 921

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 49/106 (46%), Positives = 55/106 (51%), Gaps = 14/106 (13%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYCDCF  G FC   C C  C N +E +D      +Q   RNP AF P
Sbjct: 520 KTCNCKKSKCLKLYCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAP 579

Query: 154 KIGKC-------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI  C             +      RH +GCNCKRS C K YCEC+
Sbjct: 580 KIVPCAADVPPNNMEDVNMTTPASARHKRGCNCKRSKCTKKYCECF 625



 Score = 80.1 bits (196), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 40/95 (42%), Positives = 47/95 (49%), Gaps = 14/95 (14%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
           YCDCF  G FC   C C  C N +E +D      +Q   RNP AF PKI  C        
Sbjct: 533 YCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAPKIVPCAADVPPNN 592

Query: 52  -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                +      RH +GCNCKRS C K YCEC++A
Sbjct: 593 MEDVNMTTPASARHKRGCNCKRSKCTKKYCECFQA 627


>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
          Length = 484

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/103 (54%), Positives = 73/103 (70%), Gaps = 5/103 (4%)

Query: 89  NGIRPRKPCNCTKS-----MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQC 143
           N I  + P   T+S     + ++ YCDCFA+G+ C  CNCN+C NN+ HE +R  AI+ C
Sbjct: 274 NLITQQSPAEVTESSMDSDVGVRKYCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKAC 333

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           L+RNP AF+PKIG   +G+ + RH +GCNCKRSGCLKNYCECY
Sbjct: 334 LDRNPEAFQPKIGPGSLGDAKPRHNRGCNCKRSGCLKNYCECY 376



 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 52/80 (65%), Positives = 64/80 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+ C  CNCN+C NN+ HE +R  AI+ CL+RNP AF+PKIG   +G+ + RH
Sbjct: 298 YCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGPGSLGDAKPRH 357

Query: 61  TKGCNCKRSGCLKNYCECYE 80
            +GCNCKRSGCLKNYCECYE
Sbjct: 358 NRGCNCKRSGCLKNYCECYE 377


>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
          Length = 579

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 11/96 (11%)

Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
           S CLKLYC+CFA+G +C  C+C++C NN+ +E  R  A+   L+RNPNAFRPKI     G
Sbjct: 118 SRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHG 177

Query: 162 E---GER--------RHTKGCNCKRSGCLKNYCECY 186
               GE         +H KGC+CK+SGCLK YCEC+
Sbjct: 178 SHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECF 213



 Score = 91.7 bits (226), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 45/92 (48%), Positives = 60/92 (65%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---GE 57
           YC+CFA+G +C  C+C++C NN+ +E  R  A+   L+RNPNAFRPKI     G    GE
Sbjct: 124 YCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGE 183

Query: 58  R--------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCEC++A
Sbjct: 184 EAGKVPLVGKHNKGCHCKKSGCLKKYCECFQA 215


>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
          Length = 283

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 68/81 (83%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  C+CN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK  +G+ + RH
Sbjct: 77  YCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKSRLGDVKPRH 136

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCECYEA
Sbjct: 137 NKGCNCKRSGCLKNYCECYEA 157



 Score = 95.5 bits (236), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 68/84 (80%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  C+CN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK  +G+
Sbjct: 72  ITLAGYCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKSRLGD 131

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + RH KGCNCKRSGCLKNYCECY
Sbjct: 132 VKPRHNKGCNCKRSGCLKNYCECY 155


>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 491

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 50/94 (53%), Positives = 63/94 (67%), Gaps = 2/94 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           +K CNC  S CLKLYC+CFA+G++C   C C SC NN  +++ R  A+   LERNP AFR
Sbjct: 137 KKTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFR 196

Query: 153 PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           PKI      E   RH+KGC+C++S C K YCECY
Sbjct: 197 PKINSAAT-ESWARHSKGCHCRKSNCQKRYCECY 229



 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 43/85 (50%), Positives = 57/85 (67%), Gaps = 2/85 (2%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA+G++C   C C SC NN  +++ R  A+   LERNP AFRPKI      E   R
Sbjct: 151 YCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFRPKINSAAT-ESWAR 209

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
           H+KGC+C++S C K YCECY+AG +
Sbjct: 210 HSKGCHCRKSNCQKRYCECYQAGIQ 234


>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
          Length = 508

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/81 (66%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  + RH
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPRH 366

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387



 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 53/82 (64%), Positives = 64/82 (78%)

Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE 164
           L  YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  +
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVK 363

Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
            RH KGCNC+RSGCLKNYCECY
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECY 385


>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
          Length = 508

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/81 (66%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  + RH
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPRH 366

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387



 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 53/82 (64%), Positives = 64/82 (78%)

Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE 164
           L  YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  +
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVK 363

Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
            RH KGCNC+RSGCLKNYCECY
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECY 385


>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 983

 Score = 97.8 bits (242), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 51/131 (38%), Positives = 65/131 (49%), Gaps = 35/131 (26%)

Query: 91  IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
           +  + PCNC KS CLKLYC+CFANG +C + CNC  C N +E  D+R  A+   LE+NPN
Sbjct: 581 VSKKAPCNCKKSKCLKLYCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPN 640

Query: 150 AFRPKI----------------------------------GKCLVGEGERRHTKGCNCKR 175
           AF+PKI                                  G       +R H  GC+CK+
Sbjct: 641 AFKPKILPSQSSAVKSLYEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKK 700

Query: 176 SGCLKNYCECY 186
           S C K YCEC+
Sbjct: 701 SACQKKYCECF 711



 Score = 84.0 bits (206), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 58/172 (33%), Positives = 77/172 (44%), Gaps = 6/172 (3%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFANG +C + CNC  C N +E  D+R  A+   LE+NPNAF+PKI        +  
Sbjct: 598 YCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPNAFKPKILPSQSSAVKSL 657

Query: 60  HTKGCNCKR-SGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC 118
           +    + K  S  +         G RT        +  C+C KS C K YC+CF  G  C
Sbjct: 658 YEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKKSACQKKYCECFQAGVLC 717

Query: 119 Y-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTK 169
              C C  C N    E  +  A    L   P +F P         G ++HTK
Sbjct: 718 GDNCRCIDCRNV---EKKKEIAATDALLSTPESFLPSTPLSQAIYGSKKHTK 766


>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 255

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 64/99 (64%), Gaps = 7/99 (7%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G  C  CNC +C NN+     R  A+   LERNPNAFRP
Sbjct: 6   KKYCNCRNSRCLKLYCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAFRP 65

Query: 154 KIGKCLVG------EGERRHTKGCNCKRSGCLKNYCECY 186
           KI    VG      EG  RH +GC+CK+S CLK YCEC+
Sbjct: 66  KIAPG-VGPLEKRREGASRHNRGCHCKKSSCLKKYCECF 103



 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/87 (52%), Positives = 56/87 (64%), Gaps = 7/87 (8%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG------ 54
           YC+CFA+G  C  CNC +C NN+     R  A+   LERNPNAFRPKI    VG      
Sbjct: 20  YCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAFRPKIAPG-VGPLEKRR 78

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEA 81
           EG  RH +GC+CK+S CLK YCEC++A
Sbjct: 79  EGASRHNRGCHCKKSSCLKKYCECFQA 105


>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
          Length = 502

 Score = 97.1 bits (240), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 53/81 (65%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC+ CNCN+C NN+ HE  R  A++ CL+RNP AF+PKIGK   G+   RH
Sbjct: 302 YCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIGKGRPGDVTPRH 361

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            +GCNC+RSGCLKNYCECYEA
Sbjct: 362 NRGCNCRRSGCLKNYCECYEA 382



 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 65/84 (77%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC+ CNCN+C NN+ HE  R  A++ CL+RNP AF+PKIGK   G+
Sbjct: 297 ISLAGYCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIGKGRPGD 356

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
              RH +GCNC+RSGCLKNYCECY
Sbjct: 357 VTPRHNRGCNCRRSGCLKNYCECY 380


>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
          Length = 764

 Score = 97.1 bits (240), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 54/116 (46%), Positives = 66/116 (56%), Gaps = 13/116 (11%)

Query: 84  RTLEPNG-IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR 141
           R +EP G     K CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      +
Sbjct: 464 RKVEPAGDTEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRK 523

Query: 142 QCLERNPNAFRPKIGKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
           Q   RNP AF PK+ +       +G+         RH +GCNCK+S CLK YCECY
Sbjct: 524 QIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSNCLKKYCECY 579



 Score = 79.0 bits (193), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 53/94 (56%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +       +G
Sbjct: 489 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIG 548

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
           +         RH +GCNCK+S CLK YCECY+ G
Sbjct: 549 DDPNKTPASARHKRGCNCKKSNCLKKYCECYQGG 582



 Score = 55.8 bits (133), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 46/174 (26%), Positives = 61/174 (35%), Gaps = 59/174 (33%)

Query: 21  NNIEHEDDRHHAIRQCLERN-PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 79
           N ++  +D   A  +   +N P   R K+      EG +R    CNCK+S CLK YCEC+
Sbjct: 438 NGVQPAEDSSQAASEDFNQNSPKKKRRKVEPAGDTEGCKR----CNCKKSKCLKLYCECF 493

Query: 80  EAGARTLE---------------------------------------------------- 87
            AG   +E                                                    
Sbjct: 494 AAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNK 553

Query: 88  -PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHA 139
            P   R ++ CNC KS CLK YC+C+  G  C   C C  C N    +D    A
Sbjct: 554 TPASARHKRGCNCKKSNCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSAQA 607


>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
          Length = 1094

 Score = 96.7 bits (239), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 51/105 (48%), Positives = 59/105 (56%), Gaps = 13/105 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYCDCFA G +C + C C  CFN  E++D      +Q   RNP AF P
Sbjct: 574 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 633

Query: 154 KIGKCLVGE------------GERRHTKGCNCKRSGCLKNYCECY 186
           KI   + G                RH +GCNCK+S CLK YCECY
Sbjct: 634 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECY 678



 Score = 79.0 bits (193), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 42/94 (44%), Positives = 51/94 (54%), Gaps = 13/94 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
           YCDCFA G +C + C C  CFN  E++D      +Q   RNP AF PKI   + G     
Sbjct: 587 YCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPINS 646

Query: 56  --------GERRHTKGCNCKRSGCLKNYCECYEA 81
                      RH +GCNCK+S CLK YCECY+A
Sbjct: 647 GEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQA 680


>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
           CCMP2712]
          Length = 118

 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 63/99 (63%), Gaps = 6/99 (6%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           RK CNC  S CLKLYC+CFA+G +C   CNC  C NN  +E  R  AI   LERNP AF+
Sbjct: 3   RKQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQ 62

Query: 153 PKIGKCLV-----GEGERRHTKGCNCKRSGCLKNYCECY 186
           PKI             + RHTKGC+CK+SGCLK YCEC+
Sbjct: 63  PKIASSAPPGIEDSPVQGRHTKGCHCKKSGCLKKYCECF 101



 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/89 (51%), Positives = 56/89 (62%), Gaps = 6/89 (6%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV-----G 54
           YC+CFA+G +C   CNC  C NN  +E  R  AI   LERNP AF+PKI           
Sbjct: 17  YCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQPKIASSAPPGIEDS 76

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
             + RHTKGC+CK+SGCLK YCEC++AG 
Sbjct: 77  PVQGRHTKGCHCKKSGCLKKYCECFQAGV 105


>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
          Length = 1001

 Score = 96.3 bits (238), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 51/105 (48%), Positives = 59/105 (56%), Gaps = 13/105 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYCDCFA G +C + C C  CFN  E++D      +Q   RNP AF P
Sbjct: 567 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 626

Query: 154 KIGKCLVGE------------GERRHTKGCNCKRSGCLKNYCECY 186
           KI   + G                RH +GCNCK+S CLK YCECY
Sbjct: 627 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECY 671



 Score = 79.0 bits (193), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 42/94 (44%), Positives = 51/94 (54%), Gaps = 13/94 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
           YCDCFA G +C + C C  CFN  E++D      +Q   RNP AF PKI   + G     
Sbjct: 580 YCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPINS 639

Query: 56  --------GERRHTKGCNCKRSGCLKNYCECYEA 81
                      RH +GCNCK+S CLK YCECY+A
Sbjct: 640 GEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQA 673


>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
           [Homo sapiens]
          Length = 529

 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/81 (65%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  + +H
Sbjct: 328 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 387

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 388 NKGCNCRRSGCLKNYCECYEA 408



 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 65/84 (77%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G 
Sbjct: 323 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 382

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + +H KGCNC+RSGCLKNYCECY
Sbjct: 383 VKPQHNKGCNCRRSGCLKNYCECY 406


>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
 gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
 gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
           [Homo sapiens]
          Length = 508

 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/81 (65%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  + +H
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 366

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387



 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 65/84 (77%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G 
Sbjct: 302 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 361

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + +H KGCNC+RSGCLKNYCECY
Sbjct: 362 VKPQHNKGCNCRRSGCLKNYCECY 385


>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
          Length = 276

 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 54/106 (50%), Positives = 69/106 (65%), Gaps = 7/106 (6%)

Query: 85  TLEPNGI---RPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAI 140
            L+PN     +  KPCNC KS CLKLYCDCFA G+ C  +CNC  CFNN  +  +R+  I
Sbjct: 134 ILQPNLFYLNQEHKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFI 193

Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           ++ +ERNP AF  K+ +    E +  H KGCNC++SGC K YCECY
Sbjct: 194 QKMIERNPQAFNQKVQEV---ESKMTHAKGCNCRKSGCQKKYCECY 236



 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 55/84 (65%), Gaps = 4/84 (4%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA G+ C  +CNC  CFNN  +  +R+  I++ +ERNP AF  K+ +    E +  
Sbjct: 160 YCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFIQKMIERNPQAFNQKVQEV---ESKMT 216

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H KGCNC++SGC K YCECY+ G 
Sbjct: 217 HAKGCNCRKSGCQKKYCECYQMGI 240


>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
          Length = 508

 Score = 96.3 bits (238), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/81 (65%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  + +H
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 366

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387



 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 65/84 (77%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G 
Sbjct: 302 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 361

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + +H KGCNC+RSGCLKNYCECY
Sbjct: 362 VKPQHNKGCNCRRSGCLKNYCECY 385


>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
           [Homo sapiens]
          Length = 463

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 53/81 (65%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  + +H
Sbjct: 262 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 321

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 322 NKGCNCRRSGCLKNYCECYEA 342



 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 65/84 (77%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G 
Sbjct: 257 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 316

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + +H KGCNC+RSGCLKNYCECY
Sbjct: 317 VKPQHNKGCNCRRSGCLKNYCECY 340


>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
          Length = 289

 Score = 95.9 bits (237), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 64/81 (79%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFANG+FC  CNC++C NN  HE +R  AI+ CLERNP AF PKIGK  +GE +  H
Sbjct: 151 YCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIGKRKLGEIKHHH 210

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCL+NYCEC+EA
Sbjct: 211 NKGCNCKRSGCLENYCECFEA 231



 Score = 92.8 bits (229), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 64/84 (76%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           M  + YCDCFANG+FC  CNC++C NN  HE +R  AI+ CLERNP AF PKIGK  +GE
Sbjct: 146 MTSEGYCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIGKRKLGE 205

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            +  H KGCNCKRSGCL+NYCEC+
Sbjct: 206 IKHHHNKGCNCKRSGCLENYCECF 229


>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
          Length = 806

 Score = 95.9 bits (237), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 50/104 (48%), Positives = 60/104 (57%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      +Q   RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562

Query: 154 KIGKC--LVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ +    V E            RH +GCNCK+S CLK YCEC+
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECF 606



 Score = 79.0 bits (193), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGE-- 55
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +    V E  
Sbjct: 516 YCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFG 575

Query: 56  -------GERRHTKGCNCKRSGCLKNYCECYEAG 82
                     RH +GCNCK+S CLK YCEC++ G
Sbjct: 576 DESNKTPASARHKRGCNCKKSSCLKKYCECFQGG 609



 Score = 56.2 bits (134), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 38/133 (28%), Positives = 45/133 (33%), Gaps = 54/133 (40%)

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------------------------- 87
            E    K CNCK+S CLK YCEC+ AG   +E                            
Sbjct: 497 AENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRN 556

Query: 88  -------------------------PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQC 121
                                    P   R ++ CNC KS CLK YC+CF  G  C   C
Sbjct: 557 PLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISC 616

Query: 122 NCNSCFNNIEHED 134
            C  C N    +D
Sbjct: 617 RCEGCKNTFGRKD 629


>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
          Length = 243

 Score = 95.9 bits (237), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 56/81 (69%), Positives = 66/81 (81%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C+NN  HE +R  AI+ CL RNP AF+PKIGK  +GE + RH
Sbjct: 52  YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKLGEVKPRH 111

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCEC+EA
Sbjct: 112 NKGCNCKRSGCLKNYCECFEA 132



 Score = 92.4 bits (228), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 54/79 (68%), Positives = 64/79 (81%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
           YCDCFA+G+FC  CNCN+C+NN  HE +R  AI+ CL RNP AF+PKIGK  +GE + RH
Sbjct: 52  YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKLGEVKPRH 111

Query: 168 TKGCNCKRSGCLKNYCECY 186
            KGCNCKRSGCLKNYCEC+
Sbjct: 112 NKGCNCKRSGCLKNYCECF 130


>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
          Length = 829

 Score = 95.5 bits (236), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 50/104 (48%), Positives = 60/104 (57%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      +Q   RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562

Query: 154 KIGKC--LVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ +    V E            RH +GCNCK+S CLK YCEC+
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECF 606



 Score = 78.6 bits (192), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGE-- 55
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +    V E  
Sbjct: 516 YCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFG 575

Query: 56  -------GERRHTKGCNCKRSGCLKNYCECYEAG 82
                     RH +GCNCK+S CLK YCEC++ G
Sbjct: 576 DESNKTPASARHKRGCNCKKSSCLKKYCECFQGG 609



 Score = 55.8 bits (133), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 38/133 (28%), Positives = 45/133 (33%), Gaps = 54/133 (40%)

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------------------------- 87
            E    K CNCK+S CLK YCEC+ AG   +E                            
Sbjct: 497 AENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRN 556

Query: 88  -------------------------PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQC 121
                                    P   R ++ CNC KS CLK YC+CF  G  C   C
Sbjct: 557 PLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISC 616

Query: 122 NCNSCFNNIEHED 134
            C  C N    +D
Sbjct: 617 RCEGCKNTFGRKD 629


>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
          Length = 593

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 51/104 (49%), Positives = 62/104 (59%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYC+CFA G +C + C+C +CFN   H+D      +Q   RNP AF P
Sbjct: 328 KRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAP 387

Query: 154 KIGK-----CLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ +       VGE         RH +GCNCK+S CLK YCECY
Sbjct: 388 KVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 431



 Score = 76.6 bits (187), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 41/93 (44%), Positives = 54/93 (58%), Gaps = 12/93 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
           YC+CFA G +C + C+C +CFN   H+D      +Q   RNP AF PK+ +       VG
Sbjct: 341 YCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 400

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEA 81
           E         RH +GCNCK+S CLK YCECY++
Sbjct: 401 EDASKTPASARHKRGCNCKKSNCLKKYCECYQS 433


>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
          Length = 299

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/81 (65%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G  + +H
Sbjct: 98  YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 157

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNC+RSGCLKNYCECYEA
Sbjct: 158 NKGCNCRRSGCLKNYCECYEA 178



 Score = 92.4 bits (228), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 65/84 (77%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           + L  YCDCFA+G+FC  CNCN+C NN+ H+ +R  AI+ CL RNP AF+PKIGK  +G 
Sbjct: 93  ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 152

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
            + +H KGCNC+RSGCLKNYCECY
Sbjct: 153 VKPQHNKGCNCRRSGCLKNYCECY 176


>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
          Length = 777

 Score = 95.1 bits (235), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 50/104 (48%), Positives = 60/104 (57%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      +Q   RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 533

Query: 154 KIGKC--LVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ +    V E            RH +GCNCK+S CLK YCEC+
Sbjct: 534 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECF 577



 Score = 78.2 bits (191), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGE-- 55
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +    V E  
Sbjct: 487 YCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFG 546

Query: 56  -------GERRHTKGCNCKRSGCLKNYCECYEAG 82
                     RH +GCNCK+S CLK YCEC++ G
Sbjct: 547 DESNKTPASARHKRGCNCKKSSCLKKYCECFQGG 580



 Score = 55.5 bits (132), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 38/133 (28%), Positives = 45/133 (33%), Gaps = 54/133 (40%)

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------------------------- 87
            E    K CNCK+S CLK YCEC+ AG   +E                            
Sbjct: 468 AENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRN 527

Query: 88  -------------------------PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQC 121
                                    P   R ++ CNC KS CLK YC+CF  G  C   C
Sbjct: 528 PLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISC 587

Query: 122 NCNSCFNNIEHED 134
            C  C N    +D
Sbjct: 588 RCEGCKNTFGRKD 600


>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1174

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 41/90 (45%), Positives = 49/90 (54%), Gaps = 5/90 (5%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           C C  S CLKLYCDCF+  ++C  C C  C N    E  R  AI     RNPNAF+ K+ 
Sbjct: 862 CKCKNSKCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLN 921

Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
             +       H  GC CK+S CLK YCEC+
Sbjct: 922 ASVT-----EHATGCKCKKSECLKKYCECF 946



 Score = 78.2 bits (191), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 34/79 (43%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCF+  ++C  C C  C N    E  R  AI     RNPNAF+ K+   +       H
Sbjct: 873 YCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLNASVT-----EH 927

Query: 61  TKGCNCKRSGCLKNYCECY 79
             GC CK+S CLK YCEC+
Sbjct: 928 ATGCKCKKSECLKKYCECF 946



 Score = 56.6 bits (135), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 33/114 (28%), Positives = 44/114 (38%), Gaps = 37/114 (32%)

Query: 54  GEGERRHTKGCNCKRSGCLKNYCECYEA---------------------------GARTL 86
           G+    +   C CK S CLK YC+C+ A                             R  
Sbjct: 852 GKENSENESTCKCKNSKCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRAR 911

Query: 87  EPNGIRPR---------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNI 130
            PN  + +           C C KS CLK YC+CFAN   C + C C++C N +
Sbjct: 912 NPNAFKTKLNASVTEHATGCKCKKSECLKKYCECFANLVVCGESCKCSNCKNYV 965


>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
          Length = 783

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 37/46 (80%), Positives = 41/46 (89%)

Query: 36  CLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
           CL+RNP AF+PKIGK   GE +RRH+KGCNCKRSGCLKNYCECYEA
Sbjct: 605 CLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEA 650



 Score = 90.5 bits (223), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 35/44 (79%), Positives = 39/44 (88%)

Query: 143 CLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           CL+RNP AF+PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 605 CLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 648


>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
          Length = 218

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/97 (52%), Positives = 65/97 (67%), Gaps = 4/97 (4%)

Query: 91  IRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
           I   KPCNC KS CLKLYCDCFA G+ C  +CNC  CFNN  +  +R+  I + +ERNP 
Sbjct: 79  IEEAKPCNCKKSKCLKLYCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPE 138

Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           AF  K+ +    + +  H+KGCNC++SGC K YCECY
Sbjct: 139 AFNQKVKEV---DYKLAHSKGCNCRKSGCKKKYCECY 172



 Score = 76.3 bits (186), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 55/84 (65%), Gaps = 4/84 (4%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA G+ C  +CNC  CFNN  +  +R+  I + +ERNP AF  K+ +    + +  
Sbjct: 96  YCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPEAFNQKVKEV---DYKLA 152

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H+KGCNC++SGC K YCECY+ G 
Sbjct: 153 HSKGCNCRKSGCKKKYCECYQLGI 176


>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
          Length = 597

 Score = 93.6 bits (231), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK C+C  S CLKLYC+CFAN   C  C+C  C NN  + +    +    LERNP AF P
Sbjct: 341 RKGCHCKNSKCLKLYCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNP 400

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           K  K      + +HTKGC+C++S CLK YCEC+
Sbjct: 401 KF-KQKEETKQHKHTKGCHCRKSECLKKYCECF 432



 Score = 79.3 bits (194), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 50/82 (60%), Gaps = 1/82 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFAN   C  C+C  C NN  + +    +    LERNP AF PK  K      + +H
Sbjct: 355 YCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNPKF-KQKEETKQHKH 413

Query: 61  TKGCNCKRSGCLKNYCECYEAG 82
           TKGC+C++S CLK YCEC++AG
Sbjct: 414 TKGCHCRKSECLKKYCECFQAG 435


>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
 gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
          Length = 896

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 50/105 (47%), Positives = 56/105 (53%), Gaps = 13/105 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYCDCFA G +C   C C  C N  E+ +      +Q   RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAP 533

Query: 154 KI------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI             + L      RH +GCNCKRS CLK YCECY
Sbjct: 534 KIVQPTTDISSHMDDENLTTPSSARHKRGCNCKRSMCLKKYCECY 578



 Score = 75.5 bits (184), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 48/94 (51%), Gaps = 13/94 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YCDCFA G +C   C C  C N  E+ +      +Q   RNP AF PKI           
Sbjct: 487 YCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAPKIVQPTTDISSHM 546

Query: 49  -GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
             + L      RH +GCNCKRS CLK YCECY+A
Sbjct: 547 DDENLTTPSSARHKRGCNCKRSMCLKKYCECYQA 580


>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
          Length = 503

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 52/81 (64%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA+G+FC  CNCN+C+NN  HE +R  AI+ CL+RNP AF PKIG+  +G+ +  H
Sbjct: 307 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSELGDIKPHH 366

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCEC+EA
Sbjct: 367 NKGCNCKRSGCLKNYCECFEA 387



 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 70/106 (66%), Gaps = 7/106 (6%)

Query: 81  AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
           A + T +P         N    M    YC+CFA+G+FC  CNCN+C+NN  HE +R  AI
Sbjct: 287 ASSSTFQPES-------NLHVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAI 339

Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           + CL+RNP AF PKIG+  +G+ +  H KGCNCKRSGCLKNYCEC+
Sbjct: 340 KVCLDRNPEAFLPKIGQSELGDIKPHHNKGCNCKRSGCLKNYCECF 385


>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
          Length = 622

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 52/81 (64%), Positives = 65/81 (80%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA+G+FC  CNCN+C+NN  HE +R  AI+ CL+RNP AF PKIG+  +G+ +  H
Sbjct: 427 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSKLGDIKPHH 486

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
            KGCNCKRSGCLKNYCEC+EA
Sbjct: 487 NKGCNCKRSGCLKNYCECFEA 507



 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/106 (51%), Positives = 69/106 (65%), Gaps = 7/106 (6%)

Query: 81  AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
           A   T +P         N    M    YC+CFA+G+FC  CNCN+C+NN  HE +R  AI
Sbjct: 407 ASTSTFQPES-------NLQVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAI 459

Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           + CL+RNP AF PKIG+  +G+ +  H KGCNCKRSGCLKNYCEC+
Sbjct: 460 KVCLDRNPEAFLPKIGQSKLGDIKPHHNKGCNCKRSGCLKNYCECF 505


>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
          Length = 429

 Score = 92.8 bits (229), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 50/93 (53%), Positives = 56/93 (60%), Gaps = 3/93 (3%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC  S CLKLYC+CFANGE+C   CNC SC N    E  R  AI   L +NP+AFRPK 
Sbjct: 218 CNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRPKF 277

Query: 156 G--KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           G  K         H KGC C RS C+K YCEC+
Sbjct: 278 GANKSRQSAAPSAHVKGCRCARSKCIKRYCECF 310



 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 51/85 (60%), Gaps = 3/85 (3%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG--KCLVGEGE 57
           YC+CFANGE+C   CNC SC N    E  R  AI   L +NP+AFRPK G  K       
Sbjct: 229 YCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRPKFGANKSRQSAAP 288

Query: 58  RRHTKGCNCKRSGCLKNYCECYEAG 82
             H KGC C RS C+K YCEC++AG
Sbjct: 289 SAHVKGCRCARSKCIKRYCECFQAG 313


>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 471

 Score = 92.8 bits (229), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 50/96 (52%), Positives = 61/96 (63%), Gaps = 9/96 (9%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           + +K CNC K+ CLKLYCDCFA GE C  +CNC  CF N  + + R  AI   ++R P+A
Sbjct: 127 KKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCF-NTSNSEKRAGAIVSIMDRQPDA 185

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           F PK+        +  H KGCNC RSGCLK YCECY
Sbjct: 186 FGPKV-------QQNTHKKGCNCTRSGCLKKYCECY 214



 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 50/84 (59%), Gaps = 9/84 (10%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA GE C  +CNC  CFN   + + R  AI   ++R P+AF PK+        +  
Sbjct: 143 YCDCFAAGELCGGECNCFGCFN-TSNSEKRAGAIVSIMDRQPDAFGPKVQ-------QNT 194

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H KGCNC RSGCLK YCECY  G 
Sbjct: 195 HKKGCNCTRSGCLKKYCECYTLGV 218



 Score = 44.3 bits (103), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 54/130 (41%), Gaps = 40/130 (30%)

Query: 53  VGEGERRHTKGCNCKRSGCLKNYCECYEAG---------------------------ART 85
           V    ++  K CNCK++ CLK YC+C+ AG                              
Sbjct: 121 VFSSNQKKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCFNTSNSEKRAGAIVSIMD 180

Query: 86  LEPNGIRPR-------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRH 137
            +P+   P+       K CNCT+S CLK YC+C+  G  C + C C  C N      D +
Sbjct: 181 RQPDAFGPKVQQNTHKKGCNCTRSGCLKKYCECYTLGVNCGEYCKCTQCKN-----VDLN 235

Query: 138 HAIRQCLERN 147
           H+ R  L  N
Sbjct: 236 HSERLALSMN 245


>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 492

 Score = 92.8 bits (229), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 44/68 (64%), Positives = 52/68 (76%), Gaps = 1/68 (1%)

Query: 14  CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLK 73
           C+C +C NN+ +E DR  AI+ CLERNP AF+PKIGK    + ER H KGCNCK+S CLK
Sbjct: 282 CHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIGKGR-ADTERLHNKGCNCKKSSCLK 340

Query: 74  NYCECYEA 81
           NYCECYEA
Sbjct: 341 NYCECYEA 348



 Score = 89.0 bits (219), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 42/66 (63%), Positives = 50/66 (75%), Gaps = 1/66 (1%)

Query: 121 CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLK 180
           C+C +C NN+ +E DR  AI+ CLERNP AF+PKIGK    + ER H KGCNCK+S CLK
Sbjct: 282 CHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIGKGR-ADTERLHNKGCNCKKSSCLK 340

Query: 181 NYCECY 186
           NYCECY
Sbjct: 341 NYCECY 346


>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 805

 Score = 92.4 bits (228), Expect = 7e-17,   Method: Composition-based stats.
 Identities = 47/98 (47%), Positives = 61/98 (62%), Gaps = 8/98 (8%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC K+ CLKLYCDCFA GEFC  +CNC  C N + +++ R+  +   LE+NP AF  K 
Sbjct: 480 CNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEKNPFAFNVKD 539

Query: 156 GK-------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            +        L  + +    KGCNC+RSGCLK YC+CY
Sbjct: 540 IEIEEPSQLSLKAQKKLASKKGCNCRRSGCLKKYCQCY 577



 Score = 77.4 bits (189), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 40/90 (44%), Positives = 54/90 (60%), Gaps = 8/90 (8%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-------CL 52
           YCDCFA GEFC  +CNC  C N + +++ R+  +   LE+NP AF  K  +        L
Sbjct: 491 YCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEKNPFAFNVKDIEIEEPSQLSL 550

Query: 53  VGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
             + +    KGCNC+RSGCLK YC+CY+ G
Sbjct: 551 KAQKKLASKKGCNCRRSGCLKKYCQCYQDG 580



 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 36/125 (28%), Positives = 48/125 (38%), Gaps = 50/125 (40%)

Query: 54  GEGERRHTKGCNCKRSGCLKNYCECYEAG--------------------ARTLEPNGIRP 93
           G G  + T+ CNCK++ CLK YC+C+ AG                    AR     G+  
Sbjct: 470 GNGSNKKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLE 529

Query: 94  RKP-----------------------------CNCTKSMCLKLYCDCFANGEFCYQ-CNC 123
           + P                             CNC +S CLK YC C+ +G  C + C C
Sbjct: 530 KNPFAFNVKDIEIEEPSQLSLKAQKKLASKKGCNCRRSGCLKKYCQCYQDGLQCGEHCKC 589

Query: 124 NSCFN 128
           N C N
Sbjct: 590 NGCEN 594


>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
          Length = 284

 Score = 92.4 bits (228), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 46/94 (48%), Positives = 63/94 (67%), Gaps = 2/94 (2%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C KS CLK+YCDCF +G +C   C C  C+NN +H+ +R  AI+  LE+NP+AF+P
Sbjct: 13  KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQP 72

Query: 154 KIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
           K+       G   +H KGCNC+++ CLK YCEC+
Sbjct: 73  KVATFRDSGGLIVKHNKGCNCRKTACLKKYCECF 106



 Score = 82.8 bits (203), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 2/85 (2%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-R 58
           YCDCF +G +C   C C  C+NN +H+ +R  AI+  LE+NP+AF+PK+       G   
Sbjct: 26  YCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQPKVATFRDSGGLIV 85

Query: 59  RHTKGCNCKRSGCLKNYCECYEAGA 83
           +H KGCNC+++ CLK YCEC+++G 
Sbjct: 86  KHNKGCNCRKTACLKKYCECFQSGV 110



 Score = 44.3 bits (103), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 46/112 (41%), Gaps = 44/112 (39%)

Query: 61  TKGCNCKRSGCLKNYCECYEAGA--------------------------RTLE--PNGIR 92
            KGC C +S CLK YC+C+ +G                             LE  P+  +
Sbjct: 12  VKGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQ 71

Query: 93  PR---------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
           P+               K CNC K+ CLK YC+CF +G  C + C C+ C N
Sbjct: 72  PKVATFRDSGGLIVKHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCKN 123


>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
 gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
          Length = 554

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 40/58 (68%), Positives = 47/58 (81%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF
Sbjct: 476 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAF 533



 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 29/44 (65%), Positives = 35/44 (79%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 44
           YCDCFANGEFC  C C  CFNN+++E +R  AIR CL+RNP+AF
Sbjct: 490 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAF 533


>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
           CCMP2712]
          Length = 113

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K C+C KS CLKLYC+CFA GE C  C C  C N+ +HED R  A+    +RN NAF P
Sbjct: 5   KKVCSCKKSRCLKLYCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNNAFAP 64

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI    + + +  H +GC CK+S CLK YCECY
Sbjct: 65  KIVD-EIQQDKGMHARGCRCKKSHCLKKYCECY 96



 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 41/84 (48%), Positives = 53/84 (63%), Gaps = 1/84 (1%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA GE C  C C  C N+ +HED R  A+    +RN NAF PKI    + + +  H
Sbjct: 19  YCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNNAFAPKIVD-EIQQDKGMH 77

Query: 61  TKGCNCKRSGCLKNYCECYEAGAR 84
            +GC CK+S CLK YCECY+AG +
Sbjct: 78  ARGCRCKKSHCLKKYCECYQAGVQ 101


>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
          Length = 664

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 45/96 (46%), Positives = 55/96 (57%), Gaps = 7/96 (7%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
           R ++PCNC +SMCLK+YC+CFA G FC   C+C SC N          A    L +NPNA
Sbjct: 260 REQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNA 319

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           F  K+           H +GC CKRS CLK YCEC+
Sbjct: 320 FEVKV------TAATGHRRGCRCKRSKCLKKYCECF 349



 Score = 72.8 bits (177), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 39/85 (45%), Positives = 45/85 (52%), Gaps = 7/85 (8%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA G FC   C+C SC N          A    L +NPNAF  K+           
Sbjct: 276 YCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNAFEVKV------TAATG 329

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
           H +GC CKRS CLK YCEC+ AGAR
Sbjct: 330 HRRGCRCKRSKCLKKYCECFHAGAR 354



 Score = 58.9 bits (141), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 38/114 (33%), Positives = 46/114 (40%), Gaps = 38/114 (33%)

Query: 53  VGEGERRHTKGCNCKRSGCLKNYCECYEAG------------------------ARTL-- 86
           +G  + R  + CNCKRS CLK YCEC+ AG                        AR +  
Sbjct: 254 LGASQAREQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVL 313

Query: 87  --EPNGIR--------PRKPCNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFN 128
              PN            R+ C C +S CLK YC+CF  G  C    C C  C N
Sbjct: 314 AKNPNAFEVKVTAATGHRRGCRCKRSKCLKKYCECFHAGARCNPEVCQCEDCRN 367



 Score = 39.7 bits (91), Expect = 0.60,   Method: Composition-based stats.
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
           +G  + R  + CNCKRS CLK YCEC+
Sbjct: 254 LGASQAREQRPCNCKRSMCLKMYCECF 280


>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
 gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
          Length = 509

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/95 (49%), Positives = 58/95 (61%), Gaps = 10/95 (10%)

Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
           S CLKLYC+CFA G++C  CNC  C N +E+E  R  AI     RNPNAF+PKI    + 
Sbjct: 78  SQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQPKIENGPIP 137

Query: 162 EGER----------RHTKGCNCKRSGCLKNYCECY 186
              R          +H KGC+CK+SGCLK YCEC+
Sbjct: 138 PSVRKDAGALPSLPKHNKGCHCKKSGCLKKYCECF 172



 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 55/91 (60%), Gaps = 10/91 (10%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA G++C  CNC  C N +E+E  R  AI     RNPNAF+PKI    +    R  
Sbjct: 84  YCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQPKIENGPIPPSVRKD 143

Query: 59  --------RHTKGCNCKRSGCLKNYCECYEA 81
                   +H KGC+CK+SGCLK YCEC++A
Sbjct: 144 AGALPSLPKHNKGCHCKKSGCLKKYCECFQA 174


>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
          Length = 523

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 45/60 (75%), Positives = 53/60 (88%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNCTKS+CLKLYCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 455 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 514



 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/44 (68%), Positives = 37/44 (84%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 44
           YCDCFANGEFC  CNC +C+NN+EHE++R  AI+ CL+RNP AF
Sbjct: 471 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 514


>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
          Length = 735

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 40/88 (45%), Positives = 55/88 (62%), Gaps = 2/88 (2%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           +P K C+C  S CLK+YC+CFA+ + C  C C  C NN +H +   +A    LERNP+AF
Sbjct: 323 QPTKMCHCKNSKCLKMYCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAF 382

Query: 152 RPKIGKCLVGEGER--RHTKGCNCKRSG 177
            PK  +   GE  +  +H KGC+C+RSG
Sbjct: 383 GPKFKQSFNGEKPKAEKHLKGCHCRRSG 410



 Score = 72.0 bits (175), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/72 (44%), Positives = 44/72 (61%), Gaps = 2/72 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA+ + C  C C  C NN +H +   +A    LERNP+AF PK  +   GE  +  
Sbjct: 339 YCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAFGPKFKQSFNGEKPKAE 398

Query: 59  RHTKGCNCKRSG 70
           +H KGC+C+RSG
Sbjct: 399 KHLKGCHCRRSG 410


>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
          Length = 466

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/81 (60%), Positives = 63/81 (77%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           +CDCFA+G+F   CNCN+C NN+ HE +R  AI+ CL R+P AF+P+IGK  +GE   RH
Sbjct: 308 HCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIGKGQLGEVRPRH 367

Query: 61  TKGCNCKRSGCLKNYCECYEA 81
           +KGCNC+R GCL+NYCEC EA
Sbjct: 368 SKGCNCRRLGCLRNYCECQEA 388



 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/82 (58%), Positives = 62/82 (75%)

Query: 104 CLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG 163
            L  +CDCFA+G+F   CNCN+C NN+ HE +R  AI+ CL R+P AF+P+IGK  +GE 
Sbjct: 304 TLAGHCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIGKGQLGEV 363

Query: 164 ERRHTKGCNCKRSGCLKNYCEC 185
             RH+KGCNC+R GCL+NYCEC
Sbjct: 364 RPRHSKGCNCRRLGCLRNYCEC 385


>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
 gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
          Length = 460

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 66/99 (66%), Gaps = 7/99 (7%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC  + CLKLYC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPK
Sbjct: 98  KQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPK 157

Query: 155 IGKCL-----VGEGER--RHTKGCNCKRSGCLKNYCECY 186
           I           +  R  +H KGCNCK+SGCLK YCEC+
Sbjct: 158 IANTQHAIRDSADDMRLGKHNKGCNCKKSGCLKKYCECF 196



 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 47/88 (53%), Positives = 59/88 (67%), Gaps = 7/88 (7%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE 55
           YC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPKI           +
Sbjct: 111 YCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSAD 170

Query: 56  GER--RHTKGCNCKRSGCLKNYCECYEA 81
             R  +H KGCNCK+SGCLK YCEC++A
Sbjct: 171 DMRLGKHNKGCNCKKSGCLKKYCECFQA 198


>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
 gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
          Length = 461

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/99 (54%), Positives = 66/99 (66%), Gaps = 7/99 (7%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC  + CLKLYC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPK
Sbjct: 98  KQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPK 157

Query: 155 IGKCL-----VGEGER--RHTKGCNCKRSGCLKNYCECY 186
           I           +  R  +H KGCNCK+SGCLK YCEC+
Sbjct: 158 IANTQHAIRDSADDMRLGKHNKGCNCKKSGCLKKYCECF 196



 Score = 79.3 bits (194), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 47/88 (53%), Positives = 59/88 (67%), Gaps = 7/88 (7%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE 55
           YC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPKI           +
Sbjct: 111 YCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSAD 170

Query: 56  GER--RHTKGCNCKRSGCLKNYCECYEA 81
             R  +H KGCNCK+SGCLK YCEC++A
Sbjct: 171 DMRLGKHNKGCNCKKSGCLKKYCECFQA 198


>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
 gi|223944643|gb|ACN26405.1| unknown [Zea mays]
 gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
          Length = 499

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 46/95 (48%), Positives = 57/95 (60%), Gaps = 10/95 (10%)

Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
           S CLKLYC+CFA G++C  CNC  C N +E+E  R  AI     RNPNAF+PKI    + 
Sbjct: 80  SQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIP 139

Query: 162 E----------GERRHTKGCNCKRSGCLKNYCECY 186
                         +H KGC+CK+SGCLK YCEC+
Sbjct: 140 SSVWKDAGALPSHPKHNKGCHCKKSGCLKKYCECF 174



 Score = 82.4 bits (202), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 10/91 (10%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE----- 55
           YC+CFA G++C  CNC  C N +E+E  R  AI     RNPNAF+PKI    +       
Sbjct: 86  YCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIPSSVWKD 145

Query: 56  -----GERRHTKGCNCKRSGCLKNYCECYEA 81
                   +H KGC+CK+SGCLK YCEC++A
Sbjct: 146 AGALPSHPKHNKGCHCKKSGCLKKYCECFQA 176


>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1040

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 56/140 (40%), Positives = 71/140 (50%), Gaps = 37/140 (26%)

Query: 84  RTLEPNGIRPRK-PCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIR 141
           +TL P     +K  CNC KS CLKLYCDCF  G+FC  +CNC  C N   ++ +R  AI+
Sbjct: 700 QTLRPQTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAERESAIK 759

Query: 142 QCLERNPNAFRPKIG--------------------KCLVGEGERR--------------- 166
           Q  ERNP+AF+PKI                       L+ + + +               
Sbjct: 760 QLQERNPDAFKPKIQTKEQIIQQYLSKLQIFMLIISSLLSKNQIQYLIPKIIKDDEEILI 819

Query: 167 HTKGCNCKRSGCLKNYCECY 186
           H KGCNCK+SGC K YCECY
Sbjct: 820 HNKGCNCKKSGCEKKYCECY 839



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 57/184 (30%), Positives = 74/184 (40%), Gaps = 57/184 (30%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG---------- 49
           YCDCF  G+FC  +CNC  C N   ++ +R  AI+Q  ERNP+AF+PKI           
Sbjct: 725 YCDCFNQGQFCNSECNCTECSNTENNKAERESAIKQLQERNPDAFKPKIQTKEQIIQQYL 784

Query: 50  ----------KCLVGEGERR---------------HTKGCNCKRSGCLKNYCECYEAGA- 83
                       L+ + + +               H KGCNCK+SGC K YCECY  G  
Sbjct: 785 SKLQIFMLIISSLLSKNQIQYLIPKIIKDDEEILIHNKGCNCKKSGCEKKYCECYNTGVK 844

Query: 84  ----------RTLEPNGI-----RPRKPCNCTKSMCLKL-----YCDCFANGEFCYQCNC 123
                     R  +P+ I         P N   S   +        D + N E  YQ N 
Sbjct: 845 CSDQCKCEGCRNRDPSEIVKINNAQNNPSNGNSSFSNQYSGNNKQVDQYLNNEELYQHNS 904

Query: 124 NSCF 127
            S  
Sbjct: 905 QSML 908


>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
 gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
          Length = 116

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 20/101 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK---------- 50
           YC+CFA+G +C  CNC +C+NN E+E  R  A+   LERNPNAFRPKIG           
Sbjct: 1   YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60

Query: 51  ---CLVGEGER-------RHTKGCNCKRSGCLKNYCECYEA 81
               L+ E  R       RH KGC CK+SGCLK YCEC++A
Sbjct: 61  CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQA 101



 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 49/99 (49%), Positives = 59/99 (59%), Gaps = 20/99 (20%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK---------- 157
           YC+CFA+G +C  CNC +C+NN E+E  R  A+   LERNPNAFRPKIG           
Sbjct: 1   YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60

Query: 158 ---CLVGEGER-------RHTKGCNCKRSGCLKNYCECY 186
               L+ E  R       RH KGC CK+SGCLK YCEC+
Sbjct: 61  CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECF 99


>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
           [Cucumis sativus]
          Length = 581

 Score = 89.0 bits (219), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/102 (50%), Positives = 59/102 (57%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFA G +C + C+C  CFN   HE       RQ   RNP AF PK+
Sbjct: 289 CNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKV 348

Query: 156 GK--CLVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
                L+ E            RH +GCNCK+SGCLK YCECY
Sbjct: 349 IMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECY 390



 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 43/95 (45%), Positives = 52/95 (54%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--CLVGE-- 55
           YC+CFA G +C + C+C  CFN   HE       RQ   RNP AF PK+     L+ E  
Sbjct: 300 YCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELR 359

Query: 56  -------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+SGCLK YCECY+ G 
Sbjct: 360 DDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGV 394


>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
 gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
          Length = 602

 Score = 89.0 bits (219), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/102 (50%), Positives = 59/102 (57%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFA G +C + C+C  CFN   HE       RQ   RNP AF PK+
Sbjct: 310 CNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKV 369

Query: 156 GK--CLVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
                L+ E            RH +GCNCK+SGCLK YCECY
Sbjct: 370 IMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECY 411



 Score = 76.3 bits (186), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 43/95 (45%), Positives = 52/95 (54%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--CLVGE-- 55
           YC+CFA G +C + C+C  CFN   HE       RQ   RNP AF PK+     L+ E  
Sbjct: 321 YCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELR 380

Query: 56  -------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+SGCLK YCECY+ G 
Sbjct: 381 DDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGV 415


>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 546

 Score = 89.0 bits (219), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 62/99 (62%), Gaps = 7/99 (7%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC +S C+KLYC+C+A G +C  C+C SC NN + E  R+ AI+  L +NP AF PK
Sbjct: 69  KKCNCKQSQCIKLYCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPK 128

Query: 155 I-------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           I        K +      +H KGC CKRSGC+K YCEC+
Sbjct: 129 ISQKQETASKVVDPLNLGKHNKGCACKRSGCMKKYCECF 167



 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 42/90 (46%), Positives = 55/90 (61%), Gaps = 7/90 (7%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-------GKCLV 53
           YC+C+A G +C  C+C SC NN + E  R+ AI+  L +NP AF PKI        K + 
Sbjct: 82  YCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPKISQKQETASKVVD 141

Query: 54  GEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                +H KGC CKRSGC+K YCEC++AG 
Sbjct: 142 PLNLGKHNKGCACKRSGCMKKYCECFQAGV 171


>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
          Length = 485

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 50/115 (43%), Positives = 69/115 (60%), Gaps = 13/115 (11%)

Query: 85  TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
           + E   + P+K   CNC  S CLK+YC+CF   ++C  CNC++C N + +E+ R+ AI  
Sbjct: 58  STEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEA 117

Query: 143 CLERNPNAFRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
             +RNP+AF+PKIG        R           +H KGC+CK+SGCLK YCECY
Sbjct: 118 IRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECY 172



 Score = 75.9 bits (185), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 41/92 (44%), Positives = 57/92 (61%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CF   ++C  CNC++C N + +E+ R+ AI    +RNP+AF+PKIG        R  
Sbjct: 83  YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKD 142

Query: 59  ---------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCECY+A
Sbjct: 143 NSGAVPLVAKHHKGCHCKKSGCLKKYCECYQA 174


>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
          Length = 774

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/102 (48%), Positives = 60/102 (58%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+
Sbjct: 488 CNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKV 547

Query: 156 GK-----CLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
            +       +G+         RH +GCNCK+S CLK YCECY
Sbjct: 548 IRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECY 589



 Score = 75.5 bits (184), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +       +G
Sbjct: 499 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIG 558

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           +         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 559 DDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 593


>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 623

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/104 (49%), Positives = 61/104 (58%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC KS CLKLYC+CFA G +C + C+C  CFN   HED      +Q   RNP AF P
Sbjct: 339 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAP 398

Query: 154 KI---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
           K+      ++  GE         RH +GCNCK+S CLK YCECY
Sbjct: 399 KVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 442



 Score = 75.5 bits (184), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+      ++  G
Sbjct: 352 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 411

Query: 57  E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 412 EDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 446


>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
 gi|223943031|gb|ACN25599.1| unknown [Zea mays]
 gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
          Length = 530

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 70/119 (58%), Gaps = 21/119 (17%)

Query: 85  TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
           + E   + P+K   CNC  S CLK+YC+CF   ++C  CNC++C N + +E+ R+ AI  
Sbjct: 58  STEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEA 117

Query: 143 CLERNPNAFRPKIGK---------------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
             +RNP+AF+PKIG                 LV     +H KGC+CK+SGCLK YCECY
Sbjct: 118 IRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVA----KHHKGCHCKKSGCLKKYCECY 172



 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 42/96 (43%), Positives = 58/96 (60%), Gaps = 19/96 (19%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK---------- 50
           YC+CF   ++C  CNC++C N + +E+ R+ AI    +RNP+AF+PKIG           
Sbjct: 83  YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKD 142

Query: 51  -----CLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                 LV     +H KGC+CK+SGCLK YCECY+A
Sbjct: 143 NSGAVPLVA----KHHKGCHCKKSGCLKKYCECYQA 174


>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
          Length = 665

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/106 (46%), Positives = 59/106 (55%), Gaps = 14/106 (13%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC K+ CLKLYCDCFA G +C + C C  CFN  E+ED      +Q   RNP AF P
Sbjct: 487 KRCNCKKTKCLKLYCDCFAAGFYCGESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAP 546

Query: 154 KIGKCLVGE-------------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ + +                   RH +GCNCK+S CLK YCECY
Sbjct: 547 KVMQTVTASPTTGKVVEKLSTPSSARHKRGCNCKKSMCLKKYCECY 592



 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 51/95 (53%), Gaps = 14/95 (14%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
           YCDCFA G +C + C C  CFN  E+ED      +Q   RNP AF PK+ + +       
Sbjct: 500 YCDCFAAGFYCGESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAPKVMQTVTASPTTG 559

Query: 56  ---------GERRHTKGCNCKRSGCLKNYCECYEA 81
                       RH +GCNCK+S CLK YCECY+A
Sbjct: 560 KVVEKLSTPSSARHKRGCNCKKSMCLKKYCECYQA 594


>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 510

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 73/147 (49%), Gaps = 14/147 (9%)

Query: 44  FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSM 103
            +P++G  +V           + + +         Y +   T+  +G+ P   CNC +S 
Sbjct: 176 LKPEVGLPVVPSQVASLPPSTSSQVTPSQPPTLPAYGSNVTTM--DGVAPG--CNCKRSR 231

Query: 104 CLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCL 159
           CLKLYC CFA   FC   CNC +C N       R  AIRQ +ERNPNAF+ K    G+  
Sbjct: 232 CLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDA 291

Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
           V      H  GC C++S CLK YCEC+
Sbjct: 292 V------HKMGCKCRKSACLKKYCECF 312



 Score = 76.3 bits (186), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 50/88 (56%), Gaps = 10/88 (11%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC CFA   FC   CNC +C N       R  AIRQ +ERNPNAF+ K    G+  V   
Sbjct: 236 YCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDAV--- 292

Query: 57  ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
              H  GC C++S CLK YCEC+ AGAR
Sbjct: 293 ---HKMGCKCRKSACLKKYCECFNAGAR 317


>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 61/86 (70%), Gaps = 6/86 (6%)

Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLV 160
           LKLYC+CF  GE+C+  C+C+ C N  ++ D R  AI+  L+RN +AF+PK+   GK + 
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVN 264

Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
            +G  +H +GCNCKRSGCLK YCECY
Sbjct: 265 NKG--KHLRGCNCKRSGCLKKYCECY 288



 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/87 (50%), Positives = 61/87 (70%), Gaps = 6/87 (6%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CF  GE+C+  C+C+ C N  ++ D R  AI+  L+RN +AF+PK+   GK +  +G
Sbjct: 208 YCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVNNKG 267

Query: 57  ERRHTKGCNCKRSGCLKNYCECYEAGA 83
             +H +GCNCKRSGCLK YCECY+AG 
Sbjct: 268 --KHLRGCNCKRSGCLKKYCECYQAGV 292


>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 511

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 73/147 (49%), Gaps = 14/147 (9%)

Query: 44  FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSM 103
            +P++G  +V           + + +         Y +   T+  +G+ P   CNC +S 
Sbjct: 176 LKPEVGLPVVPSQVASLPPSTSSQVTPSQPPTLPAYGSNVTTM--DGVAPG--CNCKRSR 231

Query: 104 CLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCL 159
           CLKLYC CFA   FC   CNC +C N       R  AIRQ +ERNPNAF+ K    G+  
Sbjct: 232 CLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDA 291

Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
           V      H  GC C++S CLK YCEC+
Sbjct: 292 V------HKMGCKCRKSACLKKYCECF 312



 Score = 75.9 bits (185), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 50/88 (56%), Gaps = 10/88 (11%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC CFA   FC   CNC +C N       R  AIRQ +ERNPNAF+ K    G+  V   
Sbjct: 236 YCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDAV--- 292

Query: 57  ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
              H  GC C++S CLK YCEC+ AGAR
Sbjct: 293 ---HKMGCKCRKSACLKKYCECFNAGAR 317


>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 45/86 (52%), Positives = 61/86 (70%), Gaps = 6/86 (6%)

Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLV 160
           LKLYC+CF  GE+C+  C+C+ C N  ++ D R  AI+  L+RN +AF+PK+   GK + 
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVN 264

Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
            +G  +H +GCNCKRSGCLK YCECY
Sbjct: 265 NKG--KHLRGCNCKRSGCLKKYCECY 288



 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/87 (50%), Positives = 61/87 (70%), Gaps = 6/87 (6%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CF  GE+C+  C+C+ C N  ++ D R  AI+  L+RN +AF+PK+   GK +  +G
Sbjct: 208 YCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVNNKG 267

Query: 57  ERRHTKGCNCKRSGCLKNYCECYEAGA 83
             +H +GCNCKRSGCLK YCECY+AG 
Sbjct: 268 --KHLRGCNCKRSGCLKKYCECYQAGV 292


>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 4/98 (4%)

Query: 93  PRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           P K CNC  S C+KLYC+C+ N  FC   C+C++CFNN  +   R  A++  LE+NP+AF
Sbjct: 14  PSKKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAF 73

Query: 152 RPKIGKCLVGEGER---RHTKGCNCKRSGCLKNYCECY 186
           +PKI             +H KGC CK+SGC+K YCEC+
Sbjct: 74  QPKITTSDSKPDPLNFGKHNKGCQCKKSGCMKKYCECF 111



 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/87 (43%), Positives = 54/87 (62%), Gaps = 4/87 (4%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
           YC+C+ N  FC   C+C++CFNN  +   R  A++  LE+NP+AF+PKI           
Sbjct: 29  YCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAFQPKITTSDSKPDPLN 88

Query: 59  --RHTKGCNCKRSGCLKNYCECYEAGA 83
             +H KGC CK+SGC+K YCEC++A  
Sbjct: 89  FGKHNKGCQCKKSGCMKKYCECFQAKV 115


>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
 gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
          Length = 729

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/104 (47%), Positives = 60/104 (57%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC +S CLKLYCDCFA G +C + C+C  CFN   HED      +Q   RNP AF P
Sbjct: 450 KRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAP 509

Query: 154 KIGKCL--VGE---------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ +    V +            RH +GCNCK+S CLK YCEC+
Sbjct: 510 KVIRSTDYVSDFGDETNKTPASARHKRGCNCKKSNCLKKYCECF 553



 Score = 75.9 bits (185), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 52/95 (54%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGE-- 55
           YCDCFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +    V +  
Sbjct: 463 YCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAPKVIRSTDYVSDFG 522

Query: 56  -------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+S CLK YCEC++ G 
Sbjct: 523 DETNKTPASARHKRGCNCKKSNCLKKYCECFQGGV 557


>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/110 (45%), Positives = 63/110 (57%), Gaps = 8/110 (7%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
           T +  GI  RK C C +S CLKLYCDCFA+G  C  C+C  C NN ++ D R  A+   L
Sbjct: 54  TTDQEGITWRKRCRCKQSKCLKLYCDCFASGVLCTDCDCVDCHNNSDNYDARDAAVVNVL 113

Query: 145 ERNPNAFRPKI------GKCLVGEGER--RHTKGCNCKRSGCLKNYCECY 186
            RNPNAF  K+       +C      R    ++GC CKR+ CLK YCEC+
Sbjct: 114 GRNPNAFNEKLFSSINDKQCKAAPDTRPGLLSRGCKCKRTKCLKKYCECF 163



 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 51/91 (56%), Gaps = 8/91 (8%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLVG 54
           YCDCFA+G  C  C+C  C NN ++ D R  A+   L RNPNAF  K+       +C   
Sbjct: 77  YCDCFASGVLCTDCDCVDCHNNSDNYDARDAAVVNVLGRNPNAFNEKLFSSINDKQCKAA 136

Query: 55  EGER--RHTKGCNCKRSGCLKNYCECYEAGA 83
              R    ++GC CKR+ CLK YCEC++A  
Sbjct: 137 PDTRPGLLSRGCKCKRTKCLKKYCECFQANV 167


>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
 gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
          Length = 641

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 46/115 (40%), Positives = 62/115 (53%), Gaps = 12/115 (10%)

Query: 84  RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQ 142
           R L+ +  +  + C+C KS CLKLYC CFA   +C + C+C  C NN  HE+   H  +Q
Sbjct: 346 RKLQNDDGKSCRHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMHEEAVSHIRKQ 405

Query: 143 CLERNPNAFRPKIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
              RNP AF P + +      E            RH +GCNC++S CLK YCEC+
Sbjct: 406 TESRNPLAFAPTVTRKCGSVSELGDDSNNTPASARHKRGCNCRKSSCLKKYCECF 460



 Score = 70.5 bits (171), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 36/92 (39%), Positives = 48/92 (52%), Gaps = 12/92 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-- 57
           YC CFA   +C + C+C  C NN  HE+   H  +Q   RNP AF P + +      E  
Sbjct: 370 YCACFAAKVYCSEFCSCQGCSNNHMHEEAVSHIRKQTESRNPLAFAPTVTRKCGSVSELG 429

Query: 58  ---------RRHTKGCNCKRSGCLKNYCECYE 80
                     RH +GCNC++S CLK YCEC++
Sbjct: 430 DDSNNTPASARHKRGCNCRKSSCLKKYCECFQ 461


>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
           distachyon]
          Length = 487

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/104 (50%), Positives = 63/104 (60%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC  S CLKLYC+CFA    C  CNC++C NN E+E  R  AI   L RNP AF+P
Sbjct: 67  RKHCNCKNSRCLKLYCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQP 126

Query: 154 KI--GKCLVGEGER---------RHTKGCNCKRSGCLKNYCECY 186
           KI  G   V   +          +H KGC+CK+SGCLK YCEC+
Sbjct: 127 KIENGPSTVNVRKDNSGAVPLRPKHNKGCHCKKSGCLKKYCECF 170



 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 55/92 (59%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--GKCLVGEGER 58
           YC+CFA    C  CNC++C NN E+E  R  AI   L RNP AF+PKI  G   V   + 
Sbjct: 81  YCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQPKIENGPSTVNVRKD 140

Query: 59  ---------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCEC++A
Sbjct: 141 NSGAVPLRPKHNKGCHCKKSGCLKKYCECFQA 172


>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/97 (50%), Positives = 63/97 (64%), Gaps = 4/97 (4%)

Query: 91  IRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
           I   KPCNC KS CLKLYCDCFA  + C  +CNC  CFNN  +  +R+    + +ERNP 
Sbjct: 103 IEEVKPCNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPE 162

Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           AF  K+ +    + +  H+KGCNC++SGC K YCECY
Sbjct: 163 AFNQKVKEV---DQKMAHSKGCNCRKSGCKKKYCECY 196



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 53/84 (63%), Gaps = 4/84 (4%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA  + C  +CNC  CFNN  +  +R+    + +ERNP AF  K+ +    + +  
Sbjct: 120 YCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPEAFNQKVKEV---DQKMA 176

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H+KGCNC++SGC K YCECY+ G 
Sbjct: 177 HSKGCNCRKSGCKKKYCECYQMGI 200


>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
          Length = 260

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 4/105 (3%)

Query: 83  ARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR 141
           A  ++      ++ CNC KS CLKLYCDCFA GE+C   C+C  C NN  +++ R  A++
Sbjct: 50  ANNIQKLKALQQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREMALK 109

Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           Q  ERNPNAF  +I   +       H +GCNCK+SGC K YC+CY
Sbjct: 110 QNKERNPNAFTSRI---IQQSQLNNHQRGCNCKKSGCQKKYCQCY 151



 Score = 71.6 bits (174), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 52/84 (61%), Gaps = 4/84 (4%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA GE+C   C+C  C NN  +++ R  A++Q  ERNPNAF  +I   +       
Sbjct: 75  YCDCFAIGEYCNPICHCFECKNNESNKEKREMALKQNKERNPNAFTSRI---IQQSQLNN 131

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H +GCNCK+SGC K YC+CY  G 
Sbjct: 132 HQRGCNCKKSGCQKKYCQCYLDGM 155


>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 444

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 8/99 (8%)

Query: 96  PCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           PCNC +S CLKLYC+CFA+G +C  +C CN C NN ++ +    A  + LERNP AF PK
Sbjct: 317 PCNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPK 376

Query: 155 IGKCLVGEGER-------RHTKGCNCKRSGCLKNYCECY 186
           I +      E         H +GCNCKRS C K YCEC+
Sbjct: 377 ISESTAAVTEEGQVLRVAAHHRGCNCKRSNCRKKYCECF 415



 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 41/91 (45%), Positives = 53/91 (58%), Gaps = 8/91 (8%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
           YC+CFA+G +C  +C CN C NN ++ +    A  + LERNP AF PKI +      E  
Sbjct: 329 YCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKISESTAAVTEEG 388

Query: 59  ------RHTKGCNCKRSGCLKNYCECYEAGA 83
                  H +GCNCKRS C K YCEC++AG 
Sbjct: 389 QVLRVAAHHRGCNCKRSNCRKKYCECFQAGV 419



 Score = 35.8 bits (81), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 39/104 (37%), Gaps = 49/104 (47%)

Query: 64  CNCKRSGCLKNYCECYEAGA--------------------------RTLE--PNGIRPR- 94
           CNCKRS CLK YCEC+ +G+                          RTLE  P    P+ 
Sbjct: 318 CNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKI 377

Query: 95  --------------------KPCNCTKSMCLKLYCDCFANGEFC 118
                               + CNC +S C K YC+CF  G  C
Sbjct: 378 SESTAAVTEEGQVLRVAAHHRGCNCKRSNCRKKYCECFQAGVPC 421


>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 394

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 6/93 (6%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+C A    C ++CNC  C N  E   +R  A+   L+RNP AF+P
Sbjct: 49  KGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDRNPLAFQP 108

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           K     V  G  +H +GCNC++SGC+KNYCEC+
Sbjct: 109 K-----VASGSSQHLRGCNCRKSGCMKNYCECH 136



 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 53/84 (63%), Gaps = 6/84 (7%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+C A    C ++CNC  C N  E   +R  A+   L+RNP AF+PK     V  G  +
Sbjct: 62  YCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDRNPLAFQPK-----VASGSSQ 116

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H +GCNC++SGC+KNYCEC++AG 
Sbjct: 117 HLRGCNCRKSGCMKNYCECHQAGV 140


>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
          Length = 780

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 45/103 (43%), Positives = 56/103 (54%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CF  G FC + C+C  C N   + +       Q   RNP AF P
Sbjct: 477 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAP 536

Query: 154 KIGKCLVGE----------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ +   G+             RH +GCNCK+S CLK YCECY
Sbjct: 537 KVIRTEPGQELADDSNKTPASSRHKRGCNCKKSSCLKKYCECY 579



 Score = 72.0 bits (175), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 37/93 (39%), Positives = 48/93 (51%), Gaps = 11/93 (11%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
           YC+CF  G FC + C+C  C N   + +       Q   RNP AF PK+ +   G+    
Sbjct: 490 YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQELAD 549

Query: 56  ------GERRHTKGCNCKRSGCLKNYCECYEAG 82
                    RH +GCNCK+S CLK YCECY+ G
Sbjct: 550 DSNKTPASSRHKRGCNCKKSSCLKKYCECYQGG 582


>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
          Length = 407

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/93 (50%), Positives = 58/93 (62%), Gaps = 6/93 (6%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+C A    C + CNC  C N  E   +R  A+   LERNP AF P
Sbjct: 49  KGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILERNPLAFLP 108

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           K     V  G  +H +GCNC++SGC+KNYCEC+
Sbjct: 109 K-----VASGSSQHLRGCNCRKSGCMKNYCECH 136



 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 51/84 (60%), Gaps = 6/84 (7%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+C A    C + CNC  C N  E   +R  A+   LERNP AF PK     V  G  +
Sbjct: 62  YCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILERNPLAFLPK-----VASGSSQ 116

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H +GCNC++SGC+KNYCEC++AG 
Sbjct: 117 HLRGCNCRKSGCMKNYCECHQAGV 140


>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1084

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 56/120 (46%), Positives = 68/120 (56%), Gaps = 12/120 (10%)

Query: 79  YEAGARTLEP-NGIRPRKPC-NCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDR 136
           Y A   +++P N    RK C NC ++ CLKLYC+CFA    C  C C  CFN  E E  R
Sbjct: 513 YTAQNSSIKPANHAEDRKICCNCKRTKCLKLYCECFAASRMCEGCTCQGCFNKPEFEAMR 572

Query: 137 HHAIRQCLERNPNAFRPKIG-----KCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 186
             A +  LERN +AF PKI      K L     V   ++ H+KGCNCK+S CLK YCECY
Sbjct: 573 KEARQAILERNNSAFDPKIDSQSKLKQLCNEEGVDNEQKVHSKGCNCKKSNCLKKYCECY 632



 Score = 75.5 bits (184), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 44/94 (46%), Positives = 53/94 (56%), Gaps = 10/94 (10%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG-----KCL--- 52
           YC+CFA    C  C C  CFN  E E  R  A +  LERN +AF PKI      K L   
Sbjct: 544 YCECFAASRMCEGCTCQGCFNKPEFEAMRKEARQAILERNNSAFDPKIDSQSKLKQLCNE 603

Query: 53  --VGEGERRHTKGCNCKRSGCLKNYCECYEAGAR 84
             V   ++ H+KGCNCK+S CLK YCECY+ G +
Sbjct: 604 EGVDNEQKVHSKGCNCKKSNCLKKYCECYQLGVK 637


>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
 gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
          Length = 448

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 11/104 (10%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA G +C  C+C  C N + +E  R  AI    +RNP AF+P
Sbjct: 29  KKHCNCKNSQCLKLYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQP 88

Query: 154 KI-----------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI                     +H KGC+CK+SGCLK YCEC+
Sbjct: 89  KIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECF 132



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
           YC+CFA G +C  C+C  C N + +E  R  AI    +RNP AF+PKI            
Sbjct: 43  YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 102

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCEC++A
Sbjct: 103 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQA 134


>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
 gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
          Length = 873

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 52/105 (49%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC K+ CLKLYCDCFA G +C   C C  CFN  E+ED      +Q   RNP AF P
Sbjct: 453 KRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAP 512

Query: 154 KI---GKCLVGEGE---------RRHTKGCNCKRSGCLKNYCECY 186
           KI    K      E          RH +GCNCK+S CLK YCECY
Sbjct: 513 KIVQHAKEFAASREDRSSSMPSLSRHKRGCNCKKSMCLKKYCECY 557



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/94 (46%), Positives = 50/94 (53%), Gaps = 13/94 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YCDCFA G +C   C C  CFN  E+ED      +Q   RNP AF PKI    K      
Sbjct: 466 YCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHAKEFAASR 525

Query: 57  E---------RRHTKGCNCKRSGCLKNYCECYEA 81
           E          RH +GCNCK+S CLK YCECY+A
Sbjct: 526 EDRSSSMPSLSRHKRGCNCKKSMCLKKYCECYQA 559


>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
 gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
          Length = 727

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 53/143 (37%), Positives = 64/143 (44%), Gaps = 43/143 (30%)

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFAN 114
           EGE    K CNCK+S CLK YCEC+ AG   +EP                          
Sbjct: 428 EGEGEACKRCNCKKSKCLKLYCECFAAGVYCIEP-------------------------- 461

Query: 115 GEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE------G 163
                 C+C  CFN   HED      +Q   RNP AF PK+ +       +G+       
Sbjct: 462 ------CSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPVTEIGDEFSKTPA 515

Query: 164 ERRHTKGCNCKRSGCLKNYCECY 186
             RH +GCNCK+S CLK YCECY
Sbjct: 516 SARHKRGCNCKKSNCLKKYCECY 538



 Score = 78.6 bits (192), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 53/94 (56%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +       +G
Sbjct: 448 YCECFAAGVYCIEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPVTEIG 507

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
           +         RH +GCNCK+S CLK YCECY+ G
Sbjct: 508 DEFSKTPASARHKRGCNCKKSNCLKKYCECYQGG 541


>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
 gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
          Length = 348

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/104 (47%), Positives = 57/104 (54%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC +S CLKLYC+CFA G +C   C C  CFN  E+ED      +Q   RNP AF P
Sbjct: 20  KRCNCKRSKCLKLYCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAP 79

Query: 154 KIGKC-----------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
           K+ K                   RH +GCNCK+S CLK YCECY
Sbjct: 80  KVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECY 123



 Score = 75.9 bits (185), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 50/95 (52%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
           YC+CFA G +C   C C  CFN  E+ED      +Q   RNP AF PK+ K         
Sbjct: 33  YCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAPKVVKQATNSPANE 92

Query: 52  ---LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+S CLK YCECY+AG 
Sbjct: 93  EGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGV 127


>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
 gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
          Length = 916

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 35/60 (58%), Positives = 42/60 (70%)

Query: 81  AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
           A + +L+  G   RK CNC+KS CLKLYCDCFANGEFC  C C  CFNN+++E  R  AI
Sbjct: 741 AASASLDVLGGSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAI 800



 Score = 59.3 bits (142), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 21/33 (63%), Positives = 24/33 (72%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 33
           YCDCFANGEFC  C C  CFNN+++E  R  AI
Sbjct: 768 YCDCFANGEFCQDCTCKDCFNNLDYEVKRERAI 800


>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 707

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 38/64 (59%), Positives = 46/64 (71%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           RPRKPCNC  S CLKLYCDCFA G +C  C+C +C N   ++  R +AIR  LERNP+AF
Sbjct: 99  RPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAF 158

Query: 152 RPKI 155
           R K+
Sbjct: 159 RSKV 162



 Score = 76.6 bits (187), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 49/152 (32%), Positives = 70/152 (46%), Gaps = 24/152 (15%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA G +C  C+C +C N   ++  R +AIR  LERNP+AFR K+         +  
Sbjct: 115 YCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAFRSKVVTLEYDTSSKDT 174

Query: 61  TKGCNCKR---------------SGCLKNYCECYEAGARTLEPNGIRPR--------KPC 97
               NC                 S  ++      EA + + +P              K C
Sbjct: 175 ASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEASSTSTQPRTTGTTTQRHAQHIKGC 234

Query: 98  NCTKSMCLKLYCDCFANGEFCY-QCNCNSCFN 128
           +C +S+CLK YC+CF NG +C   C C++C N
Sbjct: 235 SCLRSLCLKKYCECFQNGVYCSASCRCSNCRN 266



 Score = 46.6 bits (109), Expect = 0.004,   Method: Composition-based stats.
 Identities = 47/159 (29%), Positives = 62/159 (38%), Gaps = 25/159 (15%)

Query: 52  LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA----------------RTLEPNGIRP-- 93
           L G    R  K CNCK S CLK YC+C+ AG                  TL  N IR   
Sbjct: 92  LEGGSVTRPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSL 151

Query: 94  -RKPCNCTKSMCLKLYCDC----FANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL-ERN 147
            R P +  +S  + L  D      A+ + C      +   ++  ED       Q   E  
Sbjct: 152 ERNP-HAFRSKVVTLEYDTSSKDTASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEA 210

Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            +            +   +H KGC+C RS CLK YCEC+
Sbjct: 211 SSTSTQPRTTGTTTQRHAQHIKGCSCLRSLCLKKYCECF 249


>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
           AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
 gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
 gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
 gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 609

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/102 (48%), Positives = 61/102 (59%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFA G +C + C+C +CFN   H+D      +Q   RNP AF PK+
Sbjct: 330 CNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKV 389

Query: 156 ---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
                 ++  GE         RH +GCNCK+S CLK YCECY
Sbjct: 390 IRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 431



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 54/95 (56%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CFA G +C + C+C +CFN   H+D      +Q   RNP AF PK+      ++  G
Sbjct: 341 YCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 400

Query: 57  E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 401 EDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 435


>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
          Length = 792

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 46/95 (48%), Positives = 53/95 (55%), Gaps = 10/95 (10%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC +S CLKLYC CFA    C   CNC+ C+NN   E  R  AI   LERNP AF  K 
Sbjct: 26  CNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAFDAKF 85

Query: 156 GKCLVGEGERR----HTKGCNCKRSGCLKNYCECY 186
                 E E+     H  GC C++S CLK YCEC+
Sbjct: 86  -----RETEKHDAVVHKTGCKCRKSACLKKYCECF 115



 Score = 71.2 bits (173), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 43/104 (41%), Positives = 49/104 (47%), Gaps = 18/104 (17%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA    C   CNC+ C+NN   E  R  AI   LERNP AF  K       E E+ 
Sbjct: 37  YCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAFDAKFR-----ETEKH 91

Query: 60  ----HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNC 99
               H  GC C++S CLK YCEC+  G    E         CNC
Sbjct: 92  DAVVHKTGCKCRKSACLKKYCECFNKGVACSE--------KCNC 127


>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
          Length = 609

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/102 (48%), Positives = 61/102 (59%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFA G +C + C+C +CFN   H+D      +Q   RNP AF PK+
Sbjct: 330 CNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKV 389

Query: 156 ---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
                 ++  GE         RH +GCNCK+S CLK YCECY
Sbjct: 390 IRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 431



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 54/95 (56%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CFA G +C + C+C +CFN   H+D      +Q   RNP AF PK+      ++  G
Sbjct: 341 YCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 400

Query: 57  E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 401 EDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 435


>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1114

 Score = 86.3 bits (212), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 44/108 (40%), Positives = 57/108 (52%), Gaps = 18/108 (16%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC K+ CLKLYC CF  G  C  QC C  C+N I + ++R  AI+   +R P+AF  K+
Sbjct: 714 CNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKV 773

Query: 156 G-----KCLVGEGER------------RHTKGCNCKRSGCLKNYCECY 186
                    + + ER             H KGCNCK+S C K YCEC+
Sbjct: 774 NFLPELARKIEDAERLDADLELQNQLLAHKKGCNCKKSACKKKYCECF 821



 Score = 71.6 bits (174), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 38/102 (37%), Positives = 51/102 (50%), Gaps = 18/102 (17%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG-----KCLVG 54
           YC CF  G  C  QC C  C+N I + ++R  AI+   +R P+AF  K+         + 
Sbjct: 725 YCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKVNFLPELARKIE 784

Query: 55  EGER------------RHTKGCNCKRSGCLKNYCECYEAGAR 84
           + ER             H KGCNCK+S C K YCEC+ AG +
Sbjct: 785 DAERLDADLELQNQLLAHKKGCNCKKSACKKKYCECFLAGVK 826


>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
          Length = 176

 Score = 86.3 bits (212), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 49/89 (55%), Positives = 56/89 (62%), Gaps = 5/89 (5%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYCDCF  G  C   CNC SC NN EH  +R   I+Q +ERNP+AFRPK+
Sbjct: 85  CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQIMERNPSAFRPKV 144

Query: 156 GKCLVGEGER----RHTKGCNCKRSGCLK 180
                 E E+    RH KGCNCK+S   K
Sbjct: 145 ESKSNSEDEQDHKPRHFKGCNCKKSNWSK 173



 Score = 69.7 bits (169), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
           YCDCF  G  C   CNC SC NN EH  +R   I+Q +ERNP+AFRPK+      E E+ 
Sbjct: 96  YCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQIMERNPSAFRPKVESKSNSEDEQD 155

Query: 59  ---RHTKGCNCKRSGCLK 73
              RH KGCNCK+S   K
Sbjct: 156 HKPRHFKGCNCKKSNWSK 173


>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 760

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 45/103 (43%), Positives = 56/103 (54%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CF  G FC + C+C  C N   + +       Q   RNP AF P
Sbjct: 457 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAP 516

Query: 154 KIGKCLVGE----------GERRHTKGCNCKRSGCLKNYCECY 186
           K+ +   G+             RH +GCNCK+S CLK YCECY
Sbjct: 517 KVIRTEPGQELADDSNKTPASSRHKRGCNCKKSSCLKKYCECY 559



 Score = 71.2 bits (173), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 37/93 (39%), Positives = 48/93 (51%), Gaps = 11/93 (11%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
           YC+CF  G FC + C+C  C N   + +       Q   RNP AF PK+ +   G+    
Sbjct: 470 YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQELAD 529

Query: 56  ------GERRHTKGCNCKRSGCLKNYCECYEAG 82
                    RH +GCNCK+S CLK YCECY+ G
Sbjct: 530 DSNKTPASSRHKRGCNCKKSSCLKKYCECYQGG 562


>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
 gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
          Length = 767

 Score = 85.9 bits (211), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 46/104 (44%), Positives = 59/104 (56%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA G +C + C+C  C NN  H +      +Q   RNP AF P
Sbjct: 457 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIESRNPLAFAP 516

Query: 154 KI-----GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
           K+         +G+         RH +GCNCK+S CLK YCEC+
Sbjct: 517 KVIHTSESGLELGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 560



 Score = 71.2 bits (173), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 38/94 (40%), Positives = 51/94 (54%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----GKCLVG 54
           YC+CFA G +C + C+C  C NN  H +      +Q   RNP AF PK+         +G
Sbjct: 470 YCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSESGLELG 529

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
           +         RH +GCNCK+S CLK YCEC++ G
Sbjct: 530 DFSNKTPASARHKRGCNCKKSSCLKKYCECFQGG 563


>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
 gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
          Length = 497

 Score = 85.9 bits (211), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 50/118 (42%), Positives = 69/118 (58%), Gaps = 19/118 (16%)

Query: 85  TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
           + E   + P+K   CNC  S CLK+YC+CF   ++C  CNC++C N + +E+ R+ AI  
Sbjct: 56  STEVKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEA 115

Query: 143 CLERNPNAFRPKI--------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
             +RNP+AF+PKI              G   V     +H KGC+CK+SGCLK YCECY
Sbjct: 116 IRQRNPSAFQPKIENGPNTLNVRKDNSGAVPVVA---KHHKGCHCKKSGCLKKYCECY 170



 Score = 73.6 bits (179), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 57/95 (60%), Gaps = 17/95 (17%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------------ 48
           YC+CF   ++C  CNC++C N + +E+ R+ AI    +RNP+AF+PKI            
Sbjct: 81  YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIENGPNTLNVRKD 140

Query: 49  --GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
             G   V     +H KGC+CK+SGCLK YCECY+A
Sbjct: 141 NSGAVPVVA---KHHKGCHCKKSGCLKKYCECYQA 172


>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
 gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
          Length = 710

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 50/122 (40%), Positives = 57/122 (46%), Gaps = 30/122 (24%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA G +C   C C  CFN  E  D      +Q   RNP AF P
Sbjct: 390 KRCHCKKSKCLKLYCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAP 449

Query: 154 KIGK-----------------------------CLVGEGERRHTKGCNCKRSGCLKNYCE 184
           KI +                             C    G  RH +GCNCK+S CLK YCE
Sbjct: 450 KIVQTSDNTPTEGVGIWFSGHAMTNSLFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCE 509

Query: 185 CY 186
           CY
Sbjct: 510 CY 511



 Score = 72.4 bits (176), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 50/136 (36%), Positives = 61/136 (44%), Gaps = 18/136 (13%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA G +C   C C  CFN  E  D      +Q   RNP AF PKI    V   +  
Sbjct: 403 YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKI----VQTSDNT 458

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPN------GIRPRKPCNCTKSMCLKLYCDCFA 113
            T+G     SG               L+PN        R ++ CNC KS+CLK YC+C+ 
Sbjct: 459 PTEGVGIWFSG------HAMTNSLFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQ 512

Query: 114 NGEFCYQ-CNCNSCFN 128
            G  C   C CN C N
Sbjct: 513 AGVGCSDGCKCNGCKN 528


>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
 gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
          Length = 620

 Score = 85.5 bits (210), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 39/82 (47%), Positives = 53/82 (64%), Gaps = 2/82 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA G FC++C+C +C N  ++        +Q   RNPNAF  KI     G+G  +H
Sbjct: 278 YCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKI--VASGDGGGQH 335

Query: 61  TKGCNCKRSGCLKNYCECYEAG 82
            KGC+CK+S CLK YCEC++AG
Sbjct: 336 KKGCHCKKSACLKKYCECFQAG 357



 Score = 82.4 bits (202), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 38/80 (47%), Positives = 51/80 (63%), Gaps = 2/80 (2%)

Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
           LYC+CFA G FC++C+C +C N  ++        +Q   RNPNAF  KI     G+G  +
Sbjct: 277 LYCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKI--VASGDGGGQ 334

Query: 167 HTKGCNCKRSGCLKNYCECY 186
           H KGC+CK+S CLK YCEC+
Sbjct: 335 HKKGCHCKKSACLKKYCECF 354



 Score = 40.4 bits (93), Expect = 0.36,   Method: Composition-based stats.
 Identities = 28/94 (29%), Positives = 34/94 (36%), Gaps = 40/94 (42%)

Query: 75  YCECYEAGARTLE---------------------------PN------------GIRPRK 95
           YCEC+ AG    E                           PN            G + +K
Sbjct: 278 YCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGGGQHKK 337

Query: 96  PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
            C+C KS CLK YC+CF  G  C   C C+ C N
Sbjct: 338 GCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKN 371


>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas
          reinhardtii]
 gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 91

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 53/91 (58%), Gaps = 10/91 (10%)

Query: 1  YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG----------K 50
          YC+CFA+  +C  CNC  CFNN E+E  R  A+   +ERNPNAF+PKI            
Sbjct: 1  YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60

Query: 51 CLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                  RH KGCNCK+S CLK YCEC++ 
Sbjct: 61 VAAAGASGRHLKGCNCKKSFCLKKYCECFQV 91



 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 52/89 (58%), Gaps = 10/89 (11%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG----------K 157
           YC+CFA+  +C  CNC  CFNN E+E  R  A+   +ERNPNAF+PKI            
Sbjct: 1   YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60

Query: 158 CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                   RH KGCNCK+S CLK YCEC+
Sbjct: 61  VAAAGASGRHLKGCNCKKSFCLKKYCECF 89


>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
          Length = 772

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA G +C + C+C  C NN  H +      +Q   RNP AF P
Sbjct: 462 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIESRNPLAFAP 521

Query: 154 KI------GKCLVGEGER-----RHTKGCNCKRSGCLKNYCECY 186
           K+      G  L     +     RH +GCNCK+S CLK YCEC+
Sbjct: 522 KVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECF 565



 Score = 70.5 bits (171), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 39/94 (41%), Positives = 51/94 (54%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
           YC+CFA G +C + C+C  C NN  H +      +Q   RNP AF PK+      G  L 
Sbjct: 475 YCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELR 534

Query: 54  GEGER-----RHTKGCNCKRSGCLKNYCECYEAG 82
               +     RH +GCNCK+S CLK YCEC++ G
Sbjct: 535 DFSNKTPASARHKRGCNCKKSSCLKKYCECFQGG 568


>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
          Length = 782

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 48/103 (46%), Positives = 60/103 (58%), Gaps = 13/103 (12%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+
Sbjct: 458 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 517

Query: 156 ------GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
                 G+ + GE         RH +GCNCK+S CLK YCECY
Sbjct: 518 IRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 560



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 13/96 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
           YC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+      G+ + 
Sbjct: 469 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDIT 528

Query: 54  GE------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           GE         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 529 GEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 564


>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
 gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
 gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
          Length = 782

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 48/103 (46%), Positives = 60/103 (58%), Gaps = 13/103 (12%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+
Sbjct: 470 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 529

Query: 156 ------GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
                 G+ + GE         RH +GCNCK+S CLK YCECY
Sbjct: 530 IRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 572



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 13/96 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
           YC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+      G+ + 
Sbjct: 481 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDIT 540

Query: 54  GE------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           GE         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 541 GEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 576


>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
          Length = 794

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 48/103 (46%), Positives = 60/103 (58%), Gaps = 13/103 (12%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+
Sbjct: 470 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 529

Query: 156 ------GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
                 G+ + GE         RH +GCNCK+S CLK YCECY
Sbjct: 530 IRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 572



 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 13/96 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
           YC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+      G+ + 
Sbjct: 481 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDIT 540

Query: 54  GE------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           GE         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 541 GEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 576


>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 455

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 66/193 (34%), Positives = 86/193 (44%), Gaps = 50/193 (25%)

Query: 1   YCDCFANGEFC--YQCNCNSCFNNIEH---EDDRHHAIRQCLERNPNAFRPKIGKCLVGE 55
           YCDCF  G  C    CNC +C N I     +  R +AIR  L RNP AF           
Sbjct: 271 YCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAFV---------- 320

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
                T G                 A A    P G   +  CNC +S CLKLYC CF  G
Sbjct: 321 -----TAGV----------------ASAVQKLPAG---QVACNCIRSRCLKLYCTCFQAG 356

Query: 116 EFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNC 173
           + C    C C +C N++E   +R  AI+  L++ P+AF+ K         +R    GC C
Sbjct: 357 KMCQPGICTCIACANSVEDHPERKQAIKHTLQKRPDAFQTK---------DRPVGLGCAC 407

Query: 174 KRSGCLKNYCECY 186
           K + C++ YCEC+
Sbjct: 408 KNNKCIRKYCECF 420



 Score = 62.4 bits (150), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 48/95 (50%), Gaps = 6/95 (6%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC--YQCNCNSCFNNIEH---EDDRHHAIRQCLERNPNAF 151
           C+C KS C+ LYCDCF  G  C    CNC +C N I     +  R +AIR  L RNP AF
Sbjct: 260 CSCPKSKCVALYCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAF 319

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
               G     +        CNC RS CLK YC C+
Sbjct: 320 V-TAGVASAVQKLPAGQVACNCIRSRCLKLYCTCF 353



 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 11/84 (13%)

Query: 1   YCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 58
           YC CF  G+ C    C C +C N++E   +R  AI+  L++ P+AF+ K         +R
Sbjct: 349 YCTCFQAGKMCQPGICTCIACANSVEDHPERKQAIKHTLQKRPDAFQTK---------DR 399

Query: 59  RHTKGCNCKRSGCLKNYCECYEAG 82
               GC CK + C++ YCEC+  G
Sbjct: 400 PVGLGCACKNNKCIRKYCECFRNG 423


>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
          Length = 596

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 57/130 (43%), Positives = 65/130 (50%), Gaps = 37/130 (28%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+GE+C  CNCN C NNIE+E  R   I   LERNPNAFRP
Sbjct: 33  KKNCNCKNSKCLKLYCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRP 92

Query: 154 KIGKCLV-------------------------------------GEGERRHTKGCNCKRS 176
           KI                                          G    +H KGC CK+S
Sbjct: 93  KIASLPPQSPQITQKVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKS 152

Query: 177 GCLKNYCECY 186
           GCLK YCEC+
Sbjct: 153 GCLKKYCECF 162



 Score = 70.1 bits (170), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 50/119 (42%), Positives = 58/119 (48%), Gaps = 37/119 (31%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
           YC+CFA+GE+C  CNCN C NNIE+E  R   I   LERNPNAFRPKI            
Sbjct: 47  YCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRPKIASLPPQSPQITQ 106

Query: 54  ------------------------------GEGERRHTKGCNCKRSGCLKNYCECYEAG 82
                                         G    +H KGC CK+SGCLK YCEC++AG
Sbjct: 107 KVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKSGCLKKYCECFQAG 165


>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 48/100 (48%), Positives = 58/100 (58%), Gaps = 14/100 (14%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
           P  I   K CNC KS CLK YCDCFANG+ C + CNC  CFNN  + + R  A  Q + R
Sbjct: 82  PKIITNTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINR 141

Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +P AF             ++  KGCNCK+SGC K YCEC+
Sbjct: 142 DPGAF-------------KQSFKGCNCKKSGCQKKYCECF 168



 Score = 69.7 bits (169), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 38/83 (45%), Positives = 49/83 (59%), Gaps = 14/83 (16%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFANG+ C + CNC  CFNN  + + R  A  Q + R+P AF+             +
Sbjct: 102 YCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFK-------------Q 148

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
             KGCNCK+SGC K YCEC+ +G
Sbjct: 149 SFKGCNCKKSGCQKKYCECFLSG 171


>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
 gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
          Length = 561

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/106 (47%), Positives = 58/106 (54%), Gaps = 14/106 (13%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K CNC K+ CLK YCDCFA G +C + C C  CFN +E+ED    A +Q    NP AF P
Sbjct: 426 KRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCFNRLEYEDTVLEARQQKESHNPLAFAP 485

Query: 154 KIGK-------------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI +              L      RH  GCNCKRS C+K YCECY
Sbjct: 486 KIVQHVTEFHAINVEDASLFTPSSGRHKTGCNCKRSMCVKKYCECY 531



 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/95 (45%), Positives = 51/95 (53%), Gaps = 14/95 (14%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
           YCDCFA G +C + C C  CFN +E+ED    A +Q    NP AF PKI +         
Sbjct: 439 YCDCFAAGIYCAETCACQGCFNRLEYEDTVLEARQQKESHNPLAFAPKIVQHVTEFHAIN 498

Query: 51  ----CLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                L      RH  GCNCKRS C+K YCECY+A
Sbjct: 499 VEDASLFTPSSGRHKTGCNCKRSMCVKKYCECYQA 533


>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 611

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/104 (49%), Positives = 64/104 (61%), Gaps = 14/104 (13%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFR 152
           K C C KS CLKLYC+CFA G +C + C+C +CFN   HED    + R+ +E RNP AF 
Sbjct: 431 KRCKCKKSQCLKLYCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKS-REVIEARNPLAFA 489

Query: 153 PKIGKC------LVGEGER-----RHTKGCNCKRSGCLKNYCEC 185
           PK+         L  E  +     RHT+GCNC++SGCLK YCEC
Sbjct: 490 PKVVSTSDTAIDLWVENSKTPASARHTRGCNCRKSGCLKKYCEC 533



 Score = 71.6 bits (174), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/97 (45%), Positives = 57/97 (58%), Gaps = 14/97 (14%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKIGKC------L 52
           YC+CFA G +C + C+C +CFN   HED    + R+ +E RNP AF PK+         L
Sbjct: 444 YCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKS-REVIEARNPLAFAPKVVSTSDTAIDL 502

Query: 53  VGEGER-----RHTKGCNCKRSGCLKNYCECYEAGAR 84
             E  +     RHT+GCNC++SGCLK YCEC   G R
Sbjct: 503 WVENSKTPASARHTRGCNCRKSGCLKKYCECNLMGVR 539


>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
          Length = 553

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 41/83 (49%), Positives = 54/83 (65%), Gaps = 2/83 (2%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG
Sbjct: 162 CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 221

Query: 157 KCLVGEGERRHTKGCNCKRSGCL 179
                   R + +  N   +G +
Sbjct: 222 SS--PHANRNNMQAANAAVNGAI 242



 Score = 69.3 bits (168), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 46/72 (63%), Gaps = 2/72 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA+G +C  CNC++CFNN+++E  R  AI   LERNP+AFRPKIG        R +
Sbjct: 173 YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSS--PHANRNN 230

Query: 61  TKGCNCKRSGCL 72
            +  N   +G +
Sbjct: 231 MQAANAAVNGAI 242


>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
          Length = 1430

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 11/101 (10%)

Query: 95   KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRH------HAIRQCLERN- 147
            KPCNC KS CLKLYC+C+A G FC  CNC++C N  E + + H      HA+ +    + 
Sbjct: 1012 KPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPE-DGEAHPPYPATHALGRLRHGDV 1070

Query: 148  --PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
              P A      K    + ++R   GC CK+S CLK YCEC+
Sbjct: 1071 AVPTAVLTSAQK-RKPQSKQRQESGCFCKKSKCLKKYCECF 1110



 Score = 64.7 bits (156), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 35/91 (38%), Positives = 49/91 (53%), Gaps = 11/91 (12%)

Query: 1    YCDCFANGEFCYQCNCNSCFNNIEHEDDRH------HAIRQCLERN---PNAFRPKIGKC 51
            YC+C+A G FC  CNC++C N  E + + H      HA+ +    +   P A      K 
Sbjct: 1025 YCECYAAGVFCRDCNCSNCLNKPE-DGEAHPPYPATHALGRLRHGDVAVPTAVLTSAQK- 1082

Query: 52   LVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
               + ++R   GC CK+S CLK YCEC+EAG
Sbjct: 1083 RKPQSKQRQESGCFCKKSKCLKKYCECFEAG 1113



 Score = 55.5 bits (132), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 37/116 (31%), Positives = 42/116 (36%), Gaps = 49/116 (42%)

Query: 62   KGCNCKRSGCLKNYCECYEAGARTLEPN-------------------------------- 89
            K CNCK+S CLK YCECY AG    + N                                
Sbjct: 1012 KPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPEDGEAHPPYPATHALGRLRHGDVA 1071

Query: 90   -------GIRPRKP---------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
                     + RKP         C C KS CLK YC+CF  G  C   C C +C N
Sbjct: 1072 VPTAVLTSAQKRKPQSKQRQESGCFCKKSKCLKKYCECFEAGVHCEASCKCENCEN 1127


>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
          Length = 929

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 53/140 (37%), Positives = 66/140 (47%), Gaps = 41/140 (29%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHED---DRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
           YCDCFANG +C   C+C+SC N +E+      +  AI+Q   RNPNAF  KI       G
Sbjct: 541 YCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQ---RNPNAFVQKIEADAALGG 597

Query: 57  ERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGE 116
           +  H +GCNCK+S C+K YCEC++AG             PC                 GE
Sbjct: 598 Q--HRRGCNCKKSHCMKKYCECFQAGV------------PC-----------------GE 626

Query: 117 FCYQCNCNSCFNNIEHEDDR 136
               C C SC N   H   R
Sbjct: 627 ---HCKCESCHNTAGHSGRR 643



 Score = 79.0 bits (193), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 41/84 (48%), Positives = 53/84 (63%), Gaps = 9/84 (10%)

Query: 107 LYCDCFANGEFC-YQCNCNSCFNNIEHE---DDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           LYCDCFANG +C   C+C+SC N +E+      +  AI+Q   RNPNAF  KI       
Sbjct: 540 LYCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQ---RNPNAFVQKIEADAALG 596

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
           G+  H +GCNCK+S C+K YCEC+
Sbjct: 597 GQ--HRRGCNCKKSHCMKKYCECF 618


>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
          Length = 159

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 48/100 (48%), Positives = 58/100 (58%), Gaps = 14/100 (14%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
           P  I   K CNC KS CLK YCDCFANG+ C + CNC  CFNN  + + R  A  Q + R
Sbjct: 53  PKIITNTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNR 112

Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +P AF             ++  KGCNCK+SGC K YCEC+
Sbjct: 113 DPGAF-------------KQSFKGCNCKKSGCQKKYCECF 139



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 14/83 (16%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFANG+ C + CNC  CFNN  + + R  A  Q + R+P AF+             +
Sbjct: 73  YCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNRDPGAFK-------------Q 119

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
             KGCNCK+SGC K YCEC+  G
Sbjct: 120 SFKGCNCKKSGCQKKYCECFLNG 142


>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1022

 Score = 83.2 bits (204), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 45/95 (47%), Positives = 56/95 (58%), Gaps = 5/95 (5%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           C C K+ CLKLYC+CF   ++C   NC  C N  E+ED R HA+    +RN +AF PK  
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKAD 825

Query: 157 KC----LVGEGERR-HTKGCNCKRSGCLKNYCECY 186
           K         G +  H+KGCNCK+S C K YCECY
Sbjct: 826 KSDQYYKSDSGTKAIHSKGCNCKKSDCRKKYCECY 860



 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 40/89 (44%), Positives = 52/89 (58%), Gaps = 5/89 (5%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC----LVGEG 56
           YC+CF   ++C   NC  C N  E+ED R HA+    +RN +AF PK  K         G
Sbjct: 777 YCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKADKSDQYYKSDSG 836

Query: 57  ERR-HTKGCNCKRSGCLKNYCECYEAGAR 84
            +  H+KGCNCK+S C K YCECY+ GA+
Sbjct: 837 TKAIHSKGCNCKKSDCRKKYCECYQMGAK 865


>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
          Length = 223

 Score = 82.8 bits (203), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 37/52 (71%), Positives = 43/52 (82%)

Query: 30 HHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
           +  + CL+RNP AF+PKIGK   GE +RRH+KGCNCKRSGCLKNYCECYEA
Sbjct: 39 QYVTQACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEA 90



 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 35/50 (70%), Positives = 41/50 (82%)

Query: 137 HHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            +  + CL+RNP AF+PKIGK   GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 39  QYVTQACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 88


>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
          Length = 384

 Score = 82.8 bits (203), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 49/111 (44%), Positives = 61/111 (54%), Gaps = 25/111 (22%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFAN   C Q CNC  C N I++  +R  AI + L RNP+AF    
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAF---- 247

Query: 156 GKCLVGEGERRHT--------------------KGCNCKRSGCLKNYCECY 186
            +C   +G+ + +                    KGCNCK+SGC K YCECY
Sbjct: 248 AQCFQQKGQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECY 298



 Score = 62.8 bits (151), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 67/146 (45%), Gaps = 19/146 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFAN   C Q CNC  C N I++  +R  AI + L RNP+AF     +C   +G+  
Sbjct: 203 YCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAF----AQCFQQKGQ-- 256

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
            T+    ++   LK             + N    RK CNC KS C K YC+C++    C 
Sbjct: 257 -TQFSVQQQDKPLK----------EPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCS 305

Query: 120 Q-CNCNSCFNNIEHEDDRHHAIRQCL 144
             C C  C N  + E      I Q L
Sbjct: 306 DLCKCEQCLNRTDAEIQAQQDIAQVL 331


>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
           distachyon]
          Length = 771

 Score = 82.8 bits (203), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CF  G FC + C+C  C N   + +       Q   RNP AF P
Sbjct: 465 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIESRNPLAFAP 524

Query: 154 KI------GKCLVGEGER-----RHTKGCNCKRSGCLKNYCECY 186
           K+      G+ L  +  +     RH +GCNCK+S CLK YCEC+
Sbjct: 525 KVIRTSEPGQELGEQSNKTPASARHKRGCNCKKSSCLKKYCECF 568



 Score = 68.2 bits (165), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 37/94 (39%), Positives = 50/94 (53%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
           YC+CF  G FC + C+C  C N   + +       Q   RNP AF PK+      G+ L 
Sbjct: 478 YCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIESRNPLAFAPKVIRTSEPGQELG 537

Query: 54  GEGER-----RHTKGCNCKRSGCLKNYCECYEAG 82
            +  +     RH +GCNCK+S CLK YCEC++ G
Sbjct: 538 EQSNKTPASARHKRGCNCKKSSCLKKYCECFQGG 571


>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
          Length = 551

 Score = 82.4 bits (202), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 55/107 (51%), Positives = 69/107 (64%), Gaps = 17/107 (15%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKLYC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRP
Sbjct: 94  QKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRP 153

Query: 154 KI--------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           KI              G+ L+     +H KGC+CK+SGCLK YCEC+
Sbjct: 154 KIASSPHRTRDIREDAGEILI---LGKHNKGCHCKKSGCLKKYCECF 197



 Score = 69.7 bits (169), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 47/95 (49%), Positives = 61/95 (64%), Gaps = 17/95 (17%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------------ 48
           YC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPKI            
Sbjct: 108 YCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRPKIASSPHRTRDIRE 167

Query: 49  --GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
             G+ L+     +H KGC+CK+SGCLK YCEC++A
Sbjct: 168 DAGEILI---LGKHNKGCHCKKSGCLKKYCECFQA 199


>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 710

 Score = 82.4 bits (202), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 42/93 (45%), Positives = 55/93 (59%), Gaps = 2/93 (2%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C+KS CLKLYC+CFA G  C   C C  C N  +  D    A  + ++R+P AF  
Sbjct: 525 KKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEI 584

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           K+ +   G  + +H KGC CK+SGC K YCEC+
Sbjct: 585 KVSQN-SGTDKLQHRKGCTCKKSGCQKGYCECF 616



 Score = 66.2 bits (160), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 34/83 (40%), Positives = 47/83 (56%), Gaps = 2/83 (2%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA G  C   C C  C N  +  D    A  + ++R+P AF  K+ +   G  + +
Sbjct: 538 YCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEIKVSQN-SGTDKLQ 596

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
           H KGC CK+SGC K YCEC++ G
Sbjct: 597 HRKGCTCKKSGCQKGYCECFQLG 619



 Score = 45.4 bits (106), Expect = 0.010,   Method: Composition-based stats.
 Identities = 36/130 (27%), Positives = 48/130 (36%), Gaps = 44/130 (33%)

Query: 61  TKGCNCKRSGCLKNYCECYEAG-------------------------------------- 82
           TK C C +S CLK YCEC+ AG                                      
Sbjct: 524 TKKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFE 583

Query: 83  ---ARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHH 138
              ++    + ++ RK C C KS C K YC+CF  G  C   C C+ C N  +  D    
Sbjct: 584 IKVSQNSGTDKLQHRKGCTCKKSGCQKGYCECFQLGVPCTDFCKCSGCANCEKKSD--QP 641

Query: 139 AIRQCLERNP 148
            ++  LE  P
Sbjct: 642 CLQTSLESKP 651


>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
          Length = 586

 Score = 82.4 bits (202), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 50/99 (50%), Positives = 61/99 (61%), Gaps = 9/99 (9%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI- 155
           C+C  S CLKLYC+CF  G +C  CNC +C NN+ HE+ R  AI   LERNP AF PK+ 
Sbjct: 174 CSCKYSKCLKLYCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVS 233

Query: 156 --------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                    K   G+   +HTKGC CKR+ CLK YCEC+
Sbjct: 234 NSTAHNCESKAAEGDIVGKHTKGCKCKRTECLKKYCECF 272



 Score = 69.7 bits (169), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 44/90 (48%), Positives = 55/90 (61%), Gaps = 9/90 (10%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
           YC+CF  G +C  CNC +C NN+ HE+ R  AI   LERNP AF PK+          K 
Sbjct: 185 YCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNSTAHNCESKA 244

Query: 52  LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
             G+   +HTKGC CKR+ CLK YCEC++A
Sbjct: 245 AEGDIVGKHTKGCKCKRTECLKKYCECFKA 274


>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score = 82.0 bits (201), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 49/102 (48%), Positives = 60/102 (58%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
           CNC KS CLKLYC+CFAN   C Q CNC  C N I++ ++R  AI + L RNP AF    
Sbjct: 164 CNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTIL 223

Query: 152 -----RPKIGKCLVGEGE--RRHTKGCNCKRSGCLKNYCECY 186
                +P+I      + E  +   KGCNCK+S C K YCECY
Sbjct: 224 TNNGQQPQIIPEPKSQKEQSKETKKGCNCKKSECKKKYCECY 265



 Score = 65.9 bits (159), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 52/96 (54%), Gaps = 12/96 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---------RPKIGK 50
           YC+CFAN   C Q CNC  C N I++ ++R  AI + L RNP AF         +P+I  
Sbjct: 175 YCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTILTNNGQQPQIIP 234

Query: 51  CLVGEGE--RRHTKGCNCKRSGCLKNYCECYEAGAR 84
               + E  +   KGCNCK+S C K YCECY    +
Sbjct: 235 EPKSQKEQSKETKKGCNCKKSECKKKYCECYSINQK 270


>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
          Length = 800

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/102 (46%), Positives = 58/102 (56%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+
Sbjct: 484 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAPKV 543

Query: 156 GKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
            +        GE         RH +GCNCK+S CLK YCECY
Sbjct: 544 IRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 585



 Score = 69.7 bits (169), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+ +        G
Sbjct: 495 YCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAPKVIRMSDAGLETG 554

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 555 EDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 589


>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
          Length = 175

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/100 (47%), Positives = 56/100 (56%), Gaps = 14/100 (14%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
           P  +   K CNC KS CLK YCDCFANG+ C + CNC  CFNN  +   R  A  Q   R
Sbjct: 69  PKIVSSTKTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTR 128

Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +P AF             ++  KGCNCK+SGC K YCEC+
Sbjct: 129 DPRAF-------------KQAFKGCNCKKSGCQKKYCECF 155



 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 46/81 (56%), Gaps = 14/81 (17%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFANG+ C + CNC  CFNN  +   R  A  Q   R+P AF+             +
Sbjct: 89  YCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFK-------------Q 135

Query: 60  HTKGCNCKRSGCLKNYCECYE 80
             KGCNCK+SGC K YCEC++
Sbjct: 136 AFKGCNCKKSGCQKKYCECFQ 156


>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
          Length = 190

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/100 (47%), Positives = 56/100 (56%), Gaps = 14/100 (14%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
           P  +   K CNC KS CLK YCDCFANG+ C + CNC  CFNN  +   R  A  Q   R
Sbjct: 84  PKIVTSTKTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQAR 143

Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +P AF             ++  KGCNCK+SGC K YCEC+
Sbjct: 144 DPGAF-------------KQAFKGCNCKKSGCQKKYCECF 170



 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 46/81 (56%), Gaps = 14/81 (17%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFANG+ C + CNC  CFNN  +   R  A  Q   R+P AF+             +
Sbjct: 104 YCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQARDPGAFK-------------Q 150

Query: 60  HTKGCNCKRSGCLKNYCECYE 80
             KGCNCK+SGC K YCEC++
Sbjct: 151 AFKGCNCKKSGCQKKYCECFQ 171


>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 974

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 46/100 (46%), Positives = 56/100 (56%), Gaps = 10/100 (10%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC K+ CLKLYCDCFA GE C Q C C  C NN    + R+ AI   L+RNP AF  K 
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVKP 695

Query: 156 GKCLVGEGERRHT---------KGCNCKRSGCLKNYCECY 186
            +    + E  ++          GCNC++S CLK YC CY
Sbjct: 696 EEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCY 735



 Score = 67.8 bits (164), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 43/111 (38%), Positives = 55/111 (49%), Gaps = 18/111 (16%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA GE C Q C C  C NN    + R+ AI   L+RNP AF  K  +    + E  
Sbjct: 647 YCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVKPEEAKKSKKELN 706

Query: 60  HTK---------GCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTK 101
           ++          GCNC++S CLK YC CY +G +  E         C CT+
Sbjct: 707 NSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSE--------ACKCTE 749


>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
 gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
          Length = 749

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 42/94 (44%), Positives = 53/94 (56%), Gaps = 12/94 (12%)

Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL---- 159
           L+LYC+CFA G +C + C C  CFN   HED      +Q   RNP AF PK+ +      
Sbjct: 467 LQLYCECFAAGVYCIEPCACQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPAP 526

Query: 160 -VGE------GERRHTKGCNCKRSGCLKNYCECY 186
            +G+         RH +GCNCK+S CLK YCECY
Sbjct: 527 EIGDESSKTPASARHKRGCNCKKSSCLKKYCECY 560



 Score = 80.9 bits (198), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C C  CFN   HED      +Q   RNP AF PK+ +       +G
Sbjct: 470 YCECFAAGVYCIEPCACQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPAPEIG 529

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
           +         RH +GCNCK+S CLK YCECY+ G
Sbjct: 530 DESSKTPASARHKRGCNCKKSSCLKKYCECYQGG 563


>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
          Length = 196

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/93 (48%), Positives = 55/93 (59%), Gaps = 14/93 (15%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           + CNC KS CLK YCDCFANG+ C + CNC  CFNN  + + R  A  Q + R+P AF+ 
Sbjct: 97  RSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFKQ 156

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                          KGCNCK+SGC K YCEC+
Sbjct: 157 SF-------------KGCNCKKSGCQKKYCECF 176



 Score = 69.7 bits (169), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 14/83 (16%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFANG+ C + CNC  CFNN  + + R  A  Q + R+P AF+              
Sbjct: 110 YCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFKQSF----------- 158

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
             KGCNCK+SGC K YCEC+ +G
Sbjct: 159 --KGCNCKKSGCQKKYCECFLSG 179


>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 405

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 56/102 (54%), Positives = 67/102 (65%), Gaps = 10/102 (9%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K CNC  S CLKLYC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPK
Sbjct: 26  KQCNCKNSRCLKLYCECFASGTYCEGCNCVNCCNNVENEIVRQEAVEATLERNPNAFRPK 85

Query: 155 IGKCL--------VGEGER--RHTKGCNCKRSGCLKNYCECY 186
           I            VGE     +H KGC+CK+SGCLK YCEC+
Sbjct: 86  IFSSPGIRDSGEDVGEHPLAGKHNKGCHCKKSGCLKKYCECF 127



 Score = 68.9 bits (167), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 48/91 (52%), Positives = 60/91 (65%), Gaps = 10/91 (10%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-------- 52
           YC+CFA+G +C  CNC +C NN+E+E  R  A+   LERNPNAFRPKI            
Sbjct: 39  YCECFASGTYCEGCNCVNCCNNVENEIVRQEAVEATLERNPNAFRPKIFSSPGIRDSGED 98

Query: 53  VGEGER--RHTKGCNCKRSGCLKNYCECYEA 81
           VGE     +H KGC+CK+SGCLK YCEC++A
Sbjct: 99  VGEHPLAGKHNKGCHCKKSGCLKKYCECFQA 129


>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
          Length = 348

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/102 (46%), Positives = 59/102 (57%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CF N   C Q CNC  C N I++ ++R  AI + L RNP AF P +
Sbjct: 168 CNCKKSKCLKLYCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPIL 227

Query: 156 G-----KCLVGEG------ERRHTKGCNCKRSGCLKNYCECY 186
                   ++ E       ++   KGCNCK+S C K YCECY
Sbjct: 228 TNNGQQPQVIQEQKSQKDIQKETKKGCNCKKSECKKKYCECY 269



 Score = 65.1 bits (157), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 51/96 (53%), Gaps = 12/96 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG-----KCLVG 54
           YC+CF N   C Q CNC  C N I++ ++R  AI + L RNP AF P +        ++ 
Sbjct: 179 YCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPILTNNGQQPQVIQ 238

Query: 55  EG------ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
           E       ++   KGCNCK+S C K YCECY    +
Sbjct: 239 EQKSQKDIQKETKKGCNCKKSECKKKYCECYSINQK 274


>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
 gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
          Length = 776

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/102 (46%), Positives = 58/102 (56%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+
Sbjct: 466 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 525

Query: 156 GKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
            +        GE         RH +GCNCK+S CLK YCECY
Sbjct: 526 IRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 567



 Score = 69.3 bits (168), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+ +        G
Sbjct: 477 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGLETG 536

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 537 EDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 571


>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
 gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
          Length = 695

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 49/142 (34%), Positives = 60/142 (42%), Gaps = 43/142 (30%)

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
           GE    K CNCK+S CLK YCEC+ AG   +EP                           
Sbjct: 395 GEGESCKRCNCKKSKCLKLYCECFAAGVYCIEP--------------------------- 427

Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVGEGE 164
                C+C  CFN   HE+      +Q   RNP AF PK+ +                  
Sbjct: 428 -----CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPAS 482

Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
            RH +GCNCK+S C+K YCECY
Sbjct: 483 ARHKRGCNCKKSNCMKKYCECY 504



 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 38/94 (40%), Positives = 50/94 (53%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
           YC+CFA G +C + C+C  CFN   HE+      +Q   RNP AF PK+ +         
Sbjct: 414 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 473

Query: 51  --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
                     RH +GCNCK+S C+K YCECY+ G
Sbjct: 474 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGG 507


>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
 gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
 gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 695

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 49/142 (34%), Positives = 60/142 (42%), Gaps = 43/142 (30%)

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
           GE    K CNCK+S CLK YCEC+ AG   +EP                           
Sbjct: 395 GEGESCKRCNCKKSKCLKLYCECFAAGVYCIEP--------------------------- 427

Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVGEGE 164
                C+C  CFN   HE+      +Q   RNP AF PK+ +                  
Sbjct: 428 -----CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPAS 482

Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
            RH +GCNCK+S C+K YCECY
Sbjct: 483 ARHKRGCNCKKSNCMKKYCECY 504



 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 38/94 (40%), Positives = 50/94 (53%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
           YC+CFA G +C + C+C  CFN   HE+      +Q   RNP AF PK+ +         
Sbjct: 414 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 473

Query: 51  --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
                     RH +GCNCK+S C+K YCECY+ G
Sbjct: 474 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGG 507


>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
          Length = 386

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 49/107 (45%), Positives = 58/107 (54%), Gaps = 17/107 (15%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
           CNC KS CLKLYC+CFAN   C Q CNC  C N I++  +R  AI + L RNP+AF    
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQCF 251

Query: 152 ------------RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                       + K  K    +      KGCNCK+SGC K YCECY
Sbjct: 252 QQKSQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECY 298



 Score = 63.9 bits (154), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 50/101 (49%), Gaps = 17/101 (16%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF--------------- 44
           YC+CFAN   C Q CNC  C N I++  +R  AI + L RNP+AF               
Sbjct: 203 YCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQCFQQKSQTQFSVQ 262

Query: 45  -RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGAR 84
            + K  K    +      KGCNCK+SGC K YCECY    +
Sbjct: 263 QQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLK 303


>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
 gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
           AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
 gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
          Length = 674

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 59/187 (31%), Positives = 75/187 (40%), Gaps = 44/187 (23%)

Query: 11  CYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSG 70
             Q + +   N I  + D+   +   L+   N   PK  +  +  GE    K CNCK+S 
Sbjct: 325 TLQDSLDQTENEIREDADQDVPVEPALQEL-NLSSPKKKRVKLDSGEGESCKRCNCKKSK 383

Query: 71  CLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNI 130
           CLK YCEC+ AG   +EP                                C+C  CFN  
Sbjct: 384 CLKLYCECFAAGVYCIEP--------------------------------CSCIDCFNKP 411

Query: 131 EHEDDRHHAIRQCLERNPNAFRPKI-----------GKCLVGEGERRHTKGCNCKRSGCL 179
            HED      +Q   RNP AF PK+                     RH +GCNCK+S CL
Sbjct: 412 IHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETGDDASKTPASARHKRGCNCKKSNCL 471

Query: 180 KNYCECY 186
           K YCECY
Sbjct: 472 KKYCECY 478



 Score = 72.4 bits (176), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 49/95 (51%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+           
Sbjct: 388 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETG 447

Query: 49  GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+S CLK YCECY+ G 
Sbjct: 448 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 482


>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
          Length = 613

 Score = 80.1 bits (196), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 45/108 (41%), Positives = 57/108 (52%), Gaps = 15/108 (13%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           RK CNC ++ CLKLYCDC A G +C   C C++C N  E+E        +   R+P AF 
Sbjct: 286 RKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFA 345

Query: 153 PKI--------GKCLVGEG------ERRHTKGCNCKRSGCLKNYCECY 186
           P+I           L  +G        RH +GCNCK+S C K YCECY
Sbjct: 346 PRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECY 393



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 38/97 (39%), Positives = 49/97 (50%), Gaps = 15/97 (15%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKC 51
           YCDC A G +C   C C++C N  E+E        +   R+P AF P+I           
Sbjct: 300 YCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRIVNPDTDTTDNV 359

Query: 52  LVGEG------ERRHTKGCNCKRSGCLKNYCECYEAG 82
           L  +G        RH +GCNCK+S C K YCECY+AG
Sbjct: 360 LQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAG 396


>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 235

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/60 (65%), Positives = 44/60 (73%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC  S CLKLYC+CFA+G  C  CNC +CFNN E+E  R  AI   LERNP+AFRPKIG
Sbjct: 157 CNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 216



 Score = 66.2 bits (160), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 31/49 (63%), Positives = 36/49 (73%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
           YC+CFA+G  C  CNC +CFNN E+E  R  AI   LERNP+AFRPKIG
Sbjct: 168 YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 216


>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
 gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
          Length = 369

 Score = 79.7 bits (195), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 50/99 (50%), Positives = 61/99 (61%), Gaps = 9/99 (9%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI- 155
           C+C  S CLKLYC+CF  G +C  CNC +C NN+ HE+ R  AI   LERNP AF PK+ 
Sbjct: 174 CSCKYSKCLKLYCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVL 233

Query: 156 --------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                    K   G+   +HTKGC CKR+ CLK YCEC+
Sbjct: 234 NSTAHNCESKAAEGDIVGKHTKGCKCKRTECLKKYCECF 272



 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 44/92 (47%), Positives = 55/92 (59%), Gaps = 9/92 (9%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
           YC+CF  G +C  CNC +C NN+ HE+ R  AI   LERNP AF PK+          K 
Sbjct: 185 YCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNSTAHNCESKA 244

Query: 52  LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
             G+   +HTKGC CKR+ CLK YCEC++A  
Sbjct: 245 AEGDIVGKHTKGCKCKRTECLKKYCECFKASV 276


>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
          Length = 369

 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 62/117 (52%), Gaps = 11/117 (9%)

Query: 80  EAGARTLEPNGIRPRK--PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRH 137
           EA  +T E  GI  RK   C C +S CLKLYCDCFA+G  C  C+C  C NN E  D R 
Sbjct: 50  EAKDKTDE-EGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEKCDARE 108

Query: 138 HAIRQCLERNPNAFRPKIGKCLV--------GEGERRHTKGCNCKRSGCLKNYCECY 186
            A+   L RNPNAF  K    L                ++GC CKR+ CLK YCEC+
Sbjct: 109 AAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLSRGCKCKRTRCLKKYCECF 165



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 47/89 (52%), Gaps = 8/89 (8%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
           YCDCFA+G  C  C+C  C NN E  D R  A+   L RNPNAF  K    L        
Sbjct: 79  YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAA 138

Query: 54  -GEGERRHTKGCNCKRSGCLKNYCECYEA 81
                   ++GC CKR+ CLK YCEC++A
Sbjct: 139 PDTKPGLLSRGCKCKRTRCLKKYCECFQA 167


>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
 gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
 gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 368

 Score = 79.3 bits (194), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 62/117 (52%), Gaps = 11/117 (9%)

Query: 80  EAGARTLEPNGIRPRK--PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRH 137
           EA  +T E  GI  RK   C C +S CLKLYCDCFA+G  C  C+C  C NN E  D R 
Sbjct: 50  EAKDKTDE-EGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEKCDARE 108

Query: 138 HAIRQCLERNPNAFRPKIGKCLV--------GEGERRHTKGCNCKRSGCLKNYCECY 186
            A+   L RNPNAF  K    L                ++GC CKR+ CLK YCEC+
Sbjct: 109 AAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLSRGCKCKRTRCLKKYCECF 165



 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 47/89 (52%), Gaps = 8/89 (8%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
           YCDCFA+G  C  C+C  C NN E  D R  A+   L RNPNAF  K    L        
Sbjct: 79  YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAA 138

Query: 54  -GEGERRHTKGCNCKRSGCLKNYCECYEA 81
                   ++GC CKR+ CLK YCEC++A
Sbjct: 139 PDTKPGLLSRGCKCKRTRCLKKYCECFQA 167


>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
          Length = 356

 Score = 79.3 bits (194), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 63/120 (52%), Gaps = 11/120 (9%)

Query: 77  ECYEAGARTLEPNGIRPRK--PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHED 134
           E  EA  +T E  GI  RK   C C +S CLKLYCDCFA+G  C  C+C  C NN E  D
Sbjct: 47  EHSEAKDKTDE-EGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEKCD 105

Query: 135 DRHHAIRQCLERNPNAFRPKIGKCLV--------GEGERRHTKGCNCKRSGCLKNYCECY 186
            R  A+   L RNPNAF  K    L                ++GC CKR+ CLK YCEC+
Sbjct: 106 AREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLSRGCKCKRTRCLKKYCECF 165



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 39/89 (43%), Positives = 47/89 (52%), Gaps = 8/89 (8%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
           YCDCFA+G  C  C+C  C NN E  D R  A+   L RNPNAF  K    L        
Sbjct: 79  YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAA 138

Query: 54  -GEGERRHTKGCNCKRSGCLKNYCECYEA 81
                   ++GC CKR+ CLK YCEC++A
Sbjct: 139 PDTKPGLLSRGCKCKRTRCLKKYCECFQA 167


>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 510

 Score = 79.0 bits (193), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 53/85 (62%), Gaps = 4/85 (4%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RR 59
           YC+CFA G FC  C+C  C N  E+E       +Q  +RNP AF  KI   +   G+  R
Sbjct: 208 YCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKI---MADAGDDAR 264

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
           HTKGC+CK+S CLK YCEC++AG +
Sbjct: 265 HTKGCHCKKSACLKKYCECFQAGVK 289



 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 50/81 (61%), Gaps = 4/81 (4%)

Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG-ER 165
           LYC+CFA G FC  C+C  C N  E+E       +Q  +RNP AF  KI   +   G + 
Sbjct: 207 LYCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKI---MADAGDDA 263

Query: 166 RHTKGCNCKRSGCLKNYCECY 186
           RHTKGC+CK+S CLK YCEC+
Sbjct: 264 RHTKGCHCKKSACLKKYCECF 284


>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 651

 Score = 79.0 bits (193), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 54/94 (57%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+    + ++  G
Sbjct: 365 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLETG 424

Query: 57  E--------RRHTKGCNCKRSGCLKNYCECYEAG 82
           +         RH +GCNCK+S CLK YCECY+ G
Sbjct: 425 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGG 458



 Score = 76.6 bits (187), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 41/93 (44%), Positives = 53/93 (56%), Gaps = 12/93 (12%)

Query: 106 KLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVG 161
           K YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+    + ++ 
Sbjct: 363 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLE 422

Query: 162 EGE--------RRHTKGCNCKRSGCLKNYCECY 186
            G+         RH +GCNCK+S CLK YCECY
Sbjct: 423 TGDDASKTPASARHKRGCNCKKSNCLKKYCECY 455


>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
          Length = 305

 Score = 79.0 bits (193), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/90 (42%), Positives = 56/90 (62%), Gaps = 2/90 (2%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
           CNC KS CLKLYC+CFA+  FC  CNC +C NN EH  +R    +  ++R+P+ F P + 
Sbjct: 189 CNCRKSRCLKLYCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHVFEPHV- 247

Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                   R + +GC C+++ C+K YC+C+
Sbjct: 248 -VWDTPATRTNLRGCRCQKTRCVKKYCDCF 276



 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA+  FC  CNC +C NN EH  +R    +  ++R+P+ F P +         R +
Sbjct: 200 YCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHVFEPHV--VWDTPATRTN 257

Query: 61  TKGCNCKRSGCLKNYCECYEAG 82
            +GC C+++ C+K YC+C++ G
Sbjct: 258 LRGCRCQKTRCVKKYCDCFQNG 279


>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
           distachyon]
          Length = 787

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 60/105 (57%), Gaps = 14/105 (13%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA G +C + C+C  C N   HE+      +Q   RNP AF P
Sbjct: 475 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 534

Query: 154 KIGKCLVGEGER------------RHTKGCNCKRSGCLKNYCECY 186
           K+ + L   G+             RH +GCNCK+S CLK YCECY
Sbjct: 535 KVIR-LSDAGQETQEDPNNTPASARHKRGCNCKKSSCLKKYCECY 578



 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 52/96 (54%), Gaps = 14/96 (14%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
           YC+CFA G +C + C+C  C N   HE+      +Q   RNP AF PK+ + L   G+  
Sbjct: 488 YCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIR-LSDAGQET 546

Query: 59  -----------RHTKGCNCKRSGCLKNYCECYEAGA 83
                      RH +GCNCK+S CLK YCECY+ G 
Sbjct: 547 QEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 582


>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
          Length = 374

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/104 (47%), Positives = 62/104 (59%), Gaps = 14/104 (13%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
           CNC KS CLKLYC+CFAN   C Q CNC  C N I++ ++R  AI + L RNP+AF    
Sbjct: 169 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAFAAIL 228

Query: 152 -----RPKIGKCLVGEGERRHT----KGCNCKRSGCLKNYCECY 186
                +P+I +    + E +      KGCNCK+S C K YCECY
Sbjct: 229 TNNGQQPQIIQEEKSQKESQKDITTRKGCNCKKSECKKKYCECY 272



 Score = 63.2 bits (152), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 76/161 (47%), Gaps = 26/161 (16%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFAN   C Q CNC  C N I++ ++R  AI + L RNP+AF       L   G++ 
Sbjct: 180 YCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAF----AAILTNNGQQP 235

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
                               E  ++      I  RK CNC KS C K YC+C++  + C 
Sbjct: 236 QI----------------IQEEKSQKESQKDITTRKGCNCKKSECKKKYCECYSINQRCT 279

Query: 120 Q-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL 159
             C C +C N ++ ++D   AI Q L++  N  +  + + L
Sbjct: 280 DLCKCENCLNKVQPQED---AIEQ-LQKQENPKKESLNQNL 316


>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
          Length = 671

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 39/84 (46%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YCDCFA G FC  C+C SC N     +       Q   RNPNAF  KI        + +H
Sbjct: 359 YCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRYQIESRNPNAFANKIVD--DDSVDAKH 416

Query: 61  TKGCNCKRSGCLKNYCECYEAGAR 84
            KGC+CK+S CLK YCEC++A  R
Sbjct: 417 AKGCHCKKSACLKKYCECFQANVR 440



 Score = 75.9 bits (185), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 38/80 (47%), Positives = 45/80 (56%), Gaps = 2/80 (2%)

Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
           LYCDCFA G FC  C+C SC N     +       Q   RNPNAF  KI        + +
Sbjct: 358 LYCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRYQIESRNPNAFANKIVD--DDSVDAK 415

Query: 167 HTKGCNCKRSGCLKNYCECY 186
           H KGC+CK+S CLK YCEC+
Sbjct: 416 HAKGCHCKKSACLKKYCECF 435


>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
          Length = 612

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 49/99 (49%), Positives = 60/99 (60%), Gaps = 9/99 (9%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI- 155
           C+C  S CLKLYC+CF  G +C  CNC +C NN+ HE+ R  AI   LERNP AF PK+ 
Sbjct: 174 CSCKYSKCLKLYCECFEKGRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVL 233

Query: 156 --------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                    K   G+   +HTKGC CKR+  LK YCEC+
Sbjct: 234 NSTAHNCESKAAEGDIVGKHTKGCKCKRTEYLKKYCECF 272



 Score = 65.5 bits (158), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 43/90 (47%), Positives = 54/90 (60%), Gaps = 9/90 (10%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
           YC+CF  G +C  CNC +C NN+ HE+ R  AI   LERNP AF PK+          K 
Sbjct: 185 YCECFEKGRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNSTAHNCESKA 244

Query: 52  LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
             G+   +HTKGC CKR+  LK YCEC++A
Sbjct: 245 AEGDIVGKHTKGCKCKRTEYLKKYCECFKA 274


>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
 gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
 gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
 gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 639

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C KS CLKLYC+CF+ G FC + C+C +CFN   HED    +      RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512

Query: 154 KIGKC------LVGEGER-----RHTKGCNCKRSGCLKNYCECY 186
           K+         L  E  +     RH +GCNC++SGC K YCEC+
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECF 556



 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC------LV 53
           YC+CF+ G FC + C+C +CFN   HED    +      RNP AF PK+         L 
Sbjct: 466 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 525

Query: 54  GEGER-----RHTKGCNCKRSGCLKNYCECYEAGA 83
            E  +     RH +GCNC++SGC K YCEC+  G 
Sbjct: 526 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGV 560


>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 766

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 61/104 (58%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF P
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 530

Query: 154 KI----GKCL-VGE------GERRHTKGCNCKRSGCLKNYCECY 186
           K+    G  L  GE         RH +GCNCKRS C+K YCEC+
Sbjct: 531 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECF 574



 Score = 66.6 bits (161), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCL-VG 54
           YC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF PK+    G  L  G
Sbjct: 484 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFG 543

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCKRS C+K YCEC+++G 
Sbjct: 544 EDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGV 578


>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
          Length = 757

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 61/104 (58%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF P
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 521

Query: 154 KI----GKCL-VGE------GERRHTKGCNCKRSGCLKNYCECY 186
           K+    G  L  GE         RH +GCNCKRS C+K YCEC+
Sbjct: 522 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECF 565



 Score = 66.6 bits (161), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCL-VG 54
           YC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF PK+    G  L  G
Sbjct: 475 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFG 534

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCKRS C+K YCEC+++G 
Sbjct: 535 EDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGV 569


>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
          Length = 644

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 61/104 (58%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF P
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 408

Query: 154 KI----GKCL-VGE------GERRHTKGCNCKRSGCLKNYCECY 186
           K+    G  L  GE         RH +GCNCKRS C+K YCEC+
Sbjct: 409 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECF 452



 Score = 66.6 bits (161), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCL-VG 54
           YC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF PK+    G  L  G
Sbjct: 362 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFG 421

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCKRS C+K YCEC+++G 
Sbjct: 422 EDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGV 456


>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
 gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
          Length = 646

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 38/71 (53%), Positives = 49/71 (69%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           L+ N ++ +K CNC  S CLKLYC+C+A G +C  CNC +C NN+ +E  R  AI   LE
Sbjct: 139 LKDNTLKKQKRCNCKNSKCLKLYCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLE 198

Query: 146 RNPNAFRPKIG 156
           +NPNAFRPKI 
Sbjct: 199 KNPNAFRPKIA 209



 Score = 59.3 bits (142), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 27/49 (55%), Positives = 34/49 (69%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
           YC+C+A G +C  CNC +C NN+ +E  R  AI   LE+NPNAFRPKI 
Sbjct: 161 YCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIA 209


>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 886

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 70/238 (29%), Positives = 102/238 (42%), Gaps = 65/238 (27%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHED---DRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
           YCDCF +  FC + C C  C N +E+     +R  A+   LER P AF  + G       
Sbjct: 22  YCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERRPEAFGRESGAF----A 77

Query: 57  ERRHTK--GCNCKRSGCLKNYCECYEAGAR-------------------------TLEPN 89
           ERR+ K  GC C ++ CLK YC C+ A  +                         T+E  
Sbjct: 78  ERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPFGTIRIEDGVTCTIEGG 137

Query: 90  GIRPRK------------------PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNI 130
            +   K                   C+CT++ CLKLYCDCF     C   C+C  C N+ 
Sbjct: 138 KVNALKGAAFVPIRKLANLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSE 197

Query: 131 EHE---DDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCEC 185
           E+     +R   I+  LE+ P+ F+           +R+   GC+C+++ C   YC+C
Sbjct: 198 EYSGPMGERTLVIKDILEKRPHIFQKT--------AKRKAGGGCSCQKNKCRAGYCDC 247



 Score = 63.5 bits (153), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 10/96 (10%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHE---DDRHHAIRQCLERNPNAFR 152
           C CTK+ CL LYCDCF +  FC + C C  C N +E+     +R  A+   LER P AF 
Sbjct: 11  CKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERRPEAFG 70

Query: 153 PKIGKCLVGEGERRHTK--GCNCKRSGCLKNYCECY 186
            + G       ERR+ K  GC C ++ CLK YC C+
Sbjct: 71  RESGAF----AERRNKKDGGCTCAKNSCLKKYCVCF 102



 Score = 38.9 bits (89), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 12/88 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHED---DRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
           YCDCF     C   C+C  C N+ E+     +R   I+  LE+ P+ F+           
Sbjct: 174 YCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDILEKRPHIFQKT--------A 225

Query: 57  ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
           +R+   GC+C+++ C   YC+C  +G +
Sbjct: 226 KRKAGGGCSCQKNKCRAGYCDCKGSGGK 253


>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 782

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 57/104 (54%), Gaps = 12/104 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA G +C + C+C  C N   HE+      +    RNP AF P
Sbjct: 473 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKHIEFRNPLAFAP 532

Query: 154 KIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
           K+ +      E            RH +GCNCK+S CLK YCECY
Sbjct: 533 KVIRMSEAGQEAQEDPKNTPASARHKRGCNCKKSSCLKKYCECY 576



 Score = 65.9 bits (159), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 49/95 (51%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
           YC+CFA G +C + C+C  C N   HE+      +    RNP AF PK+ +      E  
Sbjct: 486 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKHIEFRNPLAFAPKVIRMSEAGQEAQ 545

Query: 59  ----------RHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+S CLK YCECY+ G 
Sbjct: 546 EDPKNTPASARHKRGCNCKKSSCLKKYCECYQGGV 580


>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 766

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 57/102 (55%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CF  G FC + C+C  C N   + +      +Q   RNP AF PK+
Sbjct: 461 CSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIESRNPLAFAPKV 520

Query: 156 GKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
            +       +GE         RH +GCNCK+S CLK YCECY
Sbjct: 521 IRTSEPGQELGEYSNRTPASARHKRGCNCKKSSCLKKYCECY 562



 Score = 65.9 bits (159), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CF  G FC + C+C  C N   + +      +Q   RNP AF PK+ +       +G
Sbjct: 472 YCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIESRNPLAFAPKVIRTSEPGQELG 531

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 532 EYSNRTPASARHKRGCNCKKSSCLKKYCECYQGGV 566


>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
          Length = 662

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 44/115 (38%), Positives = 58/115 (50%), Gaps = 12/115 (10%)

Query: 84  RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQ 142
           R L+ +     + CNC +S CLKLYC CFA   +C   C+C  C NN  HE+      ++
Sbjct: 354 RKLQKDDGESCRHCNCRRSKCLKLYCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKR 413

Query: 143 CLERNPNAFRPKI----GKCL-------VGEGERRHTKGCNCKRSGCLKNYCECY 186
               NP AF P +    G  L             RH +GCNC++S CLK YCEC+
Sbjct: 414 TESHNPLAFAPTVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECF 468



 Score = 60.1 bits (144), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 44/131 (33%), Positives = 57/131 (43%), Gaps = 24/131 (18%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA   +C   C+C  C NN  HE+      ++    NP AF P +           
Sbjct: 378 YCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKRTESHNPLAFAPTV----------T 427

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG-EFC 118
           HT G             E  +   +T  P   R ++ CNC KS CLK YC+CF +G   C
Sbjct: 428 HTYG----------TVLEFGDDSNKT--PASARHKRGCNCRKSSCLKKYCECFQSGVGCC 475

Query: 119 YQCNCNSCFNN 129
             C C SC N+
Sbjct: 476 ISCRCMSCKNS 486



 Score = 47.8 bits (112), Expect = 0.002,   Method: Composition-based stats.
 Identities = 33/112 (29%), Positives = 44/112 (39%), Gaps = 36/112 (32%)

Query: 7   NGEFCYQCN-------------------------CNSCFNNIEHEDDRHHAIRQCLERNP 41
           +GE C  CN                         C  C NN  HE+      ++    NP
Sbjct: 360 DGESCRHCNCRRSKCLKLYCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKRTESHNP 419

Query: 42  NAFRPKI----GKCL-------VGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
            AF P +    G  L             RH +GCNC++S CLK YCEC+++G
Sbjct: 420 LAFAPTVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECFQSG 471


>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
 gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
          Length = 808

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 59/123 (47%), Gaps = 31/123 (25%)

Query: 95  KPCNCTKSMCLKL-------------------YCDCFANGEFCYQ-CNCNSCFNNIEHED 134
           K CNC KS CLKL                   YC+CFA G +C + C+C  CFN   HED
Sbjct: 488 KRCNCKKSKCLKLIDLKMTILVLELTILFFFSYCECFAAGVYCIEPCSCQECFNKPIHED 547

Query: 135 DRHHAIRQCLERNPNAFRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYC 183
                 +Q   RNP AF PK+ +      E            RH +GCNCK+S CLK YC
Sbjct: 548 TVLQTRKQIESRNPLAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYC 607

Query: 184 ECY 186
           ECY
Sbjct: 608 ECY 610



 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 52/98 (53%), Gaps = 12/98 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +      E  
Sbjct: 520 YCECFAAGVYCIEPCSCQECFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRSADSVPETG 579

Query: 59  ----------RHTKGCNCKRSGCLKNYCECYEAGARTL 86
                     RH +GCNCK+S CLK YCECY+    T+
Sbjct: 580 IDPNKTPASARHKRGCNCKKSNCLKKYCECYQVLILTI 617


>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
 gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
          Length = 587

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 57/102 (55%), Gaps = 12/102 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA G +C + C+C  C NN  H +      +Q   RNP AF PK+
Sbjct: 462 CSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIESRNPLAFAPKV 521

Query: 156 ------GKCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 186
                 G  L           RH +GCNCK+S CLK YCEC+
Sbjct: 522 IHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 563



 Score = 65.5 bits (158), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 50/95 (52%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCL- 52
           YC+CFA G +C + C+C  C NN  H +      +Q   RNP AF PK+      G  L 
Sbjct: 473 YCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIESRNPLAFAPKVIHTSEPGMDLG 532

Query: 53  ----VGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+S CLK YCEC++ G 
Sbjct: 533 DFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGV 567


>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 978

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 45/98 (45%), Positives = 64/98 (65%), Gaps = 12/98 (12%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC K+ CLKLYC+CFA   +C + C+C+ C N  E+ED+R +AI++   RN +AF+PK 
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKT 464

Query: 156 GKCLVGEGERRH-------TKGCNCKRSGCLKNYCECY 186
                 E ER++        KGCNC+++ CLK YCEC+
Sbjct: 465 E----SEQERKNKANNEQMKKGCNCRKTKCLKKYCECF 498



 Score = 65.1 bits (157), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 57/91 (62%), Gaps = 12/91 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA   +C + C+C+ C N  E+ED+R +AI++   RN +AF+PK       E ER+
Sbjct: 416 YCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKTE----SEQERK 471

Query: 60  HT-------KGCNCKRSGCLKNYCECYEAGA 83
           +        KGCNC+++ CLK YCEC+ AG 
Sbjct: 472 NKANNEQMKKGCNCRKTKCLKKYCECFNAGT 502


>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
           gaditana CCMP526]
          Length = 365

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/171 (34%), Positives = 71/171 (41%), Gaps = 42/171 (24%)

Query: 58  RRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEF 117
           RR       +RS  L         G  T      R  K CNC KS CLKLYC+CFA G +
Sbjct: 123 RRQNPSQEARRSPPLNGSGGGTPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAY 182

Query: 118 CY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------------------- 155
           C   C C  C N +  E  R  A+   L RNP+AF  KI                     
Sbjct: 183 CRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSL 242

Query: 156 --GKCLVGE-GERR-----------------HTKGCNCKRSGCLKNYCECY 186
             G+C VG+ GE                   H +GCNCK+S C+K YCEC+
Sbjct: 243 PGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIKKYCECF 293



 Score = 69.7 bits (169), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 65/151 (43%), Gaps = 18/151 (11%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKC 51
           YC+CFA G +C   C C  C N +  E  R  A+   L RNP+AF  KI        G  
Sbjct: 173 YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPV 232

Query: 52  LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPR----KPCNCTKSMCLKL 107
            +G   R    G  C     + +  E    G       G  P     + CNC KS+C+K 
Sbjct: 233 AMGTHRRTSLPGGQCS----VGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIKK 288

Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRH 137
           YC+CF  G +C + C C  C NN E     H
Sbjct: 289 YCECFFAGVYCGENCQCEGCQNNPEASAGLH 319


>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
          Length = 658

 Score = 76.6 bits (187), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 41/94 (43%), Positives = 51/94 (54%), Gaps = 12/94 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +      E  
Sbjct: 372 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETG 431

Query: 59  ----------RHTKGCNCKRSGCLKNYCECYEAG 82
                     RH +GCNCK+S CLK YCECY+ G
Sbjct: 432 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGG 465



 Score = 74.3 bits (181), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 41/93 (44%), Positives = 50/93 (53%), Gaps = 12/93 (12%)

Query: 106 KLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE 164
           K YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +      E
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQE 429

Query: 165 R-----------RHTKGCNCKRSGCLKNYCECY 186
                       RH +GCNCK+S CLK YCECY
Sbjct: 430 TGDDASKTPASARHKRGCNCKKSNCLKKYCECY 462


>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
          Length = 265

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 50/112 (44%), Positives = 62/112 (55%), Gaps = 22/112 (19%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
           CNC KS CLKLYC+CFAN   C Q CNC  C N I++ ++R  A  + L RNP+AF    
Sbjct: 52  CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAIL 111

Query: 152 -----RPKI--GKCLVGEGERRHT----------KGCNCKRSGCLKNYCECY 186
                +P+I   K L+   E +            KGCNCK+S C K YCECY
Sbjct: 112 TNNGQQPQIIQDKTLILPLEEKSQKESQIDITTRKGCNCKKSECKKKYCECY 163



 Score = 60.8 bits (146), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 54/106 (50%), Gaps = 22/106 (20%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---------RPKI-- 48
           YC+CFAN   C Q CNC  C N I++ ++R  A  + L RNP+AF         +P+I  
Sbjct: 63  YCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAILTNNGQQPQIIQ 122

Query: 49  GKCLVGEGERRHTK----------GCNCKRSGCLKNYCECYEAGAR 84
            K L+   E +  K          GCNCK+S C K YCECY    +
Sbjct: 123 DKTLILPLEEKSQKESQIDITTRKGCNCKKSECKKKYCECYSINQK 168


>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
          Length = 717

 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 63/142 (44%), Gaps = 43/142 (30%)

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
           GE    K CNCK+S CLK YCEC+ AG   +EP                           
Sbjct: 417 GETEGCKRCNCKKSKCLKLYCECFAAGVYCVEP--------------------------- 449

Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE------GE 164
                C+C  CFN   HED      +Q   RNP AF PK+ +       VG+        
Sbjct: 450 -----CSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPAS 504

Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
            RH +GCNCK+S CLK YCECY
Sbjct: 505 ARHKRGCNCKKSSCLKKYCECY 526



 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +       VG
Sbjct: 436 YCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVG 495

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           +         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 496 DEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGV 530


>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
          Length = 738

 Score = 76.3 bits (186), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 63/186 (33%), Positives = 77/186 (41%), Gaps = 45/186 (24%)

Query: 14  CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG--KCLVGEGERRHTKGCNCKRSGC 71
            N +   N+ + E D      Q LE    A    I   + L   GE    K CNCK+S C
Sbjct: 394 VNKSLALNSSDRETDPAENGFQALEDASQASAYAISEERRLEHGGETEGCKRCNCKKSKC 453

Query: 72  LKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIE 131
           LK YCEC+ AG   +EP                                C+C  CFN   
Sbjct: 454 LKLYCECFAAGVYCVEP--------------------------------CSCQECFNKPI 481

Query: 132 HEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE------GERRHTKGCNCKRSGCLK 180
           HED      +Q   RNP AF PK+ +       VG+         RH +GCNCK+S CLK
Sbjct: 482 HEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLK 541

Query: 181 NYCECY 186
            YCECY
Sbjct: 542 KYCECY 547



 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +       VG
Sbjct: 457 YCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVG 516

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           +         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 517 DEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGV 551


>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
          Length = 373

 Score = 75.9 bits (185), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 49/105 (46%), Positives = 58/105 (55%), Gaps = 15/105 (14%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA----F 151
           CNC KS CLKLYC+CFAN   C Q CNC  C N I++  +R  AI + L RN +A    F
Sbjct: 188 CNCKKSKCLKLYCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFVQCF 247

Query: 152 RPKIGKCLVGEGERRHT----------KGCNCKRSGCLKNYCECY 186
             K G   V + +              KGCNCK+SGC K YCECY
Sbjct: 248 STKGGAQFVQQDKPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECY 292



 Score = 61.2 bits (147), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 50/99 (50%), Gaps = 15/99 (15%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF----RPKIGKCLVGE 55
           YC+CFAN   C Q CNC  C N I++  +R  AI + L RN +AF      K G   V +
Sbjct: 199 YCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFVQCFSTKGGAQFVQQ 258

Query: 56  GERRHT----------KGCNCKRSGCLKNYCECYEAGAR 84
            +              KGCNCK+SGC K YCECY    R
Sbjct: 259 DKPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECYSQNIR 297


>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
          Length = 774

 Score = 75.9 bits (185), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 63/142 (44%), Gaps = 43/142 (30%)

Query: 56  GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
           GE    K CNCK+S CLK YCEC+ AG   +EP                           
Sbjct: 480 GETEGCKRCNCKKSKCLKLYCECFAAGVYCIEP--------------------------- 512

Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVGE------GE 164
                C+C  CFN   HED      +Q   RNP AF PK+ +       +G+        
Sbjct: 513 -----CSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPAS 567

Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
            RH +GCNCK+S CLK YCECY
Sbjct: 568 ARHKRGCNCKKSSCLKKYCECY 589



 Score = 75.5 bits (184), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
           YC+CFA G +C + C+C  CFN   HED      +Q   RNP AF PK+ +       +G
Sbjct: 499 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIG 558

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           +         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 559 DDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 593


>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 365

 Score = 75.5 bits (184), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 71/171 (41%), Gaps = 42/171 (24%)

Query: 58  RRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEF 117
           RR       +RS  L         G  T      R  K CNC KS CLKLYC+CFA G +
Sbjct: 123 RRQNPSQEARRSPPLNGSGGGTPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAY 182

Query: 118 CY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------------------- 155
           C   C C  C N +  E  R  A+   L RNP+AF  KI                     
Sbjct: 183 CRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSL 242

Query: 156 --GKCLVGE-GERR-----------------HTKGCNCKRSGCLKNYCECY 186
             G+C VG+ GE                   H +GCNCK+S C++ YCEC+
Sbjct: 243 PGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIQKYCECF 293



 Score = 68.6 bits (166), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 65/151 (43%), Gaps = 18/151 (11%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKC 51
           YC+CFA G +C   C C  C N +  E  R  A+   L RNP+AF  KI        G  
Sbjct: 173 YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPV 232

Query: 52  LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPR----KPCNCTKSMCLKL 107
            +G   R    G  C     + +  E    G       G  P     + CNC KS+C++ 
Sbjct: 233 AMGTHRRTSLPGGQCS----VGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIQK 288

Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRH 137
           YC+CF  G +C + C C  C NN E     H
Sbjct: 289 YCECFFAGVYCGENCQCEGCQNNPEASAGLH 319


>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
 gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
          Length = 473

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 59/122 (48%), Gaps = 29/122 (23%)

Query: 94  RKPCNCTKSMCLKL------------------YCDCFANGEFCYQCNCNSCFNNIEHEDD 135
           +K CNC  S CLKL                  YC+CFA G +C  C+C  C N + +E  
Sbjct: 80  KKHCNCKNSQCLKLACSSILDLFTGQIDQLYGYCECFAAGLYCDGCHCKQCGNYVGNESA 139

Query: 136 RHHAIRQCLERNPNAFRPKI-----------GKCLVGEGERRHTKGCNCKRSGCLKNYCE 184
           R  AI    +RNP AF+PKI                     +H KGC+CK+SGCLK YCE
Sbjct: 140 RQEAINSTKQRNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCE 199

Query: 185 CY 186
           C+
Sbjct: 200 CF 201



 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
           YC+CFA G +C  C+C  C N + +E  R  AI    +RNP AF+PKI            
Sbjct: 112 YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 171

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCEC++A
Sbjct: 172 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQA 203


>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
 gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 84

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 50/83 (60%), Gaps = 1/83 (1%)

Query: 104 CLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG 163
           CLKLYCDCFA G+FC  C+C SC N  E+ D           R+P AF  KI     G G
Sbjct: 1   CLKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMD-APGGG 59

Query: 164 ERRHTKGCNCKRSGCLKNYCECY 186
             +H +GCNCKRS CLK YCEC+
Sbjct: 60  GGKHKRGCNCKRSHCLKKYCECF 82



 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 47/81 (58%), Gaps = 1/81 (1%)

Query: 1  YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
          YCDCFA G+FC  C+C SC N  E+ D           R+P AF  KI     G G  +H
Sbjct: 5  YCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMD-APGGGGGKH 63

Query: 61 TKGCNCKRSGCLKNYCECYEA 81
           +GCNCKRS CLK YCEC++ 
Sbjct: 64 KRGCNCKRSHCLKKYCECFQV 84


>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
          Length = 387

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/116 (41%), Positives = 58/116 (50%), Gaps = 30/116 (25%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFAN   C Q CNC  C N I++  +R  AI + L RN +AF    
Sbjct: 195 CNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAF---- 250

Query: 156 GKCLVGEGERRHT-------------------------KGCNCKRSGCLKNYCECY 186
            +C   +G  +                           KGCNCK+SGC K YCECY
Sbjct: 251 AQCFTTKGAPQFVQQGIFNFIQDKPIKEPSKDNSSVVHKGCNCKKSGCKKKYCECY 306



 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 50/110 (45%), Gaps = 30/110 (27%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFAN   C Q CNC  C N I++  +R  AI + L RN +AF     +C   +G  +
Sbjct: 206 YCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAF----AQCFTTKGAPQ 261

Query: 60  HT-------------------------KGCNCKRSGCLKNYCECYEAGAR 84
                                      KGCNCK+SGC K YCECY    +
Sbjct: 262 FVQQGIFNFIQDKPIKEPSKDNSSVVHKGCNCKKSGCKKKYCECYSQNLK 311


>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
          subellipsoidea C-169]
          Length = 90

 Score = 73.9 bits (180), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/90 (53%), Positives = 58/90 (64%), Gaps = 10/90 (11%)

Query: 1  YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
          YC+CFA+G +C  CNC +C NN E E  R  A+   LERNPNAFRPKI         G  
Sbjct: 1  YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60

Query: 52 LVGEG-ERRHTKGCNCKRSGCLKNYCECYE 80
          +  +G   RH KGCNCK+SGCLK YCEC++
Sbjct: 61 VAIQGAAARHNKGCNCKKSGCLKKYCECFQ 90



 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 10/89 (11%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 158
           YC+CFA+G +C  CNC +C NN E E  R  A+   LERNPNAFRPKI         G  
Sbjct: 1   YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60

Query: 159 LVGEG-ERRHTKGCNCKRSGCLKNYCECY 186
           +  +G   RH KGCNCK+SGCLK YCEC+
Sbjct: 61  VAIQGAAARHNKGCNCKKSGCLKKYCECF 89


>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
 gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
          Length = 902

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 43/100 (43%), Positives = 58/100 (58%), Gaps = 12/100 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K C+C  S CLK+YC+CFA    C  C C  C NN  + +    A  Q LERNP+AF PK
Sbjct: 360 KKCHCKNSKCLKMYCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPK 419

Query: 155 I--------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           I         + ++     +H+KGC+C++S CLK YCEC+
Sbjct: 420 IKPSNSKINNQIIID----KHSKGCHCRKSSCLKKYCECF 455



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 51/90 (56%), Gaps = 12/90 (13%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKCL 52
           YC+CFA    C  C C  C NN  + +    A  Q LERNP+AF PKI         + +
Sbjct: 373 YCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPKIKPSNSKINNQII 432

Query: 53  VGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
           +     +H+KGC+C++S CLK YCEC++A 
Sbjct: 433 ID----KHSKGCHCRKSSCLKKYCECFQAS 458


>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 684

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 75/173 (43%), Gaps = 46/173 (26%)

Query: 26  EDDRHHAIRQCLERNPNAFR-PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGAR 84
           ++    A  + +E  P A   P+      GEGE    K CNCK+S CLK YCEC+ AG  
Sbjct: 355 QETSDQAENEPVEEIPKALAFPERKSEQAGEGE--SCKRCNCKKSKCLKLYCECFAAGVY 412

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
            +EP                                C+C  CFN   HE+      +Q  
Sbjct: 413 CIEP--------------------------------CSCIDCFNKPIHEETVLATRKQIE 440

Query: 145 ERNPNAFRPKI---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
            RNP AF PK+      ++  G+         RH +GCNCK+S CLK YCECY
Sbjct: 441 SRNPLAFAPKVIRSADSIMEAGDDASKTPASARHKRGCNCKKSNCLKKYCECY 493



 Score = 72.8 bits (177), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
           YC+CFA G +C + C+C  CFN   HE+      +Q   RNP AF PK+      ++  G
Sbjct: 403 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRSADSIMEAG 462

Query: 57  E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
           +         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 463 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 497


>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
 gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
          Length = 414

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 52/95 (54%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
           YC+CFA G +C + C+C  C NN  HED      RQ   RNP AF PK+ +        G
Sbjct: 141 YCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVSEFG 200

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           E         RH +GCNCK+S CLK YCEC++ G 
Sbjct: 201 EETNKTPASARHKRGCNCKKSSCLKKYCECFQGGV 235



 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 52/94 (55%), Gaps = 12/94 (12%)

Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----C 158
           +K YC+CFA G +C + C+C  C NN  HED      RQ   RNP AF PK+ +      
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVS 197

Query: 159 LVGE------GERRHTKGCNCKRSGCLKNYCECY 186
             GE         RH +GCNCK+S CLK YCEC+
Sbjct: 198 EFGEETNKTPASARHKRGCNCKKSSCLKKYCECF 231


>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
          Length = 326

 Score = 73.2 bits (178), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/109 (44%), Positives = 61/109 (55%), Gaps = 17/109 (15%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAF 151
           RK CNC ++ CLKLYCDC A G +C   C C++C N  E+E      IR+ +E R+P AF
Sbjct: 14  RKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENE-GVVQIIREKIESRDPLAF 72

Query: 152 RPKI--------GKCLVGEG------ERRHTKGCNCKRSGCLKNYCECY 186
            P+I           L  +G        RH +GCNCK+S C K YCECY
Sbjct: 73  APRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECY 121



 Score = 57.4 bits (137), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/97 (40%), Positives = 51/97 (52%), Gaps = 17/97 (17%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKI--------GK 50
           YCDC A G +C   C C++C N  E+E      IR+ +E R+P AF P+I          
Sbjct: 28  YCDCLAAGVYCTDSCACSNCLNKSENEG-VVQIIREKIESRDPLAFAPRIVNPDTDTTDN 86

Query: 51  CLVGEG------ERRHTKGCNCKRSGCLKNYCECYEA 81
            L  +G        RH +GCNCK+S C K YCECY+ 
Sbjct: 87  VLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQV 123


>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
          Length = 396

 Score = 73.2 bits (178), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 41/91 (45%), Positives = 59/91 (64%), Gaps = 6/91 (6%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CF + E+C  +CNC +C+NN +HE +R  A+++   RNP+ F    
Sbjct: 151 CNCKKSRCLKLYCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCF---- 206

Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            +  V E +  +  GC C+R+ C K YCEC+
Sbjct: 207 -ENHVDEKKGVNKSGCRCRRTHCDKKYCECF 236



 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 51/84 (60%), Gaps = 6/84 (7%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CF + E+C  +CNC +C+NN +HE +R  A+++   RNP+ F     +  V E +  
Sbjct: 162 YCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCF-----ENHVDEKKGV 216

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           +  GC C+R+ C K YCEC+  G 
Sbjct: 217 NKSGCRCRRTHCDKKYCECFSHGV 240


>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
          Length = 437

 Score = 72.8 bits (177), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
           YC+CFA G +C  C+C  C N + +E  R  AI    +RNP AF+PKI            
Sbjct: 76  YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+CK+SGCLK YCEC++A
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQA 167



 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 38/90 (42%), Positives = 49/90 (54%), Gaps = 11/90 (12%)

Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 156
           YC+CFA G +C  C+C  C N + +E  R  AI    +RNP AF+PKI            
Sbjct: 76  YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135

Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                    +H KGC+CK+SGCLK YCEC+
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECF 165


>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 742

 Score = 72.4 bits (176), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 44/93 (47%), Positives = 54/93 (58%), Gaps = 8/93 (8%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           KPCNC  S CLK YCDCFA G +C  +C C  C N  E+ED+R  AI++  +R  +AF P
Sbjct: 500 KPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKRIKDAFLP 559

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            I           H KGC+CK S C K YC C+
Sbjct: 560 VI-------QNNSHIKGCHCKNSHCQKKYCVCH 585



 Score = 59.3 bits (142), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 8/84 (9%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA G +C  +C C  C N  E+ED+R  AI++  +R  +AF P I           
Sbjct: 513 YCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKRIKDAFLPVIQN-------NS 565

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
           H KGC+CK S C K YC C++ G 
Sbjct: 566 HIKGCHCKNSHCQKKYCVCHQNGV 589


>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 932

 Score = 72.0 bits (175), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 51/85 (60%), Gaps = 8/85 (9%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE--GE 57
           YCDCFA G FC  +C C  C N+ ++++    A +   +R+  +     G+ L GE    
Sbjct: 822 YCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRSSTS-----GEALNGEPSAS 876

Query: 58  RRHTKGCNCKRSGCLKNYCECYEAG 82
            R ++GC CKR+GCLK YCEC++AG
Sbjct: 877 ERSSRGCRCKRTGCLKRYCECFQAG 901



 Score = 70.1 bits (170), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 35/85 (41%), Positives = 50/85 (58%), Gaps = 8/85 (9%)

Query: 105 LKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE- 162
           LK YCDCFA G FC  +C C  C N+ ++++    A +   +R+  +     G+ L GE 
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRSSTS-----GEALNGEP 873

Query: 163 -GERRHTKGCNCKRSGCLKNYCECY 186
               R ++GC CKR+GCLK YCEC+
Sbjct: 874 SASERSSRGCRCKRTGCLKRYCECF 898



 Score = 39.3 bits (90), Expect = 0.73,   Method: Composition-based stats.
 Identities = 27/97 (27%), Positives = 34/97 (35%), Gaps = 40/97 (41%)

Query: 72  LKNYCECYEAG--------------------------------------ARTLEPNGI-R 92
           LK YC+C+ AG                                      A   EP+   R
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRSSTSGEALNGEPSASER 878

Query: 93  PRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFN 128
             + C C ++ CLK YC+CF  G  C  QC C  C N
Sbjct: 879 SSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLN 915


>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
          Length = 760

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 67/153 (43%), Gaps = 44/153 (28%)

Query: 46  PKIGKC-LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMC 104
           PK  +C L   GE    K CNCK+S CLK YCEC+ AG   +EP                
Sbjct: 458 PKKKRCKLEPAGEGESCKRCNCKKSKCLKLYCECFAAGVYCIEP---------------- 501

Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL----- 159
                           C C+ CFN   H +      +Q   RNP AF PK+ +       
Sbjct: 502 ----------------CACHDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSE 545

Query: 160 VGE------GERRHTKGCNCKRSGCLKNYCECY 186
           +G+         RH +GCNCK+S CLK YCECY
Sbjct: 546 IGDDPNKTPASARHKRGCNCKKSSCLKKYCECY 578



 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C C+ CFN   H +      +Q   RNP AF PK+ +       +G
Sbjct: 488 YCECFAAGVYCIEPCACHDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIG 547

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           +         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 548 DDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 582


>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
           sativa Japonica Group]
          Length = 508

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 16/104 (15%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           RK CNC  S     YC+CFA   +C  C+C+ C N IE+E+ R  AI   L RNP AF+P
Sbjct: 122 RKHCNCKNS-----YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQP 176

Query: 154 KIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
           KI         R           +H KGC+C++S CLK YCEC+
Sbjct: 177 KIENSPNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECF 220



 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 40/92 (43%), Positives = 53/92 (57%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA   +C  C+C+ C N IE+E+ R  AI   L RNP AF+PKI         R  
Sbjct: 131 YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRKD 190

Query: 59  ---------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+C++S CLK YCEC++A
Sbjct: 191 NSEAIPPIPKHNKGCHCRKSECLKKYCECFQA 222


>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
 gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
          Length = 144

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/102 (42%), Positives = 50/102 (49%), Gaps = 19/102 (18%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
           YC+CFA G +C   C C  CFN  E  D      +Q   RNP AF PKI +         
Sbjct: 1   YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60

Query: 51  ---------CLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                    C    G  RH +GCNCK+S CLK YCECY+AG 
Sbjct: 61  VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGV 102



 Score = 66.2 bits (160), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 47/98 (47%), Gaps = 19/98 (19%)

Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 157
           YC+CFA G +C   C C  CFN  E  D      +Q   RNP AF PKI +         
Sbjct: 1   YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60

Query: 158 ---------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                    C    G  RH +GCNCK+S CLK YCECY
Sbjct: 61  VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECY 98


>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
          Length = 274

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/92 (43%), Positives = 53/92 (57%), Gaps = 11/92 (11%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
           YC+CFA   +C  C+C+ C N IE+E+ R  AI   L RNP AF+PKI         R  
Sbjct: 88  YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRKD 147

Query: 59  ---------RHTKGCNCKRSGCLKNYCECYEA 81
                    +H KGC+C++S CLK YCEC++A
Sbjct: 148 NSEAIPPIPKHNKGCHCRKSECLKKYCECFQA 179



 Score = 68.6 bits (166), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 11/95 (11%)

Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           +    YC+CFA   +C  C+C+ C N IE+E+ R  AI   L RNP AF+PKI       
Sbjct: 83  LVKSWYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTV 142

Query: 163 GER-----------RHTKGCNCKRSGCLKNYCECY 186
             R           +H KGC+C++S CLK YCEC+
Sbjct: 143 TVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECF 177


>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
 gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
          Length = 155

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/115 (38%), Positives = 57/115 (49%), Gaps = 25/115 (21%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA+G +C   C C +C N  EHED      +    RNP AF P++
Sbjct: 2   CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV 61

Query: 156 G---------KCLVGEGE---------------RRHTKGCNCKRSGCLKNYCECY 186
                     +  V                    +H +GCNCK+S CLK YCEC+
Sbjct: 62  ISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECF 116



 Score = 52.4 bits (124), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 43/132 (32%), Positives = 59/132 (44%), Gaps = 11/132 (8%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA+G +C   C C +C N  EHED      +    RNP AF P++    +   E  
Sbjct: 13  YCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV----ISPAEV- 67

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
             + C    S    N        A  +     + ++ CNC KS+CLK YC+CF +   C 
Sbjct: 68  -VRVCQAFVSS---NSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEVGCS 123

Query: 120 Q-CNCNSCFNNI 130
             C C  C N  
Sbjct: 124 DACKCRGCKNTF 135


>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
 gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
          Length = 141

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 44/115 (38%), Positives = 57/115 (49%), Gaps = 25/115 (21%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC+CFA+G +C   C C +C N  EHED      +    RNP AF P++
Sbjct: 2   CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV 61

Query: 156 G---------KCLVGEGE---------------RRHTKGCNCKRSGCLKNYCECY 186
                     +  V                    +H +GCNCK+S CLK YCEC+
Sbjct: 62  ISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECF 116



 Score = 52.4 bits (124), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/132 (32%), Positives = 59/132 (44%), Gaps = 11/132 (8%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA+G +C   C C +C N  EHED      +    RNP AF P++    +   E  
Sbjct: 13  YCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV----ISPAE-- 66

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
             + C    S    N        A  +     + ++ CNC KS+CLK YC+CF +   C 
Sbjct: 67  VVRVCQAFVSS---NSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEVGCS 123

Query: 120 Q-CNCNSCFNNI 130
             C C  C N  
Sbjct: 124 DACKCRGCKNTF 135


>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 62/145 (42%), Gaps = 45/145 (31%)

Query: 53  VGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCF 112
            GEGE    K CNCK+S CLK YCEC+ AG   +EP                        
Sbjct: 225 AGEGE--SCKRCNCKKSKCLKLYCECFAAGVYCIEP------------------------ 258

Query: 113 ANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVG 161
                   C+C  CFN   HE+      +Q   RNP AF PK+ +               
Sbjct: 259 --------CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKT 310

Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
               RH +GCNCK+S C+K YCECY
Sbjct: 311 PASARHKRGCNCKKSNCMKKYCECY 335



 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
           YC+CFA G +C + C+C  CFN   HE+      +Q   RNP AF PK+ +         
Sbjct: 245 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 304

Query: 51  --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+S C+K YCECY+ G 
Sbjct: 305 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGGV 339


>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 62/145 (42%), Gaps = 45/145 (31%)

Query: 53  VGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCF 112
            GEGE    K CNCK+S CLK YCEC+ AG   +EP                        
Sbjct: 225 AGEGE--SCKRCNCKKSKCLKLYCECFAAGVYCIEP------------------------ 258

Query: 113 ANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVG 161
                   C+C  CFN   HE+      +Q   RNP AF PK+ +               
Sbjct: 259 --------CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKT 310

Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
               RH +GCNCK+S C+K YCECY
Sbjct: 311 PASARHKRGCNCKKSNCMKKYCECY 335



 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
           YC+CFA G +C + C+C  CFN   HE+      +Q   RNP AF PK+ +         
Sbjct: 245 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 304

Query: 51  --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                     RH +GCNCK+S C+K YCECY+ G 
Sbjct: 305 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGGV 339


>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
          Length = 524

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 32/48 (66%), Positives = 36/48 (75%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           ++PCNCTKS CLKLYCDCFANGEFCY CNC  C N  + +  R  AIR
Sbjct: 326 KRPCNCTKSQCLKLYCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIR 373



 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 23/34 (67%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 34
           YCDCFANGEFCY CNC  C N  + +  R  AIR
Sbjct: 340 YCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIR 373


>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
          Length = 774

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 63/145 (43%), Gaps = 45/145 (31%)

Query: 53  VGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCF 112
            GEGE    K CNCK+S CLK YCEC+ AG   +EP                        
Sbjct: 481 AGEGES--CKRCNCKKSKCLKLYCECFAAGVYCIEP------------------------ 514

Query: 113 ANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE----- 162
                   C C  CFN   H +      +Q   RNP AF PK+ +       +G+     
Sbjct: 515 --------CACRDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIGDDLNKT 566

Query: 163 -GERRHTKGCNCKRSGCLKNYCECY 186
               RH +GCNCK+S CLK YCECY
Sbjct: 567 PASARHKRGCNCKKSSCLKKYCECY 591



 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
           YC+CFA G +C + C C  CFN   H +      +Q   RNP AF PK+ +       +G
Sbjct: 501 YCECFAAGVYCIEPCACRDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIG 560

Query: 55  E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
           +         RH +GCNCK+S CLK YCECY+ G 
Sbjct: 561 DDLNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 595


>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
          Length = 142

 Score = 70.5 bits (171), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 57/99 (57%), Gaps = 3/99 (3%)

Query: 91  IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
           IR  + C C KS CL+LYC+CF NG FC + C C +C N   ++     A ++   RNP+
Sbjct: 9   IRKVEFCKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKMRNPD 68

Query: 150 AFRPK--IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           AF  K  + K    EG   H KGCNC ++ C K YCEC+
Sbjct: 69  AFSQKFLVVKQNQYEGIVSHKKGCNCTKTQCTKKYCECF 107



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 50/90 (55%), Gaps = 3/90 (3%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK--IGKCLVGEGE 57
           YC+CF NG FC + C C +C N   ++     A ++   RNP+AF  K  + K    EG 
Sbjct: 26  YCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKMRNPDAFSQKFLVVKQNQYEGI 85

Query: 58  RRHTKGCNCKRSGCLKNYCECYEAGARTLE 87
             H KGCNC ++ C K YCEC+ AG +  E
Sbjct: 86  VSHKKGCNCTKTQCTKKYCECFNAGIKCTE 115


>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 490

 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 50/98 (51%), Gaps = 8/98 (8%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C C K+ CLKLYC CF    +C   C C  C N    ED R +A++  L RNP+AF  K 
Sbjct: 217 CTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKF 276

Query: 156 GKCLVGEGERR-------HTKGCNCKRSGCLKNYCECY 186
            K  V             H  GC C++S C+K YCECY
Sbjct: 277 QKTPVDGATVETPSKLLTHKLGCKCRKSACMKKYCECY 314



 Score = 55.8 bits (133), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/87 (39%), Positives = 42/87 (48%), Gaps = 8/87 (9%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF    +C   C C  C N    ED R +A++  L RNP+AF  K  K  V      
Sbjct: 228 YCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKFQKTPVDGATVE 287

Query: 60  -------HTKGCNCKRSGCLKNYCECY 79
                  H  GC C++S C+K YCECY
Sbjct: 288 TPSKLLTHKLGCKCRKSACMKKYCECY 314


>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
           trifallax]
          Length = 600

 Score = 68.9 bits (167), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 56/90 (62%), Gaps = 7/90 (7%)

Query: 104 CLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDR-HHAIRQCLERNPNAFRPKI-----G 156
           CLKLYCDCFA G  C   C C +C N  ++++     A ++ + RNP AF+PK+      
Sbjct: 455 CLKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGPA 514

Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +    + +++H KGCNCK SGC KNYCEC+
Sbjct: 515 QMSQNDADKQHRKGCNCKNSGCQKNYCECF 544



 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 54/89 (60%), Gaps = 7/89 (7%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDR-HHAIRQCLERNPNAFRPKI-----GKCLV 53
           YCDCFA G  C   C C +C N  ++++     A ++ + RNP AF+PK+      +   
Sbjct: 459 YCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGPAQMSQ 518

Query: 54  GEGERRHTKGCNCKRSGCLKNYCECYEAG 82
            + +++H KGCNCK SGC KNYCEC++ G
Sbjct: 519 NDADKQHRKGCNCKNSGCQKNYCECFQFG 547


>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 113

 Score = 68.9 bits (167), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 41/95 (43%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 1  YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
          YC+CFA   FC   C C +CFN  E+E    +  +Q   R+P AF PKI +         
Sbjct: 1  YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60

Query: 52 ---LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
             L      RH +GCNCK+S CLK YCECY+AG 
Sbjct: 61 DDALDTPASARHKRGCNCKKSLCLKKYCECYQAGV 95



 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 48/91 (52%), Gaps = 12/91 (13%)

Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 158
           YC+CFA   FC   C C +CFN  E+E    +  +Q   R+P AF PKI +         
Sbjct: 1   YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60

Query: 159 ---LVGEGERRHTKGCNCKRSGCLKNYCECY 186
              L      RH +GCNCK+S CLK YCECY
Sbjct: 61  DDALDTPASARHKRGCNCKKSLCLKKYCECY 91


>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
          Length = 377

 Score = 68.9 bits (167), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 32/50 (64%), Positives = 38/50 (76%)

Query: 92  RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
           RPRK C+CTKS CLKLYCDCF++G  C  CNC +C NN E+E  RH AI+
Sbjct: 325 RPRKTCHCTKSRCLKLYCDCFSSGMMCSSCNCINCHNNAENESQRHEAIK 374



 Score = 44.7 bits (104), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 24/34 (70%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 34
           YCDCF++G  C  CNC +C NN E+E  RH AI+
Sbjct: 341 YCDCFSSGMMCSSCNCINCHNNAENESQRHEAIK 374


>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
 gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
          Length = 781

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/105 (45%), Positives = 58/105 (55%), Gaps = 16/105 (15%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC KS CLKLYC+CFA G +C   C+C +C N  E+ED      +Q   RNP AF PK+
Sbjct: 443 CNCKKSRCLKLYCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAPKV 502

Query: 156 --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                         GK +      RH KGCNCK+S C K YCEC+
Sbjct: 503 VNNATNSPANMMEEGKWM-KTSSSRHKKGCNCKKSKCSKKYCECF 546



 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 51/98 (52%), Gaps = 16/98 (16%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC+CFA G +C   C+C +C N  E+ED      +Q   RNP AF PK+           
Sbjct: 454 YCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAPKVVNNATNSPANM 513

Query: 49  ---GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
              GK +      RH KGCNCK+S C K YCEC++ G 
Sbjct: 514 MEEGKWM-KTSSSRHKKGCNCKKSKCSKKYCECFQGGV 550


>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 56/96 (58%), Gaps = 11/96 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
           +PC+CTK+ CLK+YC CF NG+ C + C C  C N  E+ + RH A   I++   RN   
Sbjct: 80  QPCSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVEYIKKKAHRNKKV 139

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            + K+        E +   GCNCK++ CLK YCEC+
Sbjct: 140 TQEKL-------FETKEVWGCNCKKTRCLKRYCECF 168



 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 11/83 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNAFRPKIGKCLVGEG 56
           YC CF NG+ C + C C  C N  E+ + RH A   I++   RN    + K+        
Sbjct: 93  YCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVEYIKKKAHRNKKVTQEKLF------- 145

Query: 57  ERRHTKGCNCKRSGCLKNYCECY 79
           E +   GCNCK++ CLK YCEC+
Sbjct: 146 ETKEVWGCNCKKTRCLKRYCECF 168


>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 81

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 6/84 (7%)

Query: 104 CLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
           CLKLYCDCFA G FC   C C  C N  +  +    A +  + ++P AF+PK     V +
Sbjct: 1   CLKLYCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPK-----VLD 55

Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
               H KGC C++S CLK YCEC+
Sbjct: 56  ASGGHVKGCACRKSRCLKKYCECF 79



 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1  YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
          YCDCFA G FC   C C  C N  +  +    A +  + ++P AF+PK     V +    
Sbjct: 5  YCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPK-----VLDASGG 59

Query: 60 HTKGCNCKRSGCLKNYCECY 79
          H KGC C++S CLK YCEC+
Sbjct: 60 HVKGCACRKSRCLKKYCECF 79


>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 120

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 51/102 (50%), Gaps = 19/102 (18%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC+CFA G +C   C C  CFN  E+ +   +  +Q   RNP AF PKI           
Sbjct: 1   YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60

Query: 49  -------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                   + L      RH +GCNCK+S CLK YCECY+AG 
Sbjct: 61  VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGV 102



 Score = 63.2 bits (152), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 48/98 (48%), Gaps = 19/98 (19%)

Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 155
           YC+CFA G +C   C C  CFN  E+ +   +  +Q   RNP AF PKI           
Sbjct: 1   YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60

Query: 156 -------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                   + L      RH +GCNCK+S CLK YCECY
Sbjct: 61  VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECY 98


>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 495

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/121 (38%), Positives = 63/121 (52%), Gaps = 10/121 (8%)

Query: 76  CECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHED 134
           C+  E  +  +  +  + R+ C C  S CLKLYC+CFA G +C   C C  C N  E E+
Sbjct: 235 CQNGENDSDNINDSEQKAREGCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECEN 294

Query: 135 DRHHAIRQCLERNPNAFRPK---IGKCLVGE------GERRHTKGCNCKRSGCLKNYCEC 185
           +   A +  L RNP+AF  K   +G  +  E      G   + KGC CKR+ C K YCEC
Sbjct: 295 EIKEARKVTLTRNPDAFTSKLEVVGTIVQDEDIEANRGMLGYKKGCKCKRTYCKKKYCEC 354

Query: 186 Y 186
           Y
Sbjct: 355 Y 355



 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 50/94 (53%), Gaps = 10/94 (10%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK---IGKCLVGE- 55
           YC+CFA G +C   C C  C N  E E++   A +  L RNP+AF  K   +G  +  E 
Sbjct: 267 YCECFAKGAYCRDICKCLQCSNTEECENEIKEARKVTLTRNPDAFTSKLEVVGTIVQDED 326

Query: 56  -----GERRHTKGCNCKRSGCLKNYCECYEAGAR 84
                G   + KGC CKR+ C K YCECY AG +
Sbjct: 327 IEANRGMLGYKKGCKCKRTYCKKKYCECYNAGVK 360


>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
          Length = 713

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 12/101 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR--- 152
           C C K+ CLK+YC CF++  FC+  C C+ C N  +H+ +R  AI+  L  +P AF    
Sbjct: 536 CGC-KTGCLKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAFSFAS 594

Query: 153 -PKIGKC------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
            P+          L  +      +GC CK+S CLK YCEC+
Sbjct: 595 LPQSASTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECF 635



 Score = 60.1 bits (144), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 19/110 (17%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR----PKIGKC---- 51
           YC CF++  FC+  C C+ C N  +H+ +R  AI+  L  +P AF     P+        
Sbjct: 546 YCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAFSFASLPQSASTTGFL 605

Query: 52  --LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNC 99
             L  +      +GC CK+S CLK YCEC++        NGI     C C
Sbjct: 606 HLLPQKSSAVVMRGCRCKKSKCLKKYCECFQ--------NGIACTSHCRC 647


>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 56/96 (58%), Gaps = 11/96 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
           +PC+C+K+ CLK+YC CF NG+ C  QC C  C N  E+ + RH+A   IR+   RN   
Sbjct: 80  QPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEYIRKKAHRNKKV 139

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            + K+        E +   GCNCK++ C K YCEC+
Sbjct: 140 TQEKL-------FETKDVWGCNCKKTRCQKRYCECF 168



 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 44/83 (53%), Gaps = 11/83 (13%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHA---IRQCLERNPNAFRPKIGKCLVGEG 56
           YC CF NG+ C  QC C  C N  E+ + RH+A   IR+   RN    + K+        
Sbjct: 93  YCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEYIRKKAHRNKKVTQEKLF------- 145

Query: 57  ERRHTKGCNCKRSGCLKNYCECY 79
           E +   GCNCK++ C K YCEC+
Sbjct: 146 ETKDVWGCNCKKTRCQKRYCECF 168


>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 712

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 54/130 (41%), Gaps = 40/130 (30%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C C KS CLKLYC CF+    C   C C  C N  E    R  A++  L+RNPNAFR K 
Sbjct: 338 CTCKKSKCLKLYCHCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKF 397

Query: 156 GKCLVGE--------GER-------------------------------RHTKGCNCKRS 176
            K   GE         ER                                H  GCNC++S
Sbjct: 398 VKVTGGEEFAKEWEKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKS 457

Query: 177 GCLKNYCECY 186
            CLK YCEC+
Sbjct: 458 FCLKRYCECF 467



 Score = 55.8 bits (133), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 51/163 (31%), Positives = 66/163 (40%), Gaps = 18/163 (11%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG-ER 58
           YC CF+    C   C C  C N  E    R  A++  L+RNPNAFR K  K   GE   +
Sbjct: 349 YCHCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKFVKVTGGEEFAK 408

Query: 59  RHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKP-----CNCTKSMCLKLYCDCFA 113
              K    K+   L       +     +E   +    P     CNC KS CLK YC+CF 
Sbjct: 409 EWEKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKSFCLKRYCECFG 468

Query: 114 NGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
               C   C C  C N    + D  H +      +P  +RP +
Sbjct: 469 AQTPCGLNCKCLGCQN----QPDYVHVL------DPRGYRPPL 501


>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
 gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
          Length = 141

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 46/91 (50%), Gaps = 1/91 (1%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C C+KS C+KLYCDCF  G  C   CNC  C N  E  ++R   +++ L+ NP AF    
Sbjct: 35  CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAFSEDS 94

Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            K            GC C  S CLK YC C+
Sbjct: 95  DKFNTKRQRMSRGNGCACPSSHCLKKYCSCF 125



 Score = 55.8 bits (133), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 39/84 (46%), Gaps = 1/84 (1%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCF  G  C   CNC  C N  E  ++R   +++ L+ NP AF     K         
Sbjct: 46  YCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAFSEDSDKFNTKRQRMS 105

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
              GC C  S CLK YC C+ A A
Sbjct: 106 RGNGCACPSSHCLKKYCSCFGADA 129


>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
          Length = 259

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 6/100 (6%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
           P      KPC+C+K+ CL+LYC CF N   C Q C C+ CFN+ +H D+   AI Q   +
Sbjct: 134 PTMDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCSECFNDGQHSDEVLKAIEQIKIK 193

Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
              A    +        + R   GC CK++ C+K YCEC+
Sbjct: 194 EQRASHHDL-----DSFDTRQVWGCKCKKTQCIKGYCECF 228



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 6/80 (7%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF N   C Q C C+ CFN+ +H D+   AI Q   +   A    +        + R
Sbjct: 154 YCSCFHNRRQCTQECKCSECFNDGQHSDEVLKAIEQIKIKEQRASHHDLD-----SFDTR 208

Query: 60  HTKGCNCKRSGCLKNYCECY 79
              GC CK++ C+K YCEC+
Sbjct: 209 QVWGCKCKKTQCIKGYCECF 228


>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
           niloticus]
          Length = 310

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/48 (64%), Positives = 33/48 (68%)

Query: 34  RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
           + CL  NP+AFRPKI     G     H KGCNCKRSGCLK YCECYEA
Sbjct: 169 KWCLGHNPDAFRPKIAGVKSGAVRGWHNKGCNCKRSGCLKRYCECYEA 216



 Score = 62.0 bits (149), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/46 (63%), Positives = 31/46 (67%)

Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           + CL  NP+AFRPKI     G     H KGCNCKRSGCLK YCECY
Sbjct: 169 KWCLGHNPDAFRPKIAGVKSGAVRGWHNKGCNCKRSGCLKRYCECY 214


>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
          Length = 214

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 26/119 (21%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           R  C+C +S C +LYC CF +  FC   CNC+ C+N   +ED           +NPNAF 
Sbjct: 7   RPKCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFD 66

Query: 153 PKI-------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           P+I                            + G  +R+H KGC+C++S C K YCEC+
Sbjct: 67  PRIIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECF 125



 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 26/106 (24%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC CF +  FC   CNC+ C+N   +ED           +NPNAF P+I           
Sbjct: 21  YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80

Query: 49  --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
                            + G  +R+H KGC+C++S C K YCEC++
Sbjct: 81  PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFK 126


>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 56/103 (54%), Gaps = 18/103 (17%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQC 143
           T++    + ++ C C KS CLK+YCDC A GE+C  +C+C +C N+ +H + R   + Q 
Sbjct: 30  TIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKVLEQM 89

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +E+N +A                    CNC++S C K YCECY
Sbjct: 90  VEKNSSA-----------------QLKCNCRKSNCQKKYCECY 115



 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 18/85 (21%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDC A GE+C  +C+C +C N+ +H + R   + Q +E+N +A                
Sbjct: 53  YCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKVLEQMVEKNSSA---------------- 96

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
               CNC++S C K YCECY AG +
Sbjct: 97  -QLKCNCRKSNCQKKYCECYNAGVK 120



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 28/111 (25%)

Query: 48  IGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA------------------------ 83
           I +  + +  ++  + C CK+S CLK YC+C   G                         
Sbjct: 26  ITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKV 85

Query: 84  --RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
             + +E N     K CNC KS C K YC+C+  G  C + C C+ C N+++
Sbjct: 86  LEQMVEKNSSAQLK-CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKNSVK 135


>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
          Length = 601

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/95 (41%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC------LV 53
           YC+CF+ G FC + C+C +CFN   HED    +      RNP AF PK+         L 
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487

Query: 54  GEGER-----RHTKGCNCKRSGCLKNYCECYEAGA 83
            E  +     RH +GCNC++SGC K YCEC+  G 
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGV 522



 Score = 63.5 bits (153), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/91 (41%), Positives = 50/91 (54%), Gaps = 12/91 (13%)

Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC------LV 160
           YC+CF+ G FC + C+C +CFN   HED    +      RNP AF PK+         L 
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487

Query: 161 GEGER-----RHTKGCNCKRSGCLKNYCECY 186
            E  +     RH +GCNC++SGC K YCEC+
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECF 518


>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
           [Oryza sativa Japonica Group]
          Length = 194

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 26/119 (21%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           R  C+C +S C +LYC CF +  FC   CNC+ C+N   +ED           +NPNAF 
Sbjct: 7   RPKCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFD 66

Query: 153 PKI-------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           P+I                            + G  +R+H KGC+C++S C K YCEC+
Sbjct: 67  PRIIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECF 125



 Score = 53.5 bits (127), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 26/106 (24%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC CF +  FC   CNC+ C+N   +ED           +NPNAF P+I           
Sbjct: 21  YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80

Query: 49  --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
                            + G  +R+H KGC+C++S C K YCEC++
Sbjct: 81  PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFK 126


>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
          Length = 223

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 26/119 (21%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
           R  C+C +S C +LYC CF +  FC   CNC+ C+N   +ED           +NPNAF 
Sbjct: 7   RPKCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFD 66

Query: 153 PKI-------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           P+I                            + G  +R+H KGC+C++S C K YCEC+
Sbjct: 67  PRIIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECF 125



 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 26/106 (24%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
           YC CF +  FC   CNC+ C+N   +ED           +NPNAF P+I           
Sbjct: 21  YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80

Query: 49  --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
                            + G  +R+H KGC+C++S C K YCEC++
Sbjct: 81  PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFK 126


>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 65.1 bits (157), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 54/102 (52%), Gaps = 11/102 (10%)

Query: 89  NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQC----NCNSCFNNIEHEDDRHHAIRQCL 144
           N I   +PC+CTK+ CLK+YC CF NG+ C +     +C +  NN+   D     I++  
Sbjct: 74  NEIAKWQPCSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKKA 133

Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            RN    +  +        E +   GCNCK++ CLK YCECY
Sbjct: 134 HRNKKVSQETL-------FETKDIWGCNCKKTRCLKRYCECY 168



 Score = 45.4 bits (106), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 40/83 (48%), Gaps = 11/83 (13%)

Query: 1   YCDCFANGEFCYQC----NCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
           YC CF NG+ C +     +C +  NN+   D     I++   RN    +  +        
Sbjct: 93  YCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKKAHRNKKVSQETLF------- 145

Query: 57  ERRHTKGCNCKRSGCLKNYCECY 79
           E +   GCNCK++ CLK YCECY
Sbjct: 146 ETKDIWGCNCKKTRCLKRYCECY 168



 Score = 37.0 bits (84), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 41/174 (23%), Positives = 68/174 (39%), Gaps = 48/174 (27%)

Query: 23  IEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
           IE+ D  H  I + ++ +         K    + E    + C+C ++ CLK YC C+  G
Sbjct: 48  IEYNDQLHMVINKQIDIDE-------KKVESPKNEIAKWQPCSCTKTNCLKMYCSCFHNG 100

Query: 83  ARTLEP----------NGIRPRKP----------------------------CNCTKSMC 104
              +E           N +  R                              CNC K+ C
Sbjct: 101 QTCVESCKCEDCKNTNNNVPQRDKAVEYIKKKAHRNKKVSQETLFETKDIWGCNCKKTRC 160

Query: 105 LKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK 157
           LK YC+C+   + C  +CNC  C N    ++D ++ IR+  E+   + R + G+
Sbjct: 161 LKRYCECYIRQKACTVECNCTHCENG--KDEDLYNEIRRQNEQPKQSKRQRSGR 212


>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 56/103 (54%), Gaps = 18/103 (17%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQC 143
           T++    + ++ C C KS CLK+YCDC A GE+C  +C+C +C N+ +H + R+  + Q 
Sbjct: 30  TIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKVLEQM 89

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +E+N  A                    CNC++S C K YCECY
Sbjct: 90  VEKNTPA-----------------QIKCNCRKSNCQKKYCECY 115



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 18/85 (21%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDC A GE+C  +C+C +C N+ +H + R+  + Q +E+N  A                
Sbjct: 53  YCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKVLEQMVEKNTPA---------------- 96

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
               CNC++S C K YCECY AG +
Sbjct: 97  -QIKCNCRKSNCQKKYCECYNAGVK 120



 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 28/111 (25%)

Query: 48  IGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA------------------------ 83
           I +  + +  ++  + C CK+S CLK YC+C   G                         
Sbjct: 26  ITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKV 85

Query: 84  --RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
             + +E N     K CNC KS C K YC+C+  G  C + C C+ C N+++
Sbjct: 86  LEQMVEKNTPAQIK-CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKNSVK 135


>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
           CCMP2712]
          Length = 125

 Score = 63.9 bits (154), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 52/96 (54%), Gaps = 6/96 (6%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C C ++ CLK YC CF N   C   C C+ C N+ +HE+ R  AIR     NP+AF+   
Sbjct: 13  CRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAFKGTA 72

Query: 156 -----GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                 +    +G ++  +GC CKRS C K YCEC+
Sbjct: 73  LELEDQEVTTPKGGKKTVRGCRCKRSKCQKKYCECF 108



 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/88 (39%), Positives = 47/88 (53%), Gaps = 6/88 (6%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----GKCLVG 54
           YC CF N   C   C C+ C N+ +HE+ R  AIR     NP+AF+         +    
Sbjct: 24  YCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAFKGTALELEDQEVTTP 83

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAG 82
           +G ++  +GC CKRS C K YCEC+ AG
Sbjct: 84  KGGKKTVRGCRCKRSKCQKKYCECFSAG 111


>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
 gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
          Length = 163

 Score = 63.9 bits (154), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 32/62 (51%), Positives = 40/62 (64%), Gaps = 1/62 (1%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           CNC +S CLKLYC+CFA G +C   C+C +CFN  E+ D   H  +Q   RNP AF PK+
Sbjct: 83  CNCKRSRCLKLYCECFAAGNYCLDTCSCENCFNKSEYADAVLHMRQQIEARNPVAFAPKV 142

Query: 156 GK 157
            K
Sbjct: 143 VK 144



 Score = 48.9 bits (115), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK 50
           YC+CFA G +C   C+C +CFN  E+ D   H  +Q   RNP AF PK+ K
Sbjct: 94  YCECFAAGNYCLDTCSCENCFNKSEYADAVLHMRQQIEARNPVAFAPKVVK 144


>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 63.5 bits (153), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 53/96 (55%), Gaps = 11/96 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
           +PC+CTK+ CLK+YC CF NG+ C + C C  C N   H   R  A   I++   RN   
Sbjct: 80  QPCSCTKTYCLKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKKAHRNKKV 139

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            + K+        E +   GCNCK++ CLK YCECY
Sbjct: 140 PQEKL-------FETKDIWGCNCKKTRCLKRYCECY 168



 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 41/83 (49%), Gaps = 11/83 (13%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNAFRPKIGKCLVGEG 56
           YC CF NG+ C + C C  C N   H   R  A   I++   RN    + K+        
Sbjct: 93  YCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKKAHRNKKVPQEKLF------- 145

Query: 57  ERRHTKGCNCKRSGCLKNYCECY 79
           E +   GCNCK++ CLK YCECY
Sbjct: 146 ETKDIWGCNCKKTRCLKRYCECY 168


>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
 gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
          Length = 649

 Score = 63.5 bits (153), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 34/74 (45%), Positives = 45/74 (60%), Gaps = 3/74 (4%)

Query: 86  LEPNGIRPRKPCNCTKS---MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
           L+ N ++ +K CNC  S        YC+C+A G +C  CNC +C NN+ +E  R  AI  
Sbjct: 139 LKDNTLKKQKRCNCKNSKSDFATARYCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGM 198

Query: 143 CLERNPNAFRPKIG 156
            LE+NPNAFRPKI 
Sbjct: 199 TLEKNPNAFRPKIA 212



 Score = 59.3 bits (142), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 27/49 (55%), Positives = 34/49 (69%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
           YC+C+A G +C  CNC +C NN+ +E  R  AI   LE+NPNAFRPKI 
Sbjct: 164 YCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIA 212


>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
 gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
          Length = 777

 Score = 63.5 bits (153), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 12/101 (11%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR--- 152
           C C K+ CLK+YC CF++  FC+  C C+ C N    + +R  AI+  L  +P AF    
Sbjct: 540 CGC-KTGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAFSFAS 598

Query: 153 -PKIGKC------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
            P+          L  +      +GC CK+S CLK YCEC+
Sbjct: 599 LPQDTNTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECF 639



 Score = 55.8 bits (133), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 19/110 (17%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR----PKIGKC---- 51
           YC CF++  FC+  C C+ C N    + +R  AI+  L  +P AF     P+        
Sbjct: 550 YCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAFSFASLPQDTNTTGFL 609

Query: 52  --LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNC 99
             L  +      +GC CK+S CLK YCEC++        NGI     C C
Sbjct: 610 HLLPQKSSAVVMRGCRCKKSKCLKKYCECFQ--------NGIACTSHCRC 651


>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 617

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/85 (42%), Positives = 49/85 (57%), Gaps = 3/85 (3%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CF  G +C  QC C  C NN ++ ++R  AI +   RN +AF  K    +       
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQ--ITSNSTVT 414

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
           H KGC+CKR+ CLK YCEC+ AG +
Sbjct: 415 HKKGCHCKRTHCLKKYCECFNAGLK 439



 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 3/80 (3%)

Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
           YC+CF  G +C  QC C  C NN ++ ++R  AI +   RN +AF  K    +       
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQ--ITSNSTVT 414

Query: 167 HTKGCNCKRSGCLKNYCECY 186
           H KGC+CKR+ CLK YCEC+
Sbjct: 415 HKKGCHCKRTHCLKKYCECF 434


>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 625

 Score = 63.2 bits (152), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 38/82 (46%), Positives = 50/82 (60%), Gaps = 3/82 (3%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
           YC+CFA G FC  C+C +C N  ++         Q   RNP AF  KI   +  +GE +H
Sbjct: 281 YCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKI---VATDGEEKH 337

Query: 61  TKGCNCKRSGCLKNYCECYEAG 82
            KGC+CK+S CLK YCEC++AG
Sbjct: 338 KKGCHCKKSACLKKYCECFQAG 359



 Score = 60.1 bits (144), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 37/80 (46%), Positives = 48/80 (60%), Gaps = 3/80 (3%)

Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
           LYC+CFA G FC  C+C +C N  ++         Q   RNP AF  KI   +  +GE +
Sbjct: 280 LYCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKI---VATDGEEK 336

Query: 167 HTKGCNCKRSGCLKNYCECY 186
           H KGC+CK+S CLK YCEC+
Sbjct: 337 HKKGCHCKKSACLKKYCECF 356


>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
          Length = 496

 Score = 63.2 bits (152), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 6/89 (6%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLVG 54
           YC CF     C  C+C  C+N+ +HE +R  AI      +P AF  ++          V 
Sbjct: 300 YCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASVSVE 359

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
              ++H +GC CK S CLK YCEC+E G 
Sbjct: 360 SKPKQHVRGCKCKNSKCLKKYCECFEFGV 388



 Score = 60.1 bits (144), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 6/88 (6%)

Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKC 158
           ++ YC CF     C  C+C  C+N+ +HE +R  AI      +P AF  ++         
Sbjct: 297 VRQYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASV 356

Query: 159 LVGEGERRHTKGCNCKRSGCLKNYCECY 186
            V    ++H +GC CK S CLK YCEC+
Sbjct: 357 SVESKPKQHVRGCKCKNSKCLKKYCECF 384


>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 159

 Score = 62.4 bits (150), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 11/132 (8%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA G +C   C C  CFN  E+ +   +  +Q   RNP AF PKI    V   E  
Sbjct: 1   YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKI----VQGAEPS 56

Query: 60  HTKGCNCKRS-GCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC 118
              G     +  C++   +  +    T  P   R ++ CNC KS+CLK YC+C+  G  C
Sbjct: 57  PVPGVTLPAAEACVQLVVQ--DEALDT--PASARHKRGCNCKKSLCLKKYCECYQAGVGC 112

Query: 119 YQ-CNCNSCFNN 129
            + C C  C N 
Sbjct: 113 SEGCRCEGCMNK 124



 Score = 60.5 bits (145), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 48/106 (45%), Gaps = 27/106 (25%)

Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 155
           YC+CFA G +C   C C  CFN  E+ +   +  +Q   RNP AF PKI           
Sbjct: 1   YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAEPSPVPG 60

Query: 156 ---------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                           + L      RH +GCNCK+S CLK YCECY
Sbjct: 61  VTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECY 106


>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 313

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/92 (44%), Positives = 45/92 (48%), Gaps = 3/92 (3%)

Query: 96  PCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           PC C K  CLKLYCDCFA   FC   C C  C N +        A R  LERNP+AF  +
Sbjct: 200 PCTCKKCRCLKLYCDCFAQSRFCGSACLCKQCCNRL-GAVVVQAARRAVLERNPDAFH-R 257

Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                   G      GC C RS CLK YC C+
Sbjct: 258 AHSLETRPGSVTSGAGCGCTRSHCLKKYCVCF 289



 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 48/134 (35%), Gaps = 37/134 (27%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YCDCFA   FC   C C  C N +        A R  LERNP+AF               
Sbjct: 212 YCDCFAQSRFCGSACLCKQCCNRL-GAVVVQAARRAVLERNPDAF--------------- 255

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC- 118
                              + A +    P  +     C CT+S CLK YC CF     C 
Sbjct: 256 -------------------HRAHSLETRPGSVTSGAGCGCTRSHCLKKYCVCFQKSLLCG 296

Query: 119 YQCNCNSCFNNIEH 132
             C C  C N +E+
Sbjct: 297 AGCKCEGCLNTVEN 310


>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
          Length = 482

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 56/106 (52%), Gaps = 14/106 (13%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C +S C+KLYC+CF +  FC   C CN C N  E+E++      + L RNP AF  
Sbjct: 357 KGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDCENMEENEEEIVKIREKILSRNPKAFEK 416

Query: 154 KI----GKCLVGEGER---------RHTKGCNCKRSGCLKNYCECY 186
           K+        +G G +         RH KGC CKRS C  NYCEC+
Sbjct: 417 KLQIEEASQPLGMGPKAASAPSSNLRHIKGCTCKRSKCSNNYCECH 462



 Score = 53.1 bits (126), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 51/98 (52%), Gaps = 14/98 (14%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCLVGE 55
           YC+CF +  FC   C CN C N  E+E++      + L RNP AF  K+        +G 
Sbjct: 370 YCECFLSKGFCGPSCTCNDCENMEENEEEIVKIREKILSRNPKAFEKKLQIEEASQPLGM 429

Query: 56  GER---------RHTKGCNCKRSGCLKNYCECYEAGAR 84
           G +         RH KGC CKRS C  NYCEC++ GA+
Sbjct: 430 GPKAASAPSSNLRHIKGCTCKRSKCSNNYCECHQHGAK 467


>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|194699308|gb|ACF83738.1| unknown [Zea mays]
 gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|224028923|gb|ACN33537.1| unknown [Zea mays]
 gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 359

 Score = 60.5 bits (145), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 53/123 (43%), Gaps = 21/123 (17%)

Query: 85  TLEPNGIRPR----------KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHE 133
            + P+  RPR          K C C KS CLKLYC CF+ G +C  +C C  CFN     
Sbjct: 47  AIAPDPKRPRVEEAADGNGCKHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFA 106

Query: 134 DDRHHAIRQCLERN----------PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYC 183
           +  H   +  L R           P A    +             +GCNCK+S CLK YC
Sbjct: 107 ETVHTTRKVLLSRQKRMSMKINRRPEANTEPMEDAHHSSSSTPPKRGCNCKKSSCLKKYC 166

Query: 184 ECY 186
           +CY
Sbjct: 167 DCY 169



 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 11/94 (11%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERN----------PNAFRPKIG 49
           YC CF+ G +C  +C C  CFN     +  H   +  L R           P A    + 
Sbjct: 80  YCPCFSGGGYCSDKCGCQPCFNKEAFAETVHTTRKVLLSRQKRMSMKINRRPEANTEPME 139

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                       +GCNCK+S CLK YC+CY+ G 
Sbjct: 140 DAHHSSSSTPPKRGCNCKKSSCLKKYCDCYQDGT 173


>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
 gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
          Length = 847

 Score = 60.1 bits (144), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 38/76 (50%), Gaps = 13/76 (17%)

Query: 19  CFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------------LVGEGERRHTKGCN 65
           CFN  E+ED      +Q   RNP AF PKI +              L      RH  GCN
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHVTEFQAIDVEDVDLFTPYSGRHKTGCN 503

Query: 66  CKRSGCLKNYCECYEA 81
           CKRS C+K YCECY+A
Sbjct: 504 CKRSMCVKKYCECYQA 519



 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 32/74 (43%), Positives = 36/74 (48%), Gaps = 13/74 (17%)

Query: 126 CFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------------LVGEGERRHTKGCN 172
           CFN  E+ED      +Q   RNP AF PKI +              L      RH  GCN
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHVTEFQAIDVEDVDLFTPYSGRHKTGCN 503

Query: 173 CKRSGCLKNYCECY 186
           CKRS C+K YCECY
Sbjct: 504 CKRSMCVKKYCECY 517


>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 375

 Score = 59.7 bits (143), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 49/104 (47%), Gaps = 13/104 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C KS CLKLYC CFA G +C + C C  CFN  +  +      +  L R       
Sbjct: 80  KHCACKKSRCLKLYCPCFAGGGYCSEKCGCVPCFNKADFAETVQTTRKVLLSRQ-KRMSL 138

Query: 154 KIGKCLVGEGERRHT-----------KGCNCKRSGCLKNYCECY 186
           KI + L    E               +GCNCK+S CLK YC+CY
Sbjct: 139 KINRRLEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCY 182



 Score = 48.9 bits (115), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 49/181 (27%), Positives = 69/181 (38%), Gaps = 59/181 (32%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA G +C + C C  CFN  +  +      +  L R                 ++R
Sbjct: 93  YCPCFAGGGYCSEKCGCVPCFNKADFAETVQTTRKVLLSR-----------------QKR 135

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLE-----PNGIRPRKPCNCTKSMCLKLYCDCFAN 114
            +   N +            EA A T+E      +   PR+ CNC KS CLK YCDC+ +
Sbjct: 136 MSLKINRR-----------LEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 184

Query: 115 GEFCYQ-CNCNSCFN---------------------NIEHEDDRHHAIRQCLERNPNAFR 152
           G  C   C C  C N                     +++H +D H  I   +ER+P    
Sbjct: 185 GTGCSLFCRCEDCKNPFGKNEGIMADESKRFLYTGADLDHSEDEHDFI---VERSPRLQS 241

Query: 153 P 153
           P
Sbjct: 242 P 242


>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 103

 Score = 59.7 bits (143), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 42/80 (52%), Gaps = 13/80 (16%)

Query: 3  DCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE-GERRHT 61
          D  A G      +C+  F  I             L ++P   +PKIG     E   RRHT
Sbjct: 15 DVSALGRLLPAGDCSELFGRITAG----------LGQSP--VQPKIGATSAAEDATRRHT 62

Query: 62 KGCNCKRSGCLKNYCECYEA 81
          KGCNCKRSGCLKNYCECYEA
Sbjct: 63 KGCNCKRSGCLKNYCECYEA 82



 Score = 58.9 bits (141), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 43/88 (48%), Gaps = 13/88 (14%)

Query: 100 TKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL 159
           T S  +    D  A G      +C+  F  I             L ++P   +PKIG   
Sbjct: 5   TSSATITGITDVSALGRLLPAGDCSELFGRITAG----------LGQSP--VQPKIGATS 52

Query: 160 VGE-GERRHTKGCNCKRSGCLKNYCECY 186
             E   RRHTKGCNCKRSGCLKNYCECY
Sbjct: 53  AAEDATRRHTKGCNCKRSGCLKNYCECY 80


>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 319

 Score = 59.7 bits (143), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/54 (51%), Positives = 35/54 (64%)

Query: 28 DRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
          DR  AI   LERNP AF+PK+           H +GCNC++S CLKNYCEC++A
Sbjct: 19 DRSQAIASILERNPLAFQPKVVANGSETNNLHHLRGCNCRKSNCLKNYCECHQA 72



 Score = 57.4 bits (137), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 27/52 (51%), Positives = 33/52 (63%)

Query: 135 DRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           DR  AI   LERNP AF+PK+           H +GCNC++S CLKNYCEC+
Sbjct: 19  DRSQAIASILERNPLAFQPKVVANGSETNNLHHLRGCNCRKSNCLKNYCECH 70


>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
          Length = 286

 Score = 58.9 bits (141), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 52/94 (55%), Gaps = 8/94 (8%)

Query: 98  NCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF--RPKI 155
           NC+ + C+K YC CF   ++C  CN   C NN E+E++R  AI   L +NP AF  + K 
Sbjct: 144 NCSNTYCIKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKT 203

Query: 156 GKCLVGEGERR----HTKGCNCKRSGCLKNYCEC 185
            K  + + ++     HT  CNC +SGC   YC C
Sbjct: 204 RKNFIKKSKKDSNLIHT--CNCSKSGCNNKYCVC 235



 Score = 53.1 bits (126), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 8/88 (9%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF--RPKIGKCLVGEGER 58
           YC CF   ++C  CN   C NN E+E++R  AI   L +NP AF  + K  K  + + ++
Sbjct: 154 YCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKTRKNFIKKSKK 213

Query: 59  R----HTKGCNCKRSGCLKNYCECYEAG 82
                HT  CNC +SGC   YC C + G
Sbjct: 214 DSNLIHT--CNCSKSGCNNKYCVCLKEG 239


>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 356

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 47/103 (45%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
           K C C KS CLKLYC CFA G +C  +C C  CFN     +           R   +   
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMK 121

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           + R P A    +             +GCNCK+S CLK YC+CY
Sbjct: 122 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 164



 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 11/94 (11%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
           YC CFA G +C  +C C  CFN     +           R   +   + R P A    + 
Sbjct: 75  YCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 134

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                       +GCNCK+S CLK YC+CY+ G 
Sbjct: 135 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 168



 Score = 37.0 bits (84), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 53/151 (35%), Gaps = 57/151 (37%)

Query: 37  LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY----------------- 79
           L   P+  R ++ +   G G     K C CK+S CLK YC C+                 
Sbjct: 41  LAIAPDPKRQRVEEAADGNG----CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFN 96

Query: 80  ------------------------------EAGARTLE-----PNGIRPRKPCNCTKSMC 104
                                         EA A  +E      +   PR+ CNC KS C
Sbjct: 97  KEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSC 156

Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHED 134
           LK YCDC+ +G  C   C C+ C N     D
Sbjct: 157 LKKYCDCYQDGTGCSLFCRCDDCQNPFGKND 187


>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
          Length = 341

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 47/103 (45%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
           K C C KS CLKLYC CFA G +C  +C C  CFN     +           R   +   
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMK 121

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           + R P A    +             +GCNCK+S CLK YC+CY
Sbjct: 122 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 164



 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 39/91 (42%), Gaps = 11/91 (12%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
           YC CFA G +C  +C C  CFN     +           R   +   + R P A    + 
Sbjct: 75  YCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 134

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
                       +GCNCK+S CLK YC+CY+
Sbjct: 135 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQ 165


>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
          Length = 434

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 51/101 (50%), Gaps = 12/101 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C ++ CLK YC CF N   C   C C  C N+ +HE  R  A+R     +P AF+ 
Sbjct: 15  KGCRCGRTKCLKQYCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMAFK- 73

Query: 154 KIGKCLVGEGERRHT--------KGCNCKRSGCLKNYCECY 186
             G  L  E +  HT        +GC C+RS C K YCEC+
Sbjct: 74  --GTSLELENQEVHTPNGTLKTVRGCRCRRSKCQKKYCECF 112



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 46/93 (49%), Gaps = 12/93 (12%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF N   C   C C  C N+ +HE  R  A+R     +P AF+   G  L  E +  
Sbjct: 28  YCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMAFK---GTSLELENQEV 84

Query: 60  HT--------KGCNCKRSGCLKNYCECYEAGAR 84
           HT        +GC C+RS C K YCEC+ AG +
Sbjct: 85  HTPNGTLKTVRGCRCRRSKCQKKYCECFGAGLK 117


>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
          Length = 223

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 11/96 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
           +PC+C K+ CLK+YC CF NG FC   C C  C N  E +  R  A   +++   RN   
Sbjct: 87  QPCSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKKAHRNKKV 146

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            + KI + +          GCNC ++ C+K YCEC+
Sbjct: 147 PKEKIFETV-------EIWGCNCSKTRCVKKYCECF 175



 Score = 43.9 bits (102), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 11/88 (12%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAI---RQCLERNPNAFRPKIGKCLVGEG 56
           YC CF NG FC   C C  C N  E +  R  A+   ++   RN    + KI + +    
Sbjct: 100 YCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKKAHRNKKVPKEKIFETV---- 155

Query: 57  ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
                 GCNC ++ C+K YCEC+  G +
Sbjct: 156 ---EIWGCNCSKTRCVKKYCECFIRGKK 180


>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 52/97 (53%), Gaps = 12/97 (12%)

Query: 91  IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
           +   +PCNC+KS C K+YC+C A G  C   C C +C N   ++    H I++ L+ N N
Sbjct: 66  LSSDEPCNCSKSECRKMYCECLAKGRLCSSACRCENCQNRTSNK-KVLHVIKE-LDLNKN 123

Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
             + K          +R   GC CK+S CLK YCEC+
Sbjct: 124 LKKIK---------SKRFKDGCTCKKSMCLKKYCECF 151



 Score = 44.3 bits (103), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 12/83 (14%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+C A G  C   C C +C N   ++    H I++ L+ N N  + K          +R
Sbjct: 83  YCECLAKGRLCSSACRCENCQNRTSNKK-VLHVIKE-LDLNKNLKKIK---------SKR 131

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
              GC CK+S CLK YCEC+ +G
Sbjct: 132 FKDGCTCKKSMCLKKYCECFHSG 154



 Score = 43.5 bits (101), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 44/102 (43%), Gaps = 32/102 (31%)

Query: 64  CNCKRSGCLKNYCECYEAG------------------------ARTLEPN-------GIR 92
           CNC +S C K YCEC   G                         + L+ N         R
Sbjct: 72  CNCSKSECRKMYCECLAKGRLCSSACRCENCQNRTSNKKVLHVIKELDLNKNLKKIKSKR 131

Query: 93  PRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHE 133
            +  C C KSMCLK YC+CF +G+ C Y CNC +C N +  E
Sbjct: 132 FKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQNQLLDE 173


>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
           distachyon]
          Length = 377

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 49/104 (47%), Gaps = 13/104 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C KS CLKLYC CFA G +C + C C  CFN  ++ +      +  L R       
Sbjct: 89  KHCACKKSKCLKLYCPCFAGGGYCSEKCGCVPCFNKSDYAETVQTTRKVLLSRQ-KRMSL 147

Query: 154 KIGKCLVGEGERRHT-----------KGCNCKRSGCLKNYCECY 186
           KI +      E               +GCNCK+S CLK YC+CY
Sbjct: 148 KINRRSEANAEAMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCY 191



 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 57/135 (42%), Gaps = 35/135 (25%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA G +C + C C  CFN  ++ +      +  L R                 ++R
Sbjct: 102 YCPCFAGGGYCSEKCGCVPCFNKSDYAETVQTTRKVLLSR-----------------QKR 144

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLE-----PNGIRPRKPCNCTKSMCLKLYCDCFAN 114
            +   N +RS          EA A  +E      +   PR+ CNC KS CLK YCDC+ +
Sbjct: 145 MSLKIN-RRS----------EANAEAMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 193

Query: 115 GEFCYQ-CNCNSCFN 128
           G  C   C C  C N
Sbjct: 194 GTGCSLFCRCEDCKN 208


>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
 gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
          Length = 359

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 47/103 (45%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
           K C C KS CLKLYC CF+ G +C  +C C  CFN     +           R   +   
Sbjct: 67  KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMK 126

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           + R P A    +             +GCNCK+S CLK YC+CY
Sbjct: 127 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 169



 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 40/94 (42%), Gaps = 11/94 (11%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
           YC CF+ G +C  +C C  CFN     +           R   +   + R P A    + 
Sbjct: 80  YCPCFSGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 139

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                       +GCNCK+S CLK YC+CY+ G 
Sbjct: 140 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 173



 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 52/145 (35%), Gaps = 57/145 (39%)

Query: 37  LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY----------------- 79
           L   P+  R ++ +   G G     K C CK+S CLK YC C+                 
Sbjct: 46  LAIAPDPKRQRVEEAADGNG----CKHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFN 101

Query: 80  ------------------------------EAGARTLE-----PNGIRPRKPCNCTKSMC 104
                                         EA A  +E      +   PR+ CNC KS C
Sbjct: 102 KEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSC 161

Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFN 128
           LK YCDC+ +G  C   C C+ C N
Sbjct: 162 LKKYCDCYQDGTGCSLFCRCDDCQN 186


>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
          Length = 356

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 47/103 (45%), Gaps = 11/103 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
           K C C KS CLKLYC CFA G +C  +C C  CF+     +           R   +   
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFSKEAFSETVQTTRKVLLSRQKRMSMK 121

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           + R P A    +             +GCNCK+S CLK YC+CY
Sbjct: 122 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 164



 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 40/94 (42%), Gaps = 11/94 (11%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
           YC CFA G +C  +C C  CF+     +           R   +   + R P A    + 
Sbjct: 75  YCPCFAGGGYCSDKCGCQPCFSKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 134

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                       +GCNCK+S CLK YC+CY+ G 
Sbjct: 135 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 168



 Score = 37.0 bits (84), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 53/151 (35%), Gaps = 57/151 (37%)

Query: 37  LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY----------------- 79
           L   P+  R ++ +   G G     K C CK+S CLK YC C+                 
Sbjct: 41  LAIAPDPKRQRVEEAADGNG----CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFS 96

Query: 80  ------------------------------EAGARTLE-----PNGIRPRKPCNCTKSMC 104
                                         EA A  +E      +   PR+ CNC KS C
Sbjct: 97  KEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSC 156

Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHED 134
           LK YCDC+ +G  C   C C+ C N     D
Sbjct: 157 LKKYCDCYQDGTGCSLFCRCDDCQNPFGKND 187


>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
          Length = 374

 Score = 57.0 bits (136), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 48/104 (46%), Gaps = 13/104 (12%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C C KS CLKLYC CFA G +C + C C  CFN   + +      +  L R       
Sbjct: 86  KHCACKKSRCLKLYCPCFAGGGYCSEKCGCQPCFNKALYAETVQTTRKVLLSRQ-KRMSL 144

Query: 154 KIGKCLVGEGERRHT-----------KGCNCKRSGCLKNYCECY 186
           KI +      E               +GCNCK+S CLK YC+CY
Sbjct: 145 KINRRSEANAEAVEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 188



 Score = 45.8 bits (107), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 25/130 (19%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA G +C + C C  CFN   + +      +  L R                 ++R
Sbjct: 99  YCPCFAGGGYCSEKCGCQPCFNKALYAETVQTTRKVLLSR-----------------QKR 141

Query: 60  HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
            +   N +     +   + + + + T       PR+ CNC KS CLK YCDC+ +G  C 
Sbjct: 142 MSLKINRRSEANAEAVEDAHHSSSST------PPRRGCNCKKSSCLKKYCDCYQDGTGCS 195

Query: 120 Q-CNCNSCFN 128
             C C  C N
Sbjct: 196 LFCRCEDCRN 205


>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
          Length = 184

 Score = 56.2 bits (134), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 25/25 (100%), Positives = 25/25 (100%)

Query: 57 ERRHTKGCNCKRSGCLKNYCECYEA 81
          ERRHTKGCNCKRSGCLKNYCECYEA
Sbjct: 1  ERRHTKGCNCKRSGCLKNYCECYEA 25



 Score = 52.8 bits (125), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 23/23 (100%), Positives = 23/23 (100%)

Query: 164 ERRHTKGCNCKRSGCLKNYCECY 186
           ERRHTKGCNCKRSGCLKNYCECY
Sbjct: 1   ERRHTKGCNCKRSGCLKNYCECY 23


>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
 gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
          Length = 466

 Score = 55.5 bits (132), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/67 (46%), Positives = 40/67 (59%), Gaps = 5/67 (7%)

Query: 19  CFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCE 77
           C N +E+         Q   RNPNAF  KI      +G+  RHTKGC+CK+S CLK YCE
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKIED----DGDDARHTKGCHCKKSACLKKYCE 226

Query: 78  CYEAGAR 84
           C++AG +
Sbjct: 227 CFQAGVK 233



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 67/154 (43%), Gaps = 27/154 (17%)

Query: 56  GERRHTKGCNCKRSG----CLKNY-CECYEAGARTLEPNGIRPRKPCNCTK------SMC 104
           G+R  T G + + +      LKN+  +    G R+     + PR P    +      S+ 
Sbjct: 79  GKRSSTTGASTEWTAGFTKTLKNFDVDSPRVGTRSARRTVVPPRTPSKAVRRPRSLRSIP 138

Query: 105 -LKLYC-------DCFANGEFCYQCNCNS---CFNNIEHEDDRHHAIRQCLERNPNAFRP 153
            ++L         +  A       C  N+   C N +E+         Q   RNPNAF  
Sbjct: 139 RIRLLAVNQLRARNATAKKASVSSCTANASRRCQNTVENVSVVEKTRSQIEARNPNAFLS 198

Query: 154 KIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
           KI      +G+  RHTKGC+CK+S CLK YCEC+
Sbjct: 199 KIED----DGDDARHTKGCHCKKSACLKKYCECF 228


>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
 gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
          Length = 478

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/95 (38%), Positives = 48/95 (50%), Gaps = 26/95 (27%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           K C+C  S CLK+YC+CFAN   C  C C  C NN                    A   K
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCNNCQCFGCQNN-------------------EANIEK 143

Query: 155 IG---KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           IG     L+     +H++GC+C++SGCLK YCEC+
Sbjct: 144 IGNDQSVLID----KHSRGCHCRKSGCLKKYCECF 174



 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 42/86 (48%), Gaps = 26/86 (30%)

Query: 1   YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG---KCLVGEGE 57
           YC+CFAN   C  C C  C NN                    A   KIG     L+    
Sbjct: 116 YCECFANKILCNNCQCFGCQNN-------------------EANIEKIGNDQSVLID--- 153

Query: 58  RRHTKGCNCKRSGCLKNYCECYEAGA 83
            +H++GC+C++SGCLK YCEC++AG 
Sbjct: 154 -KHSRGCHCRKSGCLKKYCECFQAGI 178


>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
 gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
          Length = 113

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 39/62 (62%), Gaps = 1/62 (1%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF P
Sbjct: 19  KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 78

Query: 154 KI 155
           K+
Sbjct: 79  KV 80



 Score = 39.7 bits (91), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 1  YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 48
          YC+CFA+  +C + C+C  CFN+  HE+       +   RNP AF PK+
Sbjct: 32 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKV 80


>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 50/93 (53%), Gaps = 5/93 (5%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +PCNC++S CLK YC CF +G  C  +C C  C N  E  ++R  AI    ++     + 
Sbjct: 149 QPCNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKKCHRDKKV 208

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            + + L      + + GC CK +GC K YCEC+
Sbjct: 209 PVNELL----SLQMSYGCKCKSTGCQKKYCECF 237



 Score = 40.4 bits (93), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 5/83 (6%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF +G  C  +C C  C N  E  ++R  AI    ++     +  + + L      +
Sbjct: 162 YCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKKCHRDKKVPVNELL----SLQ 217

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
            + GC CK +GC K YCEC++ G
Sbjct: 218 MSYGCKCKSTGCQKKYCECFKRG 240


>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|219884557|gb|ACL52653.1| unknown [Zea mays]
 gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
          Length = 394

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 51/97 (52%), Gaps = 8/97 (8%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC CFA+G  C + C C  C N  + +     A    L +    F  
Sbjct: 87  KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL-KPVQTF-- 143

Query: 154 KIGKCLVGEGERRHT----KGCNCKRSGCLKNYCECY 186
           + G+ +V +  R       + C CK+SGCLK YC+CY
Sbjct: 144 EAGQDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCY 180



 Score = 43.5 bits (101), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA+G  C + C C  C N  + +     A    L +    F  + G+ +V +  R 
Sbjct: 100 YCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL-KPVQTF--EAGQDIVEQVIRS 156

Query: 60  HT----KGCNCKRSGCLKNYCECYEAGA 83
                 + C CK+SGCLK YC+CY+ GA
Sbjct: 157 PMDLIRRKCTCKKSGCLKKYCDCYQGGA 184



 Score = 39.3 bits (90), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 41/111 (36%), Gaps = 44/111 (39%)

Query: 62  KGCNCKRSGCLKNYCECYEAGAR------------------TLEPNGIRPRKP------- 96
           K C+CK+S CLK YC C+ +G                    TL    + P KP       
Sbjct: 87  KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPLKPVQTFEAG 146

Query: 97  ------------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
                             C C KS CLK YCDC+  G  C   C C+ C N
Sbjct: 147 QDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 197


>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 5/91 (5%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C+KS CL++YC CF NG  C   C C  C N  ++   R  A+    ++     +   
Sbjct: 89  CSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPE 148

Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           GK      E     GCNC ++ C+KNYCEC+
Sbjct: 149 GKIF----ETTEIWGCNCSKTRCVKNYCECF 175



 Score = 42.0 bits (97), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF NG  C   C C  C N  ++   R  A+    ++     +   GK      E  
Sbjct: 100 YCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPEGKIF----ETT 155

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
              GCNC ++ C+KNYCEC+  G +
Sbjct: 156 EIWGCNCSKTRCVKNYCECFIRGKK 180


>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 622

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 1/59 (1%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
           + PC C KS+CLKLYC+CFA GE C + C C  C N+  H + R+ AI+    +NP+AF
Sbjct: 326 KTPCKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNHLELRNEAIKVIEAKNPSAF 384


>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
          Length = 321

 Score = 53.5 bits (127), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 11/108 (10%)

Query: 80  EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHH 138
           +  ++  EPN      PC CTKS CLKLYC CF     C + C C +C N  +H   R  
Sbjct: 188 DLHSQQQEPNTT----PCKCTKSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQT 243

Query: 139 AIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           A+ +         + +  K      +R    GC C++S C KNYCECY
Sbjct: 244 ALEKI------KIKSQRQKHDDDLFDRSRVWGCKCQKSQCQKNYCECY 285


>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
          Length = 204

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 44/93 (47%), Gaps = 1/93 (1%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC C+A+G  C + C C  C N           +             
Sbjct: 78  KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 137

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +I + L+        + C CK+SGCLK YC+CY
Sbjct: 138 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCY 170



 Score = 42.4 bits (98), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC C+A+G  C + C C  C N           +             +I + L+      
Sbjct: 91  YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 150

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
             + C CK+SGCLK YC+CY+ GA
Sbjct: 151 IRRKCTCKKSGCLKKYCDCYQGGA 174



 Score = 41.2 bits (95), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 53/134 (39%), Gaps = 47/134 (35%)

Query: 38  ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------- 87
           E N N+ R +  K   G+G     K C+CK+S CLK YC CY +G+   E          
Sbjct: 58  EANGNSDRKRRRKADDGDG----CKACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNK 113

Query: 88  -PNG------IRPRKP-------------------------CNCTKSMCLKLYCDCFANG 115
            P G      + P KP                         C C KS CLK YCDC+  G
Sbjct: 114 PPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGG 173

Query: 116 EFC-YQCNCNSCFN 128
             C   C C+ C N
Sbjct: 174 AGCSINCKCDDCRN 187


>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
 gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
          Length = 409

 Score = 53.1 bits (126), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 8/97 (8%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS C+KLYC CFA+G  C + C C  C N    +     A+   L+    +   
Sbjct: 102 KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLKPVQTS--- 158

Query: 154 KIGKCLVGEGERRHT----KGCNCKRSGCLKNYCECY 186
           + G+ +V +  R       + C CK+SGCLK YC+CY
Sbjct: 159 EAGQDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCY 195



 Score = 43.1 bits (100), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA+G  C + C C  C N    +     A+   L+    +   + G+ +V +  R 
Sbjct: 115 YCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLKPVQTS---EAGQDIVEQVIRS 171

Query: 60  HT----KGCNCKRSGCLKNYCECYEAGA 83
                 + C CK+SGCLK YC+CY+ GA
Sbjct: 172 PMDLIRRKCTCKKSGCLKKYCDCYQGGA 199



 Score = 39.3 bits (90), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 44/111 (39%), Gaps = 44/111 (39%)

Query: 62  KGCNCKRSGCLKNYCECYEAGARTLE-----------PNGIRPR-------KP------- 96
           K C+CK+S C+K YC C+ +G+R  E           P  + PR       KP       
Sbjct: 102 KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLKPVQTSEAG 161

Query: 97  ------------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
                             C C KS CLK YCDC+  G  C   C C+ C N
Sbjct: 162 QDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 212


>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
          Length = 151

 Score = 53.1 bits (126), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 54/99 (54%), Gaps = 9/99 (9%)

Query: 89  NGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           N    ++ C C K+ CLKLYC CF  G+ C  QC+C+ C N++    +R+ AI + + + 
Sbjct: 19  NSEETKEGCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHKY 78

Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            +  + K        G++    GC CK+S C  NYCEC+
Sbjct: 79  KDESKLKFS------GDK--GSGCCCKKSKCQLNYCECF 109



 Score = 39.7 bits (91), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 9/83 (10%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF  G+ C  QC+C+ C N++    +R+ AI + + +  +  + K        G++ 
Sbjct: 38  YCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHKYKDESKLKFS------GDK- 90

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
              GC CK+S C  NYCEC+  G
Sbjct: 91  -GSGCCCKKSKCQLNYCECFIKG 112


>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
          Length = 123

 Score = 52.8 bits (125), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 52/103 (50%), Gaps = 11/103 (10%)

Query: 85  TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQC 143
           TL P  IR    C CTK+ CLKLYC+C   G  C   C C  C N  +   +R  A++  
Sbjct: 7   TLPPPSIR----CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFP-ERKKALKHI 61

Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            + NP AF  ++   L G  + R    C C++S C K YC CY
Sbjct: 62  KKTNPKAFL-RVSN-LKGHEDSR---CCACQKSLCNKKYCSCY 99



 Score = 37.4 bits (85), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 7/83 (8%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+C   G  C   C C  C N  +  + R  A++   + NP AF  ++   L G  + R
Sbjct: 26  YCECLKAGRTCTDLCACVDCLNRAKFPE-RKKALKHIKKTNPKAFL-RVSN-LKGHEDSR 82

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
               C C++S C K YC CY++G
Sbjct: 83  ---CCACQKSLCNKKYCSCYQSG 102


>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 52.8 bits (125), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 50/93 (53%), Gaps = 5/93 (5%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +PCNC++S CLK YC CF +G  C  +C C  C N  +  ++R  AI    ++     + 
Sbjct: 149 QPCNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKKCHRDKKV 208

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            + + L      + + GC CK +GC K YCEC+
Sbjct: 209 PVNELL----SLQMSYGCKCKTTGCQKKYCECF 237



 Score = 38.9 bits (89), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 5/83 (6%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF +G  C  +C C  C N  +  ++R  AI    ++     +  + + L      +
Sbjct: 162 YCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKKCHRDKKVPVNELL----SLQ 217

Query: 60  HTKGCNCKRSGCLKNYCECYEAG 82
            + GC CK +GC K YCEC++ G
Sbjct: 218 MSYGCKCKTTGCQKKYCECFKRG 240


>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
          Length = 391

 Score = 52.8 bits (125), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 44/93 (47%), Gaps = 1/93 (1%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC C+A+G  C + C C  C N           +             
Sbjct: 76  KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 135

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +I + L+        + C CK+SGCLK YC+CY
Sbjct: 136 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCY 168



 Score = 41.6 bits (96), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC C+A+G  C + C C  C N           +             +I + L+      
Sbjct: 89  YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 148

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
             + C CK+SGCLK YC+CY+ GA
Sbjct: 149 IRRKCTCKKSGCLKKYCDCYQGGA 172



 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 43/110 (39%), Gaps = 43/110 (39%)

Query: 62  KGCNCKRSGCLKNYCECYEAGARTLE-----------PNG------IRPRKP-------- 96
           K C+CK+S CLK YC CY +G+   E           P G      + P KP        
Sbjct: 76  KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 135

Query: 97  -----------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
                            C C KS CLK YCDC+  G  C   C C+ C N
Sbjct: 136 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 185


>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
 gi|194698372|gb|ACF83270.1| unknown [Zea mays]
 gi|194708474|gb|ACF88321.1| unknown [Zea mays]
 gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
          Length = 394

 Score = 52.8 bits (125), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 44/93 (47%), Gaps = 1/93 (1%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           K C+C KS CLKLYC C+A+G  C + C C  C N           +             
Sbjct: 78  KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 137

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +I + L+        + C CK+SGCLK YC+CY
Sbjct: 138 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCY 170



 Score = 41.6 bits (96), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC C+A+G  C + C C  C N           +             +I + L+      
Sbjct: 91  YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 150

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
             + C CK+SGCLK YC+CY+ GA
Sbjct: 151 IRRKCTCKKSGCLKKYCDCYQGGA 174



 Score = 41.2 bits (95), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 43/110 (39%), Gaps = 43/110 (39%)

Query: 62  KGCNCKRSGCLKNYCECYEAGARTLE-----------PNG------IRPRKP-------- 96
           K C+CK+S CLK YC CY +G+   E           P G      + P KP        
Sbjct: 78  KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 137

Query: 97  -----------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
                            C C KS CLK YCDC+  G  C   C C+ C N
Sbjct: 138 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 187


>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 405

 Score = 52.4 bits (124), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 46/91 (50%), Gaps = 1/91 (1%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C+C KS CLKLYC CFA+G  C + C C+ C+N   H   ++      L+    A   + 
Sbjct: 97  CSCKKSRCLKLYCVCFASGSHCSELCGCDPCYNKPIHGVQQNTPPGLPLQVVRTAEAGQN 156

Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                        + C CK+SGCLK YC+CY
Sbjct: 157 SAEFARYQMDFFRRKCTCKKSGCLKKYCDCY 187



 Score = 42.0 bits (97), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 1/84 (1%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CFA+G  C + C C+ C+N   H   ++      L+    A   +            
Sbjct: 108 YCVCFASGSHCSELCGCDPCYNKPIHGVQQNTPPGLPLQVVRTAEAGQNSAEFARYQMDF 167

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
             + C CK+SGCLK YC+CY+ GA
Sbjct: 168 FRRKCTCKKSGCLKKYCDCYQGGA 191



 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 54/140 (38%), Gaps = 47/140 (33%)

Query: 32  AIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLE---- 87
           A+    E N ++ R K  K   GEG R     C+CK+S CLK YC C+ +G+   E    
Sbjct: 69  AVSPPAEANGSSDRRKKRKAEDGEGCRT----CSCKKSRCLKLYCVCFASGSHCSELCGC 124

Query: 88  ---------------PNGIRP-----------------------RKPCNCTKSMCLKLYC 109
                          P G+                         R+ C C KS CLK YC
Sbjct: 125 DPCYNKPIHGVQQNTPPGLPLQVVRTAEAGQNSAEFARYQMDFFRRKCTCKKSGCLKKYC 184

Query: 110 DCFANGEFC-YQCNCNSCFN 128
           DC+  G  C   C C+ C N
Sbjct: 185 DCYQGGAGCSINCKCDDCKN 204


>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 255

 Score = 52.4 bits (124), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 7/91 (7%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C CTKS CLKLYC CF   + C   C C  C N  EH + R +A+ +  ++     R K 
Sbjct: 136 CRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALEKVKQK---LHRQKN 192

Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
              L    +R    GC C++S C KNYCEC+
Sbjct: 193 DDDLF---DRSKIWGCKCQKSQCQKNYCECF 220



 Score = 36.6 bits (83), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF   + C   C C  C N  EH + R +A+ +  ++     R K    L    +R 
Sbjct: 147 YCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALEKVKQK---LHRQKNDDDLF---DRS 200

Query: 60  HTKGCNCKRSGCLKNYCECY 79
              GC C++S C KNYCEC+
Sbjct: 201 KIWGCKCQKSQCQKNYCECF 220


>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
          Length = 289

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/50 (50%), Positives = 34/50 (68%), Gaps = 1/50 (2%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
           KPC+CTK+ CL+LYC CF N   C  +C CN C+N+ +HE+D   A+ Q 
Sbjct: 156 KPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYNDGQHEEDVLKAVEQI 205


>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
          Length = 280

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 34/50 (68%), Gaps = 1/50 (2%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
           KPC+CTK+ CL+LYC CF N   C  +C CN C+N+ +HE++   A+ Q 
Sbjct: 149 KPCHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEEEVQKAVEQI 198


>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
          Length = 326

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 29/45 (64%), Gaps = 4/45 (8%)

Query: 44  FRPKIGKCLVGEGER----RHTKGCNCKRSGCLKNYCECYEAGAR 84
           FRPK+      E E+    RH KGCNCK+S CLK YCECY+ G +
Sbjct: 244 FRPKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECYQMGVK 288



 Score = 45.8 bits (107), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/40 (57%), Positives = 26/40 (65%), Gaps = 4/40 (10%)

Query: 151 FRPKIGKCLVGEGER----RHTKGCNCKRSGCLKNYCECY 186
           FRPK+      E E+    RH KGCNCK+S CLK YCECY
Sbjct: 244 FRPKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECY 283


>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 45/107 (42%), Gaps = 29/107 (27%)

Query: 81  AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHA 139
              + +E   +     C C KS C K YC+CFA G+ C   CNC  C N I  +  +   
Sbjct: 62  TKIKQIEKPILNTITKCKCMKSKCQKSYCECFAAGKACNMDCNCQGCNNTICSQSIK--- 118

Query: 140 IRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                                    +  T GCNCK++GCLK YCECY
Sbjct: 119 -------------------------KVQTGGCNCKKTGCLKKYCECY 140



 Score = 45.4 bits (106), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 40/130 (30%), Positives = 58/130 (44%), Gaps = 24/130 (18%)

Query: 37  LERNPNAFRP--------KIGKCLVGEGERRHT-KGCNCKRSGCLKNYCECYEAGAR--- 84
           ++ NP  F P        KI K    E    +T   C C +S C K+YCEC+ AG     
Sbjct: 42  IDDNPKFFSPIRPFPIQQKITKIKQIEKPILNTITKCKCMKSKCQKSYCECFAAGKACNM 101

Query: 85  ---------TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEH 132
                    T+    I+  +   CNC K+ CLK YC+C+   + C + CNC  C N  + 
Sbjct: 102 DCNCQGCNNTICSQSIKKVQTGGCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDS 161

Query: 133 EDDRHHAIRQ 142
           E ++ +   Q
Sbjct: 162 ESEKENEQLQ 171



 Score = 42.0 bits (97), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 36/85 (42%), Gaps = 29/85 (34%)

Query: 1   YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CFA G+ C   CNC  C N I  +  +                            + 
Sbjct: 89  YCECFAAGKACNMDCNCQGCNNTICSQSIK----------------------------KV 120

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
            T GCNCK++GCLK YCECY    R
Sbjct: 121 QTGGCNCKKTGCLKKYCECYLKKQR 145


>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 78

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/54 (48%), Positives = 34/54 (62%), Gaps = 1/54 (1%)

Query: 103 MCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           MCLKLYC CFA    C   C C +C+N+ EH  +R  A+R+ L R+P+AF  K 
Sbjct: 1   MCLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKF 54



 Score = 40.0 bits (92), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 1/49 (2%)

Query: 1  YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 48
          YC CFA    C   C C +C+N+ EH  +R  A+R+ L R+P+AF  K 
Sbjct: 6  YCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKF 54


>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
 gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
          Length = 213

 Score = 49.7 bits (117), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 100 TKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC 158
           T+S  L  YC+CFA G +C + C+C  CFNN  HE        Q   RNP AF PK+ + 
Sbjct: 3   TRSCYLYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRN 62

Query: 159 LVGEGERRHTKGCN 172
           L    E R   GC+
Sbjct: 63  LDSASEFRGGVGCS 76



 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 1  YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
          YC+CFA G +C + C+C  CFNN  HE        Q   RNP AF PK+ + L    E R
Sbjct: 11 YCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRNLDSASEFR 70

Query: 60 HTKGCN 65
             GC+
Sbjct: 71 GGVGCS 76


>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
 gi|194696492|gb|ACF82330.1| unknown [Zea mays]
          Length = 278

 Score = 49.7 bits (117), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 11/75 (14%)

Query: 20 FNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKR 68
           NN  H +      +Q   RNP AF PK+      G  L           RH +GCNCK+
Sbjct: 1  MNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKK 60

Query: 69 SGCLKNYCECYEAGA 83
          S CLK YCEC++ G 
Sbjct: 61 SSCLKKYCECFQGGV 75



 Score = 46.2 bits (108), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 34/71 (47%), Gaps = 11/71 (15%)

Query: 127 FNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKR 175
            NN  H +      +Q   RNP AF PK+      G  L           RH +GCNCK+
Sbjct: 1   MNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKK 60

Query: 176 SGCLKNYCECY 186
           S CLK YCEC+
Sbjct: 61  SSCLKKYCECF 71


>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
          Length = 285

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 51/98 (52%), Gaps = 12/98 (12%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNC-NSCFNNIEHEDDRHHAIRQCLE---RNP 148
           ++PC C  S CLK YC CF +G  C + C C   C NN +H + R+ AI+   E   RN 
Sbjct: 157 QQPCFCRNSGCLKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRNK 216

Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           N  +  + K        +   GCNCK++ C   YCEC+
Sbjct: 217 NVPKDALFKL-------KDCFGCNCKKTRCQTGYCECF 247



 Score = 38.5 bits (88), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 42/84 (50%), Gaps = 12/84 (14%)

Query: 1   YCDCFANGEFCYQ-CNC-NSCFNNIEHEDDRHHAIRQCLE---RNPNAFRPKIGKCLVGE 55
           YC CF +G  C + C C   C NN +H + R+ AI+   E   RN N  +  + K     
Sbjct: 171 YCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRNKNVPKDALFKL---- 226

Query: 56  GERRHTKGCNCKRSGCLKNYCECY 79
              +   GCNCK++ C   YCEC+
Sbjct: 227 ---KDCFGCNCKKTRCQTGYCECF 247


>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
          Length = 278

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 52/98 (53%), Gaps = 12/98 (12%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCY-QCNC-NSCFNNIEHEDDRHHAIRQCLE---RNP 148
           ++PC C  S CLK YC CF +G  C  +C C   C NN +H ++R+ AI+   E   RN 
Sbjct: 150 QQPCFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRNK 209

Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           N  +  + K        +   GC+CK+S C   YCEC+
Sbjct: 210 NIPKDALFKL-------KDCFGCSCKKSRCQTGYCECF 240



 Score = 38.5 bits (88), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 12/84 (14%)

Query: 1   YCDCFANGEFCY-QCNC-NSCFNNIEHEDDRHHAIRQCLE---RNPNAFRPKIGKCLVGE 55
           YC CF +G  C  +C C   C NN +H ++R+ AI+   E   RN N  +  + K     
Sbjct: 164 YCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRNKNIPKDALFKL---- 219

Query: 56  GERRHTKGCNCKRSGCLKNYCECY 79
              +   GC+CK+S C   YCEC+
Sbjct: 220 ---KDCFGCSCKKSRCQTGYCECF 240


>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score = 49.3 bits (116), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 46/101 (45%), Gaps = 25/101 (24%)

Query: 96  PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           PC C+KS CLKLYC CF     C + C C+ C N  ++   R  A+              
Sbjct: 224 PCKCSKSHCLKLYCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE------------- 270

Query: 155 IGKCLVGEGERRH---------TKGCNCKRSGCLKNYCECY 186
             K  V +  R+H           GC C++S C KNYCEC+
Sbjct: 271 --KVKVKQQRRKHDDDLFDKDTVWGCQCRKSQCKKNYCECF 309


>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 49.3 bits (116), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 70/161 (43%), Gaps = 44/161 (27%)

Query: 31  HAIRQCLERNPNAFRP-KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPN 89
           H +++  E    AF+  +I +    E ++   + CNCK SGCLK YC C+ +G       
Sbjct: 114 HTLQKKQENEEQAFKYFQILQNDPSEIQKLQQQPCNCKNSGCLKRYCRCFHSGR------ 167

Query: 90  GIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCN-SCFNNIEHEDDRHHAIRQCLE--- 145
                        MCLK             +C C+  C NN +H++ R++AI    +   
Sbjct: 168 -------------MCLK-------------ECQCSEDCQNNEQHQEQRNNAIIHVDQKCY 201

Query: 146 RNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           RN    R  + K  V         GC+C +S C K YCECY
Sbjct: 202 RNRRMPRDALFKLDV-------IYGCSCTKSKCRKRYCECY 235



 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 12/84 (14%)

Query: 1   YCDCFANGEFCY-QCNCN-SCFNNIEHEDDRHHAIRQCLE---RNPNAFRPKIGKCLVGE 55
           YC CF +G  C  +C C+  C NN +H++ R++AI    +   RN    R  + K  V  
Sbjct: 159 YCRCFHSGRMCLKECQCSEDCQNNEQHQEQRNNAIIHVDQKCYRNRRMPRDALFKLDV-- 216

Query: 56  GERRHTKGCNCKRSGCLKNYCECY 79
                  GC+C +S C K YCECY
Sbjct: 217 -----IYGCSCTKSKCRKRYCECY 235


>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/52 (50%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
           C C KSMCLK YCDCFANG  C  QC C  C N  +++++R  AI +   +N
Sbjct: 58  CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQN 109


>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
          Length = 303

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 47/92 (51%), Gaps = 7/92 (7%)

Query: 96  PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           PCNC+KS CLKLYC CF   + C + C C  C N   H   R  A+ +   ++    R K
Sbjct: 182 PCNCSKSNCLKLYCQCFHQNKQCTELCKCLDCKNCDYHTQVRQTALEKIKMKSQ---RQK 238

Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
               L    +     GC C++S C KNYCEC+
Sbjct: 239 HDNDLFDLSK---VWGCKCQKSQCQKNYCECF 267


>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 87  EPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLE 145
           +   I     C C KSMCLK YCDCFANG  C  QC C  C N  ++ ++R  AI +   
Sbjct: 48  QSQNIPKSLKCKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKM 107

Query: 146 RN 147
           +N
Sbjct: 108 QN 109


>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
 gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
          Length = 590

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 48/92 (52%), Gaps = 20/92 (21%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC--YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           C+C K+ C + YCDC  NG+ C    C+C SC N IE+++                 R +
Sbjct: 470 CSCQKNSCTQAYCDCLKNGQACDPSSCSCASCLNTIENQN----------------LRLE 513

Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           I +    +G+ +  +GC+CK S C K YCEC+
Sbjct: 514 IQEKKQKQGQAK--EGCSCKNSQCQKRYCECF 543



 Score = 39.3 bits (90), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 31/119 (26%)

Query: 64  CNCKRSGCLKNYCECYEAGA-------------RTLEPNGIR------------PRKPCN 98
           C+C+++ C + YC+C + G               T+E   +R             ++ C+
Sbjct: 470 CSCQKNSCTQAYCDCLKNGQACDPSSCSCASCLNTIENQNLRLEIQEKKQKQGQAKEGCS 529

Query: 99  CTKSMCLKLYCDCFANGEFC--YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C  S C K YC+CF NG  C   +C C  C NN+ H     H++ Q        F+ KI
Sbjct: 530 CKNSQCQKRYCECFQNGRQCDPSKCKCKDCKNNLPHS----HSVHQMRPIVQQLFQSKI 584


>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
          Length = 275

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 11/95 (11%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE---RNPNA 150
           +PCNC++S CLK YC CF +G  C  +C C  C N     ++R  AI    +   R+ N 
Sbjct: 151 QPCNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREAAINYVFKKCNRDKNV 210

Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCEC 185
              ++    +       + GC CK +GC K YCEC
Sbjct: 211 PANELLSLQI-------SYGCKCKSTGCQKKYCEC 238


>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
 gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
          Length = 421

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 11/89 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 54
           YC CFA+G  C + C C+ C+N   H   R          Q LE   ++   ++ +  + 
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 186

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
            G R+    C CK+SGCLK YC+CY+ GA
Sbjct: 187 FGRRK----CTCKKSGCLKKYCDCYQGGA 211



 Score = 43.5 bits (101), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 11/85 (12%)

Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 161
           YC CFA+G  C + C C+ C+N   H   R          Q LE   ++   ++ +  + 
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 186

Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
            G R+    C CK+SGCLK YC+CY
Sbjct: 187 FGRRK----CTCKKSGCLKKYCDCY 207


>gi|115489472|ref|NP_001067223.1| Os12g0605300 [Oryza sativa Japonica Group]
 gi|77557030|gb|ABA99826.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649730|dbj|BAF30242.1| Os12g0605300 [Oryza sativa Japonica Group]
          Length = 407

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 11/89 (12%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 54
           YC CFA+G  C + C C+ C+N   H   R          Q LE   ++   ++ +  + 
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 172

Query: 55  EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
            G R+    C CK+SGCLK YC+CY+ GA
Sbjct: 173 FGRRK----CTCKKSGCLKKYCDCYQGGA 197



 Score = 43.5 bits (101), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 11/85 (12%)

Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 161
           YC CFA+G  C + C C+ C+N   H   R          Q LE   ++   ++ +  + 
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 172

Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
            G R+    C CK+SGCLK YC+CY
Sbjct: 173 FGRRK----CTCKKSGCLKKYCDCY 193


>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
          Length = 710

 Score = 47.4 bits (111), Expect = 0.003,   Method: Composition-based stats.
 Identities = 38/135 (28%), Positives = 51/135 (37%), Gaps = 35/135 (25%)

Query: 46  PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCL 105
           PK  +     G+    K C+CK+S CLK YCEC+ AG    EP                 
Sbjct: 423 PKKKRHKFDNGDGTSCKRCSCKKSKCLKLYCECFHAGVFCSEP----------------- 465

Query: 106 KLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 165
                          C+C  C N   + +       Q   RNP AF PK+ +   G+ E 
Sbjct: 466 ---------------CSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQ-EL 509

Query: 166 RHTKGC--NCKRSGC 178
               GC  +C+  GC
Sbjct: 510 AGGVGCSVSCRCEGC 524



 Score = 41.2 bits (95), Expect = 0.19,   Method: Composition-based stats.
 Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 4/74 (5%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC+CF  G FC + C+C  C N   + +       Q   RNP AF PK+ +   G+ E  
Sbjct: 452 YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQ-ELA 510

Query: 60  HTKGC--NCKRSGC 71
              GC  +C+  GC
Sbjct: 511 GGVGCSVSCRCEGC 524


>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
          Length = 246

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 12/112 (10%)

Query: 80  EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED-DRH 137
           +   + L    ++ ++PCNC  S C+K YC CF +G+ C  +C C     N EH + +R 
Sbjct: 119 QKKTKILNITSLKCQQPCNCKSSGCIKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERS 178

Query: 138 HAIRQCLE---RNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            AI+   E   RN    +  + K  V         GC+C +S C K YCEC+
Sbjct: 179 EAIKHVNEKCYRNRKIPKEALFKLDV-------IYGCSCTKSKCRKRYCECF 223


>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
          Length = 298

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 11/94 (11%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
           YC CFA G +C  +C C  CFN     +           R   +   + R P A    + 
Sbjct: 17  YCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 76

Query: 50  KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
                       +GCNCK+S CLK YC+CY+ G 
Sbjct: 77  DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 110



 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 11/98 (11%)

Query: 100 TKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNP 148
           T ++C+  YC CFA G +C  +C C  CFN     +           R   +   + R P
Sbjct: 9   TITVCICSYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRP 68

Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            A    +             +GCNCK+S CLK YC+CY
Sbjct: 69  EANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 106


>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
          Length = 96

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 2/54 (3%)

Query: 32 AIRQCLERNPNAFRPKIGKCLVGEGERR--HTKGCNCKRSGCLKNYCECYEAGA 83
          A+     RNPNAF  K       + ++   H KGC CK+S CLK YCEC+ AG 
Sbjct: 2  AVNDAKNRNPNAFNIKFQIIEEKQDDKIVIHKKGCTCKKSNCLKKYCECFNAGV 55



 Score = 42.7 bits (99), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 139 AIRQCLERNPNAFRPKIGKCLVGEGERR--HTKGCNCKRSGCLKNYCECY 186
           A+     RNPNAF  K       + ++   H KGC CK+S CLK YCEC+
Sbjct: 2   AVNDAKNRNPNAFNIKFQIIEEKQDDKIVIHKKGCTCKKSNCLKKYCECF 51



 Score = 36.6 bits (83), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHH 138
           +K C C KS CLK YC+CF  G  C Q C C+SC N  + E+++  
Sbjct: 33  KKGCTCKKSNCLKKYCECFNAGVLCNQNCVCDSCKNMQKEENNQQQ 78


>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
           +PC+C K+ CL+LYC CF N   C  +C CN C+N+ +HED+   A+ Q 
Sbjct: 147 RPCHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYNDGKHEDEVLKAVEQI 196


>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
           K C+C+K+ CL+LYC CF N   C  +C CN CFN+ +HED+   A+ Q 
Sbjct: 124 KACHCSKTHCLQLYCSCFHNRRPCTNECKCNDCFNDGKHEDEVIKAVEQI 173


>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
 gi|238005872|gb|ACR33971.1| unknown [Zea mays]
          Length = 272

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 35/64 (54%), Gaps = 12/64 (18%)

Query: 32 AIRQCLE-RNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 79
          + RQ +E RNP AF PK+      G  L           RH +GCNCK+S CLK YCEC+
Sbjct: 6  STRQQIESRNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 65

Query: 80 EAGA 83
          + G 
Sbjct: 66 QGGV 69



 Score = 43.5 bits (101), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 12/60 (20%)

Query: 139 AIRQCLE-RNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 186
           + RQ +E RNP AF PK+      G  L           RH +GCNCK+S CLK YCEC+
Sbjct: 6   STRQQIESRNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 65


>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
          Length = 330

 Score = 46.6 bits (109), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 49/95 (51%), Gaps = 19/95 (20%)

Query: 93  PRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
            ++ C C K+ CLK YC CF  G+FC + C C  C NN ++++     I Q         
Sbjct: 212 SKQGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE----LIEQT-------- 259

Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           R +I + L  E +      C CK+S CLK YCECY
Sbjct: 260 RSQINRNLNDEVK------CFCKKSKCLKKYCECY 288



 Score = 40.0 bits (92), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 45/105 (42%), Gaps = 28/105 (26%)

Query: 62  KGCNCKRSGCLKNYCECYEAG------------------------ARTLEPNGIRPRKPC 97
           +GC CK++ CLK YC+C++ G                         R+     +     C
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKELIEQTRSQINRNLNDEVKC 273

Query: 98  NCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIR 141
            C KS CLK YC+C+  G+ C   C C  C N   +E + H A R
Sbjct: 274 FCKKSKCLKKYCECYNAGKKCGVDCKCEGCMN---YEINLHEAQR 315



 Score = 38.9 bits (89), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 43/85 (50%), Gaps = 19/85 (22%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC CF  G+FC + C C  C NN ++++     I Q         R +I + L  E +  
Sbjct: 227 YCQCFQQGKFCGETCECLDCCNNEQNKE----LIEQT--------RSQINRNLNDEVK-- 272

Query: 60  HTKGCNCKRSGCLKNYCECYEAGAR 84
               C CK+S CLK YCECY AG +
Sbjct: 273 ----CFCKKSKCLKKYCECYNAGKK 293


>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
          Length = 573

 Score = 46.6 bits (109), Expect = 0.005,   Method: Composition-based stats.
 Identities = 15/23 (65%), Positives = 20/23 (86%)

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
           +H KGC CK+SGC+K YCEC++A
Sbjct: 452 KHNKGCQCKKSGCMKKYCECFQA 474



 Score = 44.3 bits (103), Expect = 0.024,   Method: Composition-based stats.
 Identities = 14/21 (66%), Positives = 18/21 (85%)

Query: 166 RHTKGCNCKRSGCLKNYCECY 186
           +H KGC CK+SGC+K YCEC+
Sbjct: 452 KHNKGCQCKKSGCMKKYCECF 472



 Score = 38.1 bits (87), Expect = 1.6,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFN 128
           K C C KS C+K YC+CF     C  QC C  C N
Sbjct: 455 KGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 489


>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 381

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 45/91 (49%), Gaps = 17/91 (18%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFNNIEHEDD---RHHAIRQCLERNPN 149
           KPC C  + CLKLYC CF  G FC    C C  C+N  E  +    R  A+ + L R  +
Sbjct: 277 KPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNETGGKRQEAVSEILLRRID 336

Query: 150 AF--RPKIGKCLVGEGERRHTKGCNCKRSGC 178
           AF  RPK          ++  +GC CK++ C
Sbjct: 337 AFESRPK----------KKTGEGCACKKNRC 357


>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C C KS C+KLYC CF+   +C Q C C +C N   +ED     +     +NP AF PKI
Sbjct: 9   CRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPKI 68

Query: 156 GKC 158
            + 
Sbjct: 69  VRV 71


>gi|76155324|gb|AAX26592.2| SJCHGC03965 protein [Schistosoma japonicum]
          Length = 176

 Score = 46.2 bits (108), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 19/24 (79%), Positives = 21/24 (87%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEF 117
           RKPCNCTKS CLKLYC+CFA G+ 
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQL 167


>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
          Length = 628

 Score = 45.8 bits (107), Expect = 0.009,   Method: Composition-based stats.
 Identities = 16/23 (69%), Positives = 21/23 (91%)

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
           +H KGC+CK+SGCLK YCEC++A
Sbjct: 225 KHNKGCHCKKSGCLKKYCECFQA 247



 Score = 43.5 bits (101), Expect = 0.040,   Method: Composition-based stats.
 Identities = 15/21 (71%), Positives = 19/21 (90%)

Query: 166 RHTKGCNCKRSGCLKNYCECY 186
           +H KGC+CK+SGCLK YCEC+
Sbjct: 225 KHNKGCHCKKSGCLKKYCECF 245



 Score = 38.9 bits (89), Expect = 1.00,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDR 136
           K C+C KS CLK YC+CF     C + C C  C  N E  ++R
Sbjct: 228 KGCHCKKSGCLKKYCECFQANILCSENCKCMDC-KNFEGSEER 269


>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
          Length = 139

 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 39/64 (60%), Gaps = 1/64 (1%)

Query: 93  PRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
            ++ C C K+ CLKLYC CF +G+ C Q C CN C N++    +R+ AI + + +  + +
Sbjct: 18  TKEGCQCQKTKCLKLYCQCFHDGKCCGQSCGCNGCKNSLNDYLERNKAIGKIIHKYKDDY 77

Query: 152 RPKI 155
           + K+
Sbjct: 78  KQKL 81


>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
          Length = 264

 Score = 44.7 bits (104), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 51/93 (54%), Gaps = 6/93 (6%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           KPC+C+K+ CL+LYC CF N   C Q C C  C+N+ +H D+   A+ Q   +   A   
Sbjct: 146 KPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYNDGQHTDEVLKAVEQIKIKEQRASHH 205

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
            +        + R   GC CK++ C+K YCEC+
Sbjct: 206 DLDSF-----DTRQVWGCKCKKTKCVKGYCECF 233


>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 379

 Score = 43.9 bits (102), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 7/97 (7%)

Query: 97  CNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           CNC  + C+K YC C+AN  FC    C C SC N  +   +     +  L  +  +    
Sbjct: 247 CNCQTTQCIKKYCSCYANNNFCVPSICKCQSCMNTFQQSYNSTSQKKPKLISSQKSLSQV 306

Query: 155 IGKCLVGEGERR-----HTKGCNCKRSGCLKNYCECY 186
           I   +    + +      T  C C++S C K YC+CY
Sbjct: 307 ILYQIQNINQLKIQKENATISCTCQKSFCKKQYCDCY 343



 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 7/91 (7%)

Query: 1   YCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 58
           YC C+AN  FC    C C SC N  +   +     +  L  +  +    I   +    + 
Sbjct: 258 YCSCYANNNFCVPSICKCQSCMNTFQQSYNSTSQKKPKLISSQKSLSQVILYQIQNINQL 317

Query: 59  R-----HTKGCNCKRSGCLKNYCECYEAGAR 84
           +      T  C C++S C K YC+CY++G +
Sbjct: 318 KIQKENATISCTCQKSFCKKQYCDCYKSGKK 348


>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 43.9 bits (102), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 43/92 (46%), Gaps = 12/92 (13%)

Query: 96  PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
           PCNC++S C K+YC+C A G  C   C C +C     H    +  +   +E        K
Sbjct: 71  PCNCSRSGCRKMYCECLAKGRLCSSFCRCQNC-----HNKTSNKLVLNVIEELDQNRNRK 125

Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
                     RR   GC C++S CLK YCEC+
Sbjct: 126 ------RFRSRRFKDGCTCQKSMCLKKYCECF 151



 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 54/139 (38%), Gaps = 38/139 (27%)

Query: 30  HHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG------- 82
            H I   +  +P   + K G  L+   +      CNC RSGC K YCEC   G       
Sbjct: 44  QHTIPDNILISPEPSKKK-GSILLPSDD-----PCNCSRSGCRKMYCECLAKGRLCSSFC 97

Query: 83  ---------ARTLEPNGIRPRKP---------------CNCTKSMCLKLYCDCFANGEFC 118
                    +  L  N I                    C C KSMCLK YC+CF +G+ C
Sbjct: 98  RCQNCHNKTSNKLVLNVIEELDQNRNRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKSC 157

Query: 119 YQ-CNCNSCFNNIEHEDDR 136
              C C +C N +  E ++
Sbjct: 158 GSGCKCVNCENYVLDEVEK 176


>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
          Length = 754

 Score = 43.5 bits (101), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 30/67 (44%), Positives = 37/67 (55%), Gaps = 7/67 (10%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAI----RQCLERNP 148
           K C+C+K+ CLKLYC CFAN   C +  C C+ C  N E E     AI    R  L +NP
Sbjct: 232 KGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDC-KNCEAESTEDGAILKARRSILRKNP 290

Query: 149 NAFRPKI 155
            AF+ K 
Sbjct: 291 KAFKSKF 297


>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
          Length = 244

 Score = 43.5 bits (101), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 49/110 (44%), Gaps = 26/110 (23%)

Query: 86  LEPNGIRPRKP---CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIR 141
           +EP+ +   KP   C C+KS C + YC+CFA G+ C   C C +C N             
Sbjct: 83  IEPSEVMSPKPQIKCQCSKSQCQQSYCECFARGKTCGKHCGCQNCQN------------- 129

Query: 142 QCLERNPNAFRPKIGKCLVGEG-----ERRHTKGCNCKRSGCLKNYCECY 186
                  N    KI K LV +        R+ KGC C +S C  N+C C+
Sbjct: 130 ----MQMNKKLGKIQKRLVKQKRIKQRTPRYLKGCTCGKSKCQNNFCSCH 175


>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
          Length = 261

 Score = 43.5 bits (101), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 29/61 (47%), Positives = 36/61 (59%), Gaps = 5/61 (8%)

Query: 88  PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLER 146
           P  +R    C C KSMCLK YCDCFANG  C  QC C  C N+ E+ ++R  AI +   +
Sbjct: 85  PKSLR----CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHNSEEYMEEREEAISKLKIQ 140

Query: 147 N 147
           N
Sbjct: 141 N 141


>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
          Length = 677

 Score = 42.7 bits (99), Expect = 0.063,   Method: Composition-based stats.
 Identities = 15/17 (88%), Positives = 17/17 (100%)

Query: 92  RPRKPCNCTKSMCLKLY 108
           RPRKPCNCTKS+CLKL+
Sbjct: 520 RPRKPCNCTKSLCLKLW 536


>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
          Length = 192

 Score = 42.4 bits (98), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 35/108 (32%)

Query: 86  LEPNGIRPRK------PCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHH 138
           ++ N ++P++       C C KS C K YC+CFA G  C   CNC  C N+         
Sbjct: 57  IQKNIVKPKEIEKSIVKCKCLKSKCQKSYCECFAVGRSCSIDCNCLGCNNS--------- 107

Query: 139 AIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
               C +        KI K   G        GCNCK++GC+K YCECY
Sbjct: 108 ---NCSQ--------KIKKSQCG--------GCNCKKTGCVKKYCECY 136



 Score = 35.8 bits (81), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 20/111 (18%)

Query: 38  ERNPNAFRPK----IGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECYEAGAR-TLEPNGI 91
           + +PN    K    I K +V   E  +    C C +S C K+YCEC+  G   +++ N +
Sbjct: 43  DESPNILSAKKPFPIQKNIVKPKEIEKSIVKCKCLKSKCQKSYCECFAVGRSCSIDCNCL 102

Query: 92  ---------RPRKP----CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
                    + +K     CNC K+ C+K YC+C+   + C + CNC  C N
Sbjct: 103 GCNNSNCSQKIKKSQCGGCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCN 153


>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
          Length = 262

 Score = 42.0 bits (97), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 36/137 (26%), Positives = 54/137 (39%), Gaps = 21/137 (15%)

Query: 18  SCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTK-GCNCKRSGCLKNYC 76
           S  N +         I    + +    +PK    L  + + +  K GC+C +  C   YC
Sbjct: 84  SLVNQMYESQVFSQPILDISQISQIQLQPKTKLDLTDQPKYKKKKVGCSCTKRKCSSKYC 143

Query: 77  ECYEAG-------------------ARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEF 117
            C + G                   ++ ++    + +K C C K+ CLK YCDCFA G  
Sbjct: 144 FCAKVGQSCSNLCKCVDCSNNKSFMSQQIQVTNAQEQKGCKCKKNFCLKGYCDCFAKGMQ 203

Query: 118 CYQ-CNCNSCFNNIEHE 133
           C   C C SC N +  E
Sbjct: 204 CSSNCKCISCKNMLTIE 220


>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 42.0 bits (97), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 6/93 (6%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           KPC+C+K+ CL+LYC CF   + C  +C C  CFN+  H+ +   A+++   +       
Sbjct: 145 KPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFNDGNHKVEMVKALQKVKLK-----EY 199

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           ++ +  +   + R   GC CK++ C K YCEC+
Sbjct: 200 RVSQSDLDSFDTRQVLGCKCKKTHCKKGYCECF 232


>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
          Length = 372

 Score = 41.2 bits (95), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)

Query: 1   YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
           YC C+A+G  C + C C  C N           +             +I + L+      
Sbjct: 69  YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 128

Query: 60  HTKGCNCKRSGCLKNYCECYEAGA 83
             + C CK+SGCLK YC+CY+ GA
Sbjct: 129 IRRKCTCKKSGCLKKYCDCYQGGA 152



 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 1/80 (1%)

Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
           YC C+A+G  C + C C  C N           +             +I + L+      
Sbjct: 69  YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 128

Query: 167 HTKGCNCKRSGCLKNYCECY 186
             + C CK+SGCLK YC+CY
Sbjct: 129 IRRKCTCKKSGCLKKYCDCY 148


>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score = 40.8 bits (94), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 13/102 (12%)

Query: 86  LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCL 144
           +E   ++P+  C C KS C + YC+CFA G+ C + C C +C N            +Q  
Sbjct: 86  IEVKDLQPQIKCQCNKSQCQQSYCECFARGKTCGKHCGCQNCQN------------KQMN 133

Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           ++     +  I +  + +   R+ KGC C +S C   +C C+
Sbjct: 134 KKLEKIQKKLIKQKRIKQRTSRYLKGCTCGKSKCQNKFCSCH 175


>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
 gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
          Length = 250

 Score = 40.8 bits (94), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 24/30 (80%)

Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
          G+ ++   KGC+CKRS C+KNYC+CY++ A
Sbjct: 16 GQAQQPTIKGCSCKRSQCIKNYCDCYQSMA 45



 Score = 38.9 bits (89), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
           G+ ++   KGC+CKRS C+KNYC+CY
Sbjct: 16  GQAQQPTIKGCSCKRSQCIKNYCDCY 41


>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
          Length = 426

 Score = 40.8 bits (94), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 16/23 (69%), Positives = 21/23 (91%)

Query: 59  RHTKGCNCKRSGCLKNYCECYEA 81
           +H KGC+CK+SGCLK YCEC++A
Sbjct: 134 KHNKGCHCKKSGCLKKYCECFQA 156



 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 39/93 (41%), Gaps = 18/93 (19%)

Query: 94  RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
           +K CNC  S CLKL            Q       N I+         +  +E   NA   
Sbjct: 80  KKHCNCKNSQCLKL------------QEAITVRSNGIQKPS------KLKIENGSNALNL 121

Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
           +           +H KGC+CK+SGCLK YCEC+
Sbjct: 122 RKDDAGAPASLPKHNKGCHCKKSGCLKKYCECF 154


>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
 gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
          Length = 154

 Score = 40.4 bits (93), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 16/23 (69%), Positives = 21/23 (91%)

Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
          +H KGC+CK+SGCLK YCEC++A
Sbjct: 11 KHNKGCHCKKSGCLKKYCECFQA 33



 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 15/21 (71%), Positives = 19/21 (90%)

Query: 166 RHTKGCNCKRSGCLKNYCECY 186
           +H KGC+CK+SGCLK YCEC+
Sbjct: 11  KHNKGCHCKKSGCLKKYCECF 31


>gi|357450559|ref|XP_003595556.1| Pentatricopeptide repeat protein [Medicago truncatula]
 gi|355484604|gb|AES65807.1| Pentatricopeptide repeat protein [Medicago truncatula]
          Length = 1040

 Score = 39.3 bits (90), Expect = 0.71,   Method: Composition-based stats.
 Identities = 27/91 (29%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 66  CKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCN 124
           C+ +   K  C   +A  R+ E  G    + C C  S     YCD FA G +C + C C 
Sbjct: 750 CQTAPKRKRKCLTLQAQVRSTEERG--SCRNCKCKISN----YCDAFATGVYCTEPCLCL 803

Query: 125 SCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
            C N   ++D      R+    N + F PK+
Sbjct: 804 ECLNKPINDDTVLQTRRKIESCNSHVFFPKV 834


>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
          Length = 514

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 3/59 (5%)

Query: 100 TKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKIG 156
           T    L  YCDC A G +C   C C++C N  E+E      IR+ +E R+P AF P+I 
Sbjct: 239 TLVFVLSSYCDCLAAGVYCTDSCACSNCLNKSENE-GVVQIIREKIESRDPLAFAPRIA 296



 Score = 36.2 bits (82), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 30/51 (58%), Gaps = 3/51 (5%)

Query: 1   YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKIG 49
           YCDC A G +C   C C++C N  E+E      IR+ +E R+P AF P+I 
Sbjct: 247 YCDCLAAGVYCTDSCACSNCLNKSENE-GVVQIIREKIESRDPLAFAPRIA 296


>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
 gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
          Length = 459

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 15/23 (65%), Positives = 19/23 (82%)

Query: 62  KGCNCKRSGCLKNYCECYEAGAR 84
           KGCNCK++GCLK YCEC++   R
Sbjct: 196 KGCNCKKTGCLKRYCECFQNNKR 218



 Score = 37.0 bits (84), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 14/18 (77%), Positives = 17/18 (94%)

Query: 169 KGCNCKRSGCLKNYCECY 186
           KGCNCK++GCLK YCEC+
Sbjct: 196 KGCNCKKTGCLKRYCECF 213


>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
 gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
          Length = 240

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 15/23 (65%), Positives = 20/23 (86%)

Query: 61 TKGCNCKRSGCLKNYCECYEAGA 83
           KGC+CKRS C+KNYC+CY++ A
Sbjct: 22 VKGCSCKRSQCIKNYCDCYQSMA 44



 Score = 35.8 bits (81), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 14/19 (73%), Positives = 17/19 (89%)

Query: 168 TKGCNCKRSGCLKNYCECY 186
            KGC+CKRS C+KNYC+CY
Sbjct: 22  VKGCSCKRSQCIKNYCDCY 40


>gi|345314646|ref|XP_001520178.2| PREDICTED: protein lin-54 homolog, partial [Ornithorhynchus
           anatinus]
          Length = 145

 Score = 37.4 bits (85), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 15/16 (93%), Positives = 16/16 (100%)

Query: 92  RPRKPCNCTKSMCLKL 107
           RPRKPCNCTKS+CLKL
Sbjct: 130 RPRKPCNCTKSLCLKL 145


>gi|330945164|ref|XP_003306504.1| hypothetical protein PTT_19658 [Pyrenophora teres f. teres 0-1]
 gi|311315956|gb|EFQ85390.1| hypothetical protein PTT_19658 [Pyrenophora teres f. teres 0-1]
          Length = 1707

 Score = 37.0 bits (84), Expect = 3.8,   Method: Composition-based stats.
 Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 14/142 (9%)

Query: 26   EDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGART 85
            E +RH  +RQCLER+   ++    KC   E  + H K    +    ++N  E      + 
Sbjct: 888  EMERHKKLRQCLERDGARYK----KCF--ENAKAHLKKMAAEHRSKMQNAAEMRTCLVKD 941

Query: 86   LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
             E   I   K C+  +  CLK  CD ++  E      C    N +E   D+   +++ L 
Sbjct: 942  CE-GKINELKRCSEAEVSCLK--CD-YSEKE----TECKELSNKLEASADQLSQMKKRLA 993

Query: 146  RNPNAFRPKIGKCLVGEGERRH 167
               ++      + LVGE + +H
Sbjct: 994  GLQSSLEDMRKQQLVGETKIKH 1015


>gi|395855486|ref|XP_003800190.1| PREDICTED: usherin [Otolemur garnettii]
          Length = 5202

 Score = 36.6 bits (83), Expect = 4.8,   Method: Composition-based stats.
 Identities = 40/132 (30%), Positives = 55/132 (41%), Gaps = 22/132 (16%)

Query: 64  CNCKR--SGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC--- 118
           CNCKR  SG   N C+      + L+P+G RP   CNC  S  +     C  N   C   
Sbjct: 662 CNCKRHVSGRQCNQCQNGFYNLQELDPDGCRP---CNCNTSGTVDGDITCHQNSGQCKCK 718

Query: 119 -----YQCN-CNSCFNNIEHEDDRHHAIRQC-LERNPNAF-RPKIGKCLVGEGERRHTKG 170
                 +C+ CN  F  ++  +D      QC L  + N F  P  G+C      R+  KG
Sbjct: 719 ANVIGLRCDHCNFGFKFLQSFNDDGCEPCQCNLHGSVNKFCNPLSGQCEC----RKEAKG 774

Query: 171 CNCKRSGCLKNY 182
             C    C +N+
Sbjct: 775 LQCDT--CRENF 784


>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
 gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
          Length = 296

 Score = 36.6 bits (83), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 13/20 (65%), Positives = 18/20 (90%)

Query: 62 KGCNCKRSGCLKNYCECYEA 81
          KGC CKR+ C+KNYC+CY++
Sbjct: 38 KGCACKRTSCIKNYCDCYQS 57


>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
 gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
          Length = 296

 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 13/20 (65%), Positives = 18/20 (90%)

Query: 62 KGCNCKRSGCLKNYCECYEA 81
          KGC CKR+ C+KNYC+CY++
Sbjct: 38 KGCACKRTSCIKNYCDCYQS 57


>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
 gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
          Length = 140

 Score = 35.8 bits (81), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 14/23 (60%), Positives = 20/23 (86%)

Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
          +H KGC+C++S CLK YCEC++A
Sbjct: 23 KHNKGCHCRKSECLKKYCECFQA 45


>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
 gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
          Length = 225

 Score = 35.8 bits (81), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 15/29 (51%), Positives = 23/29 (79%)

Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
          + +++  KGC CKRS C+KNYC+CY++ A
Sbjct: 18 QPKQQAPKGCCCKRSQCIKNYCDCYQSMA 46


>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 957

 Score = 35.4 bits (80), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 12/69 (17%)

Query: 95  KPCNCTKSMCLKLYCDCFANGEFC--YQCNCNSCFNN---IEHEDDRHHAI-------RQ 142
           K C C K+ CLKLYC CF++   C    C+C  C N    +  + D   A+       + 
Sbjct: 316 KGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCANTQDELARDGDNGMAMSAVVTARKS 375

Query: 143 CLERNPNAF 151
            LERNP AF
Sbjct: 376 ILERNPRAF 384


>gi|145529912|ref|XP_001450739.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418361|emb|CAK83342.1| unnamed protein product [Paramecium tetraurelia]
          Length = 226

 Score = 35.4 bits (80), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 37/90 (41%), Gaps = 26/90 (28%)

Query: 97  CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
           C C KS C+K YC C AN + C ++C C +C N                         K 
Sbjct: 129 CRCRKSKCIKNYCVCSANNQECTFKCECYNCSN-------------------------KQ 163

Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCEC 185
            K  V +       GCNC+R  C K YCEC
Sbjct: 164 PKSAVSKQSGSEYLGCNCRRQDCSKGYCEC 193


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.326    0.142    0.516 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,176,436,644
Number of Sequences: 23463169
Number of extensions: 132990450
Number of successful extensions: 318072
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 541
Number of HSP's successfully gapped in prelim test: 618
Number of HSP's that attempted gapping in prelim test: 311890
Number of HSP's gapped (non-prelim): 4654
length of query: 186
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 52
effective length of database: 9,215,130,721
effective search space: 479186797492
effective search space used: 479186797492
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 72 (32.3 bits)