BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy17547
(186 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
Length = 695
Score = 191 bits (486), Expect = 9e-47, Method: Composition-based stats.
Identities = 82/104 (78%), Positives = 88/104 (84%), Gaps = 3/104 (2%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
LE NGIRPRKPCNCTKS CLKLYCDCFANGEFCY CNC +CFNN+E+ED R AI+ CLE
Sbjct: 474 LESNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLE 533
Query: 146 RNPNAFRPKIGKCLVGEGE---RRHTKGCNCKRSGCLKNYCECY 186
RNPNAFRPKIGK G+ R+HTKGCNCKRSGCLKNYCECY
Sbjct: 534 RNPNAFRPKIGKAKDVAGDSSIRKHTKGCNCKRSGCLKNYCECY 577
Score = 150 bits (380), Expect = 2e-34, Method: Composition-based stats.
Identities = 64/84 (76%), Positives = 70/84 (83%), Gaps = 3/84 (3%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
YCDCFANGEFCY CNC +CFNN+E+ED R AI+ CLERNPNAFRPKIGK G+
Sbjct: 496 YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSSI 555
Query: 58 RRHTKGCNCKRSGCLKNYCECYEA 81
R+HTKGCNCKRSGCLKNYCECYEA
Sbjct: 556 RKHTKGCNCKRSGCLKNYCECYEA 579
>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
Length = 752
Score = 189 bits (479), Expect = 6e-46, Method: Composition-based stats.
Identities = 82/104 (78%), Positives = 88/104 (84%), Gaps = 3/104 (2%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
LE NGIRPRKPCNCTKS CLKLYCDCFANGEFCY CNC +CFNN+E+ED R AI+ CLE
Sbjct: 531 LESNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLE 590
Query: 146 RNPNAFRPKIGKCLVGEGE---RRHTKGCNCKRSGCLKNYCECY 186
RNPNAFRPKIGK G+ R+HTKGCNCKRSGCLKNYCECY
Sbjct: 591 RNPNAFRPKIGKAKDVAGDSSIRKHTKGCNCKRSGCLKNYCECY 634
Score = 148 bits (374), Expect = 8e-34, Method: Composition-based stats.
Identities = 64/84 (76%), Positives = 70/84 (83%), Gaps = 3/84 (3%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
YCDCFANGEFCY CNC +CFNN+E+ED R AI+ CLERNPNAFRPKIGK G+
Sbjct: 553 YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSSI 612
Query: 58 RRHTKGCNCKRSGCLKNYCECYEA 81
R+HTKGCNCKRSGCLKNYCECYEA
Sbjct: 613 RKHTKGCNCKRSGCLKNYCECYEA 636
>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
Length = 750
Score = 179 bits (455), Expect = 3e-43, Method: Composition-based stats.
Identities = 75/95 (78%), Positives = 85/95 (89%), Gaps = 1/95 (1%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+ +KPCNCT+S CLKLYCDCFANGEFCYQCNCNSC+NN+EHE+DR AIR L+RNPNAF
Sbjct: 535 KTKKPCNCTRSQCLKLYCDCFANGEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAF 594
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RPKI K VG+ ER+HTKGCNCKRSGCLKNYCEC+
Sbjct: 595 RPKI-KSFVGQAERQHTKGCNCKRSGCLKNYCECF 628
Score = 156 bits (394), Expect = 4e-36, Method: Composition-based stats.
Identities = 66/81 (81%), Positives = 73/81 (90%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFCYQCNCNSC+NN+EHE+DR AIR L+RNPNAFRPKI K VG+ ER+H
Sbjct: 551 YCDCFANGEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAFRPKI-KSFVGQAERQH 609
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
TKGCNCKRSGCLKNYCEC+EA
Sbjct: 610 TKGCNCKRSGCLKNYCECFEA 630
>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
Length = 916
Score = 169 bits (429), Expect = 4e-40, Method: Composition-based stats.
Identities = 79/95 (83%), Positives = 85/95 (89%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+CFNN+EHE++R AI CLERNPNAF
Sbjct: 661 RPRKPCNCTKSQCLKLYCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGACLERNPNAF 720
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RPKIGK G+ ERRHTKGCNCKRSGCLKNYCECY
Sbjct: 721 RPKIGKGKEGDHERRHTKGCNCKRSGCLKNYCECY 755
Score = 137 bits (345), Expect = 2e-30, Method: Composition-based stats.
Identities = 66/81 (81%), Positives = 72/81 (88%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC+ CNCN+CFNN+EHE++R AI CLERNPNAFRPKIGK G+ ERRH
Sbjct: 677 YCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGACLERNPNAFRPKIGKGKEGDHERRH 736
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
TKGCNCKRSGCLKNYCECYEA
Sbjct: 737 TKGCNCKRSGCLKNYCECYEA 757
>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
Length = 822
Score = 169 bits (427), Expect = 6e-40, Method: Composition-based stats.
Identities = 80/103 (77%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
T+EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 593 TVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 652
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 653 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 695
Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 616 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 675
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 676 HNKGCNCKRSGCLKNYCECYEA 697
>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
Length = 834
Score = 169 bits (427), Expect = 6e-40, Method: Composition-based stats.
Identities = 80/102 (78%), Positives = 88/102 (86%), Gaps = 1/102 (0%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
+EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLE
Sbjct: 606 VEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLE 665
Query: 146 RNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
RNPNAFRPKIGK GE RRH KGCNCKRSGCLKNYCECY
Sbjct: 666 RNPNAFRPKIGKGRETGEDIRRHNKGCNCKRSGCLKNYCECY 707
Score = 125 bits (314), Expect = 7e-27, Method: Composition-based stats.
Identities = 62/82 (75%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK GE RR
Sbjct: 628 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIRR 687
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 688 HNKGCNCKRSGCLKNYCECYEA 709
>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
Length = 804
Score = 168 bits (425), Expect = 1e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 598 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 657
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 658 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 700
Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 621 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 680
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 681 HNKGCNCKRSGCLKNYCECYEA 702
>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
Length = 802
Score = 167 bits (424), Expect = 1e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 597 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 656
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 657 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 699
Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 679
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 680 HNKGCNCKRSGCLKNYCECYEA 701
>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
Length = 859
Score = 167 bits (424), Expect = 1e-39, Method: Composition-based stats.
Identities = 80/102 (78%), Positives = 88/102 (86%), Gaps = 1/102 (0%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
+EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLE
Sbjct: 631 VEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLE 690
Query: 146 RNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
RNPNAFRPKIGK GE RRH KGCNCKRSGCLKNYCECY
Sbjct: 691 RNPNAFRPKIGKGRETGEDIRRHNKGCNCKRSGCLKNYCECY 732
Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats.
Identities = 62/82 (75%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK GE RR
Sbjct: 653 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIRR 712
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 713 HNKGCNCKRSGCLKNYCECYEA 734
>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
Length = 835
Score = 167 bits (424), Expect = 1e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 606 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 665
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 666 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 708
Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 629 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 688
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 689 HNKGCNCKRSGCLKNYCECYEA 710
>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
Length = 835
Score = 167 bits (424), Expect = 2e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 606 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 665
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 666 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 708
Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 629 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 688
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 689 HNKGCNCKRSGCLKNYCECYEA 710
>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
Length = 829
Score = 167 bits (423), Expect = 2e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 600 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 659
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 660 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 702
Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 623 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 682
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 683 HNKGCNCKRSGCLKNYCECYEA 704
>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
Length = 829
Score = 167 bits (423), Expect = 2e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 600 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 659
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 660 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 702
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 623 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 682
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 683 HNKGCNCKRSGCLKNYCECYEA 704
>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
Length = 827
Score = 167 bits (422), Expect = 2e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 598 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 657
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 658 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 700
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 621 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 680
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 681 HNKGCNCKRSGCLKNYCECYEA 702
>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
Length = 826
Score = 167 bits (422), Expect = 3e-39, Method: Composition-based stats.
Identities = 79/103 (76%), Positives = 89/103 (86%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
++EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CL
Sbjct: 597 SVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCL 656
Query: 145 ERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
ERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 657 ERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 699
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 679
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 680 HNKGCNCKRSGCLKNYCECYEA 701
>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
Length = 825
Score = 166 bits (421), Expect = 3e-39, Method: Composition-based stats.
Identities = 80/108 (74%), Positives = 90/108 (83%), Gaps = 1/108 (0%)
Query: 80 EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHA 139
+ G +EPNGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R A
Sbjct: 591 QRGYGNIEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRA 650
Query: 140 IRQCLERNPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
I+ CLERNPNAFRPKIGK G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 651 IKSCLERNPNAFRPKIGKGRETGDDIRRHNKGCNCKRSGCLKNYCECY 698
Score = 122 bits (307), Expect = 5e-26, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK G+ RR
Sbjct: 619 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIRR 678
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 679 HNKGCNCKRSGCLKNYCECYEA 700
>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
occidentalis]
Length = 609
Score = 166 bits (420), Expect = 4e-39, Method: Composition-based stats.
Identities = 72/93 (77%), Positives = 80/93 (86%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+KPCNCTKS CLKLYCDCFANGEFC CNC C+NN+EHE++R A++ CLERNPNAF P
Sbjct: 392 KKPCNCTKSQCLKLYCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHP 451
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KIGK G+ ERRHTKGCNCKRSGCLKNYCECY
Sbjct: 452 KIGKYKPGDKERRHTKGCNCKRSGCLKNYCECY 484
Score = 142 bits (359), Expect = 4e-32, Method: Composition-based stats.
Identities = 62/81 (76%), Positives = 69/81 (85%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC C+NN+EHE++R A++ CLERNPNAF PKIGK G+ ERRH
Sbjct: 406 YCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHPKIGKYKPGDKERRH 465
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
TKGCNCKRSGCLKNYCECYEA
Sbjct: 466 TKGCNCKRSGCLKNYCECYEA 486
>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
Length = 725
Score = 164 bits (415), Expect = 2e-38, Method: Composition-based stats.
Identities = 68/100 (68%), Positives = 78/100 (78%), Gaps = 1/100 (1%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
P+ ++PCNCTKS CLKLYCDCFANGE+CY CNC CFN +H+++R AIR LERN
Sbjct: 449 PDEAYKKRPCNCTKSQCLKLYCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERN 508
Query: 148 PNAFRPKIGKC-LVGEGERRHTKGCNCKRSGCLKNYCECY 186
PNAF+PKIG +G R HTKGCNCKRSGCLKNYCECY
Sbjct: 509 PNAFKPKIGSIGSTDDGTRLHTKGCNCKRSGCLKNYCECY 548
Score = 138 bits (347), Expect = 1e-30, Method: Composition-based stats.
Identities = 57/82 (69%), Positives = 64/82 (78%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-LVGEGERR 59
YCDCFANGE+CY CNC CFN +H+++R AIR LERNPNAF+PKIG +G R
Sbjct: 469 YCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERNPNAFKPKIGSIGSTDDGTRL 528
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
HTKGCNCKRSGCLKNYCECYE
Sbjct: 529 HTKGCNCKRSGCLKNYCECYEG 550
>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
Length = 818
Score = 163 bits (413), Expect = 2e-38, Method: Composition-based stats.
Identities = 78/101 (77%), Positives = 86/101 (85%), Gaps = 1/101 (0%)
Query: 87 EPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLER 146
+ NGIRPRKPCNCTKS CLKLYCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLER
Sbjct: 590 DANGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLER 649
Query: 147 NPNAFRPKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
NPNAFRPKIGK GE RRH KGCNCKRSGCLKNYCECY
Sbjct: 650 NPNAFRPKIGKGRETGEDIRRHNKGCNCKRSGCLKNYCECY 690
Score = 125 bits (314), Expect = 7e-27, Method: Composition-based stats.
Identities = 62/82 (75%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC+ CNCN+C NN+ +E++R AI+ CLERNPNAFRPKIGK GE RR
Sbjct: 611 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIRR 670
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 671 HNKGCNCKRSGCLKNYCECYEA 692
>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
Length = 804
Score = 163 bits (412), Expect = 4e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF
Sbjct: 575 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 634
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 635 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 669
Score = 129 bits (325), Expect = 4e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 591 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 650
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 651 SKGCNCKRSGCLKNYCECYEA 671
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
E R K CNC +S CLK YC+C+ G
Sbjct: 571 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 630
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 631 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 689
Query: 133 EDDR 136
+R
Sbjct: 690 SPER 693
>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
Length = 827
Score = 162 bits (411), Expect = 4e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 598 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 657
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 658 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 692
Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 614 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 673
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 674 SKGCNCKRSGCLKNYCECYEA 694
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 594 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 653
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 654 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 712
Query: 133 EDDR 136
+R
Sbjct: 713 SPER 716
>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
Length = 709
Score = 162 bits (411), Expect = 4e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
Length = 748
Score = 162 bits (410), Expect = 6e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 129 bits (323), Expect = 7e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 162 bits (410), Expect = 6e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 129 bits (323), Expect = 7e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
Length = 749
Score = 162 bits (410), Expect = 6e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 129 bits (323), Expect = 7e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 162 bits (410), Expect = 6e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 129 bits (323), Expect = 7e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
boliviensis]
Length = 749
Score = 162 bits (409), Expect = 7e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 129 bits (323), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)
Query: 45 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
RP + E R K CNC +S CLK YC+C+ G LE
Sbjct: 506 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 565
Query: 88 ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
P +P+ K CNC +S CLK YC+C+ C C
Sbjct: 566 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 625
Query: 123 CNSCFNNIEHEDDR 136
C C N E +R
Sbjct: 626 CVGC-KNFEESPER 638
>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
gorilla]
Length = 749
Score = 162 bits (409), Expect = 7e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
Length = 749
Score = 162 bits (409), Expect = 7e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
Length = 749
Score = 162 bits (409), Expect = 7e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)
Query: 45 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
RP + E R K CNC +S CLK YC+C+ G LE
Sbjct: 506 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 565
Query: 88 ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
P +P+ K CNC +S CLK YC+C+ C C
Sbjct: 566 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 625
Query: 123 CNSCFNNIEHEDDR 136
C C N E +R
Sbjct: 626 CVGC-KNFEESPER 638
>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
domain-containing protein 1
gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
Length = 749
Score = 162 bits (409), Expect = 7e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
Length = 749
Score = 162 bits (409), Expect = 7e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
Length = 749
Score = 162 bits (409), Expect = 7e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
Length = 749
Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
Length = 749
Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
Length = 749
Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
familiaris]
Length = 749
Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
Length = 749
Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
Length = 749
Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
carolinensis]
Length = 749
Score = 162 bits (409), Expect = 8e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 53.5 bits (127), Expect = 4e-05, Method: Composition-based stats.
Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
E R K CNC +S CLK YC+C+ G
Sbjct: 516 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
Length = 749
Score = 161 bits (408), Expect = 9e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
Length = 748
Score = 161 bits (408), Expect = 9e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 519 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 578
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 579 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 613
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 535 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 594
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 595 SKGCNCKRSGCLKNYCECYEA 615
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 575 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 633
Query: 133 EDDR 136
+R
Sbjct: 634 SPER 637
>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
Length = 748
Score = 161 bits (408), Expect = 9e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 519 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 578
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 579 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 613
Score = 128 bits (322), Expect = 9e-28, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 535 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 594
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 595 SKGCNCKRSGCLKNYCECYEA 615
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 575 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 633
Query: 133 EDDR 136
+R
Sbjct: 634 SPER 637
>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
Length = 749
Score = 161 bits (408), Expect = 9e-38, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
Length = 749
Score = 161 bits (408), Expect = 1e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 580 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 614
Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 536 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 595
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 596 SKGCNCKRSGCLKNYCECYEA 616
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 576 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 634
Query: 133 EDDR 136
+R
Sbjct: 635 SPER 638
>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
Length = 751
Score = 161 bits (408), Expect = 1e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF
Sbjct: 522 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 581
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 582 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 616
Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 538 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 597
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 598 SKGCNCKRSGCLKNYCECYEA 618
Score = 53.1 bits (126), Expect = 5e-05, Method: Composition-based stats.
Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
E R K CNC +S CLK YC+C+ G
Sbjct: 518 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 577
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 578 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 636
Query: 133 EDDR 136
+R
Sbjct: 637 SPER 640
>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
Length = 757
Score = 161 bits (408), Expect = 1e-37, Method: Composition-based stats.
Identities = 69/103 (66%), Positives = 81/103 (78%), Gaps = 1/103 (0%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
T+ P+ ++PCNCTKS CLKLYCDCFANGEFCY CNC C+NN+++E++R AIR L
Sbjct: 480 TIFPDEAYKKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATL 539
Query: 145 ERNPNAFRPKIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
ERNP+AF+PKIG E R HTKGCNCKRSGCLKNYCECY
Sbjct: 540 ERNPSAFKPKIGAVSADEDALRLHTKGCNCKRSGCLKNYCECY 582
Score = 135 bits (341), Expect = 5e-30, Method: Composition-based stats.
Identities = 58/82 (70%), Positives = 66/82 (80%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RR 59
YCDCFANGEFCY CNC C+NN+++E++R AIR LERNP+AF+PKIG E R
Sbjct: 503 YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFKPKIGAVSADEDALRL 562
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
HTKGCNCKRSGCLKNYCECYEA
Sbjct: 563 HTKGCNCKRSGCLKNYCECYEA 584
>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
Length = 793
Score = 161 bits (407), Expect = 1e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 564 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 623
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 624 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 658
Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 580 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 639
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 640 SKGCNCKRSGCLKNYCECYEA 660
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 560 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 619
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 620 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 678
Query: 133 EDDR 136
+R
Sbjct: 679 SPER 682
>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
Length = 752
Score = 161 bits (407), Expect = 1e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF
Sbjct: 523 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 582
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 583 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 617
Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 539 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 598
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 599 SKGCNCKRSGCLKNYCECYEA 619
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
E R K CNC +S CLK YC+C+ G
Sbjct: 519 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 578
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 579 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 637
Query: 133 EDDR 136
+R
Sbjct: 638 SPER 641
>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
Length = 756
Score = 161 bits (407), Expect = 1e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF
Sbjct: 527 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 586
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 587 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 621
Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 543 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 602
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 603 SKGCNCKRSGCLKNYCECYEA 623
Score = 53.1 bits (126), Expect = 5e-05, Method: Composition-based stats.
Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
E R K CNC +S CLK YC+C+ G
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 582
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 583 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 641
Query: 133 EDDR 136
+R
Sbjct: 642 SPER 645
>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
Length = 741
Score = 160 bits (405), Expect = 2e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 512 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 571
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 572 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 606
Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 528 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 587
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 588 SKGCNCKRSGCLKNYCECYEA 608
Score = 54.3 bits (129), Expect = 2e-05, Method: Composition-based stats.
Identities = 38/137 (27%), Positives = 49/137 (35%), Gaps = 43/137 (31%)
Query: 42 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE--- 87
N R + + E R K CNC +S CLK YC+C+ G LE
Sbjct: 495 NQPRGPLNGIISSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHEN 554
Query: 88 -------------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ 120
P +P+ K CNC +S CLK YC+C+ C
Sbjct: 555 ERQKAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSS 614
Query: 121 -CNCNSCFNNIEHEDDR 136
C C C N E +R
Sbjct: 615 ICKCIGC-KNFEESPER 630
>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
Length = 757
Score = 160 bits (405), Expect = 2e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF
Sbjct: 528 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 587
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 588 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 622
Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 544 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 603
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 604 SKGCNCKRSGCLKNYCECYEA 624
Score = 52.4 bits (124), Expect = 7e-05, Method: Composition-based stats.
Identities = 34/124 (27%), Positives = 43/124 (34%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
E R K CNC +S CLK YC+C+ G
Sbjct: 524 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 583
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 584 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 642
Query: 133 EDDR 136
+R
Sbjct: 643 SPER 646
>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
Length = 763
Score = 160 bits (405), Expect = 2e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 534 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 593
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 594 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 628
Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 550 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 609
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 610 SKGCNCKRSGCLKNYCECYEA 630
Score = 53.5 bits (127), Expect = 4e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 530 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 589
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 590 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 648
Query: 133 EDDR 136
+R
Sbjct: 649 SPER 652
>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
Length = 813
Score = 160 bits (404), Expect = 3e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 584 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 643
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 644 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 678
Score = 127 bits (318), Expect = 3e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 600 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 659
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 660 SKGCNCKRSGCLKNYCECYEA 680
Score = 53.5 bits (127), Expect = 4e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 580 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 639
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 640 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 698
Query: 133 EDDR 136
+R
Sbjct: 699 SPER 702
>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
boliviensis]
Length = 660
Score = 159 bits (402), Expect = 4e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (317), Expect = 3e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 56.2 bits (134), Expect = 5e-06, Method: Composition-based stats.
Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)
Query: 45 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
RP + E R K CNC +S CLK YC+C+ G LE
Sbjct: 417 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 476
Query: 88 ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
P +P+ K CNC +S CLK YC+C+ C C
Sbjct: 477 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 536
Query: 123 CNSCFNNIEHEDDR 136
C C N E +R
Sbjct: 537 CVGC-KNFEESPER 549
>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Saccoglossus kowalevskii]
Length = 946
Score = 159 bits (402), Expect = 5e-37, Method: Composition-based stats.
Identities = 77/105 (73%), Positives = 85/105 (80%)
Query: 82 GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
G+ +E G RPRKPCNCTKS CLKLYCDCFANGEFC CNC +CFNN+EHE +R AI+
Sbjct: 490 GSVQVEAPGTRPRKPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHESERAKAIK 549
Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
CLERNP AF PKIGK G+ +RRH KGCNCKRSGCLKNYCECY
Sbjct: 550 SCLERNPLAFHPKIGKGKEGQADRRHNKGCNCKRSGCLKNYCECY 594
Score = 122 bits (305), Expect = 8e-26, Method: Composition-based stats.
Identities = 61/81 (75%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +CFNN+EHE +R AI+ CLERNP AF PKIGK G+ +RRH
Sbjct: 516 YCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKSCLERNPLAFHPKIGKGKEGQADRRH 575
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 576 NKGCNCKRSGCLKNYCECYEA 596
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 36/120 (30%), Positives = 43/120 (35%), Gaps = 43/120 (35%)
Query: 59 RHTKGCNCKRSGCLKNYCECYEAGART-------------------------LEPNGI-- 91
R K CNC +S CLK YC+C+ G LE N +
Sbjct: 500 RPRKPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKSCLERNPLAF 559
Query: 92 --------------RPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDR 136
R K CNC +S CLK YC+C+ C C C C N E DR
Sbjct: 560 HPKIGKGKEGQADRRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEESPDR 618
>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
gorilla]
gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
Length = 660
Score = 159 bits (401), Expect = 6e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545
Query: 133 EDDR 136
+R
Sbjct: 546 SPER 549
>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
Length = 660
Score = 159 bits (401), Expect = 6e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545
Query: 133 EDDR 136
+R
Sbjct: 546 SPER 549
>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
Length = 660
Score = 159 bits (401), Expect = 6e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 55.8 bits (133), Expect = 7e-06, Method: Composition-based stats.
Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 43/134 (32%)
Query: 45 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE------ 87
RP + E R K CNC +S CLK YC+C+ G LE
Sbjct: 417 RPPFNGIIPSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQ 476
Query: 88 ----------PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CN 122
P +P+ K CNC +S CLK YC+C+ C C
Sbjct: 477 KAIKACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICK 536
Query: 123 CNSCFNNIEHEDDR 136
C C N E +R
Sbjct: 537 CVGC-KNFEESPER 549
>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
Length = 660
Score = 159 bits (401), Expect = 6e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (316), Expect = 4e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545
Query: 133 EDDR 136
+R
Sbjct: 546 SPER 549
>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
Length = 660
Score = 159 bits (401), Expect = 7e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (316), Expect = 5e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 545
Query: 133 EDDR 136
+R
Sbjct: 546 SPER 549
>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
familiaris]
Length = 660
Score = 159 bits (401), Expect = 7e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (316), Expect = 5e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGC-KNFEE 545
Query: 133 EDDR 136
+R
Sbjct: 546 SPER 549
>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
Length = 756
Score = 159 bits (401), Expect = 7e-37, Method: Composition-based stats.
Identities = 75/95 (78%), Positives = 85/95 (89%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNCN+CFNN+EHE++R AI+ CL+RNP AF
Sbjct: 527 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAF 586
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 587 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 621
Score = 125 bits (314), Expect = 8e-27, Method: Composition-based stats.
Identities = 62/81 (76%), Positives = 71/81 (87%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+CFNN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 543 YCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 602
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 603 SKGCNCKRSGCLKNYCECYEA 623
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRN 582
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 583 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 641
Query: 133 EDDR 136
+R
Sbjct: 642 SPER 645
>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
Length = 660
Score = 159 bits (401), Expect = 7e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 431 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 490
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 491 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 525
Score = 126 bits (316), Expect = 5e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 447 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 506
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 507 SKGCNCKRSGCLKNYCECYEA 527
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 487 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 545
Query: 133 EDDR 136
+R
Sbjct: 546 SPER 549
>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
Length = 659
Score = 159 bits (401), Expect = 7e-37, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 430 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 489
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 490 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 524
Score = 126 bits (316), Expect = 5e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 446 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 505
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 506 SKGCNCKRSGCLKNYCECYEA 526
Score = 52.8 bits (125), Expect = 6e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 426 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 485
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 486 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 544
Query: 133 EDDR 136
+R
Sbjct: 545 SPER 548
>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
Length = 667
Score = 157 bits (398), Expect = 1e-36, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF
Sbjct: 438 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAF 497
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 498 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 532
Score = 125 bits (314), Expect = 8e-27, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN++HE+DR AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 454 YCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 513
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 514 SKGCNCKRSGCLKNYCECYEA 534
>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
Length = 921
Score = 156 bits (394), Expect = 4e-36, Method: Composition-based stats.
Identities = 65/97 (67%), Positives = 74/97 (76%), Gaps = 1/97 (1%)
Query: 90 GIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
G RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+
Sbjct: 703 GASRRKHCNCSKSQCLKLYCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPS 762
Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
AF+PKI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 763 AFKPKITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 798
Score = 134 bits (336), Expect = 2e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 721 YCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 779
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 780 NKGCNCKRSGCLKNYCECYEA 800
>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
Length = 821
Score = 156 bits (394), Expect = 4e-36, Method: Composition-based stats.
Identities = 75/95 (78%), Positives = 83/95 (87%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS CLKLYCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF
Sbjct: 592 RPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDRNPEAF 651
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 652 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 686
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 62/81 (76%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 608 YCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 667
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 668 SKGCNCKRSGCLKNYCECYEA 688
Score = 55.5 bits (132), Expect = 9e-06, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 45/125 (36%), Gaps = 43/125 (34%)
Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE--------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 587 AETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDR 646
Query: 88 -PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 647 NPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFE 705
Query: 132 HEDDR 136
+R
Sbjct: 706 ESPER 710
>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
Length = 807
Score = 155 bits (393), Expect = 5e-36, Method: Composition-based stats.
Identities = 75/95 (78%), Positives = 83/95 (87%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS CLKLYCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF
Sbjct: 578 RPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAF 637
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 638 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 672
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 62/81 (76%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 594 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 653
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 654 SKGCNCKRSGCLKNYCECYEA 674
Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 574 ETAPRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRN 633
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 634 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 692
Query: 133 EDDR 136
+R
Sbjct: 693 SPER 696
>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
Length = 775
Score = 155 bits (393), Expect = 6e-36, Method: Composition-based stats.
Identities = 75/95 (78%), Positives = 83/95 (87%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS CLKLYCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF
Sbjct: 546 RPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAF 605
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 606 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 640
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 62/81 (76%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 562 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 621
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 622 SKGCNCKRSGCLKNYCECYEA 642
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 45/125 (36%), Gaps = 43/125 (34%)
Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE--------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 541 AETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDR 600
Query: 88 -PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 601 NPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFE 659
Query: 132 HEDDR 136
+R
Sbjct: 660 ESPER 664
>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
Length = 621
Score = 155 bits (392), Expect = 7e-36, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 392 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 451
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 452 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 486
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 408 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 467
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 468 SKGCNCKRSGCLKNYCECYEA 488
Score = 50.8 bits (120), Expect = 2e-04, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 388 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 447
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 448 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 506
Query: 133 EDDR 136
+R
Sbjct: 507 SPER 510
>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
Length = 978
Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 765 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 824
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 825 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 856
Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 779 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 837
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 838 NKGCNCKRSGCLKNYCECYEA 858
>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
Length = 950
Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 797
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829
Score = 133 bits (334), Expect = 3e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831
>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
Length = 705
Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats.
Identities = 74/105 (70%), Positives = 85/105 (80%)
Query: 82 GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
G T + +G RPRKPCNCTKS CLKLYCDCFANGEFC CNCN+C NN++HED+R A++
Sbjct: 470 GMLTGDGSGSRPRKPCNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVK 529
Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
CL+RNP+AF PKIGK + RRH KGCNCKRSGCLKNYCECY
Sbjct: 530 ACLDRNPHAFHPKIGKGHGSQTNRRHNKGCNCKRSGCLKNYCECY 574
Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats.
Identities = 58/81 (71%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+C NN++HED+R A++ CL+RNP+AF PKIGK + RRH
Sbjct: 496 YCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKACLDRNPHAFHPKIGKGHGSQTNRRH 555
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 556 NKGCNCKRSGCLKNYCECYEA 576
Score = 52.4 bits (124), Expect = 7e-05, Method: Composition-based stats.
Identities = 38/128 (29%), Positives = 48/128 (37%), Gaps = 44/128 (34%)
Query: 52 LVGEGE-RRHTKGCNCKRSGCLKNYCECYEAGA-------------------RT------ 85
L G+G R K CNC +S CLK YC+C+ G RT
Sbjct: 472 LTGDGSGSRPRKPCNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKAC 531
Query: 86 --LEPNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
P+ P+ K CNC +S CLK YC+C+ C C C C
Sbjct: 532 LDRNPHAFHPKIGKGHGSQTNRRHNKGCNCKRSGCLKNYCECYEAKILCSNFCKCVGC-K 590
Query: 129 NIEHEDDR 136
N E +R
Sbjct: 591 NFEESPER 598
>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
Length = 980
Score = 154 bits (390), Expect = 1e-35, Method: Composition-based stats.
Identities = 67/101 (66%), Positives = 75/101 (74%), Gaps = 1/101 (0%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
L+ G RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E R AIR CLE
Sbjct: 758 LDALGGSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLE 817
Query: 146 RNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RNP+AF+PKI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 818 RNPSAFKPKITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 857
Score = 133 bits (334), Expect = 4e-29, Method: Composition-based stats.
Identities = 56/81 (69%), Positives = 62/81 (76%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E R AIR CLERNP+AF+PKI G+ R H
Sbjct: 780 YCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLERNPSAFKPKITAPNSGD-MRLH 838
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 839 NKGCNCKRSGCLKNYCECYEA 859
>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 560
Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 331 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 390
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 391 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 425
Score = 122 bits (307), Expect = 5e-26, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 347 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 406
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 407 SKGCNCKRSGCLKNYCECYEA 427
Score = 50.4 bits (119), Expect = 3e-04, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 45/124 (36%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG-----------ARTLE---------------- 87
E R K CNC +S CLK YC+C+ G LE
Sbjct: 327 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 386
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC +S CLK YC+C+ C C C C N E
Sbjct: 387 PEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNFEE 445
Query: 133 EDDR 136
+R
Sbjct: 446 SPER 449
>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 995
Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats.
Identities = 76/103 (73%), Positives = 86/103 (83%)
Query: 84 RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQC 143
R LE NGIR +KPCNCT+S CLKLYC+CFANGEFC+QCNCN+C+NNIEHE+ R +I+ C
Sbjct: 771 RVLEANGIRMKKPCNCTRSQCLKLYCECFANGEFCFQCNCNNCYNNIEHEEARQKSIKSC 830
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
LERNP AFRPKI E ERRH KGC+CKRSGCLKNYCECY
Sbjct: 831 LERNPCAFRPKIQIGETEEDERRHNKGCHCKRSGCLKNYCECY 873
Score = 119 bits (297), Expect = 8e-25, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 68/81 (83%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFANGEFC+QCNCN+C+NNIEHE+ R +I+ CLERNP AFRPKI E ERRH
Sbjct: 795 YCECFANGEFCFQCNCNNCYNNIEHEEARQKSIKSCLERNPCAFRPKIQIGETEEDERRH 854
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGC+CKRSGCLKNYCECYEA
Sbjct: 855 NKGCHCKRSGCLKNYCECYEA 875
>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
Length = 944
Score = 154 bits (388), Expect = 2e-35, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 732 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNMDYEVERERAIRSCLDRNPSAFQP 791
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 792 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 823
Score = 132 bits (333), Expect = 5e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 746 YCDCFANGEFCQDCTCKDCFNNMDYEVERERAIRSCLDRNPSAFQPKITAPNSGD-MRLH 804
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 805 NKGCNCKRSGCLKNYCECYEA 825
>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
Length = 952
Score = 153 bits (387), Expect = 3e-35, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 740 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 799
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 800 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 831
Score = 132 bits (332), Expect = 7e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 754 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 812
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 813 NKGCNCKRSGCLKNYCECYEA 833
>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
protein of 120 kDa
gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
Length = 950
Score = 153 bits (387), Expect = 3e-35, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 797
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829
Score = 132 bits (332), Expect = 7e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831
>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
Length = 949
Score = 153 bits (386), Expect = 4e-35, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 737 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 796
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 797 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 828
Score = 132 bits (331), Expect = 9e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 751 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 809
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 810 NKGCNCKRSGCLKNYCECYEA 830
>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
Length = 950
Score = 152 bits (384), Expect = 6e-35, Method: Composition-based stats.
Identities = 63/93 (67%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC++S CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSRSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKP 797
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829
Score = 132 bits (331), Expect = 7e-29, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831
>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
Length = 771
Score = 152 bits (383), Expect = 7e-35, Method: Composition-based stats.
Identities = 75/105 (71%), Positives = 86/105 (81%), Gaps = 1/105 (0%)
Query: 82 GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
G T E N RPRKPCNCT+S CLKLYCDCFANGEFC CNC +CFNN++HE +R AI+
Sbjct: 534 GLSTSEANS-RPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKAIK 592
Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
CL+RNP AF+PKIGK GE +RRH+KGCNCK+SGCLKNYCECY
Sbjct: 593 ACLDRNPVAFKPKIGKGKEGESDRRHSKGCNCKKSGCLKNYCECY 637
Score = 119 bits (298), Expect = 6e-25, Method: Composition-based stats.
Identities = 59/81 (72%), Positives = 69/81 (85%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +CFNN++HE +R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 559 YCDCFANGEFCNNCNCVNCFNNLDHESERLKAIKACLDRNPVAFKPKIGKGKEGESDRRH 618
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCK+SGCLKNYCECYEA
Sbjct: 619 SKGCNCKKSGCLKNYCECYEA 639
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 36/124 (29%), Positives = 43/124 (34%), Gaps = 43/124 (34%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTL---------------------------E 87
E R K CNC RS CLK YC+C+ G
Sbjct: 539 EANSRPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKAIKACLDRN 598
Query: 88 PNGIRPR--------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEH 132
P +P+ K CNC KS CLK YC+C+ C C C C N E
Sbjct: 599 PVAFKPKIGKGKEGESDRRHSKGCNCKKSGCLKNYCECYEAKIMCSSICKCMGC-KNFEE 657
Query: 133 EDDR 136
+R
Sbjct: 658 SPER 661
>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
Length = 879
Score = 152 bits (383), Expect = 8e-35, Method: Composition-based stats.
Identities = 65/93 (69%), Positives = 72/93 (77%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E R AIR CLERNP+AF+P
Sbjct: 664 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFKP 723
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 724 KITAPNSGD-IRLHNKGCNCKRSGCLKNYCECY 755
Score = 130 bits (328), Expect = 2e-28, Method: Composition-based stats.
Identities = 56/81 (69%), Positives = 62/81 (76%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E R AIR CLERNP+AF+PKI G+ R H
Sbjct: 678 YCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFKPKITAPNSGD-IRLH 736
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 737 NKGCNCKRSGCLKNYCECYEA 757
>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
Length = 950
Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats.
Identities = 63/93 (67%), Positives = 72/93 (77%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C C NN+++E +R AIR CL+RNP+AF+P
Sbjct: 738 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKP 797
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 798 KITAPNSGD-MRLHNKGCNCKRSGCLKNYCECY 829
Score = 129 bits (324), Expect = 6e-28, Method: Composition-based stats.
Identities = 54/81 (66%), Positives = 62/81 (76%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C C NN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 752 YCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSGD-MRLH 810
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 811 NKGCNCKRSGCLKNYCECYEA 831
>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
Length = 209
Score = 149 bits (375), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 76/104 (73%), Positives = 85/104 (81%), Gaps = 5/104 (4%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
LE NG++PRKPCNCTKS CLKLYCDCFANGEFC+ CNC C NN+EHE+ R AIR CL
Sbjct: 13 VLESNGLKPRKPCNCTKSQCLKLYCDCFANGEFCHGCNCVCCANNLEHEELRLRAIRSCL 72
Query: 145 ERNPNAFRPKIGKCLVGEG--ERRHTKGCNCKRSGCLKNYCECY 186
+RNP+AF+PKIG VG G RRH KGC+CKRSGCLKNYCECY
Sbjct: 73 DRNPHAFKPKIG---VGWGPEPRRHNKGCHCKRSGCLKNYCECY 113
Score = 115 bits (288), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 60/83 (72%), Positives = 67/83 (80%), Gaps = 5/83 (6%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG--ER 58
YCDCFANGEFC+ CNC C NN+EHE+ R AIR CL+RNP+AF+PKIG VG G R
Sbjct: 36 YCDCFANGEFCHGCNCVCCANNLEHEELRLRAIRSCLDRNPHAFKPKIG---VGWGPEPR 92
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
RH KGC+CKRSGCLKNYCECYEA
Sbjct: 93 RHNKGCHCKRSGCLKNYCECYEA 115
>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
Length = 605
Score = 148 bits (374), Expect = 8e-34, Method: Composition-based stats.
Identities = 67/93 (72%), Positives = 74/93 (79%), Gaps = 2/93 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RKPCNCTKSMCLKLYC+CFANG FC CNC +C NN+E + DR AI+ CLERNP AFRP
Sbjct: 377 RKPCNCTKSMCLKLYCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAFRP 436
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG+ + R H KGCNCKRSGCLKNYCECY
Sbjct: 437 KIGRGR--DANRTHQKGCNCKRSGCLKNYCECY 467
Score = 121 bits (304), Expect = 1e-25, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 62/81 (76%), Gaps = 2/81 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFANG FC CNC +C NN+E + DR AI+ CLERNP AFRPKIG+ + R H
Sbjct: 391 YCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAFRPKIGRGR--DANRTH 448
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 449 QKGCNCKRSGCLKNYCECYEA 469
>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
boliviensis]
Length = 528
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
Length = 528
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
Length = 528
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 521
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 292 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 351
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 352 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 386
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 308 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 367
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 368 SKGCNCKRSGCLKNYCECYEA 388
>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
gorilla]
Length = 528
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
Length = 527
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 298 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 357
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 358 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 392
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 314 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 373
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 374 SKGCNCKRSGCLKNYCECYEA 394
>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
Length = 528
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
familiaris]
gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
Length = 528
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
Length = 528
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
Length = 515
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 286 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 345
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 346 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 380
Score = 119 bits (298), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 302 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 361
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 362 SKGCNCKRSGCLKNYCECYEA 382
>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
Length = 528
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 119 bits (298), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
magnipapillata]
Length = 530
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 70/94 (74%), Positives = 78/94 (82%), Gaps = 1/94 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+KPCNCTKS CLKLYC+CFANGEFC CNC CFNNI HE +R AI+ CL+RNP AF P
Sbjct: 318 KKPCNCTKSQCLKLYCECFANGEFCNNCNCRICFNNIAHEVERSKAIKSCLDRNPYAFHP 377
Query: 154 KIGKCLV-GEGERRHTKGCNCKRSGCLKNYCECY 186
KIGK + G+ ERRHTKGCNC+RSGCLKNYCECY
Sbjct: 378 KIGKGKIKGDTERRHTKGCNCRRSGCLKNYCECY 411
Score = 126 bits (317), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 60/82 (73%), Positives = 67/82 (81%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV-GEGERR 59
YC+CFANGEFC CNC CFNNI HE +R AI+ CL+RNP AF PKIGK + G+ ERR
Sbjct: 332 YCECFANGEFCNNCNCRICFNNIAHEVERSKAIKSCLDRNPYAFHPKIGKGKIKGDTERR 391
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
HTKGCNC+RSGCLKNYCECYEA
Sbjct: 392 HTKGCNCRRSGCLKNYCECYEA 413
>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
Length = 494
Score = 147 bits (370), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 265 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 324
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 325 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 359
Score = 118 bits (296), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 281 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 340
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 341 SKGCNCKRSGCLKNYCECYEA 361
>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
Length = 495
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 266 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 325
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 326 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 360
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 282 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 341
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 342 SKGCNCKRSGCLKNYCECYEA 362
>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
Length = 614
Score = 146 bits (368), Expect = 4e-33, Method: Composition-based stats.
Identities = 75/107 (70%), Positives = 85/107 (79%), Gaps = 1/107 (0%)
Query: 81 AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
AG+ + RPRK CNCTKS CLKLYCDCFANGEFC CNC +CFNN+EHE++R +I
Sbjct: 380 AGSAAEQNAAARPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSI 439
Query: 141 RQCLERNPNAFRPKIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
+ CLERNP AF PKIGK G+G+ RRH KGCNCKRSGCLKNYCECY
Sbjct: 440 KTCLERNPQAFHPKIGKSKAGQGDIRRHNKGCNCKRSGCLKNYCECY 486
Score = 117 bits (292), Expect = 3e-24, Method: Composition-based stats.
Identities = 61/82 (74%), Positives = 69/82 (84%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RR 59
YCDCFANGEFC CNC +CFNN+EHE++R +I+ CLERNP AF PKIGK G+G+ RR
Sbjct: 407 YCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAFHPKIGKSKAGQGDIRR 466
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCKRSGCLKNYCECYEA
Sbjct: 467 HNKGCNCKRSGCLKNYCECYEA 488
Score = 53.1 bits (126), Expect = 4e-05, Method: Composition-based stats.
Identities = 35/121 (28%), Positives = 46/121 (38%), Gaps = 44/121 (36%)
Query: 59 RHTKGCNCKRSGCLKNYCECYEAG---------------------ARTLE------PNGI 91
R K CNC +S CLK YC+C+ G +R+++ P
Sbjct: 391 RPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAF 450
Query: 92 RPR---------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDD 135
P+ K CNC +S CLK YC+C+ C C C C N E D
Sbjct: 451 HPKIGKSKAGQGDIRRHNKGCNCKRSGCLKNYCECYEAKILCTSLCKCVGC-KNFEESPD 509
Query: 136 R 136
R
Sbjct: 510 R 510
>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
Length = 419
Score = 145 bits (367), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 190 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 249
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 250 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 284
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 206 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 265
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 266 SKGCNCKRSGCLKNYCECYEA 286
>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
Length = 438
Score = 145 bits (367), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 209 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 268
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 269 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 303
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 225 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 284
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 285 SKGCNCKRSGCLKNYCECYEA 305
>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
Length = 419
Score = 145 bits (366), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 190 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 249
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 250 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 284
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 206 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 265
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 266 SKGCNCKRSGCLKNYCECYEA 286
>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
Length = 348
Score = 144 bits (362), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/95 (76%), Positives = 84/95 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 119 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 178
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 179 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 213
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 135 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 194
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 195 SKGCNCKRSGCLKNYCECYEA 215
>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
Length = 528
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 72/95 (75%), Positives = 83/95 (87%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDC ANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 359 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 393
Score = 113 bits (283), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/81 (72%), Positives = 69/81 (85%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDC ANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 315 YCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 375 SKGCNCKRSGCLKNYCECYEA 395
>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
Length = 844
Score = 142 bits (357), Expect = 7e-32, Method: Composition-based stats.
Identities = 79/109 (72%), Positives = 86/109 (78%), Gaps = 2/109 (1%)
Query: 80 EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHA 139
E T E G+RPRK CNCTKS CLKLYCDCFANGEFC +CNCN+C NN+E+E+ R A
Sbjct: 614 ETDQVTTELGGLRPRKACNCTKSQCLKLYCDCFANGEFCNRCNCNNCHNNLENEELRQKA 673
Query: 140 IRQCLERNPNAFRPKIGKCLVGEGE--RRHTKGCNCKRSGCLKNYCECY 186
IR CL+RNPNAFRPKIGK G E RRH KGCNCKRSGCLKNYCECY
Sbjct: 674 IRGCLDRNPNAFRPKIGKSKAGGPEIIRRHNKGCNCKRSGCLKNYCECY 722
Score = 108 bits (271), Expect = 7e-22, Method: Composition-based stats.
Identities = 73/152 (48%), Positives = 87/152 (57%), Gaps = 40/152 (26%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--R 58
YCDCFANGEFC +CNCN+C NN+E+E+ R AIR CL+RNPNAFRPKIGK G E R
Sbjct: 642 YCDCFANGEFCNRCNCNNCHNNLENEELRQKAIRGCLDRNPNAFRPKIGKSKAGGPEIIR 701
Query: 59 RHTKGCNCKRSGCLKNYCECYEA-----------GARTLE-----------PNGIRP--- 93
RH KGCNCKRSGCLKNYCECYEA G R +E P ++P
Sbjct: 702 RHNKGCNCKRSGCLKNYCECYEAKIACSSICKCVGCRNVEETLERGRRRDAPRALQPLAG 761
Query: 94 -------------RKPCNCTKSMCLKLYCDCF 112
++PC+ S ++ C C
Sbjct: 762 PPTYRPHVALAHAKQPCSFMTSEVIEAVCQCL 793
>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
Length = 407
Score = 142 bits (357), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 80/147 (54%), Positives = 90/147 (61%), Gaps = 17/147 (11%)
Query: 45 RPKIGKCLVGEGERRHTKGCNCKRSGC----LKNYCECYEAGARTLEPNGIRPRKPCNCT 100
RP K VG E+R K N R +K + Y G R KPCNCT
Sbjct: 106 RPPKSKSEVGGIEQR--KAFNVMREQVNQVRMKTKKKVYAPGQR----------KPCNCT 153
Query: 101 KSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL- 159
KS CLKLYCDCFANGEFC CNC C NNI+++ R AIRQ LERNPNAF+PKIG
Sbjct: 154 KSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARG 213
Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
+ ER H KGC+CK+SGCLKNYCECY
Sbjct: 214 TADIERLHQKGCHCKKSGCLKNYCECY 240
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/82 (68%), Positives = 63/82 (76%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGERR 59
YCDCFANGEFC CNC C NNI+++ R AIRQ LERNPNAF+PKIG + ER
Sbjct: 161 YCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGTADIERL 220
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLKNYCECYEA
Sbjct: 221 HQKGCHCKKSGCLKNYCECYEA 242
>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
Length = 458
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/105 (75%), Positives = 85/105 (80%)
Query: 82 GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
G E G RPRKPCNCTKS CLKLYCDCFANGEFC CNCN+CFNN+EHE +R +I+
Sbjct: 223 GTLNTEQVGTRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHEAERAKSIK 282
Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
CLERNP AF PKIGK GEG+RRH KGCNCKRSGCLKNYCECY
Sbjct: 283 ACLERNPYAFHPKIGKGKEGEGDRRHNKGCNCKRSGCLKNYCECY 327
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/81 (77%), Positives = 69/81 (85%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+CFNN+EHE +R +I+ CLERNP AF PKIGK GEG+RRH
Sbjct: 249 YCDCFANGEFCNNCNCNNCFNNLEHEAERAKSIKACLERNPYAFHPKIGKGKEGEGDRRH 308
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 309 NKGCNCKRSGCLKNYCECYEA 329
>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
Length = 360
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/95 (78%), Positives = 85/95 (89%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNCN+CFNN+EHE++R AI+ CL+RNP AF
Sbjct: 131 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAF 190
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 191 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 225
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 62/81 (76%), Positives = 71/81 (87%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+CFNN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 147 YCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 206
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 207 SKGCNCKRSGCLKNYCECYEA 227
>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
Length = 437
Score = 140 bits (353), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/148 (53%), Positives = 90/148 (60%), Gaps = 13/148 (8%)
Query: 41 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCT 100
P A RP G+ L GE + + +K + Y G R KPCNCT
Sbjct: 132 PGASRPAGGE-LTGEQRKAFIEMKEQVNQVRMKTKKKVYAPGQR----------KPCNCT 180
Query: 101 KSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV 160
KS CLKLYCDCFANGEFC CNC C NNI+++ R AIRQ LERNPNAF+PKIG
Sbjct: 181 KSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARG 240
Query: 161 GEG--ERRHTKGCNCKRSGCLKNYCECY 186
G ER H KGC+CK+SGCLKNYCECY
Sbjct: 241 GTADIERLHQKGCHCKKSGCLKNYCECY 268
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/83 (68%), Positives = 63/83 (75%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG--ER 58
YCDCFANGEFC CNC C NNI+++ R AIRQ LERNPNAF+PKIG G ER
Sbjct: 188 YCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTADIER 247
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLKNYCECYEA
Sbjct: 248 LHQKGCHCKKSGCLKNYCECYEA 270
>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
Length = 429
Score = 140 bits (353), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/95 (72%), Positives = 75/95 (78%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RKPCNCTKS CLKLYCDCFANGEFC CNC C NNIE++ R AIRQ LERNPNAF+P
Sbjct: 168 RKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKP 227
Query: 154 KIGKCLVG--EGERRHTKGCNCKRSGCLKNYCECY 186
KIG G + ER H KGC+CK+SGCLKNYCECY
Sbjct: 228 KIGIARGGITDIERLHQKGCHCKKSGCLKNYCECY 262
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 58/83 (69%), Positives = 64/83 (77%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG--EGER 58
YCDCFANGEFC CNC C NNIE++ R AIRQ LERNPNAF+PKIG G + ER
Sbjct: 182 YCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITDIER 241
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLKNYCECYEA
Sbjct: 242 LHQKGCHCKKSGCLKNYCECYEA 264
>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
protein 54
gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
Length = 435
Score = 140 bits (353), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 68/95 (71%), Positives = 75/95 (78%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RKPCNCTKS CLKLYCDCFANGEFC CNC C NNIE++ R AIRQ LERNPNAF+P
Sbjct: 174 RKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKP 233
Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG + + ER H KGC+CK+SGCLKNYCECY
Sbjct: 234 KIGIARGGITDIERLHQKGCHCKKSGCLKNYCECY 268
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 57/83 (68%), Positives = 64/83 (77%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
YCDCFANGEFC CNC C NNIE++ R AIRQ LERNPNAF+PKIG + + ER
Sbjct: 188 YCDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITDIER 247
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLKNYCECYEA
Sbjct: 248 LHQKGCHCKKSGCLKNYCECYEA 270
>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
Length = 587
Score = 140 bits (353), Expect = 3e-31, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 73/93 (78%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+P
Sbjct: 365 RKHCNCSKSQCLKLYCDCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKP 424
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ R H KGCNCKRSGCLKNYCECY
Sbjct: 425 KITAPASGD-MRLHNKGCNCKRSGCLKNYCECY 456
Score = 120 bits (302), Expect = 2e-25, Method: Composition-based stats.
Identities = 55/81 (67%), Positives = 63/81 (77%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF+PKI G+ R H
Sbjct: 379 YCDCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKPKITAPASGD-MRLH 437
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 438 NKGCNCKRSGCLKNYCECYEA 458
>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
Length = 408
Score = 139 bits (350), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 67/96 (69%), Positives = 77/96 (80%), Gaps = 2/96 (2%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
+PRKPCNC S CLKLYCDCFANGEFC CNC +C N+ +HE DR A++ CLERNPNA
Sbjct: 176 KPRKPCNCKNSQCLKLYCDCFANGEFCRDSCNCQNCKNSFQHEGDRSRAVKACLERNPNA 235
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
F+PK+G VG+ ERRH KGCNCK+S CLKNYCECY
Sbjct: 236 FKPKVGHGRVGD-ERRHIKGCNCKKSSCLKNYCECY 270
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 57/82 (69%), Positives = 66/82 (80%), Gaps = 2/82 (2%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFANGEFC CNC +C N+ +HE DR A++ CLERNPNAF+PK+G VG+ ERR
Sbjct: 192 YCDCFANGEFCRDSCNCQNCKNSFQHEGDRSRAVKACLERNPNAFKPKVGHGRVGD-ERR 250
Query: 60 HTKGCNCKRSGCLKNYCECYEA 81
H KGCNCK+S CLKNYCECYEA
Sbjct: 251 HIKGCNCKKSSCLKNYCECYEA 272
>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
Length = 348
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 71/95 (74%), Positives = 83/95 (87%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKP NCTKS+CL+LYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 119 RPRKPRNCTKSLCLRLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 178
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 179 KPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 213
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 60/81 (74%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 135 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRRH 194
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 195 SKGCNCKRSGCLKNYCECYEA 215
>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
Length = 433
Score = 139 bits (349), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 68/95 (71%), Positives = 74/95 (77%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RKPCNCTKS CLKLYCDCFANGEFC CNC C NNI+++ R AIRQ LERNPNAF+P
Sbjct: 172 RKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKP 231
Query: 154 KIGKCLVGEG--ERRHTKGCNCKRSGCLKNYCECY 186
KIG G ER H KGC+CK+SGCLKNYCECY
Sbjct: 232 KIGIARGGTADIERLHQKGCHCKKSGCLKNYCECY 266
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/83 (68%), Positives = 63/83 (75%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG--ER 58
YCDCFANGEFC CNC C NNI+++ R AIRQ LERNPNAF+PKIG G ER
Sbjct: 186 YCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTADIER 245
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLKNYCECYEA
Sbjct: 246 LHQKGCHCKKSGCLKNYCECYEA 268
>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
Length = 226
Score = 138 bits (348), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 75/95 (78%), Positives = 80/95 (84%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS CLKLYCDCFA GEFC CNC +C NNIEHE +R AI+ CLERNP+AF
Sbjct: 2 RPRKPCNCTKSQCLKLYCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAF 61
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
PKIGK VGE ERRH KGC+CKRSGCLKNYCECY
Sbjct: 62 HPKIGKGKVGESERRHNKGCHCKRSGCLKNYCECY 96
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/81 (76%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA GEFC CNC +C NNIEHE +R AI+ CLERNP+AF PKIGK VGE ERRH
Sbjct: 18 YCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAFHPKIGKGKVGESERRH 77
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGC+CKRSGCLKNYCECYEA
Sbjct: 78 NKGCHCKRSGCLKNYCECYEA 98
>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
Length = 448
Score = 138 bits (347), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 67/100 (67%), Positives = 78/100 (78%), Gaps = 2/100 (2%)
Query: 89 NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNP 148
+GI +KPCNCTKSMCLKLYCDCFANGEFC CNC C NN+++E +R AI+ LERNP
Sbjct: 216 SGIGIKKPCNCTKSMCLKLYCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNP 275
Query: 149 NAFRPKIGKCLVG--EGERRHTKGCNCKRSGCLKNYCECY 186
NAF+PKIG G + ER H KGC+CK+S CLKNYCECY
Sbjct: 276 NAFKPKIGVASRGRVDSERLHQKGCHCKKSNCLKNYCECY 315
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 54/83 (65%), Positives = 63/83 (75%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG--EGER 58
YCDCFANGEFC CNC C NN+++E +R AI+ LERNPNAF+PKIG G + ER
Sbjct: 235 YCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNPNAFKPKIGVASRGRVDSER 294
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+S CLKNYCECYEA
Sbjct: 295 LHQKGCHCKKSNCLKNYCECYEA 317
>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
Length = 460
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 74/95 (77%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+ ++PCNCTKS CLKLYC+CFANGEFC C C +CFNN EH +R A++ CL+RNP AF
Sbjct: 262 KSKRPCNCTKSQCLKLYCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAF 321
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RPKIG G + HTKGCNCKRSGCLKNYCECY
Sbjct: 322 RPKIGSRKQGNVKGCHTKGCNCKRSGCLKNYCECY 356
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 62/81 (76%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFANGEFC C C +CFNN EH +R A++ CL+RNP AFRPKIG G + H
Sbjct: 278 YCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAFRPKIGSRKQGNVKGCH 337
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
TKGCNCKRSGCLKNYCECYEA
Sbjct: 338 TKGCNCKRSGCLKNYCECYEA 358
>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
Length = 452
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+KPC+CT+SMCLKLYCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+P
Sbjct: 221 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 280
Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG E ER H KGC+CK+S CLKNYCECY
Sbjct: 281 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 315
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
YCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+PKIG E ER
Sbjct: 235 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 294
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+S CLKNYCECYEA
Sbjct: 295 LHQKGCHCKKSNCLKNYCECYEA 317
>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
Length = 495
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+KPC+CT+SMCLKLYCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+P
Sbjct: 264 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 323
Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG E ER H KGC+CK+S CLKNYCECY
Sbjct: 324 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 358
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
YCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+PKIG E ER
Sbjct: 278 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 337
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+S CLKNYCECYEA
Sbjct: 338 LHQKGCHCKKSNCLKNYCECYEA 360
>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
Length = 469
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+KPC+CT+SMCLKLYCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+P
Sbjct: 238 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 297
Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG E ER H KGC+CK+S CLKNYCECY
Sbjct: 298 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 332
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
YCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+PKIG E ER
Sbjct: 252 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 311
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+S CLKNYCECYEA
Sbjct: 312 LHQKGCHCKKSNCLKNYCECYEA 334
>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
garnettii]
Length = 544
Score = 130 bits (326), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 60/98 (61%), Positives = 73/98 (74%)
Query: 89 NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNP 148
+ I P KPC C S+CLKL CDCFANGEF CNC S +N++EHE++R AI+ CL+RNP
Sbjct: 339 SSIWPGKPCECVISLCLKLDCDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNP 398
Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
AF+PK GK + + H+KGCNCK SGCLKNYCECY
Sbjct: 399 EAFKPKTGKGKEEDSDXHHSKGCNCKXSGCLKNYCECY 436
Score = 113 bits (283), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 51/80 (63%), Positives = 62/80 (77%)
Query: 2 CDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHT 61
CDCFANGEF CNC S +N++EHE++R AI+ CL+RNP AF+PK GK + + H+
Sbjct: 359 CDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNPEAFKPKTGKGKEEDSDXHHS 418
Query: 62 KGCNCKRSGCLKNYCECYEA 81
KGCNCK SGCLKNYCECYEA
Sbjct: 419 KGCNCKXSGCLKNYCECYEA 438
>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
Length = 391
Score = 130 bits (326), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 73/95 (76%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+KPC+CT+SMCLKLYCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+P
Sbjct: 234 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 293
Query: 154 KIGKCL--VGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG E ER H KGC+CK+S CLKNYCECY
Sbjct: 294 KIGVATNRQVEPERLHQKGCHCKKSNCLKNYCECY 328
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 55/83 (66%), Positives = 61/83 (73%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGEGER 58
YCDCFANGEFC C+C C N IE+E +R AIR LERNPNAF+PKIG E ER
Sbjct: 248 YCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATNRQVEPER 307
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+S CLKNYCECYEA
Sbjct: 308 LHQKGCHCKKSNCLKNYCECYEA 330
>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 450
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/151 (51%), Positives = 91/151 (60%), Gaps = 22/151 (14%)
Query: 36 CLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRK 95
L NPN +P G + G G SG L N RK
Sbjct: 105 ILTLNPNIVQPMTGPPGLTPGGVNQLGGS----SGPLTNLTNS---------------RK 145
Query: 96 PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
PCNCTKS CLKLYC+CFA G+ C CNC++C NN+ +E++R AI+ LERNP AF PKI
Sbjct: 146 PCNCTKSHCLKLYCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNPTAFHPKI 205
Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
G+ GEGER+HTKGCNCKRSGCLKNYCECY
Sbjct: 206 GR---GEGERKHTKGCNCKRSGCLKNYCECY 233
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 66/81 (81%), Gaps = 3/81 (3%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA G+ C CNC++C NN+ +E++R AI+ LERNP AF PKIG+ GEGER+H
Sbjct: 158 YCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNPTAFHPKIGR---GEGERKH 214
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
TKGCNCKRSGCLKNYCECYEA
Sbjct: 215 TKGCNCKRSGCLKNYCECYEA 235
>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
Length = 764
Score = 124 bits (312), Expect = 1e-26, Method: Composition-based stats.
Identities = 60/81 (74%), Positives = 69/81 (85%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC+ CNCN+C NN++HE++R AI+ CL+RNP AF PKIGK GE +RRH
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRRH 588
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 589 NKGCNCKRSGCLKNYCECYEA 609
Score = 120 bits (302), Expect = 2e-25, Method: Composition-based stats.
Identities = 58/79 (73%), Positives = 67/79 (84%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
YCDCFANGEFC+ CNCN+C NN++HE++R AI+ CL+RNP AF PKIGK GE +RRH
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRRH 588
Query: 168 TKGCNCKRSGCLKNYCECY 186
KGCNCKRSGCLKNYCECY
Sbjct: 589 NKGCNCKRSGCLKNYCECY 607
>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
Length = 363
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/93 (72%), Positives = 77/93 (82%), Gaps = 3/93 (3%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RKPCNCTKS CLKLYC+CFA G+ C CNCN+C NN+ +E++R AI+ LERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 203
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG+ GEGER+HTKGCNCKRSGCLKNYCECY
Sbjct: 204 KIGR---GEGERKHTKGCNCKRSGCLKNYCECY 233
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 66/81 (81%), Gaps = 3/81 (3%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA G+ C CNCN+C NN+ +E++R AI+ LERNP AF PKIG+ GEGER+H
Sbjct: 158 YCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHPKIGR---GEGERKH 214
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
TKGCNCKRSGCLKNYCECYEA
Sbjct: 215 TKGCNCKRSGCLKNYCECYEA 235
>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
Length = 364
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/93 (72%), Positives = 77/93 (82%), Gaps = 3/93 (3%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RKPCNCTKS CLKLYC+CFA G+ C CNCN+C NN+ +E++R AI+ LERNP AF P
Sbjct: 145 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 204
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KIG+ GEGER+HTKGCNCKRSGCLKNYCECY
Sbjct: 205 KIGR---GEGERKHTKGCNCKRSGCLKNYCECY 234
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 66/81 (81%), Gaps = 3/81 (3%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA G+ C CNCN+C NN+ +E++R AI+ LERNP AF PKIG+ GEGER+H
Sbjct: 159 YCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHPKIGR---GEGERKH 215
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
TKGCNCKRSGCLKNYCECYEA
Sbjct: 216 TKGCNCKRSGCLKNYCECYEA 236
>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
Length = 702
Score = 124 bits (310), Expect = 2e-26, Method: Composition-based stats.
Identities = 58/95 (61%), Positives = 67/95 (70%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C+NN EHE R A+ LERNPNAFRP
Sbjct: 153 KKHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRP 212
Query: 154 KI--GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G G RH KGCNCK+SGCLK YCEC+
Sbjct: 213 KIAPGADNRRAGGARHNKGCNCKKSGCLKKYCECF 247
Score = 109 bits (272), Expect = 5e-22, Method: Composition-based stats.
Identities = 51/84 (60%), Positives = 60/84 (71%), Gaps = 2/84 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--GKCLVGEGER 58
YC+CFA+G +C CNC +C+NN EHE R A+ LERNPNAFRPKI G G
Sbjct: 167 YCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRPKIAPGADNRRAGGA 226
Query: 59 RHTKGCNCKRSGCLKNYCECYEAG 82
RH KGCNCK+SGCLK YCEC++AG
Sbjct: 227 RHNKGCNCKKSGCLKKYCECFQAG 250
Score = 55.1 bits (131), Expect = 1e-05, Method: Composition-based stats.
Identities = 39/111 (35%), Positives = 44/111 (39%), Gaps = 44/111 (39%)
Query: 62 KGCNCKRSGCLKNYCECYEAG-------------------------ARTLE--PNGIRPR 94
K CNCK S CLK YCEC+ +G LE PN RP+
Sbjct: 154 KHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRPK 213
Query: 95 ----------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
K CNC KS CLK YC+CF G C + C C C N
Sbjct: 214 IAPGADNRRAGGARHNKGCNCKKSGCLKKYCECFQAGIVCSEICKCLDCKN 264
>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
Length = 798
Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats.
Identities = 62/81 (76%), Positives = 70/81 (86%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF+PKIGK GE +RRH
Sbjct: 590 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRRH 649
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 650 SKGCNCKRSGCLKNYCECYEA 670
Score = 120 bits (302), Expect = 2e-25, Method: Composition-based stats.
Identities = 62/84 (73%), Positives = 70/84 (83%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
M L YCDCFANGEFC CNCN+CFNN+EHE +R AI+ CL+RNP AF+PKIGK GE
Sbjct: 585 MPLNGYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGE 644
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+RRH+KGCNCKRSGCLKNYCECY
Sbjct: 645 SDRRHSKGCNCKRSGCLKNYCECY 668
>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
Length = 559
Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 69/106 (65%), Gaps = 11/106 (10%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+ +K CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAF
Sbjct: 89 KKQKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAF 148
Query: 152 RPKIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
RPKI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 149 RPKIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECF 194
Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats.
Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 105 YCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 164
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 165 DAGEVLILGKHNKGCHCKKSGCLKKYCECFQA 196
>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
Length = 559
Score = 117 bits (293), Expect = 2e-24, Method: Composition-based stats.
Identities = 56/106 (52%), Positives = 69/106 (65%), Gaps = 11/106 (10%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+ +K CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAF
Sbjct: 87 KKQKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAF 146
Query: 152 RPKIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
RPKI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 147 RPKIASSPHGTRDSREEAGEVLILGKHNKGCHCKKSGCLKKYCECF 192
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 103 YCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 162
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 163 EAGEVLILGKHNKGCHCKKSGCLKKYCECFQA 194
>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
tropicalis]
Length = 183
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 63/97 (64%), Positives = 78/97 (80%)
Query: 90 GIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
G + +KPCNCT+S CLKLYCDCFANG+ C CNC++C+NN EHE +R A++ CL RNP+
Sbjct: 7 GSKTKKPCNCTRSQCLKLYCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPD 66
Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
AF+PKI K +G+ + RH KGCNCKRSGC KNYCECY
Sbjct: 67 AFQPKIEKGKIGDVKPRHNKGCNCKRSGCFKNYCECY 103
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/126 (47%), Positives = 77/126 (61%), Gaps = 18/126 (14%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANG+ C CNC++C+NN EHE +R A++ CL RNP+AF+PKI K +G+ + RH
Sbjct: 25 YCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPDAFQPKIEKGKIGDVKPRH 84
Query: 61 TKGCNCKRSGCLKNYCECYEA--------------GARTLEPNGIRPRKPCNCTKSMCLK 106
KGCNCKRSGC KNYCECYEA ++L+ I P CT ++
Sbjct: 85 NKGCNCKRSGCFKNYCECYEAKIACSSVCKCVGCKNVKSLDSGSIIP----TCTSREVIE 140
Query: 107 LYCDCF 112
C C
Sbjct: 141 ATCACL 146
>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
Length = 321
Score = 115 bits (288), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 57/95 (60%), Positives = 70/95 (73%), Gaps = 2/95 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
RK CNC S CLKLYC+CFA+G++C Q CNC C NN+E+ R A+ LERNPNAFR
Sbjct: 20 RKTCNCRNSRCLKLYCECFASGQYCLQNCNCVECKNNVENSVQRQEAVEATLERNPNAFR 79
Query: 153 PKIGKCL-VGEGERRHTKGCNCKRSGCLKNYCECY 186
PK+ L GEG+ +H KGC+CK+SGCLK YCEC+
Sbjct: 80 PKVLPTLNGGEGDEKHNKGCHCKKSGCLKKYCECF 114
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/84 (57%), Positives = 62/84 (73%), Gaps = 2/84 (2%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-VGEGER 58
YC+CFA+G++C Q CNC C NN+E+ R A+ LERNPNAFRPK+ L GEG+
Sbjct: 34 YCECFASGQYCLQNCNCVECKNNVENSVQRQEAVEATLERNPNAFRPKVLPTLNGGEGDE 93
Query: 59 RHTKGCNCKRSGCLKNYCECYEAG 82
+H KGC+CK+SGCLK YCEC++A
Sbjct: 94 KHNKGCHCKKSGCLKKYCECFQAS 117
>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
Length = 1133
Score = 115 bits (288), Expect = 8e-24, Method: Composition-based stats.
Identities = 49/93 (52%), Positives = 56/93 (60%), Gaps = 11/93 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
YC+CFA+G +C CNC CFNN EHE R A+ LERNPNAFRPKI
Sbjct: 144 YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQAPAAAAA 203
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
RH KGCNCK+S CLK YCEC++AG
Sbjct: 204 VVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAG 236
Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats.
Identities = 48/96 (50%), Positives = 56/96 (58%), Gaps = 11/96 (11%)
Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------ 155
+M + YC+CFA+G +C CNC CFNN EHE R A+ LERNPNAFRPKI
Sbjct: 138 NMTAQSYCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQA 197
Query: 156 -----GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RH KGCNCK+S CLK YCEC+
Sbjct: 198 PAAAAAVVNNAAAPGRHLKGCNCKKSSCLKKYCECF 233
>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 586
Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats.
Identities = 43/68 (63%), Positives = 55/68 (80%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
T+ P+ ++PCNCTKS CLKLYCDCFANGEFCY CNC C+NN+++E++R AIR L
Sbjct: 519 TIFPDEAYKKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATL 578
Query: 145 ERNPNAFR 152
ERNP+AF+
Sbjct: 579 ERNPSAFK 586
Score = 84.7 bits (208), Expect = 2e-14, Method: Composition-based stats.
Identities = 30/45 (66%), Positives = 38/45 (84%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 45
YCDCFANGEFCY CNC C+NN+++E++R AIR LERNP+AF+
Sbjct: 542 YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 586
>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
Length = 766
Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats.
Identities = 54/98 (55%), Positives = 63/98 (64%), Gaps = 8/98 (8%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY--QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
K CNC S CLKLYC+CFA G C +C C +C N +EH +R AI L+RNPNAFR
Sbjct: 16 KGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVLDRNPNAFR 75
Query: 153 PKIGKCLVGEGER----RHTKGCNCKRSGCLKNYCECY 186
PKI V R RH +GC+CKRSGCLK YCEC+
Sbjct: 76 PKIAA--VASPSRDVALRHNRGCHCKRSGCLKKYCECF 111
Score = 98.6 bits (244), Expect = 1e-18, Method: Composition-based stats.
Identities = 46/88 (52%), Positives = 56/88 (63%), Gaps = 8/88 (9%)
Query: 1 YCDCFANGEFCY--QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 58
YC+CFA G C +C C +C N +EH +R AI L+RNPNAFRPKI V R
Sbjct: 29 YCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVLDRNPNAFRPKIAA--VASPSR 86
Query: 59 ----RHTKGCNCKRSGCLKNYCECYEAG 82
RH +GC+CKRSGCLK YCEC++A
Sbjct: 87 DVALRHNRGCHCKRSGCLKKYCECFQAA 114
>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
Length = 1772
Score = 113 bits (282), Expect = 4e-23, Method: Composition-based stats.
Identities = 49/93 (52%), Positives = 57/93 (61%), Gaps = 6/93 (6%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
KPCNC S CLKLYCDCFA G FC + C C C N+ H DR AI+ LE+NP AFR
Sbjct: 331 KPCNCRNSRCLKLYCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAIKATLEKNPKAFRA 390
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K V H GC+CK++ CLK YCEC+
Sbjct: 391 K-----VDAEATTHQNGCHCKKTKCLKKYCECF 418
Score = 95.5 bits (236), Expect = 9e-18, Method: Composition-based stats.
Identities = 42/83 (50%), Positives = 50/83 (60%), Gaps = 6/83 (7%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA G FC + C C C N+ H DR AI+ LE+NP AFR K V
Sbjct: 344 YCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAIKATLEKNPKAFRAK-----VDAEATT 398
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
H GC+CK++ CLK YCEC+EAG
Sbjct: 399 HQNGCHCKKTKCLKKYCECFEAG 421
>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
Length = 316
Score = 112 bits (281), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 62/81 (76%), Positives = 69/81 (85%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFANGEFC CNC +C NNIEHE++R AI+ CLERNP+AFRPKIGK G ERRH
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEG-NERRH 196
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 197 SKGCNCKRSGCLKNYCECYEA 217
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 60/79 (75%), Positives = 67/79 (84%), Gaps = 1/79 (1%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
YC+CFANGEFC CNC +C NNIEHE++R AI+ CLERNP+AFRPKIGK G ERRH
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEG-NERRH 196
Query: 168 TKGCNCKRSGCLKNYCECY 186
+KGCNCKRSGCLKNYCECY
Sbjct: 197 SKGCNCKRSGCLKNYCECY 215
>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 638
Score = 112 bits (279), Expect = 8e-23, Method: Composition-based stats.
Identities = 50/100 (50%), Positives = 67/100 (67%), Gaps = 2/100 (2%)
Query: 87 EPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLER 146
+ + + + C+C++S CLKLYC+CFA G FC C+C C+N +HE+ R AI + + R
Sbjct: 155 QQDSVLTSRRCSCSRSFCLKLYCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNR 214
Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P+AF+ KI L E H +GCNCKRSGCLKNYCECY
Sbjct: 215 KPDAFQSKIAHSL--ENATVHARGCNCKRSGCLKNYCECY 252
Score = 99.4 bits (246), Expect = 6e-19, Method: Composition-based stats.
Identities = 46/84 (54%), Positives = 56/84 (66%), Gaps = 2/84 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA G FC C+C C+N +HE+ R AI + + R P+AF+ KI L E H
Sbjct: 176 YCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNRKPDAFQSKIAHSL--ENATVH 233
Query: 61 TKGCNCKRSGCLKNYCECYEAGAR 84
+GCNCKRSGCLKNYCECYEA R
Sbjct: 234 ARGCNCKRSGCLKNYCECYEARVR 257
>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
Length = 629
Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats.
Identities = 54/115 (46%), Positives = 69/115 (60%), Gaps = 20/115 (17%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+ ++ CNC S CLKLYC+CFA+G +C CNC +CFNN+++E R A+ LERNPNAF
Sbjct: 125 KKQRQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAF 184
Query: 152 RPKIGKCLVGEGERR--------------------HTKGCNCKRSGCLKNYCECY 186
RPKI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 185 RPKIASSPQGARDSRVSRSSILWQEEAGEGLILIKHHKGCHCKKSGCLKKYCECF 239
Score = 96.3 bits (238), Expect = 5e-18, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 60/101 (59%), Gaps = 20/101 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +CFNN+++E R A+ LERNPNAFRPKI G + R
Sbjct: 141 YCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPQGARDSRV 200
Query: 60 -------------------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 201 SRSSILWQEEAGEGLILIKHHKGCHCKKSGCLKKYCECFQA 241
>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 151
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/118 (54%), Positives = 73/118 (61%), Gaps = 13/118 (11%)
Query: 82 GARTLEP-NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
AR P + RK CNC S CLKLYC+CFA+G++C CNC C NN EH +R AI
Sbjct: 17 AARASTPRDSASQRKRCNCKNSKCLKLYCECFASGKYCDACNCAGCKNNDEHAHERQSAI 76
Query: 141 RQCLERNPNAFRPKI-----GKCLVGEG-------ERRHTKGCNCKRSGCLKNYCECY 186
Q LERNPNAFRPKI L G G E RH +GC+CKRSGCLK YCEC+
Sbjct: 77 EQTLERNPNAFRPKIINQTTNGTLDGAGAGDGATTEARHNRGCHCKRSGCLKKYCECF 134
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 12/99 (12%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----GKCLVGE 55
YC+CFA+G++C CNC C NN EH +R AI Q LERNPNAFRPKI L G
Sbjct: 44 YCECFASGKYCDACNCAGCKNNDEHAHERQSAIEQTLERNPNAFRPKIINQTTNGTLDGA 103
Query: 56 G-------ERRHTKGCNCKRSGCLKNYCECYEAGARTLE 87
G E RH +GC+CKRSGCLK YCEC++A +E
Sbjct: 104 GAGDGATTEARHNRGCHCKRSGCLKKYCECFQAAIYCVE 142
>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
Length = 371
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G
Sbjct: 273 IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 328
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH+KGCNCKRSGCLKNYCECY
Sbjct: 329 AKLRHSKGCNCKRSGCLKNYCECY 352
>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
Length = 682
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/107 (54%), Positives = 69/107 (64%), Gaps = 13/107 (12%)
Query: 93 PRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
P+KP CNC SMCLKLYC+CFA G C CNC +C NN+E+E +R A+ L+RNPNA
Sbjct: 105 PKKPKQCNCKNSMCLKLYCECFAAGIHCNGCNCLNCRNNVENEKERKEAVETTLQRNPNA 164
Query: 151 FRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
FRPKI G + +H KGCNCKRSGCLK YCEC+
Sbjct: 165 FRPKIASSPHGSRDTTEDAQEVQMLGKHNKGCNCKRSGCLKKYCECF 211
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 57/91 (62%), Gaps = 11/91 (12%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA G C CNC +C NN+E+E +R A+ L+RNPNAFRPKI G +
Sbjct: 122 YCECFAAGIHCNGCNCLNCRNNVENEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTE 181
Query: 59 ---------RHTKGCNCKRSGCLKNYCECYE 80
+H KGCNCKRSGCLK YCEC++
Sbjct: 182 DAQEVQMLGKHNKGCNCKRSGCLKKYCECFQ 212
>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
Length = 536
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRP
Sbjct: 83 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 142
Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
KI G + RH KGC+CK+SGCLK YCEC+
Sbjct: 143 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 186
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
YC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRPKI G +
Sbjct: 97 YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 156
Query: 58 --------RRHTKGCNCKRSGCLKNYCECYEA 81
RH KGC+CK+SGCLK YCEC++A
Sbjct: 157 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 188
>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
Length = 532
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRP
Sbjct: 83 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 142
Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
KI G + RH KGC+CK+SGCLK YCEC+
Sbjct: 143 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 186
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
YC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRPKI G +
Sbjct: 97 YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 156
Query: 58 --------RRHTKGCNCKRSGCLKNYCECYEA 81
RH KGC+CK+SGCLK YCEC++A
Sbjct: 157 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 188
>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific
Length = 475
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G
Sbjct: 273 IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 328
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH+KGCNCKRSGCLKNYCECY
Sbjct: 329 AKLRHSKGCNCKRSGCLKNYCECY 352
>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
norvegicus]
Length = 467
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 270 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 325
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 326 SKGCNCKRSGCLKNYCECYEA 346
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G
Sbjct: 265 IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 320
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH+KGCNCKRSGCLKNYCECY
Sbjct: 321 AKLRHSKGCNCKRSGCLKNYCECY 344
>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
Length = 602
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/103 (55%), Positives = 66/103 (64%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC S CLKLYC+CFA+G +C CNC +C NN+EHE R A+ LERNPNAFRPK
Sbjct: 128 KLCNCKHSRCLKLYCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPK 187
Query: 155 IGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
I G E R H KGC+CK+SGCLK YCEC+
Sbjct: 188 IANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECF 230
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 49/92 (53%), Positives = 59/92 (64%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C NN+EHE R A+ LERNPNAFRPKI G E R
Sbjct: 141 YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKIANSPHGTRESRD 200
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 201 EIGELVMLGKHNKGCHCKKSGCLKKYCECFQA 232
>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
Length = 475
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
P+G + P T L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RN
Sbjct: 262 PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 313
Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P AF+PK+GK +G + RH+KGCNCKRSGCLKNYCECY
Sbjct: 314 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 352
>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 571
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRP
Sbjct: 118 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 177
Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
KI G + RH KGC+CK+SGCLK YCEC+
Sbjct: 178 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 221
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
YC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRPKI G +
Sbjct: 132 YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 191
Query: 58 --------RRHTKGCNCKRSGCLKNYCECYEA 81
RH KGC+CK+SGCLK YCEC++A
Sbjct: 192 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 223
>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 578
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRP
Sbjct: 118 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 177
Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
KI G + RH KGC+CK+SGCLK YCEC+
Sbjct: 178 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 221
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
YC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRPKI G +
Sbjct: 132 YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 191
Query: 58 --------RRHTKGCNCKRSGCLKNYCECYEA 81
RH KGC+CK+SGCLK YCEC++A
Sbjct: 192 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 223
>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
Length = 543
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRP
Sbjct: 83 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRP 142
Query: 154 KIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
KI G + RH KGC+CK+SGCLK YCEC+
Sbjct: 143 KIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECF 186
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE--- 57
YC+CFA+G +C CNC +CFNN+E+E R A+ LERNPNAFRPKI G +
Sbjct: 97 YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNRE 156
Query: 58 --------RRHTKGCNCKRSGCLKNYCECYEA 81
RH KGC+CK+SGCLK YCEC++A
Sbjct: 157 EVGDVVMLARHNKGCHCKKSGCLKKYCECFQA 188
>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
Length = 380
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC C+CN N+ HE +R AI+ C++RNP AF+PK+GK +G + RH
Sbjct: 183 YCDCFASGDFCTSCSCN----NLRHELERFKAIKACIDRNPEAFQPKMGKGRLGVSKLRH 238
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 239 SKGCNCKRSGCLKNYCECYEA 259
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC C+CN N+ HE +R AI+ C++RNP AF+PK+GK +G
Sbjct: 178 IALAGYCDCFASGDFCTSCSCN----NLRHELERFKAIKACIDRNPEAFQPKMGKGRLGV 233
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH+KGCNCKRSGCLKNYCECY
Sbjct: 234 SKLRHSKGCNCKRSGCLKNYCECY 257
>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
Length = 475
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 278 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 333
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 334 SKGCNCKRSGCLKNYCECYEA 354
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
P+G + P T L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RN
Sbjct: 262 PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 313
Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P AF+PK+GK +G + RH+KGCNCKRSGCLKNYCECY
Sbjct: 314 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 352
>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
Length = 584
Score = 109 bits (273), Expect = 4e-22, Method: Composition-based stats.
Identities = 48/100 (48%), Positives = 65/100 (65%), Gaps = 1/100 (1%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLER 146
P G P + CNC KS CLKLYC+C+A G +C QC C +C NN+E+E R A+ L++
Sbjct: 400 PVGRPPLRSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDK 459
Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
NP+AFRPKI + + H +GC C +S CL+ YCEC+
Sbjct: 460 NPDAFRPKIKQETTTNRKANHLRGCKCTKSKCLQRYCECF 499
Score = 90.5 bits (223), Expect = 2e-16, Method: Composition-based stats.
Identities = 38/83 (45%), Positives = 55/83 (66%), Gaps = 1/83 (1%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+C+A G +C QC C +C NN+E+E R A+ L++NP+AFRPKI + +
Sbjct: 420 YCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDKNPDAFRPKIKQETTTNRKAN 479
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
H +GC C +S CL+ YCEC++AG
Sbjct: 480 HLRGCKCTKSKCLQRYCECFQAG 502
>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
Length = 360
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 163 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 218
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 219 SKGCNCKRSGCLKNYCECYEA 239
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
P+G + P T L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RN
Sbjct: 147 PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 198
Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P AF+PK+GK +G + RH+KGCNCKRSGCLKNYCECY
Sbjct: 199 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 237
>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
Length = 295
Score = 108 bits (271), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 98 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 153
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 154 SKGCNCKRSGCLKNYCECYEA 174
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 69/99 (69%), Gaps = 8/99 (8%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
P+G + P T L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RN
Sbjct: 82 PSGPALQGPPKIT----LSGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRN 133
Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P AF+PK+GK +G + RH+KGCNCKRSGCLKNYCECY
Sbjct: 134 PEAFQPKMGKGRLGAAKLRHSKGCNCKRSGCLKNYCECY 172
>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
Length = 191
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 64/81 (79%), Gaps = 4/81 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G + RH
Sbjct: 98 YCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLRH 153
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNCKRSGCLKNYCECYEA
Sbjct: 154 SKGCNCKRSGCLKNYCECYEA 174
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 50/84 (59%), Positives = 64/84 (76%), Gaps = 4/84 (4%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCF++G+FC C+CN N+ HE +R AI+ CL+RNP AF+PK+GK +G
Sbjct: 93 IALAGYCDCFSSGDFCNSCSCN----NLRHELERFKAIKACLDRNPEAFQPKMGKGRLGA 148
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH+KGCNCKRSGCLKNYCECY
Sbjct: 149 AKLRHSKGCNCKRSGCLKNYCECY 172
>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
Length = 503
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/81 (62%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC C+ ++C NN+ HE +R AI+ CL+RNP AF+PKIG +G+ + RH
Sbjct: 302 YCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLGDVKPRH 361
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 362 NKGCNCRRSGCLKNYCECYEA 382
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 50/84 (59%), Positives = 65/84 (77%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC C+ ++C NN+ HE +R AI+ CL+RNP AF+PKIG +G+
Sbjct: 297 LTLAGYCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLGD 356
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNC+RSGCLKNYCECY
Sbjct: 357 VKPRHNKGCNCRRSGCLKNYCECY 380
>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
Length = 601
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRP
Sbjct: 120 QKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 179
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 180 KIASSPHGTRDSREENGEGLILGKHNKGCHCKKSGCLKKYCECF 223
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 134 YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 193
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 194 ENGEGLILGKHNKGCHCKKSGCLKKYCECFQA 225
>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
Length = 519
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRP
Sbjct: 112 QKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 171
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 172 KIASSPHGTRDSREETGEGLVFVKHNKGCHCKKSGCLKKYCECF 215
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 126 YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 185
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 186 ETGEGLVFVKHNKGCHCKKSGCLKKYCECFQA 217
>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
[Cucumis sativus]
Length = 556
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/103 (54%), Positives = 65/103 (63%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC S CLKLYC+CFA+G +C CNC +C NN+EHE R A+ LERNPNAFRPK
Sbjct: 128 KLCNCKHSRCLKLYCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPK 187
Query: 155 IGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
G E R H KGC+CK+SGCLK YCEC+
Sbjct: 188 XANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECF 230
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 48/92 (52%), Positives = 58/92 (63%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C NN+EHE R A+ LERNPNAFRPK G E R
Sbjct: 141 YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKXANSPHGTRESRD 200
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 201 EIGELVMLGKHNKGCHCKKSGCLKKYCECFQA 232
>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 568
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/104 (55%), Positives = 69/104 (66%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C NN+E+E +R AI LERNPNAFRP
Sbjct: 116 KKQCNCKHSRCLKLYCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRP 175
Query: 154 KIGKCL---------VGEGER--RHTKGCNCKRSGCLKNYCECY 186
KI VGE RH KGC+CK+SGCLK YCEC+
Sbjct: 176 KIAASPHGGRDNREEVGEVVLLGRHNKGCHCKKSGCLKKYCECF 219
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/92 (54%), Positives = 61/92 (66%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-------- 52
YC+CFA+G +C CNC +C NN+E+E +R AI LERNPNAFRPKI
Sbjct: 130 YCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRPKIAASPHGGRDNRE 189
Query: 53 -VGEGER--RHTKGCNCKRSGCLKNYCECYEA 81
VGE RH KGC+CK+SGCLK YCEC++A
Sbjct: 190 EVGEVVLLGRHNKGCHCKKSGCLKKYCECFQA 221
>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
Length = 561
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRP
Sbjct: 92 QKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 151
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 152 KIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECF 195
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 106 YCECFASGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 165
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 166 DAGEVLILGKHNKGCHCKKSGCLKKYCECFQA 197
>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 615
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 68/105 (64%), Gaps = 19/105 (18%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G +C CNC +CFNN+E+E R AI LERNP+AFRPKIG
Sbjct: 163 CNCRHSKCLKLYCECFASGIYCDGCNCTNCFNNVENEVARREAIEATLERNPDAFRPKIG 222
Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
L+G +H KGC+CK+SGCLK YCEC+
Sbjct: 223 SSPHTNRNNMEVAGDLPLIG----KHNKGCHCKKSGCLKKYCECF 263
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 62/96 (64%), Gaps = 19/96 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G +C CNC +CFNN+E+E R AI LERNP+AFRPKIG
Sbjct: 174 YCECFASGIYCDGCNCTNCFNNVENEVARREAIEATLERNPDAFRPKIGSSPHTNRNNME 233
Query: 52 ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
L+G +H KGC+CK+SGCLK YCEC++A
Sbjct: 234 VAGDLPLIG----KHNKGCHCKKSGCLKKYCECFQA 265
>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 619
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 69/105 (65%), Gaps = 19/105 (18%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG
Sbjct: 162 CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 221
Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 222 SSPHANRNNMEAAGDLPLVG----KHNKGCHCKKSGCLKKYCECF 262
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 19/96 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG
Sbjct: 173 YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHANRNNME 232
Query: 52 ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 233 AAGDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 264
>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
Length = 576
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 69/105 (65%), Gaps = 19/105 (18%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG
Sbjct: 114 CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 173
Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 174 SSPHANRNNMEAAGDLPLVG----KHNKGCHCKKSGCLKKYCECF 214
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 19/96 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG
Sbjct: 125 YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHANRNNME 184
Query: 52 ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 185 AAGDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 216
>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
CCMP2712]
Length = 138
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/114 (53%), Positives = 74/114 (64%), Gaps = 15/114 (13%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLER 146
P RKPCNC S CLKLYC+CFA+G +C QCNC+SC NN+ E DR A+ LER
Sbjct: 8 PGTPAKRKPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLER 67
Query: 147 NPNAFRPKIGKC----LV---GEGER-------RHTKGCNCKRSGCLKNYCECY 186
NP+AFRPKI + +V +G R RH KGC+CK+SGCLK YCEC+
Sbjct: 68 NPHAFRPKITQTGQQGIVDSPAKGARWLMFVKARHMKGCHCKKSGCLKKYCECF 121
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 50/98 (51%), Positives = 64/98 (65%), Gaps = 15/98 (15%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC----LV-- 53
YC+CFA+G +C QCNC+SC NN+ E DR A+ LERNP+AFRPKI + +V
Sbjct: 28 YCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERNPHAFRPKITQTGQQGIVDS 87
Query: 54 -GEGER-------RHTKGCNCKRSGCLKNYCECYEAGA 83
+G R RH KGC+CK+SGCLK YCEC++A
Sbjct: 88 PAKGARWLMFVKARHMKGCHCKKSGCLKKYCECFQAAV 125
>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
Length = 529
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 69/105 (65%), Gaps = 19/105 (18%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG
Sbjct: 68 CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 127
Query: 157 KC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 128 SSPHANRNNMEAAGDLPLVG----KHNKGCHCKKSGCLKKYCECF 168
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 19/96 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG
Sbjct: 79 YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHANRNNME 138
Query: 52 ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 139 AAGDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 170
>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
Length = 545
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
++ CNC S CLKLYC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRP
Sbjct: 133 QRQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 192
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 193 KIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSGCLKKYCECF 236
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C+NN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 147 YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSRE 206
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 207 ETGDGLVFVKHNKGCHCKKSGCLKKYCECFQA 238
>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
Length = 601
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+++E R A+ LERNP AFRP
Sbjct: 127 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRP 186
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G ++R H KGC+CK+SGCLK YCEC+
Sbjct: 187 KIASSPHGVRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECF 230
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +CFNN+++E R A+ LERNP AFRPKI G ++R
Sbjct: 141 YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGVRDKRE 200
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 201 DIGEVVLLGKHNKGCHCKKSGCLKKYCECFQA 232
>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 603
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+++E R A+ LERNP AFRP
Sbjct: 131 KKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRP 190
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G ++R H KGC+CK+SGCLK YCEC+
Sbjct: 191 KIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECF 234
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +CFNN+++E R A+ LERNP AFRPKI G ++R
Sbjct: 145 YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGGRDKRE 204
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 205 DIGEVVLLGKHNKGCHCKKSGCLKKYCECFQA 236
>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1852
Score = 106 bits (264), Expect = 5e-21, Method: Composition-based stats.
Identities = 47/103 (45%), Positives = 58/103 (56%), Gaps = 4/103 (3%)
Query: 88 PNGIRP----RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQC 143
PNGI P R CNC KS CLKLYC+CF+ C CNC C N R A+ C
Sbjct: 1498 PNGITPMSTPRNRCNCKKSQCLKLYCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDC 1557
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
L +N +AF+P+I + H+ GC CK+S CLK YCEC+
Sbjct: 1558 LNKNAHAFKPRIAAAPGAADLQGHSMGCKCKKSECLKKYCECF 1600
Score = 82.4 bits (202), Expect = 8e-14, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 46/82 (56%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CF+ C CNC C N R A+ CL +N +AF+P+I + H
Sbjct: 1522 YCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAPGAADLQGH 1581
Query: 61 TKGCNCKRSGCLKNYCECYEAG 82
+ GC CK+S CLK YCEC++AG
Sbjct: 1582 SMGCKCKKSECLKKYCECFQAG 1603
Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats.
Identities = 35/107 (32%), Positives = 41/107 (38%), Gaps = 42/107 (39%)
Query: 64 CNCKRSGCLKNYCECYEA----------GARTL-----------------EPNGIRPRKP 96
CNCK+S CLK YCEC+ A R L + +PR
Sbjct: 1511 CNCKKSQCLKLYCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIA 1570
Query: 97 --------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
C C KS CLK YC+CF G C +C C SC N
Sbjct: 1571 AAPGAADLQGHSMGCKCKKSECLKKYCECFQAGVLCGLKCKCESCKN 1617
>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
Length = 592
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 67/104 (64%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRP
Sbjct: 106 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRP 165
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 166 KIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECF 209
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 59/92 (64%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 120 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSRE 179
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 180 ESGEALVLGKHNKGCHCKKSGCLKKYCECFQA 211
>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
Length = 524
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 67/104 (64%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRP
Sbjct: 38 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRP 97
Query: 154 KIGKCLVGEGERR-----------HTKGCNCKRSGCLKNYCECY 186
KI G + R H KGC+CK+SGCLK YCEC+
Sbjct: 98 KIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECF 141
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 47/92 (51%), Positives = 59/92 (64%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPKI G + R
Sbjct: 52 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSRE 111
Query: 60 ----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 112 ESGEALVLGKHNKGCHCKKSGCLKKYCECFQA 143
>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
Length = 550
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/104 (54%), Positives = 66/104 (63%), Gaps = 18/104 (17%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G C CNC +CFNN E+E R AI LERNP+AFRPKIG
Sbjct: 114 CNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 173
Query: 157 KC--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 174 SSPHTNRNNEVSSDLPLVG----KHNKGCHCKKSGCLKKYCECF 213
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 50/95 (52%), Positives = 60/95 (63%), Gaps = 18/95 (18%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G C CNC +CFNN E+E R AI LERNP+AFRPKIG
Sbjct: 125 YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPHTNRNNEV 184
Query: 52 -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 185 SSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 215
>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
Length = 728
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/81 (70%), Positives = 68/81 (83%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANG+FC CNCN+C+NN+ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 527 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDIKPRH 586
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 587 NKGCNCKRSGCLKNYCECYEA 607
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/102 (59%), Positives = 74/102 (72%), Gaps = 4/102 (3%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
T P+G + P T L YCDCFANG+FC CNCN+C+NN+ HE +R AI+ CL
Sbjct: 508 TTFPSGSNLQGPAKIT----LAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACL 563
Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+RNP AF+PKIGK +G+ + RH KGCNCKRSGCLKNYCECY
Sbjct: 564 DRNPEAFQPKIGKGKLGDIKPRHNKGCNCKRSGCLKNYCECY 605
>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 412
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/104 (54%), Positives = 66/104 (63%), Gaps = 18/104 (17%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G C CNC +CFNN E+E R AI LERNP+AFRPKIG
Sbjct: 157 CNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 216
Query: 157 KC--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 217 SSPHTNRNNEVSSDLPLVG----KHNKGCHCKKSGCLKKYCECF 256
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 50/95 (52%), Positives = 60/95 (63%), Gaps = 18/95 (18%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G C CNC +CFNN E+E R AI LERNP+AFRPKIG
Sbjct: 168 YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPHTNRNNEV 227
Query: 52 -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 228 SSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 258
>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
Length = 511
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/81 (70%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANG+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 369
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 370 NKGCNCKRSGCLKNYCECYEA 390
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/84 (66%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFANG+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 305 ITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 364
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 365 VKPRHNKGCNCKRSGCLKNYCECY 388
>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
Length = 511
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/81 (70%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANG+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 369
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 370 NKGCNCKRSGCLKNYCECYEA 390
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/84 (66%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFANG+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 305 ITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 364
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 365 VKPRHNKGCNCKRSGCLKNYCECY 388
>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
Length = 512
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C N++ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 311 YCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 370
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 371 NKGCNCKRSGCLKNYCECYEA 391
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C N++ HE +R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 306 ITLAGYCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 365
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 366 VKPRHNKGCNCKRSGCLKNYCECY 389
>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
Length = 220
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 68/104 (65%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +CFNN+++E R A+ LERNPNAFRP
Sbjct: 82 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRP 141
Query: 154 KIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
KI G + +H KGC+CK+SGCLK YCEC+
Sbjct: 142 KIASSPHGTRDNKEETGEVKVLVKHNKGCHCKKSGCLKKYCECF 185
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA+G +C CNC +CFNN+++E R A+ LERNPNAFRPKI G +
Sbjct: 96 YCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 155
Query: 59 ---------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 156 ETGEVKVLVKHNKGCHCKKSGCLKKYCECFQA 187
>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 1042
Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats.
Identities = 52/94 (55%), Positives = 62/94 (65%), Gaps = 8/94 (8%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
+ CNC KS CLKLYCDCFA G C CNC C N E ++R A+ Q ERNPNAF+
Sbjct: 651 KTTCNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANT-EGNEERKQAMDQITERNPNAFK 709
Query: 153 PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
PKI + + H+KGC+CK+SGCLK YCECY
Sbjct: 710 PKI------QEKGFHSKGCHCKKSGCLKKYCECY 737
Score = 87.8 bits (216), Expect = 2e-15, Method: Composition-based stats.
Identities = 44/83 (53%), Positives = 55/83 (66%), Gaps = 8/83 (9%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA G C CNC C N E ++R A+ Q ERNPNAF+PKI + +
Sbjct: 665 YCDCFAYGLGCSPDCNCADCANT-EGNEERKQAMDQITERNPNAFKPKI------QEKGF 717
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
H+KGC+CK+SGCLK YCECY++G
Sbjct: 718 HSKGCHCKKSGCLKKYCECYQSG 740
Score = 53.9 bits (128), Expect = 3e-05, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 43/101 (42%), Gaps = 36/101 (35%)
Query: 64 CNCKRSGCLKNYCECYEAG------------ART---------------LEPNGIRPR-- 94
CNCK+S CLK YC+C+ G A T PN +P+
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERKQAMDQITERNPNAFKPKIQ 713
Query: 95 ------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
K C+C KS CLK YC+C+ +G C C C C N
Sbjct: 714 EKGFHSKGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKN 754
>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
Length = 1987
Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats.
Identities = 51/105 (48%), Positives = 60/105 (57%), Gaps = 6/105 (5%)
Query: 82 GARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
G RT E + K C C KS CLKLYCDCFA G++C C+C SC N EH D
Sbjct: 779 GRRTSENSS----KSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRRE 834
Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
R+P AF KI L G +H +GCNC++S CLK YCECY
Sbjct: 835 DIAARDPQAFTRKIQ--LAPNGNGKHKRGCNCRKSHCLKKYCECY 877
Score = 88.2 bits (217), Expect = 1e-15, Method: Composition-based stats.
Identities = 39/84 (46%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA G++C C+C SC N EH D R+P AF KI L G +H
Sbjct: 801 YCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAARDPQAFTRKIQ--LAPNGNGKH 858
Query: 61 TKGCNCKRSGCLKNYCECYEAGAR 84
+GCNC++S CLK YCECY+ G +
Sbjct: 859 KRGCNCRKSHCLKKYCECYQGGVK 882
Score = 58.9 bits (141), Expect = 8e-07, Method: Composition-based stats.
Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 40/108 (37%)
Query: 61 TKGCNCKRSGCLKNYCECYEAG-----------------------------AR------- 84
+K C CK+S CLK YC+C+ AG AR
Sbjct: 787 SKSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAARDPQAFTR 846
Query: 85 --TLEPNGI-RPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
L PNG + ++ CNC KS CLK YC+C+ G C QC C C N
Sbjct: 847 KIQLAPNGNGKHKRGCNCRKSHCLKKYCECYQGGVKCGIQCTCMECEN 894
>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats.
Identities = 55/137 (40%), Positives = 66/137 (48%), Gaps = 43/137 (31%)
Query: 93 PRK-PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
P+K PCNC KS CLKLYC+CFA+G +C + CNC C N ED R AI LE+NPNA
Sbjct: 559 PKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNA 618
Query: 151 FRPKIGK-----CLVGEGERR------------------------------------HTK 169
F+PKIG + G RR H
Sbjct: 619 FKPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKH 678
Query: 170 GCNCKRSGCLKNYCECY 186
GC+CK+S C K YCEC+
Sbjct: 679 GCHCKKSACQKKYCECF 695
Score = 83.6 bits (205), Expect = 3e-14, Method: Composition-based stats.
Identities = 45/124 (36%), Positives = 56/124 (45%), Gaps = 42/124 (33%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
YC+CFA+G +C + CNC C N ED R AI LE+NPNAF+PKIG +
Sbjct: 575 YCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTS 634
Query: 55 EGERR------------------------------------HTKGCNCKRSGCLKNYCEC 78
G RR H GC+CK+S C K YCEC
Sbjct: 635 GGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKHGCHCKKSACQKKYCEC 694
Query: 79 YEAG 82
++AG
Sbjct: 695 FQAG 698
>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats.
Identities = 55/137 (40%), Positives = 66/137 (48%), Gaps = 43/137 (31%)
Query: 93 PRK-PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
P+K PCNC KS CLKLYC+CFA+G +C + CNC C N ED R AI LE+NPNA
Sbjct: 559 PKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNA 618
Query: 151 FRPKIGK-----CLVGEGERR------------------------------------HTK 169
F+PKIG + G RR H
Sbjct: 619 FKPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKH 678
Query: 170 GCNCKRSGCLKNYCECY 186
GC+CK+S C K YCEC+
Sbjct: 679 GCHCKKSACQKKYCECF 695
Score = 83.6 bits (205), Expect = 3e-14, Method: Composition-based stats.
Identities = 45/124 (36%), Positives = 56/124 (45%), Gaps = 42/124 (33%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
YC+CFA+G +C + CNC C N ED R AI LE+NPNAF+PKIG +
Sbjct: 575 YCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTS 634
Query: 55 EGERR------------------------------------HTKGCNCKRSGCLKNYCEC 78
G RR H GC+CK+S C K YCEC
Sbjct: 635 GGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKHGCHCKKSACQKKYCEC 694
Query: 79 YEAG 82
++AG
Sbjct: 695 FQAG 698
>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
Length = 548
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/81 (70%), Positives = 68/81 (83%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANG+FC CNCN+C+NN+ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 347 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDIKPRH 406
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 407 NKGCNCKRSGCLKNYCECYEA 427
Score = 102 bits (253), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 61/102 (59%), Positives = 74/102 (72%), Gaps = 4/102 (3%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
T P+G + P T L YCDCFANG+FC CNCN+C+NN+ HE +R AI+ CL
Sbjct: 328 TTFPSGSNLQGPAKIT----LAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACL 383
Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+RNP AF+PKIGK +G+ + RH KGCNCKRSGCLKNYCECY
Sbjct: 384 DRNPEAFQPKIGKGKLGDIKPRHNKGCNCKRSGCLKNYCECY 425
>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
Length = 509
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 308 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 367
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 368 NKGCNCKRSGCLKNYCECYEA 388
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 303 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 362
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 363 VKPRHNKGCNCKRSGCLKNYCECY 386
>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
Length = 511
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 310 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 369
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 370 NKGCNCKRSGCLKNYCECYEA 390
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 305 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 364
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 365 VKPRHNKGCNCKRSGCLKNYCECY 388
>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
Length = 356
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 54/104 (51%), Positives = 67/104 (64%), Gaps = 18/104 (17%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G +C CNC +CFNN ++E R A+ L+RNP+AFRPKIG
Sbjct: 152 CNCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIG 211
Query: 157 KC--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 212 SSPHMHRNNEVPSDLPLVG----KHNKGCHCKKSGCLKKYCECF 251
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/95 (49%), Positives = 61/95 (64%), Gaps = 18/95 (18%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G +C CNC +CFNN ++E R A+ L+RNP+AFRPKIG
Sbjct: 163 YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRNNEV 222
Query: 52 -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 223 PSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 253
>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
Length = 656
Score = 102 bits (253), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 56/113 (49%), Positives = 70/113 (61%), Gaps = 15/113 (13%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
L+ + +K CNC S CLKLYC+CFA G +C CNC +C NN+++E R A+ LE
Sbjct: 193 LKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLE 252
Query: 146 RNPNAFRPKIGKCLVGEGERR------------HTKGCNCKRSGCLKNYCECY 186
RNPNAFRPKI + ERR H KGC+CK+SGCLK YCEC+
Sbjct: 253 RNPNAFRPKIASSPL---ERRDSKECEIQVIGKHNKGCHCKKSGCLKKYCECF 302
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 59/93 (63%), Gaps = 15/93 (16%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR- 59
YC+CFA G +C CNC +C NN+++E R A+ LERNPNAFRPKI + ERR
Sbjct: 215 YCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPL---ERRD 271
Query: 60 -----------HTKGCNCKRSGCLKNYCECYEA 81
H KGC+CK+SGCLK YCEC++A
Sbjct: 272 SKECEIQVIGKHNKGCHCKKSGCLKKYCECFQA 304
>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
Length = 383
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 182 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 241
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 242 NKGCNCKRSGCLKNYCECYEA 262
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 177 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD 236
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 237 VKPRHNKGCNCKRSGCLKNYCECY 260
>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
Length = 515
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 66/81 (81%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+ + RH
Sbjct: 314 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 373
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 374 NKGCNCKRSGCLKNYCECYEA 394
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 66/84 (78%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+
Sbjct: 309 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 368
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 369 VKPRHNKGCNCKRSGCLKNYCECY 392
>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
Length = 712
Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats.
Identities = 55/131 (41%), Positives = 67/131 (51%), Gaps = 39/131 (29%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYC+CFAN +FC C CN C N+ EH+D+R HA Q L RNP AFRP
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508
Query: 154 KI--------------------------------------GKCLVGEGERRHTKGCNCKR 175
K+ G V + ++RH +GCNCK+
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKK 568
Query: 176 SGCLKNYCECY 186
S C K YCEC+
Sbjct: 569 SNCQKKYCECF 579
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 46/121 (38%), Positives = 59/121 (48%), Gaps = 39/121 (32%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC+CFAN +FC C CN C N+ EH+D+R HA Q L RNP AFRPK+
Sbjct: 462 YCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRPKVETQQDETSLFS 521
Query: 49 ---------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
G V + ++RH +GCNCK+S C K YCEC++
Sbjct: 522 KDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKKSNCQKKYCECFQQ 581
Query: 82 G 82
G
Sbjct: 582 G 582
>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
gi|224030123|gb|ACN34137.1| unknown [Zea mays]
gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
Length = 607
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/103 (51%), Positives = 66/103 (64%), Gaps = 18/103 (17%)
Query: 98 NCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK 157
NC S CLKLYC+CFA+G +C CNC +CFNN ++E R A+ L+RNP+AFRPKIG
Sbjct: 153 NCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGS 212
Query: 158 C--------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 213 SPHMHRNNEVPSDLPLVG----KHNKGCHCKKSGCLKKYCECF 251
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/95 (49%), Positives = 61/95 (64%), Gaps = 18/95 (18%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA+G +C CNC +CFNN ++E R A+ L+RNP+AFRPKIG
Sbjct: 163 YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRNNEV 222
Query: 52 -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 223 PSDLPLVG----KHNKGCHCKKSGCLKKYCECFQA 253
>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
Length = 510
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 66/81 (81%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 309 YCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 368
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 369 NKGCNCRRSGCLKNYCECYEA 389
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 66/84 (78%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 304 ITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLGD 363
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNC+RSGCLKNYCECY
Sbjct: 364 VKPRHNKGCNCRRSGCLKNYCECY 387
>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 912
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 56/104 (53%), Positives = 68/104 (65%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC S CLKLYC+CFA+G +C CNC +C NN EHE R A+ LERNPNAFRPK
Sbjct: 469 KQCNCKNSRCLKLYCECFASGTYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPK 528
Query: 155 IGKC--LVGEGER----------RHTKGCNCKRSGCLKNYCECY 186
I ++ +G+ +H KGC+CK+SGCLK YCECY
Sbjct: 529 IANSPRILHDGKEDVVDRNGSAAKHNKGCHCKKSGCLKKYCECY 572
Score = 84.3 bits (207), Expect = 2e-14, Method: Composition-based stats.
Identities = 48/93 (51%), Positives = 61/93 (65%), Gaps = 12/93 (12%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGEGER 58
YC+CFA+G +C CNC +C NN EHE R A+ LERNPNAFRPKI ++ +G+
Sbjct: 482 YCECFASGTYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPKIANSPRILHDGKE 541
Query: 59 ----------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCECY+A
Sbjct: 542 DVVDRNGSAAKHNKGCHCKKSGCLKKYCECYQA 574
>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1223
Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats.
Identities = 51/104 (49%), Positives = 59/104 (56%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYC+CFA EFC C C +CFN E+E + +Q RNP AF P
Sbjct: 737 KRCNCKKSKCLKLYCECFAAREFCVGSCACRNCFNKPEYEATVLNTRQQIESRNPLAFAP 796
Query: 154 KI-----------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI + L RH +GCNCK+S CLK YCECY
Sbjct: 797 KIVQAPESSPIPGDEALDTPASARHKRGCNCKKSLCLKKYCECY 840
Score = 84.7 bits (208), Expect = 2e-14, Method: Composition-based stats.
Identities = 43/94 (45%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC+CFA EFC C C +CFN E+E + +Q RNP AF PKI
Sbjct: 750 YCECFAAREFCVGSCACRNCFNKPEYEATVLNTRQQIESRNPLAFAPKIVQAPESSPIPG 809
Query: 49 GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
+ L RH +GCNCK+S CLK YCECY+AG
Sbjct: 810 DEALDTPASARHKRGCNCKKSLCLKKYCECYQAG 843
Score = 36.6 bits (83), Expect = 4.9, Method: Composition-based stats.
Identities = 15/26 (57%), Positives = 17/26 (65%)
Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
GE K CNCK+S CLK YCEC+
Sbjct: 729 GEKPGDGCKRCNCKKSKCLKLYCECF 754
>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
Length = 238
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/66 (69%), Positives = 53/66 (80%)
Query: 91 IRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
I R+PCNCTKS CLKLYCDCFANGEFC C+C +C NN+ +E DR AI+ CLERNP A
Sbjct: 52 INNRRPCNCTKSQCLKLYCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMA 111
Query: 151 FRPKIG 156
F+PKIG
Sbjct: 112 FQPKIG 117
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 33/49 (67%), Positives = 39/49 (79%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
YCDCFANGEFC C+C +C NN+ +E DR AI+ CLERNP AF+PKIG
Sbjct: 69 YCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG 117
>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/95 (58%), Positives = 63/95 (66%), Gaps = 5/95 (5%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYCDCFA G C CNC SC NN +H +R + I+Q +ERNP AFRPK+
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKV 263
Query: 156 GKCLVGEGE----RRHTKGCNCKRSGCLKNYCECY 186
E E RH KGCNCK+S CLK YCECY
Sbjct: 264 DSRSNSEDEIDHKPRHFKGCNCKKSNCLKKYCECY 298
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 5/89 (5%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-- 57
YCDCFA G C CNC SC NN +H +R + I+Q +ERNP AFRPK+ E E
Sbjct: 215 YCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKVDSRSNSEDEID 274
Query: 58 --RRHTKGCNCKRSGCLKNYCECYEAGAR 84
RH KGCNCK+S CLK YCECY+ G +
Sbjct: 275 HKPRHFKGCNCKKSNCLKKYCECYQMGVK 303
>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Oryctolagus cuniculus]
Length = 450
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 64/81 (79%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN HE +R AI+ CL+RNP AF+PKIGK G + RH
Sbjct: 250 YCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGSAKPRH 309
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 310 NKGCNCKRSGCLKNYCECYEA 330
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 64/84 (76%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN HE +R AI+ CL+RNP AF+PKIGK G
Sbjct: 245 IALSGYCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGS 304
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 305 AKPRHNKGCNCKRSGCLKNYCECY 328
>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
Length = 516
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 66/81 (81%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE R AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 315 YCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIGKGKLGDVKPRH 374
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 375 NKGCNCRRSGCLKNYCECYEA 395
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 66/84 (78%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE R AI+ CL+RNP AF+PKIGK +G+
Sbjct: 310 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIGKGKLGD 369
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNC+RSGCLKNYCECY
Sbjct: 370 VKPRHNKGCNCRRSGCLKNYCECY 393
>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
Length = 515
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 69/110 (62%), Gaps = 9/110 (8%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
L+ + +K CNC S CLKLYC+CFA G +C CNC +C NN+++E R A+ LE
Sbjct: 122 LKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLE 181
Query: 146 RNPNAFRPKIGKCLVGEGER---------RHTKGCNCKRSGCLKNYCECY 186
RNPNAFRPKI + + +H KGC+CK+SGCLK YCEC+
Sbjct: 182 RNPNAFRPKIASSPQEQRDSKECEIKVIGKHNKGCHCKKSGCLKKYCECF 231
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 58/90 (64%), Gaps = 9/90 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA G +C CNC +C NN+++E R A+ LERNPNAFRPKI + +
Sbjct: 144 YCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPQEQRDSKE 203
Query: 59 -------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 204 CEIKVIGKHNKGCHCKKSGCLKKYCECFQA 233
>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
Length = 391
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK G+ + RH
Sbjct: 186 YCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGKGRSGDVKPRH 245
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 246 NKGCNCKRSGCLKNYCECYEA 266
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK G+
Sbjct: 181 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGKGRSGD 240
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 241 VKPRHNKGCNCKRSGCLKNYCECY 264
>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 459
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/118 (46%), Positives = 73/118 (61%), Gaps = 13/118 (11%)
Query: 80 EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHA 139
E ++ PN + +K CNC S CLKLYC+CFA+G +C CNC +C N +E+E R A
Sbjct: 82 EVESKENTPN--KQQKHCNCKNSKCLKLYCECFASGSYCNGCNCVNCHNKLENESSRQVA 139
Query: 140 IRQCLERNPNAFRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
I LERNP+AF+PKI G + +H+KGC+C++SGCLK YCECY
Sbjct: 140 ISGILERNPDAFKPKIAGSPHGMKDLQENVQQVLLIGKHSKGCHCRKSGCLKKYCECY 197
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 59/92 (64%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA+G +C CNC +C N +E+E R AI LERNP+AF+PKI G +
Sbjct: 108 YCECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQE 167
Query: 59 ---------RHTKGCNCKRSGCLKNYCECYEA 81
+H+KGC+C++SGCLK YCECY+A
Sbjct: 168 NVQQVLLIGKHSKGCHCRKSGCLKKYCECYQA 199
>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
Length = 508
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 66/81 (81%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+ + RH
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 366
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 66/84 (78%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+
Sbjct: 302 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 361
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNC+RSGCLKNYCECY
Sbjct: 362 VKPRHNKGCNCRRSGCLKNYCECY 385
>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
Length = 377
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 68/81 (83%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANG+FC CNCN+C+NN +HE +R +AI+ CL+RNP AF PKIGK +G+ + RH
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLGDIKPRH 235
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCEC+EA
Sbjct: 236 NKGCNCKRSGCLKNYCECFEA 256
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/79 (68%), Positives = 66/79 (83%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
YCDCFANG+FC CNCN+C+NN +HE +R +AI+ CL+RNP AF PKIGK +G+ + RH
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLGDIKPRH 235
Query: 168 TKGCNCKRSGCLKNYCECY 186
KGCNCKRSGCLKNYCEC+
Sbjct: 236 NKGCNCKRSGCLKNYCECF 254
>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/95 (58%), Positives = 63/95 (66%), Gaps = 5/95 (5%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYCDCFA G C CNC SC NN +H +R + I+Q +ERNP AFRPK+
Sbjct: 195 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKV 254
Query: 156 GKCLVGEGE----RRHTKGCNCKRSGCLKNYCECY 186
E E RH KGCNCK+S CLK YCECY
Sbjct: 255 ESRSNSEDEIDQKPRHFKGCNCKKSNCLKKYCECY 289
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 5/89 (5%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-- 57
YCDCFA G C CNC SC NN +H +R + I+Q +ERNP AFRPK+ E E
Sbjct: 206 YCDCFAAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKVESRSNSEDEID 265
Query: 58 --RRHTKGCNCKRSGCLKNYCECYEAGAR 84
RH KGCNCK+S CLK YCECY+ G +
Sbjct: 266 QKPRHFKGCNCKKSNCLKKYCECYQMGVK 294
>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
fascicularis]
Length = 490
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 66/81 (81%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+ + RH
Sbjct: 289 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 348
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 349 NKGCNCRRSGCLKNYCECYEA 369
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 66/84 (78%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+
Sbjct: 284 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 343
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNC+RSGCLKNYCECY
Sbjct: 344 VKPRHNKGCNCRRSGCLKNYCECY 367
>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
Length = 676
Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats.
Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 11/96 (11%)
Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
S CLKLYC+CFA+G +C C+C++C NN+ +E R A+ L+RNPNAFRPKI G
Sbjct: 199 SRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHG 258
Query: 162 E---GER--------RHTKGCNCKRSGCLKNYCECY 186
GE +H KGC+CK+SGCLK YCEC+
Sbjct: 259 SHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECF 294
Score = 93.6 bits (231), Expect = 3e-17, Method: Composition-based stats.
Identities = 45/92 (48%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---GE 57
YC+CFA+G +C C+C++C NN+ +E R A+ L+RNPNAFRPKI G GE
Sbjct: 205 YCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGE 264
Query: 58 R--------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 265 EAGKVPLVGKHNKGCHCKKSGCLKKYCECFQA 296
>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
Length = 897
Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats.
Identities = 54/105 (51%), Positives = 59/105 (56%), Gaps = 13/105 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYCDCFA G FC C+C CFN E+ + +Q RNP AF P
Sbjct: 509 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 568
Query: 154 KIGKC------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI K L RHT+GCNCKRS CLK YCECY
Sbjct: 569 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECY 613
Score = 81.6 bits (200), Expect = 1e-13, Method: Composition-based stats.
Identities = 44/94 (46%), Positives = 51/94 (54%), Gaps = 13/94 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
YCDCFA G FC C+C CFN E+ + +Q RNP AF PKI K
Sbjct: 522 YCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNM 581
Query: 52 ----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
L RHT+GCNCKRS CLK YCECY++
Sbjct: 582 EDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQS 615
>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
Length = 629
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/107 (50%), Positives = 64/107 (59%), Gaps = 19/107 (17%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC S CLKLYC+CFA G C CNC +C NN+E+E R A+ LERNP AF+PK
Sbjct: 147 KHCNCKNSRCLKLYCECFAAGVHCTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPK 206
Query: 155 IGKC---------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
I LVG +H KGC+CK+SGCLK YCEC+
Sbjct: 207 IASSPHGSRDAKEDAMEVQLVG----KHNKGCHCKKSGCLKKYCECF 249
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 57/96 (59%), Gaps = 19/96 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--------- 51
YC+CFA G C CNC +C NN+E+E R A+ LERNP AF+PKI
Sbjct: 160 YCECFAAGVHCTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKE 219
Query: 52 ------LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LVG +H KGC+CK+SGCLK YCEC++A
Sbjct: 220 DAMEVQLVG----KHNKGCHCKKSGCLKKYCECFQA 251
>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
distachyon]
Length = 540
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/111 (51%), Positives = 65/111 (58%), Gaps = 13/111 (11%)
Query: 89 NGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
NG RK C C S CLKLYCDCFA+G +C CNC +C N++ HE R AI +ERN
Sbjct: 143 NGTPRRKNCKCKNSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAINAVMERN 202
Query: 148 PNAFRPKIGKCL----------VGEGER--RHTKGCNCKRSGCLKNYCECY 186
P AF PKIG EG R +H KGC CKRS CLK YCEC+
Sbjct: 203 PVAFMPKIGNIPRHAAQNREYRAAEGPRVGKHMKGCQCKRSECLKKYCECF 253
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 56/94 (59%), Gaps = 13/94 (13%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL------- 52
YCDCFA+G +C CNC +C N++ HE R AI +ERNP AF PKIG
Sbjct: 162 YCDCFASGRYCNDDCNCKNCCNDVSHETARQDAINAVMERNPVAFMPKIGNIPRHAAQNR 221
Query: 53 ---VGEGER--RHTKGCNCKRSGCLKNYCECYEA 81
EG R +H KGC CKRS CLK YCEC+++
Sbjct: 222 EYRAAEGPRVGKHMKGCQCKRSECLKKYCECFQS 255
>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 542
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/97 (51%), Positives = 62/97 (63%), Gaps = 2/97 (2%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+ RK C C SMCLKLYC+CFA G+ C C+C +C N+ +HE + A L RNP AF
Sbjct: 228 KERKSCKCKNSMCLKLYCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAF 287
Query: 152 RPKIGKCLVGEGER--RHTKGCNCKRSGCLKNYCECY 186
PK+ +G +H KGCNCKRSGC KNYCEC+
Sbjct: 288 EPKMTAVPKEDGALAIKHQKGCNCKRSGCQKNYCECF 324
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 53/85 (62%), Gaps = 2/85 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA G+ C C+C +C N+ +HE + A L RNP AF PK+ +G
Sbjct: 244 YCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAFEPKMTAVPKEDGALAI 303
Query: 59 RHTKGCNCKRSGCLKNYCECYEAGA 83
+H KGCNCKRSGC KNYCEC+ AG
Sbjct: 304 KHQKGCNCKRSGCQKNYCECFHAGV 328
>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
Length = 263
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 67/81 (82%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN++HE +R AI+ CL RNP AF+PKIG +G+ + RH
Sbjct: 53 YCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLGDVKPRH 112
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGC+C+RSGCLKNYCECYEA
Sbjct: 113 SKGCSCRRSGCLKNYCECYEA 133
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 67/84 (79%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN++HE +R AI+ CL RNP AF+PKIG +G+
Sbjct: 48 ITLAGYCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLGD 107
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH+KGC+C+RSGCLKNYCECY
Sbjct: 108 VKPRHSKGCSCRRSGCLKNYCECY 131
>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
Length = 497
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 76/116 (65%), Gaps = 12/116 (10%)
Query: 83 ARTLEPNGI-RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
ART NG + + C+C +S CLKLYC+CFA+G +C +C+C +C NN+E+ED R A
Sbjct: 80 ARTKIINGTPKSHRQCHCKQSKCLKLYCECFASGSYCDECSCANCHNNVENEDVRREAAE 139
Query: 142 QCLERNPNAFRPKI--GKCLV---GEGER------RHTKGCNCKRSGCLKNYCECY 186
LERNPNAF+PKI C G+ + +H KGC+CKR+GCLK YCEC+
Sbjct: 140 CILERNPNAFKPKITGSPCTPPDDGDAAKDVLVMAKHIKGCHCKRTGCLKKYCECF 195
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/92 (50%), Positives = 62/92 (67%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--GKCLV---GE 55
YC+CFA+G +C +C+C +C NN+E+ED R A LERNPNAF+PKI C G+
Sbjct: 106 YCECFASGSYCDECSCANCHNNVENEDVRREAAECILERNPNAFKPKITGSPCTPPDDGD 165
Query: 56 GER------RHTKGCNCKRSGCLKNYCECYEA 81
+ +H KGC+CKR+GCLK YCEC++A
Sbjct: 166 AAKDVLVMAKHIKGCHCKRTGCLKKYCECFQA 197
>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
both male and female reproductiv [Ectocarpus
siliculosus]
Length = 735
Score = 99.0 bits (245), Expect = 8e-19, Method: Composition-based stats.
Identities = 47/90 (52%), Positives = 56/90 (62%), Gaps = 3/90 (3%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC KS CLKLYC+CF ++C CNC C N ED R AI+ +ERNP AF K
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKFE 499
Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ G+R H GCNCK+S CLK YCEC+
Sbjct: 500 R---RAGKRSHNAGCNCKKSACLKKYCECF 526
Score = 84.7 bits (208), Expect = 1e-14, Method: Composition-based stats.
Identities = 40/82 (48%), Positives = 50/82 (60%), Gaps = 3/82 (3%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CF ++C CNC C N ED R AI+ +ERNP AF K + G+R H
Sbjct: 451 YCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKFER---RAGKRSH 507
Query: 61 TKGCNCKRSGCLKNYCECYEAG 82
GCNCK+S CLK YCEC++AG
Sbjct: 508 NAGCNCKKSACLKKYCECFQAG 529
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 38/120 (31%), Positives = 45/120 (37%), Gaps = 39/120 (32%)
Query: 48 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECYE-------------------AGARTLEP 88
+G G G + CNCK+S CLK YCEC++ R L
Sbjct: 424 VGMAGCGGGRDPLSVTCNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAI 483
Query: 89 NGIRPRKP-------------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
G R P CNC KS CLK YC+CF G C C C +C N
Sbjct: 484 QGTIERNPQAFVSKFERRAGKRSHNAGCNCKKSACLKKYCECFQAGVACGTNCKCVNCKN 543
>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
[Cucumis sativus]
Length = 788
Score = 99.0 bits (245), Expect = 8e-19, Method: Composition-based stats.
Identities = 56/117 (47%), Positives = 65/117 (55%), Gaps = 14/117 (11%)
Query: 84 RTLEPNGIRPR--KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAI 140
R LEP G K CNC KS CLKLYC+CFA G +C + C+C CFN HED
Sbjct: 480 RRLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATR 539
Query: 141 RQCLERNPNAFRPKI---GKCLVGEGER--------RHTKGCNCKRSGCLKNYCECY 186
+Q RNP AF PK+ L G+ RH +GCNCK+S CLK YCECY
Sbjct: 540 KQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECY 596
Score = 80.1 bits (196), Expect = 4e-13, Method: Composition-based stats.
Identities = 42/94 (44%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ L G
Sbjct: 506 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPG 565
Query: 57 ER--------RHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 566 DESNKTPASARHKRGCNCKKSSCLKKYCECYQGG 599
>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
Length = 999
Score = 99.0 bits (245), Expect = 8e-19, Method: Composition-based stats.
Identities = 53/139 (38%), Positives = 66/139 (47%), Gaps = 45/139 (32%)
Query: 93 PRK-PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
P+K PCNC KS CLKLYC+CFA+G +C + CNC C N E+ R AI LE+NPNA
Sbjct: 562 PKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNA 621
Query: 151 FRPKIG-----------KCLVGE--------------------------------GERRH 167
F+PKIG + VG + H
Sbjct: 622 FKPKIGATPGMTPGSARRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMH 681
Query: 168 TKGCNCKRSGCLKNYCECY 186
GC+CK+S C K YCEC+
Sbjct: 682 KHGCHCKKSACQKKYCECF 700
Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats.
Identities = 43/126 (34%), Positives = 56/126 (44%), Gaps = 44/126 (34%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG---------- 49
YC+CFA+G +C + CNC C N E+ R AI LE+NPNAF+PKIG
Sbjct: 578 YCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNAFKPKIGATPGMTPGSA 637
Query: 50 -KCLVGE--------------------------------GERRHTKGCNCKRSGCLKNYC 76
+ VG + H GC+CK+S C K YC
Sbjct: 638 RRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMHKHGCHCKKSACQKKYC 697
Query: 77 ECYEAG 82
EC++AG
Sbjct: 698 ECFQAG 703
Score = 37.0 bits (84), Expect = 3.8, Method: Composition-based stats.
Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 149 NAFRPKIGKCLVGEGERRHTKG-CNCKRSGCLKNYCECY 186
+A + +G VG + K CNCK+S CLK YCEC+
Sbjct: 544 SASKAPLGPAPVGMATQVPKKAPCNCKKSKCLKLYCECF 582
>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
Length = 788
Score = 99.0 bits (245), Expect = 9e-19, Method: Composition-based stats.
Identities = 56/117 (47%), Positives = 65/117 (55%), Gaps = 14/117 (11%)
Query: 84 RTLEPNGIRPR--KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAI 140
R LEP G K CNC KS CLKLYC+CFA G +C + C+C CFN HED
Sbjct: 480 RRLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATR 539
Query: 141 RQCLERNPNAFRPKI---GKCLVGEGER--------RHTKGCNCKRSGCLKNYCECY 186
+Q RNP AF PK+ L G+ RH +GCNCK+S CLK YCECY
Sbjct: 540 KQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECY 596
Score = 80.1 bits (196), Expect = 4e-13, Method: Composition-based stats.
Identities = 42/94 (44%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ L G
Sbjct: 506 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPG 565
Query: 57 ER--------RHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 566 DESNKTPASARHKRGCNCKKSSCLKKYCECYQGG 599
>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
Length = 517
Score = 99.0 bits (245), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 54/81 (66%), Positives = 66/81 (81%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+ + RH
Sbjct: 320 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGDVKPRH 379
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGC+C+RSGCLKNYCECYEA
Sbjct: 380 NKGCSCRRSGCLKNYCECYEA 400
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 53/84 (63%), Positives = 66/84 (78%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ HE +R AI+ CL RNP AF+PKIGK +G+
Sbjct: 315 IALARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLGD 374
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGC+C+RSGCLKNYCECY
Sbjct: 375 VKPRHNKGCSCRRSGCLKNYCECY 398
>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
Length = 821
Score = 98.6 bits (244), Expect = 9e-19, Method: Composition-based stats.
Identities = 54/105 (51%), Positives = 59/105 (56%), Gaps = 13/105 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYCDCFA G FC C+C CFN E+ + +Q RNP AF P
Sbjct: 433 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 492
Query: 154 KIGKC------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI K L RHT+GCNCKRS CLK YCECY
Sbjct: 493 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECY 537
Score = 80.5 bits (197), Expect = 3e-13, Method: Composition-based stats.
Identities = 44/94 (46%), Positives = 51/94 (54%), Gaps = 13/94 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
YCDCFA G FC C+C CFN E+ + +Q RNP AF PKI K
Sbjct: 446 YCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNM 505
Query: 52 ----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
L RHT+GCNCKRS CLK YCECY++
Sbjct: 506 EDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQS 539
>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
Length = 921
Score = 98.2 bits (243), Expect = 1e-18, Method: Composition-based stats.
Identities = 49/106 (46%), Positives = 55/106 (51%), Gaps = 14/106 (13%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYCDCF G FC C C C N +E +D +Q RNP AF P
Sbjct: 520 KTCNCKKSKCLKLYCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAP 579
Query: 154 KIGKC-------------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI C + RH +GCNCKRS C K YCEC+
Sbjct: 580 KIVPCAADVPPNNMEDVNMTTPASARHKRGCNCKRSKCTKKYCECF 625
Score = 80.1 bits (196), Expect = 4e-13, Method: Composition-based stats.
Identities = 40/95 (42%), Positives = 47/95 (49%), Gaps = 14/95 (14%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
YCDCF G FC C C C N +E +D +Q RNP AF PKI C
Sbjct: 533 YCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAPKIVPCAADVPPNN 592
Query: 52 -----LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
+ RH +GCNCKRS C K YCEC++A
Sbjct: 593 MEDVNMTTPASARHKRGCNCKRSKCTKKYCECFQA 627
>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
Length = 484
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/103 (54%), Positives = 73/103 (70%), Gaps = 5/103 (4%)
Query: 89 NGIRPRKPCNCTKS-----MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQC 143
N I + P T+S + ++ YCDCFA+G+ C CNCN+C NN+ HE +R AI+ C
Sbjct: 274 NLITQQSPAEVTESSMDSDVGVRKYCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKAC 333
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
L+RNP AF+PKIG +G+ + RH +GCNCKRSGCLKNYCECY
Sbjct: 334 LDRNPEAFQPKIGPGSLGDAKPRHNRGCNCKRSGCLKNYCECY 376
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/80 (65%), Positives = 64/80 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+ C CNCN+C NN+ HE +R AI+ CL+RNP AF+PKIG +G+ + RH
Sbjct: 298 YCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGPGSLGDAKPRH 357
Query: 61 TKGCNCKRSGCLKNYCECYE 80
+GCNCKRSGCLKNYCECYE
Sbjct: 358 NRGCNCKRSGCLKNYCECYE 377
>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
Length = 579
Score = 98.2 bits (243), Expect = 1e-18, Method: Composition-based stats.
Identities = 49/96 (51%), Positives = 63/96 (65%), Gaps = 11/96 (11%)
Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
S CLKLYC+CFA+G +C C+C++C NN+ +E R A+ L+RNPNAFRPKI G
Sbjct: 118 SRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHG 177
Query: 162 E---GER--------RHTKGCNCKRSGCLKNYCECY 186
GE +H KGC+CK+SGCLK YCEC+
Sbjct: 178 SHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECF 213
Score = 91.7 bits (226), Expect = 1e-16, Method: Composition-based stats.
Identities = 45/92 (48%), Positives = 60/92 (65%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---GE 57
YC+CFA+G +C C+C++C NN+ +E R A+ L+RNPNAFRPKI G GE
Sbjct: 124 YCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGE 183
Query: 58 R--------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 184 EAGKVPLVGKHNKGCHCKKSGCLKKYCECFQA 215
>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
Length = 283
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 68/81 (83%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC C+CN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK +G+ + RH
Sbjct: 77 YCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKSRLGDVKPRH 136
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 137 NKGCNCKRSGCLKNYCECYEA 157
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 68/84 (80%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC C+CN+C NN+ HE +R +AI+ CL+RNP AF+PKIGK +G+
Sbjct: 72 ITLAGYCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKSRLGD 131
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ RH KGCNCKRSGCLKNYCECY
Sbjct: 132 VKPRHNKGCNCKRSGCLKNYCECY 155
>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 491
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/94 (53%), Positives = 63/94 (67%), Gaps = 2/94 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
+K CNC S CLKLYC+CFA+G++C C C SC NN +++ R A+ LERNP AFR
Sbjct: 137 KKTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFR 196
Query: 153 PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
PKI E RH+KGC+C++S C K YCECY
Sbjct: 197 PKINSAAT-ESWARHSKGCHCRKSNCQKRYCECY 229
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 57/85 (67%), Gaps = 2/85 (2%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA+G++C C C SC NN +++ R A+ LERNP AFRPKI E R
Sbjct: 151 YCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFRPKINSAAT-ESWAR 209
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
H+KGC+C++S C K YCECY+AG +
Sbjct: 210 HSKGCHCRKSNCQKRYCECYQAGIQ 234
>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
Length = 508
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/81 (66%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G + RH
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPRH 366
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/82 (64%), Positives = 64/82 (78%)
Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE 164
L YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G +
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVK 363
Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
RH KGCNC+RSGCLKNYCECY
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECY 385
>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
Length = 508
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/81 (66%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G + RH
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPRH 366
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/82 (64%), Positives = 64/82 (78%)
Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE 164
L YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G +
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVK 363
Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
RH KGCNC+RSGCLKNYCECY
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECY 385
>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 983
Score = 97.8 bits (242), Expect = 2e-18, Method: Composition-based stats.
Identities = 51/131 (38%), Positives = 65/131 (49%), Gaps = 35/131 (26%)
Query: 91 IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
+ + PCNC KS CLKLYC+CFANG +C + CNC C N +E D+R A+ LE+NPN
Sbjct: 581 VSKKAPCNCKKSKCLKLYCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPN 640
Query: 150 AFRPKI----------------------------------GKCLVGEGERRHTKGCNCKR 175
AF+PKI G +R H GC+CK+
Sbjct: 641 AFKPKILPSQSSAVKSLYEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKK 700
Query: 176 SGCLKNYCECY 186
S C K YCEC+
Sbjct: 701 SACQKKYCECF 711
Score = 84.0 bits (206), Expect = 3e-14, Method: Composition-based stats.
Identities = 58/172 (33%), Positives = 77/172 (44%), Gaps = 6/172 (3%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFANG +C + CNC C N +E D+R A+ LE+NPNAF+PKI +
Sbjct: 598 YCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPNAFKPKILPSQSSAVKSL 657
Query: 60 HTKGCNCKR-SGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC 118
+ + K S + G RT + C+C KS C K YC+CF G C
Sbjct: 658 YEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKKSACQKKYCECFQAGVLC 717
Query: 119 Y-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTK 169
C C C N E + A L P +F P G ++HTK
Sbjct: 718 GDNCRCIDCRNV---EKKKEIAATDALLSTPESFLPSTPLSQAIYGSKKHTK 766
>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 255
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 64/99 (64%), Gaps = 7/99 (7%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G C CNC +C NN+ R A+ LERNPNAFRP
Sbjct: 6 KKYCNCRNSRCLKLYCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAFRP 65
Query: 154 KIGKCLVG------EGERRHTKGCNCKRSGCLKNYCECY 186
KI VG EG RH +GC+CK+S CLK YCEC+
Sbjct: 66 KIAPG-VGPLEKRREGASRHNRGCHCKKSSCLKKYCECF 103
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/87 (52%), Positives = 56/87 (64%), Gaps = 7/87 (8%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG------ 54
YC+CFA+G C CNC +C NN+ R A+ LERNPNAFRPKI VG
Sbjct: 20 YCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAFRPKIAPG-VGPLEKRR 78
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEA 81
EG RH +GC+CK+S CLK YCEC++A
Sbjct: 79 EGASRHNRGCHCKKSSCLKKYCECFQA 105
>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
Length = 502
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC+ CNCN+C NN+ HE R A++ CL+RNP AF+PKIGK G+ RH
Sbjct: 302 YCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIGKGRPGDVTPRH 361
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+GCNC+RSGCLKNYCECYEA
Sbjct: 362 NRGCNCRRSGCLKNYCECYEA 382
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 65/84 (77%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC+ CNCN+C NN+ HE R A++ CL+RNP AF+PKIGK G+
Sbjct: 297 ISLAGYCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIGKGRPGD 356
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
RH +GCNC+RSGCLKNYCECY
Sbjct: 357 VTPRHNRGCNCRRSGCLKNYCECY 380
>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
Length = 764
Score = 97.1 bits (240), Expect = 3e-18, Method: Composition-based stats.
Identities = 54/116 (46%), Positives = 66/116 (56%), Gaps = 13/116 (11%)
Query: 84 RTLEPNG-IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR 141
R +EP G K CNC KS CLKLYC+CFA G +C + C+C CFN HED +
Sbjct: 464 RKVEPAGDTEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRK 523
Query: 142 QCLERNPNAFRPKIGKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
Q RNP AF PK+ + +G+ RH +GCNCK+S CLK YCECY
Sbjct: 524 QIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSNCLKKYCECY 579
Score = 79.0 bits (193), Expect = 8e-13, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 53/94 (56%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + +G
Sbjct: 489 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIG 548
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 549 DDPNKTPASARHKRGCNCKKSNCLKKYCECYQGG 582
Score = 55.8 bits (133), Expect = 8e-06, Method: Composition-based stats.
Identities = 46/174 (26%), Positives = 61/174 (35%), Gaps = 59/174 (33%)
Query: 21 NNIEHEDDRHHAIRQCLERN-PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 79
N ++ +D A + +N P R K+ EG +R CNCK+S CLK YCEC+
Sbjct: 438 NGVQPAEDSSQAASEDFNQNSPKKKRRKVEPAGDTEGCKR----CNCKKSKCLKLYCECF 493
Query: 80 EAGARTLE---------------------------------------------------- 87
AG +E
Sbjct: 494 AAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNK 553
Query: 88 -PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHA 139
P R ++ CNC KS CLK YC+C+ G C C C C N +D A
Sbjct: 554 TPASARHKRGCNCKKSNCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSAQA 607
>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
Length = 1094
Score = 96.7 bits (239), Expect = 4e-18, Method: Composition-based stats.
Identities = 51/105 (48%), Positives = 59/105 (56%), Gaps = 13/105 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYCDCFA G +C + C C CFN E++D +Q RNP AF P
Sbjct: 574 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 633
Query: 154 KIGKCLVGE------------GERRHTKGCNCKRSGCLKNYCECY 186
KI + G RH +GCNCK+S CLK YCECY
Sbjct: 634 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECY 678
Score = 79.0 bits (193), Expect = 8e-13, Method: Composition-based stats.
Identities = 42/94 (44%), Positives = 51/94 (54%), Gaps = 13/94 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
YCDCFA G +C + C C CFN E++D +Q RNP AF PKI + G
Sbjct: 587 YCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPINS 646
Query: 56 --------GERRHTKGCNCKRSGCLKNYCECYEA 81
RH +GCNCK+S CLK YCECY+A
Sbjct: 647 GEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQA 680
>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
CCMP2712]
Length = 118
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 63/99 (63%), Gaps = 6/99 (6%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
RK CNC S CLKLYC+CFA+G +C CNC C NN +E R AI LERNP AF+
Sbjct: 3 RKQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQ 62
Query: 153 PKIGKCLV-----GEGERRHTKGCNCKRSGCLKNYCECY 186
PKI + RHTKGC+CK+SGCLK YCEC+
Sbjct: 63 PKIASSAPPGIEDSPVQGRHTKGCHCKKSGCLKKYCECF 101
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/89 (51%), Positives = 56/89 (62%), Gaps = 6/89 (6%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV-----G 54
YC+CFA+G +C CNC C NN +E R AI LERNP AF+PKI
Sbjct: 17 YCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQPKIASSAPPGIEDS 76
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RHTKGC+CK+SGCLK YCEC++AG
Sbjct: 77 PVQGRHTKGCHCKKSGCLKKYCECFQAGV 105
>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
Length = 1001
Score = 96.3 bits (238), Expect = 5e-18, Method: Composition-based stats.
Identities = 51/105 (48%), Positives = 59/105 (56%), Gaps = 13/105 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYCDCFA G +C + C C CFN E++D +Q RNP AF P
Sbjct: 567 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 626
Query: 154 KIGKCLVGE------------GERRHTKGCNCKRSGCLKNYCECY 186
KI + G RH +GCNCK+S CLK YCECY
Sbjct: 627 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECY 671
Score = 79.0 bits (193), Expect = 9e-13, Method: Composition-based stats.
Identities = 42/94 (44%), Positives = 51/94 (54%), Gaps = 13/94 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
YCDCFA G +C + C C CFN E++D +Q RNP AF PKI + G
Sbjct: 580 YCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPINS 639
Query: 56 --------GERRHTKGCNCKRSGCLKNYCECYEA 81
RH +GCNCK+S CLK YCECY+A
Sbjct: 640 GEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQA 673
>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
[Homo sapiens]
Length = 529
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G + +H
Sbjct: 328 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 387
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 388 NKGCNCRRSGCLKNYCECYEA 408
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 65/84 (77%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G
Sbjct: 323 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 382
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ +H KGCNC+RSGCLKNYCECY
Sbjct: 383 VKPQHNKGCNCRRSGCLKNYCECY 406
>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
[Homo sapiens]
Length = 508
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G + +H
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 366
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 65/84 (77%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G
Sbjct: 302 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 361
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ +H KGCNC+RSGCLKNYCECY
Sbjct: 362 VKPQHNKGCNCRRSGCLKNYCECY 385
>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
Length = 276
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/106 (50%), Positives = 69/106 (65%), Gaps = 7/106 (6%)
Query: 85 TLEPNGI---RPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAI 140
L+PN + KPCNC KS CLKLYCDCFA G+ C +CNC CFNN + +R+ I
Sbjct: 134 ILQPNLFYLNQEHKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFI 193
Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
++ +ERNP AF K+ + E + H KGCNC++SGC K YCECY
Sbjct: 194 QKMIERNPQAFNQKVQEV---ESKMTHAKGCNCRKSGCQKKYCECY 236
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 55/84 (65%), Gaps = 4/84 (4%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA G+ C +CNC CFNN + +R+ I++ +ERNP AF K+ + E +
Sbjct: 160 YCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFIQKMIERNPQAFNQKVQEV---ESKMT 216
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H KGCNC++SGC K YCECY+ G
Sbjct: 217 HAKGCNCRKSGCQKKYCECYQMGI 240
>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
Length = 508
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G + +H
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 366
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 367 NKGCNCRRSGCLKNYCECYEA 387
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 65/84 (77%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G
Sbjct: 302 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 361
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ +H KGCNC+RSGCLKNYCECY
Sbjct: 362 VKPQHNKGCNCRRSGCLKNYCECY 385
>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
[Homo sapiens]
Length = 463
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G + +H
Sbjct: 262 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 321
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 322 NKGCNCRRSGCLKNYCECYEA 342
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 65/84 (77%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G
Sbjct: 257 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 316
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ +H KGCNC+RSGCLKNYCECY
Sbjct: 317 VKPQHNKGCNCRRSGCLKNYCECY 340
>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
Length = 289
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 64/81 (79%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFANG+FC CNC++C NN HE +R AI+ CLERNP AF PKIGK +GE + H
Sbjct: 151 YCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIGKRKLGEIKHHH 210
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCL+NYCEC+EA
Sbjct: 211 NKGCNCKRSGCLENYCECFEA 231
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 64/84 (76%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
M + YCDCFANG+FC CNC++C NN HE +R AI+ CLERNP AF PKIGK +GE
Sbjct: 146 MTSEGYCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIGKRKLGE 205
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ H KGCNCKRSGCL+NYCEC+
Sbjct: 206 IKHHHNKGCNCKRSGCLENYCECF 229
>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
Length = 806
Score = 95.9 bits (237), Expect = 7e-18, Method: Composition-based stats.
Identities = 50/104 (48%), Positives = 60/104 (57%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYC+CFA G +C + C+C CFN HED +Q RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562
Query: 154 KIGKC--LVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + V E RH +GCNCK+S CLK YCEC+
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECF 606
Score = 79.0 bits (193), Expect = 9e-13, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGE-- 55
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + V E
Sbjct: 516 YCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFG 575
Query: 56 -------GERRHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S CLK YCEC++ G
Sbjct: 576 DESNKTPASARHKRGCNCKKSSCLKKYCECFQGG 609
Score = 56.2 bits (134), Expect = 6e-06, Method: Composition-based stats.
Identities = 38/133 (28%), Positives = 45/133 (33%), Gaps = 54/133 (40%)
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------------------------- 87
E K CNCK+S CLK YCEC+ AG +E
Sbjct: 497 AENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRN 556
Query: 88 -------------------------PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQC 121
P R ++ CNC KS CLK YC+CF G C C
Sbjct: 557 PLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISC 616
Query: 122 NCNSCFNNIEHED 134
C C N +D
Sbjct: 617 RCEGCKNTFGRKD 629
>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
Length = 243
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/81 (69%), Positives = 66/81 (81%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C+NN HE +R AI+ CL RNP AF+PKIGK +GE + RH
Sbjct: 52 YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKLGEVKPRH 111
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCEC+EA
Sbjct: 112 NKGCNCKRSGCLKNYCECFEA 132
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 54/79 (68%), Positives = 64/79 (81%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 167
YCDCFA+G+FC CNCN+C+NN HE +R AI+ CL RNP AF+PKIGK +GE + RH
Sbjct: 52 YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKLGEVKPRH 111
Query: 168 TKGCNCKRSGCLKNYCECY 186
KGCNCKRSGCLKNYCEC+
Sbjct: 112 NKGCNCKRSGCLKNYCECF 130
>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 95.5 bits (236), Expect = 9e-18, Method: Composition-based stats.
Identities = 50/104 (48%), Positives = 60/104 (57%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYC+CFA G +C + C+C CFN HED +Q RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562
Query: 154 KIGKC--LVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + V E RH +GCNCK+S CLK YCEC+
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECF 606
Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGE-- 55
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + V E
Sbjct: 516 YCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFG 575
Query: 56 -------GERRHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S CLK YCEC++ G
Sbjct: 576 DESNKTPASARHKRGCNCKKSSCLKKYCECFQGG 609
Score = 55.8 bits (133), Expect = 8e-06, Method: Composition-based stats.
Identities = 38/133 (28%), Positives = 45/133 (33%), Gaps = 54/133 (40%)
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------------------------- 87
E K CNCK+S CLK YCEC+ AG +E
Sbjct: 497 AENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRN 556
Query: 88 -------------------------PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQC 121
P R ++ CNC KS CLK YC+CF G C C
Sbjct: 557 PLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISC 616
Query: 122 NCNSCFNNIEHED 134
C C N +D
Sbjct: 617 RCEGCKNTFGRKD 629
>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
Length = 593
Score = 95.1 bits (235), Expect = 1e-17, Method: Composition-based stats.
Identities = 51/104 (49%), Positives = 62/104 (59%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYC+CFA G +C + C+C +CFN H+D +Q RNP AF P
Sbjct: 328 KRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAP 387
Query: 154 KIGK-----CLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + VGE RH +GCNCK+S CLK YCECY
Sbjct: 388 KVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 431
Score = 76.6 bits (187), Expect = 4e-12, Method: Composition-based stats.
Identities = 41/93 (44%), Positives = 54/93 (58%), Gaps = 12/93 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
YC+CFA G +C + C+C +CFN H+D +Q RNP AF PK+ + VG
Sbjct: 341 YCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 400
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEA 81
E RH +GCNCK+S CLK YCECY++
Sbjct: 401 EDASKTPASARHKRGCNCKKSNCLKKYCECYQS 433
>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
Length = 299
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/81 (65%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G + +H
Sbjct: 98 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPQH 157
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNC+RSGCLKNYCECYEA
Sbjct: 158 NKGCNCRRSGCLKNYCECYEA 178
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 65/84 (77%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ L YCDCFA+G+FC CNCN+C NN+ H+ +R AI+ CL RNP AF+PKIGK +G
Sbjct: 93 ITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGN 152
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
+ +H KGCNC+RSGCLKNYCECY
Sbjct: 153 VKPQHNKGCNCRRSGCLKNYCECY 176
>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
Length = 777
Score = 95.1 bits (235), Expect = 1e-17, Method: Composition-based stats.
Identities = 50/104 (48%), Positives = 60/104 (57%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYC+CFA G +C + C+C CFN HED +Q RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 533
Query: 154 KIGKC--LVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + V E RH +GCNCK+S CLK YCEC+
Sbjct: 534 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECF 577
Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC--LVGE-- 55
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + V E
Sbjct: 487 YCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFG 546
Query: 56 -------GERRHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S CLK YCEC++ G
Sbjct: 547 DESNKTPASARHKRGCNCKKSSCLKKYCECFQGG 580
Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats.
Identities = 38/133 (28%), Positives = 45/133 (33%), Gaps = 54/133 (40%)
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------------------------- 87
E K CNCK+S CLK YCEC+ AG +E
Sbjct: 468 AENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRN 527
Query: 88 -------------------------PNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQC 121
P R ++ CNC KS CLK YC+CF G C C
Sbjct: 528 PLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISC 587
Query: 122 NCNSCFNNIEHED 134
C C N +D
Sbjct: 588 RCEGCKNTFGRKD 600
>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1174
Score = 94.4 bits (233), Expect = 2e-17, Method: Composition-based stats.
Identities = 41/90 (45%), Positives = 49/90 (54%), Gaps = 5/90 (5%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
C C S CLKLYCDCF+ ++C C C C N E R AI RNPNAF+ K+
Sbjct: 862 CKCKNSKCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLN 921
Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ H GC CK+S CLK YCEC+
Sbjct: 922 ASVT-----EHATGCKCKKSECLKKYCECF 946
Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats.
Identities = 34/79 (43%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCF+ ++C C C C N E R AI RNPNAF+ K+ + H
Sbjct: 873 YCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLNASVT-----EH 927
Query: 61 TKGCNCKRSGCLKNYCECY 79
GC CK+S CLK YCEC+
Sbjct: 928 ATGCKCKKSECLKKYCECF 946
Score = 56.6 bits (135), Expect = 4e-06, Method: Composition-based stats.
Identities = 33/114 (28%), Positives = 44/114 (38%), Gaps = 37/114 (32%)
Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEA---------------------------GARTL 86
G+ + C CK S CLK YC+C+ A R
Sbjct: 852 GKENSENESTCKCKNSKCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRAR 911
Query: 87 EPNGIRPR---------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNI 130
PN + + C C KS CLK YC+CFAN C + C C++C N +
Sbjct: 912 NPNAFKTKLNASVTEHATGCKCKKSECLKKYCECFANLVVCGESCKCSNCKNYV 965
>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
Length = 783
Score = 94.4 bits (233), Expect = 2e-17, Method: Composition-based stats.
Identities = 37/46 (80%), Positives = 41/46 (89%)
Query: 36 CLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
CL+RNP AF+PKIGK GE +RRH+KGCNCKRSGCLKNYCECYEA
Sbjct: 605 CLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEA 650
Score = 90.5 bits (223), Expect = 3e-16, Method: Composition-based stats.
Identities = 35/44 (79%), Positives = 39/44 (88%)
Query: 143 CLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
CL+RNP AF+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 605 CLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 648
>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
Length = 218
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/97 (52%), Positives = 65/97 (67%), Gaps = 4/97 (4%)
Query: 91 IRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
I KPCNC KS CLKLYCDCFA G+ C +CNC CFNN + +R+ I + +ERNP
Sbjct: 79 IEEAKPCNCKKSKCLKLYCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPE 138
Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
AF K+ + + + H+KGCNC++SGC K YCECY
Sbjct: 139 AFNQKVKEV---DYKLAHSKGCNCRKSGCKKKYCECY 172
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 55/84 (65%), Gaps = 4/84 (4%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA G+ C +CNC CFNN + +R+ I + +ERNP AF K+ + + +
Sbjct: 96 YCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPEAFNQKVKEV---DYKLA 152
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H+KGCNC++SGC K YCECY+ G
Sbjct: 153 HSKGCNCRKSGCKKKYCECYQLGI 176
>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
Length = 597
Score = 93.6 bits (231), Expect = 4e-17, Method: Composition-based stats.
Identities = 45/93 (48%), Positives = 57/93 (61%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK C+C S CLKLYC+CFAN C C+C C NN + + + LERNP AF P
Sbjct: 341 RKGCHCKNSKCLKLYCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNP 400
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K K + +HTKGC+C++S CLK YCEC+
Sbjct: 401 KF-KQKEETKQHKHTKGCHCRKSECLKKYCECF 432
Score = 79.3 bits (194), Expect = 6e-13, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFAN C C+C C NN + + + LERNP AF PK K + +H
Sbjct: 355 YCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNPKF-KQKEETKQHKH 413
Query: 61 TKGCNCKRSGCLKNYCECYEAG 82
TKGC+C++S CLK YCEC++AG
Sbjct: 414 TKGCHCRKSECLKKYCECFQAG 435
>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
Length = 896
Score = 93.2 bits (230), Expect = 4e-17, Method: Composition-based stats.
Identities = 50/105 (47%), Positives = 56/105 (53%), Gaps = 13/105 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYCDCFA G +C C C C N E+ + +Q RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAP 533
Query: 154 KI------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI + L RH +GCNCKRS CLK YCECY
Sbjct: 534 KIVQPTTDISSHMDDENLTTPSSARHKRGCNCKRSMCLKKYCECY 578
Score = 75.5 bits (184), Expect = 8e-12, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 48/94 (51%), Gaps = 13/94 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YCDCFA G +C C C C N E+ + +Q RNP AF PKI
Sbjct: 487 YCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAPKIVQPTTDISSHM 546
Query: 49 -GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
+ L RH +GCNCKRS CLK YCECY+A
Sbjct: 547 DDENLTTPSSARHKRGCNCKRSMCLKKYCECYQA 580
>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
Length = 503
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/81 (64%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA+G+FC CNCN+C+NN HE +R AI+ CL+RNP AF PKIG+ +G+ + H
Sbjct: 307 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSELGDIKPHH 366
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCEC+EA
Sbjct: 367 NKGCNCKRSGCLKNYCECFEA 387
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 70/106 (66%), Gaps = 7/106 (6%)
Query: 81 AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
A + T +P N M YC+CFA+G+FC CNCN+C+NN HE +R AI
Sbjct: 287 ASSSTFQPES-------NLHVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAI 339
Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ CL+RNP AF PKIG+ +G+ + H KGCNCKRSGCLKNYCEC+
Sbjct: 340 KVCLDRNPEAFLPKIGQSELGDIKPHHNKGCNCKRSGCLKNYCECF 385
>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
Length = 622
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/81 (64%), Positives = 65/81 (80%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA+G+FC CNCN+C+NN HE +R AI+ CL+RNP AF PKIG+ +G+ + H
Sbjct: 427 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSKLGDIKPHH 486
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCEC+EA
Sbjct: 487 NKGCNCKRSGCLKNYCECFEA 507
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 69/106 (65%), Gaps = 7/106 (6%)
Query: 81 AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
A T +P N M YC+CFA+G+FC CNCN+C+NN HE +R AI
Sbjct: 407 ASTSTFQPES-------NLQVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAI 459
Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ CL+RNP AF PKIG+ +G+ + H KGCNCKRSGCLKNYCEC+
Sbjct: 460 KVCLDRNPEAFLPKIGQSKLGDIKPHHNKGCNCKRSGCLKNYCECF 505
>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
Length = 429
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 56/93 (60%), Gaps = 3/93 (3%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC S CLKLYC+CFANGE+C CNC SC N E R AI L +NP+AFRPK
Sbjct: 218 CNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRPKF 277
Query: 156 G--KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
G K H KGC C RS C+K YCEC+
Sbjct: 278 GANKSRQSAAPSAHVKGCRCARSKCIKRYCECF 310
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/85 (51%), Positives = 51/85 (60%), Gaps = 3/85 (3%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG--KCLVGEGE 57
YC+CFANGE+C CNC SC N E R AI L +NP+AFRPK G K
Sbjct: 229 YCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRPKFGANKSRQSAAP 288
Query: 58 RRHTKGCNCKRSGCLKNYCECYEAG 82
H KGC C RS C+K YCEC++AG
Sbjct: 289 SAHVKGCRCARSKCIKRYCECFQAG 313
>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 471
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 50/96 (52%), Positives = 61/96 (63%), Gaps = 9/96 (9%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
+ +K CNC K+ CLKLYCDCFA GE C +CNC CF N + + R AI ++R P+A
Sbjct: 127 KKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCF-NTSNSEKRAGAIVSIMDRQPDA 185
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
F PK+ + H KGCNC RSGCLK YCECY
Sbjct: 186 FGPKV-------QQNTHKKGCNCTRSGCLKKYCECY 214
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 50/84 (59%), Gaps = 9/84 (10%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA GE C +CNC CFN + + R AI ++R P+AF PK+ +
Sbjct: 143 YCDCFAAGELCGGECNCFGCFN-TSNSEKRAGAIVSIMDRQPDAFGPKVQ-------QNT 194
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H KGCNC RSGCLK YCECY G
Sbjct: 195 HKKGCNCTRSGCLKKYCECYTLGV 218
Score = 44.3 bits (103), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 54/130 (41%), Gaps = 40/130 (30%)
Query: 53 VGEGERRHTKGCNCKRSGCLKNYCECYEAG---------------------------ART 85
V ++ K CNCK++ CLK YC+C+ AG
Sbjct: 121 VFSSNQKKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCFNTSNSEKRAGAIVSIMD 180
Query: 86 LEPNGIRPR-------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRH 137
+P+ P+ K CNCT+S CLK YC+C+ G C + C C C N D +
Sbjct: 181 RQPDAFGPKVQQNTHKKGCNCTRSGCLKKYCECYTLGVNCGEYCKCTQCKN-----VDLN 235
Query: 138 HAIRQCLERN 147
H+ R L N
Sbjct: 236 HSERLALSMN 245
>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
Length = 492
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 44/68 (64%), Positives = 52/68 (76%), Gaps = 1/68 (1%)
Query: 14 CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLK 73
C+C +C NN+ +E DR AI+ CLERNP AF+PKIGK + ER H KGCNCK+S CLK
Sbjct: 282 CHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIGKGR-ADTERLHNKGCNCKKSSCLK 340
Query: 74 NYCECYEA 81
NYCECYEA
Sbjct: 341 NYCECYEA 348
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/66 (63%), Positives = 50/66 (75%), Gaps = 1/66 (1%)
Query: 121 CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLK 180
C+C +C NN+ +E DR AI+ CLERNP AF+PKIGK + ER H KGCNCK+S CLK
Sbjct: 282 CHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIGKGR-ADTERLHNKGCNCKKSSCLK 340
Query: 181 NYCECY 186
NYCECY
Sbjct: 341 NYCECY 346
>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 805
Score = 92.4 bits (228), Expect = 7e-17, Method: Composition-based stats.
Identities = 47/98 (47%), Positives = 61/98 (62%), Gaps = 8/98 (8%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC K+ CLKLYCDCFA GEFC +CNC C N + +++ R+ + LE+NP AF K
Sbjct: 480 CNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEKNPFAFNVKD 539
Query: 156 GK-------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ L + + KGCNC+RSGCLK YC+CY
Sbjct: 540 IEIEEPSQLSLKAQKKLASKKGCNCRRSGCLKKYCQCY 577
Score = 77.4 bits (189), Expect = 3e-12, Method: Composition-based stats.
Identities = 40/90 (44%), Positives = 54/90 (60%), Gaps = 8/90 (8%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-------CL 52
YCDCFA GEFC +CNC C N + +++ R+ + LE+NP AF K + L
Sbjct: 491 YCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEKNPFAFNVKDIEIEEPSQLSL 550
Query: 53 VGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
+ + KGCNC+RSGCLK YC+CY+ G
Sbjct: 551 KAQKKLASKKGCNCRRSGCLKKYCQCYQDG 580
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 36/125 (28%), Positives = 48/125 (38%), Gaps = 50/125 (40%)
Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEAG--------------------ARTLEPNGIRP 93
G G + T+ CNCK++ CLK YC+C+ AG AR G+
Sbjct: 470 GNGSNKKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLE 529
Query: 94 RKP-----------------------------CNCTKSMCLKLYCDCFANGEFCYQ-CNC 123
+ P CNC +S CLK YC C+ +G C + C C
Sbjct: 530 KNPFAFNVKDIEIEEPSQLSLKAQKKLASKKGCNCRRSGCLKKYCQCYQDGLQCGEHCKC 589
Query: 124 NSCFN 128
N C N
Sbjct: 590 NGCEN 594
>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
Length = 284
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 63/94 (67%), Gaps = 2/94 (2%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C KS CLK+YCDCF +G +C C C C+NN +H+ +R AI+ LE+NP+AF+P
Sbjct: 13 KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQP 72
Query: 154 KIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
K+ G +H KGCNC+++ CLK YCEC+
Sbjct: 73 KVATFRDSGGLIVKHNKGCNCRKTACLKKYCECF 106
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 57/85 (67%), Gaps = 2/85 (2%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-R 58
YCDCF +G +C C C C+NN +H+ +R AI+ LE+NP+AF+PK+ G
Sbjct: 26 YCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQPKVATFRDSGGLIV 85
Query: 59 RHTKGCNCKRSGCLKNYCECYEAGA 83
+H KGCNC+++ CLK YCEC+++G
Sbjct: 86 KHNKGCNCRKTACLKKYCECFQSGV 110
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 46/112 (41%), Gaps = 44/112 (39%)
Query: 61 TKGCNCKRSGCLKNYCECYEAGA--------------------------RTLE--PNGIR 92
KGC C +S CLK YC+C+ +G LE P+ +
Sbjct: 12 VKGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQ 71
Query: 93 PR---------------KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
P+ K CNC K+ CLK YC+CF +G C + C C+ C N
Sbjct: 72 PKVATFRDSGGLIVKHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCKN 123
>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
Length = 554
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 40/58 (68%), Positives = 47/58 (81%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF
Sbjct: 476 RKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAF 533
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/44 (65%), Positives = 35/44 (79%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 44
YCDCFANGEFC C C CFNN+++E +R AIR CL+RNP+AF
Sbjct: 490 YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAF 533
>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
CCMP2712]
Length = 113
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K C+C KS CLKLYC+CFA GE C C C C N+ +HED R A+ +RN NAF P
Sbjct: 5 KKVCSCKKSRCLKLYCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNNAFAP 64
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI + + + H +GC CK+S CLK YCECY
Sbjct: 65 KIVD-EIQQDKGMHARGCRCKKSHCLKKYCECY 96
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 53/84 (63%), Gaps = 1/84 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA GE C C C C N+ +HED R A+ +RN NAF PKI + + + H
Sbjct: 19 YCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNNAFAPKIVD-EIQQDKGMH 77
Query: 61 TKGCNCKRSGCLKNYCECYEAGAR 84
+GC CK+S CLK YCECY+AG +
Sbjct: 78 ARGCRCKKSHCLKKYCECYQAGVQ 101
>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
Length = 664
Score = 91.3 bits (225), Expect = 2e-16, Method: Composition-based stats.
Identities = 45/96 (46%), Positives = 55/96 (57%), Gaps = 7/96 (7%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNA 150
R ++PCNC +SMCLK+YC+CFA G FC C+C SC N A L +NPNA
Sbjct: 260 REQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNA 319
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
F K+ H +GC CKRS CLK YCEC+
Sbjct: 320 FEVKV------TAATGHRRGCRCKRSKCLKKYCECF 349
Score = 72.8 bits (177), Expect = 6e-11, Method: Composition-based stats.
Identities = 39/85 (45%), Positives = 45/85 (52%), Gaps = 7/85 (8%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA G FC C+C SC N A L +NPNAF K+
Sbjct: 276 YCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNAFEVKV------TAATG 329
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
H +GC CKRS CLK YCEC+ AGAR
Sbjct: 330 HRRGCRCKRSKCLKKYCECFHAGAR 354
Score = 58.9 bits (141), Expect = 8e-07, Method: Composition-based stats.
Identities = 38/114 (33%), Positives = 46/114 (40%), Gaps = 38/114 (33%)
Query: 53 VGEGERRHTKGCNCKRSGCLKNYCECYEAG------------------------ARTL-- 86
+G + R + CNCKRS CLK YCEC+ AG AR +
Sbjct: 254 LGASQAREQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVL 313
Query: 87 --EPNGIR--------PRKPCNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFN 128
PN R+ C C +S CLK YC+CF G C C C C N
Sbjct: 314 AKNPNAFEVKVTAATGHRRGCRCKRSKCLKKYCECFHAGARCNPEVCQCEDCRN 367
Score = 39.7 bits (91), Expect = 0.60, Method: Composition-based stats.
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
+G + R + CNCKRS CLK YCEC+
Sbjct: 254 LGASQAREQRPCNCKRSMCLKMYCECF 280
>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
Length = 509
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/95 (49%), Positives = 58/95 (61%), Gaps = 10/95 (10%)
Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
S CLKLYC+CFA G++C CNC C N +E+E R AI RNPNAF+PKI +
Sbjct: 78 SQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQPKIENGPIP 137
Query: 162 EGER----------RHTKGCNCKRSGCLKNYCECY 186
R +H KGC+CK+SGCLK YCEC+
Sbjct: 138 PSVRKDAGALPSLPKHNKGCHCKKSGCLKKYCECF 172
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 55/91 (60%), Gaps = 10/91 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA G++C CNC C N +E+E R AI RNPNAF+PKI + R
Sbjct: 84 YCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQPKIENGPIPPSVRKD 143
Query: 59 --------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 144 AGALPSLPKHNKGCHCKKSGCLKKYCECFQA 174
>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
Length = 523
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/60 (75%), Positives = 53/60 (88%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNCTKS+CLKLYCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 455 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 514
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/44 (68%), Positives = 37/44 (84%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 44
YCDCFANGEFC CNC +C+NN+EHE++R AI+ CL+RNP AF
Sbjct: 471 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 514
>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
Length = 735
Score = 90.9 bits (224), Expect = 2e-16, Method: Composition-based stats.
Identities = 40/88 (45%), Positives = 55/88 (62%), Gaps = 2/88 (2%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+P K C+C S CLK+YC+CFA+ + C C C C NN +H + +A LERNP+AF
Sbjct: 323 QPTKMCHCKNSKCLKMYCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAF 382
Query: 152 RPKIGKCLVGEGER--RHTKGCNCKRSG 177
PK + GE + +H KGC+C+RSG
Sbjct: 383 GPKFKQSFNGEKPKAEKHLKGCHCRRSG 410
Score = 72.0 bits (175), Expect = 1e-10, Method: Composition-based stats.
Identities = 32/72 (44%), Positives = 44/72 (61%), Gaps = 2/72 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA+ + C C C C NN +H + +A LERNP+AF PK + GE +
Sbjct: 339 YCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAFGPKFKQSFNGEKPKAE 398
Query: 59 RHTKGCNCKRSG 70
+H KGC+C+RSG
Sbjct: 399 KHLKGCHCRRSG 410
>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
Length = 466
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/81 (60%), Positives = 63/81 (77%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
+CDCFA+G+F CNCN+C NN+ HE +R AI+ CL R+P AF+P+IGK +GE RH
Sbjct: 308 HCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIGKGQLGEVRPRH 367
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+KGCNC+R GCL+NYCEC EA
Sbjct: 368 SKGCNCRRLGCLRNYCECQEA 388
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/82 (58%), Positives = 62/82 (75%)
Query: 104 CLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG 163
L +CDCFA+G+F CNCN+C NN+ HE +R AI+ CL R+P AF+P+IGK +GE
Sbjct: 304 TLAGHCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIGKGQLGEV 363
Query: 164 ERRHTKGCNCKRSGCLKNYCEC 185
RH+KGCNC+R GCL+NYCEC
Sbjct: 364 RPRHSKGCNCRRLGCLRNYCEC 385
>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
Length = 460
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 66/99 (66%), Gaps = 7/99 (7%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC + CLKLYC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPK
Sbjct: 98 KQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPK 157
Query: 155 IGKCL-----VGEGER--RHTKGCNCKRSGCLKNYCECY 186
I + R +H KGCNCK+SGCLK YCEC+
Sbjct: 158 IANTQHAIRDSADDMRLGKHNKGCNCKKSGCLKKYCECF 196
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 47/88 (53%), Positives = 59/88 (67%), Gaps = 7/88 (7%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE 55
YC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPKI +
Sbjct: 111 YCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSAD 170
Query: 56 GER--RHTKGCNCKRSGCLKNYCECYEA 81
R +H KGCNCK+SGCLK YCEC++A
Sbjct: 171 DMRLGKHNKGCNCKKSGCLKKYCECFQA 198
>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
Length = 461
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 66/99 (66%), Gaps = 7/99 (7%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC + CLKLYC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPK
Sbjct: 98 KQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPK 157
Query: 155 IGKCL-----VGEGER--RHTKGCNCKRSGCLKNYCECY 186
I + R +H KGCNCK+SGCLK YCEC+
Sbjct: 158 IANTQHAIRDSADDMRLGKHNKGCNCKKSGCLKKYCECF 196
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 47/88 (53%), Positives = 59/88 (67%), Gaps = 7/88 (7%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE 55
YC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPKI +
Sbjct: 111 YCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSAD 170
Query: 56 GER--RHTKGCNCKRSGCLKNYCECYEA 81
R +H KGCNCK+SGCLK YCEC++A
Sbjct: 171 DMRLGKHNKGCNCKKSGCLKKYCECFQA 198
>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
gi|223944643|gb|ACN26405.1| unknown [Zea mays]
gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
Length = 499
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 46/95 (48%), Positives = 57/95 (60%), Gaps = 10/95 (10%)
Query: 102 SMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVG 161
S CLKLYC+CFA G++C CNC C N +E+E R AI RNPNAF+PKI +
Sbjct: 80 SQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIP 139
Query: 162 E----------GERRHTKGCNCKRSGCLKNYCECY 186
+H KGC+CK+SGCLK YCEC+
Sbjct: 140 SSVWKDAGALPSHPKHNKGCHCKKSGCLKKYCECF 174
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 54/91 (59%), Gaps = 10/91 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE----- 55
YC+CFA G++C CNC C N +E+E R AI RNPNAF+PKI +
Sbjct: 86 YCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIPSSVWKD 145
Query: 56 -----GERRHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 146 AGALPSHPKHNKGCHCKKSGCLKKYCECFQA 176
>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1040
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 71/140 (50%), Gaps = 37/140 (26%)
Query: 84 RTLEPNGIRPRK-PCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIR 141
+TL P +K CNC KS CLKLYCDCF G+FC +CNC C N ++ +R AI+
Sbjct: 700 QTLRPQTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAERESAIK 759
Query: 142 QCLERNPNAFRPKIG--------------------KCLVGEGERR--------------- 166
Q ERNP+AF+PKI L+ + + +
Sbjct: 760 QLQERNPDAFKPKIQTKEQIIQQYLSKLQIFMLIISSLLSKNQIQYLIPKIIKDDEEILI 819
Query: 167 HTKGCNCKRSGCLKNYCECY 186
H KGCNCK+SGC K YCECY
Sbjct: 820 HNKGCNCKKSGCEKKYCECY 839
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 74/184 (40%), Gaps = 57/184 (30%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG---------- 49
YCDCF G+FC +CNC C N ++ +R AI+Q ERNP+AF+PKI
Sbjct: 725 YCDCFNQGQFCNSECNCTECSNTENNKAERESAIKQLQERNPDAFKPKIQTKEQIIQQYL 784
Query: 50 ----------KCLVGEGERR---------------HTKGCNCKRSGCLKNYCECYEAGA- 83
L+ + + + H KGCNCK+SGC K YCECY G
Sbjct: 785 SKLQIFMLIISSLLSKNQIQYLIPKIIKDDEEILIHNKGCNCKKSGCEKKYCECYNTGVK 844
Query: 84 ----------RTLEPNGI-----RPRKPCNCTKSMCLKL-----YCDCFANGEFCYQCNC 123
R +P+ I P N S + D + N E YQ N
Sbjct: 845 CSDQCKCEGCRNRDPSEIVKINNAQNNPSNGNSSFSNQYSGNNKQVDQYLNNEELYQHNS 904
Query: 124 NSCF 127
S
Sbjct: 905 QSML 908
>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
Length = 116
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 61/101 (60%), Gaps = 20/101 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK---------- 50
YC+CFA+G +C CNC +C+NN E+E R A+ LERNPNAFRPKIG
Sbjct: 1 YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60
Query: 51 ---CLVGEGER-------RHTKGCNCKRSGCLKNYCECYEA 81
L+ E R RH KGC CK+SGCLK YCEC++A
Sbjct: 61 CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQA 101
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 59/99 (59%), Gaps = 20/99 (20%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK---------- 157
YC+CFA+G +C CNC +C+NN E+E R A+ LERNPNAFRPKIG
Sbjct: 1 YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60
Query: 158 ---CLVGEGER-------RHTKGCNCKRSGCLKNYCECY 186
L+ E R RH KGC CK+SGCLK YCEC+
Sbjct: 61 CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECF 99
>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
[Cucumis sativus]
Length = 581
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/102 (50%), Positives = 59/102 (57%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFA G +C + C+C CFN HE RQ RNP AF PK+
Sbjct: 289 CNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKV 348
Query: 156 GK--CLVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
L+ E RH +GCNCK+SGCLK YCECY
Sbjct: 349 IMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECY 390
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 52/95 (54%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--CLVGE-- 55
YC+CFA G +C + C+C CFN HE RQ RNP AF PK+ L+ E
Sbjct: 300 YCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELR 359
Query: 56 -------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+SGCLK YCECY+ G
Sbjct: 360 DDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGV 394
>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
Length = 602
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/102 (50%), Positives = 59/102 (57%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFA G +C + C+C CFN HE RQ RNP AF PK+
Sbjct: 310 CNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKV 369
Query: 156 GK--CLVGE---------GERRHTKGCNCKRSGCLKNYCECY 186
L+ E RH +GCNCK+SGCLK YCECY
Sbjct: 370 IMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECY 411
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 52/95 (54%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--CLVGE-- 55
YC+CFA G +C + C+C CFN HE RQ RNP AF PK+ L+ E
Sbjct: 321 YCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELR 380
Query: 56 -------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+SGCLK YCECY+ G
Sbjct: 381 DDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGV 415
>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 546
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 62/99 (62%), Gaps = 7/99 (7%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC +S C+KLYC+C+A G +C C+C SC NN + E R+ AI+ L +NP AF PK
Sbjct: 69 KKCNCKQSQCIKLYCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPK 128
Query: 155 I-------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
I K + +H KGC CKRSGC+K YCEC+
Sbjct: 129 ISQKQETASKVVDPLNLGKHNKGCACKRSGCMKKYCECF 167
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 55/90 (61%), Gaps = 7/90 (7%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-------GKCLV 53
YC+C+A G +C C+C SC NN + E R+ AI+ L +NP AF PKI K +
Sbjct: 82 YCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPKISQKQETASKVVD 141
Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+H KGC CKRSGC+K YCEC++AG
Sbjct: 142 PLNLGKHNKGCACKRSGCMKKYCECFQAGV 171
>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
Length = 485
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 69/115 (60%), Gaps = 13/115 (11%)
Query: 85 TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
+ E + P+K CNC S CLK+YC+CF ++C CNC++C N + +E+ R+ AI
Sbjct: 58 STEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEA 117
Query: 143 CLERNPNAFRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
+RNP+AF+PKIG R +H KGC+CK+SGCLK YCECY
Sbjct: 118 IRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECY 172
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/92 (44%), Positives = 57/92 (61%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CF ++C CNC++C N + +E+ R+ AI +RNP+AF+PKIG R
Sbjct: 83 YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKD 142
Query: 59 ---------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCECY+A
Sbjct: 143 NSGAVPLVAKHHKGCHCKKSGCLKKYCECYQA 174
>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
Length = 774
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 60/102 (58%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFA G +C + C+C CFN HED +Q RNP AF PK+
Sbjct: 488 CNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKV 547
Query: 156 GK-----CLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
+ +G+ RH +GCNCK+S CLK YCECY
Sbjct: 548 IRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECY 589
Score = 75.5 bits (184), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + +G
Sbjct: 499 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIG 558
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 559 DDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 593
>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
Length = 623
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/104 (49%), Positives = 61/104 (58%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC KS CLKLYC+CFA G +C + C+C CFN HED +Q RNP AF P
Sbjct: 339 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAP 398
Query: 154 KI---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
K+ ++ GE RH +GCNCK+S CLK YCECY
Sbjct: 399 KVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 442
Score = 75.5 bits (184), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ ++ G
Sbjct: 352 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 411
Query: 57 E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCK+S CLK YCECY+ G
Sbjct: 412 EDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 446
>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
gi|223943031|gb|ACN25599.1| unknown [Zea mays]
gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
Length = 530
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 70/119 (58%), Gaps = 21/119 (17%)
Query: 85 TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
+ E + P+K CNC S CLK+YC+CF ++C CNC++C N + +E+ R+ AI
Sbjct: 58 STEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEA 117
Query: 143 CLERNPNAFRPKIGK---------------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+RNP+AF+PKIG LV +H KGC+CK+SGCLK YCECY
Sbjct: 118 IRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVA----KHHKGCHCKKSGCLKKYCECY 172
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 58/96 (60%), Gaps = 19/96 (19%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK---------- 50
YC+CF ++C CNC++C N + +E+ R+ AI +RNP+AF+PKIG
Sbjct: 83 YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKD 142
Query: 51 -----CLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
LV +H KGC+CK+SGCLK YCECY+A
Sbjct: 143 NSGAVPLVA----KHHKGCHCKKSGCLKKYCECYQA 174
>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
Length = 665
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/106 (46%), Positives = 59/106 (55%), Gaps = 14/106 (13%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC K+ CLKLYCDCFA G +C + C C CFN E+ED +Q RNP AF P
Sbjct: 487 KRCNCKKTKCLKLYCDCFAAGFYCGESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAP 546
Query: 154 KIGKCLVGE-------------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + + RH +GCNCK+S CLK YCECY
Sbjct: 547 KVMQTVTASPTTGKVVEKLSTPSSARHKRGCNCKKSMCLKKYCECY 592
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 51/95 (53%), Gaps = 14/95 (14%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
YCDCFA G +C + C C CFN E+ED +Q RNP AF PK+ + +
Sbjct: 500 YCDCFAAGFYCGESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAPKVMQTVTASPTTG 559
Query: 56 ---------GERRHTKGCNCKRSGCLKNYCECYEA 81
RH +GCNCK+S CLK YCECY+A
Sbjct: 560 KVVEKLSTPSSARHKRGCNCKKSMCLKKYCECYQA 594
>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 510
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 73/147 (49%), Gaps = 14/147 (9%)
Query: 44 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSM 103
+P++G +V + + + Y + T+ +G+ P CNC +S
Sbjct: 176 LKPEVGLPVVPSQVASLPPSTSSQVTPSQPPTLPAYGSNVTTM--DGVAPG--CNCKRSR 231
Query: 104 CLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCL 159
CLKLYC CFA FC CNC +C N R AIRQ +ERNPNAF+ K G+
Sbjct: 232 CLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDA 291
Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
V H GC C++S CLK YCEC+
Sbjct: 292 V------HKMGCKCRKSACLKKYCECF 312
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 50/88 (56%), Gaps = 10/88 (11%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC CFA FC CNC +C N R AIRQ +ERNPNAF+ K G+ V
Sbjct: 236 YCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDAV--- 292
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
H GC C++S CLK YCEC+ AGAR
Sbjct: 293 ---HKMGCKCRKSACLKKYCECFNAGAR 317
>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 61/86 (70%), Gaps = 6/86 (6%)
Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLV 160
LKLYC+CF GE+C+ C+C+ C N ++ D R AI+ L+RN +AF+PK+ GK +
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVN 264
Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
+G +H +GCNCKRSGCLK YCECY
Sbjct: 265 NKG--KHLRGCNCKRSGCLKKYCECY 288
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 61/87 (70%), Gaps = 6/87 (6%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CF GE+C+ C+C+ C N ++ D R AI+ L+RN +AF+PK+ GK + +G
Sbjct: 208 YCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVNNKG 267
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEAGA 83
+H +GCNCKRSGCLK YCECY+AG
Sbjct: 268 --KHLRGCNCKRSGCLKKYCECYQAGV 292
>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 511
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 73/147 (49%), Gaps = 14/147 (9%)
Query: 44 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSM 103
+P++G +V + + + Y + T+ +G+ P CNC +S
Sbjct: 176 LKPEVGLPVVPSQVASLPPSTSSQVTPSQPPTLPAYGSNVTTM--DGVAPG--CNCKRSR 231
Query: 104 CLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCL 159
CLKLYC CFA FC CNC +C N R AIRQ +ERNPNAF+ K G+
Sbjct: 232 CLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDA 291
Query: 160 VGEGERRHTKGCNCKRSGCLKNYCECY 186
V H GC C++S CLK YCEC+
Sbjct: 292 V------HKMGCKCRKSACLKKYCECF 312
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 50/88 (56%), Gaps = 10/88 (11%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC CFA FC CNC +C N R AIRQ +ERNPNAF+ K G+ V
Sbjct: 236 YCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKFRPNGEDAV--- 292
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
H GC C++S CLK YCEC+ AGAR
Sbjct: 293 ---HKMGCKCRKSACLKKYCECFNAGAR 317
>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/86 (52%), Positives = 61/86 (70%), Gaps = 6/86 (6%)
Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLV 160
LKLYC+CF GE+C+ C+C+ C N ++ D R AI+ L+RN +AF+PK+ GK +
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVN 264
Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
+G +H +GCNCKRSGCLK YCECY
Sbjct: 265 NKG--KHLRGCNCKRSGCLKKYCECY 288
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 61/87 (70%), Gaps = 6/87 (6%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CF GE+C+ C+C+ C N ++ D R AI+ L+RN +AF+PK+ GK + +G
Sbjct: 208 YCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKSGKQVNNKG 267
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEAGA 83
+H +GCNCKRSGCLK YCECY+AG
Sbjct: 268 --KHLRGCNCKRSGCLKKYCECYQAGV 292
>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 4/98 (4%)
Query: 93 PRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
P K CNC S C+KLYC+C+ N FC C+C++CFNN + R A++ LE+NP+AF
Sbjct: 14 PSKKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAF 73
Query: 152 RPKIGKCLVGEGER---RHTKGCNCKRSGCLKNYCECY 186
+PKI +H KGC CK+SGC+K YCEC+
Sbjct: 74 QPKITTSDSKPDPLNFGKHNKGCQCKKSGCMKKYCECF 111
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
YC+C+ N FC C+C++CFNN + R A++ LE+NP+AF+PKI
Sbjct: 29 YCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAFQPKITTSDSKPDPLN 88
Query: 59 --RHTKGCNCKRSGCLKNYCECYEAGA 83
+H KGC CK+SGC+K YCEC++A
Sbjct: 89 FGKHNKGCQCKKSGCMKKYCECFQAKV 115
>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
Length = 729
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/104 (47%), Positives = 60/104 (57%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC +S CLKLYCDCFA G +C + C+C CFN HED +Q RNP AF P
Sbjct: 450 KRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAP 509
Query: 154 KIGKCL--VGE---------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + V + RH +GCNCK+S CLK YCEC+
Sbjct: 510 KVIRSTDYVSDFGDETNKTPASARHKRGCNCKKSNCLKKYCECF 553
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 52/95 (54%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL--VGE-- 55
YCDCFA G +C + C+C CFN HED +Q RNP AF PK+ + V +
Sbjct: 463 YCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAPKVIRSTDYVSDFG 522
Query: 56 -------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S CLK YCEC++ G
Sbjct: 523 DETNKTPASARHKRGCNCKKSNCLKKYCECFQGGV 557
>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 421
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/110 (45%), Positives = 63/110 (57%), Gaps = 8/110 (7%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
T + GI RK C C +S CLKLYCDCFA+G C C+C C NN ++ D R A+ L
Sbjct: 54 TTDQEGITWRKRCRCKQSKCLKLYCDCFASGVLCTDCDCVDCHNNSDNYDARDAAVVNVL 113
Query: 145 ERNPNAFRPKI------GKCLVGEGER--RHTKGCNCKRSGCLKNYCECY 186
RNPNAF K+ +C R ++GC CKR+ CLK YCEC+
Sbjct: 114 GRNPNAFNEKLFSSINDKQCKAAPDTRPGLLSRGCKCKRTKCLKKYCECF 163
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 51/91 (56%), Gaps = 8/91 (8%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLVG 54
YCDCFA+G C C+C C NN ++ D R A+ L RNPNAF K+ +C
Sbjct: 77 YCDCFASGVLCTDCDCVDCHNNSDNYDARDAAVVNVLGRNPNAFNEKLFSSINDKQCKAA 136
Query: 55 EGER--RHTKGCNCKRSGCLKNYCECYEAGA 83
R ++GC CKR+ CLK YCEC++A
Sbjct: 137 PDTRPGLLSRGCKCKRTKCLKKYCECFQANV 167
>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
Length = 641
Score = 87.8 bits (216), Expect = 2e-15, Method: Composition-based stats.
Identities = 46/115 (40%), Positives = 62/115 (53%), Gaps = 12/115 (10%)
Query: 84 RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQ 142
R L+ + + + C+C KS CLKLYC CFA +C + C+C C NN HE+ H +Q
Sbjct: 346 RKLQNDDGKSCRHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMHEEAVSHIRKQ 405
Query: 143 CLERNPNAFRPKIGKCLVGEGE-----------RRHTKGCNCKRSGCLKNYCECY 186
RNP AF P + + E RH +GCNC++S CLK YCEC+
Sbjct: 406 TESRNPLAFAPTVTRKCGSVSELGDDSNNTPASARHKRGCNCRKSSCLKKYCECF 460
Score = 70.5 bits (171), Expect = 3e-10, Method: Composition-based stats.
Identities = 36/92 (39%), Positives = 48/92 (52%), Gaps = 12/92 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-- 57
YC CFA +C + C+C C NN HE+ H +Q RNP AF P + + E
Sbjct: 370 YCACFAAKVYCSEFCSCQGCSNNHMHEEAVSHIRKQTESRNPLAFAPTVTRKCGSVSELG 429
Query: 58 ---------RRHTKGCNCKRSGCLKNYCECYE 80
RH +GCNC++S CLK YCEC++
Sbjct: 430 DDSNNTPASARHKRGCNCRKSSCLKKYCECFQ 461
>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
distachyon]
Length = 487
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/104 (50%), Positives = 63/104 (60%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC S CLKLYC+CFA C CNC++C NN E+E R AI L RNP AF+P
Sbjct: 67 RKHCNCKNSRCLKLYCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQP 126
Query: 154 KI--GKCLVGEGER---------RHTKGCNCKRSGCLKNYCECY 186
KI G V + +H KGC+CK+SGCLK YCEC+
Sbjct: 127 KIENGPSTVNVRKDNSGAVPLRPKHNKGCHCKKSGCLKKYCECF 170
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 55/92 (59%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--GKCLVGEGER 58
YC+CFA C CNC++C NN E+E R AI L RNP AF+PKI G V +
Sbjct: 81 YCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQPKIENGPSTVNVRKD 140
Query: 59 ---------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 141 NSGAVPLRPKHNKGCHCKKSGCLKKYCECFQA 172
>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/97 (50%), Positives = 63/97 (64%), Gaps = 4/97 (4%)
Query: 91 IRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
I KPCNC KS CLKLYCDCFA + C +CNC CFNN + +R+ + +ERNP
Sbjct: 103 IEEVKPCNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPE 162
Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
AF K+ + + + H+KGCNC++SGC K YCECY
Sbjct: 163 AFNQKVKEV---DQKMAHSKGCNCRKSGCKKKYCECY 196
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 53/84 (63%), Gaps = 4/84 (4%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA + C +CNC CFNN + +R+ + +ERNP AF K+ + + +
Sbjct: 120 YCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPEAFNQKVKEV---DQKMA 176
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H+KGCNC++SGC K YCECY+ G
Sbjct: 177 HSKGCNCRKSGCKKKYCECYQMGI 200
>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
Length = 260
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 65/105 (61%), Gaps = 4/105 (3%)
Query: 83 ARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR 141
A ++ ++ CNC KS CLKLYCDCFA GE+C C+C C NN +++ R A++
Sbjct: 50 ANNIQKLKALQQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREMALK 109
Query: 142 QCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
Q ERNPNAF +I + H +GCNCK+SGC K YC+CY
Sbjct: 110 QNKERNPNAFTSRI---IQQSQLNNHQRGCNCKKSGCQKKYCQCY 151
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 52/84 (61%), Gaps = 4/84 (4%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA GE+C C+C C NN +++ R A++Q ERNPNAF +I +
Sbjct: 75 YCDCFAIGEYCNPICHCFECKNNESNKEKREMALKQNKERNPNAFTSRI---IQQSQLNN 131
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H +GCNCK+SGC K YC+CY G
Sbjct: 132 HQRGCNCKKSGCQKKYCQCYLDGM 155
>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 444
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 8/99 (8%)
Query: 96 PCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
PCNC +S CLKLYC+CFA+G +C +C CN C NN ++ + A + LERNP AF PK
Sbjct: 317 PCNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPK 376
Query: 155 IGKCLVGEGER-------RHTKGCNCKRSGCLKNYCECY 186
I + E H +GCNCKRS C K YCEC+
Sbjct: 377 ISESTAAVTEEGQVLRVAAHHRGCNCKRSNCRKKYCECF 415
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 53/91 (58%), Gaps = 8/91 (8%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
YC+CFA+G +C +C CN C NN ++ + A + LERNP AF PKI + E
Sbjct: 329 YCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKISESTAAVTEEG 388
Query: 59 ------RHTKGCNCKRSGCLKNYCECYEAGA 83
H +GCNCKRS C K YCEC++AG
Sbjct: 389 QVLRVAAHHRGCNCKRSNCRKKYCECFQAGV 419
Score = 35.8 bits (81), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 39/104 (37%), Gaps = 49/104 (47%)
Query: 64 CNCKRSGCLKNYCECYEAGA--------------------------RTLE--PNGIRPR- 94
CNCKRS CLK YCEC+ +G+ RTLE P P+
Sbjct: 318 CNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKI 377
Query: 95 --------------------KPCNCTKSMCLKLYCDCFANGEFC 118
+ CNC +S C K YC+CF G C
Sbjct: 378 SESTAAVTEEGQVLRVAAHHRGCNCKRSNCRKKYCECFQAGVPC 421
>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 394
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 6/93 (6%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+C A C ++CNC C N E +R A+ L+RNP AF+P
Sbjct: 49 KGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDRNPLAFQP 108
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K V G +H +GCNC++SGC+KNYCEC+
Sbjct: 109 K-----VASGSSQHLRGCNCRKSGCMKNYCECH 136
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 53/84 (63%), Gaps = 6/84 (7%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+C A C ++CNC C N E +R A+ L+RNP AF+PK V G +
Sbjct: 62 YCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDRNPLAFQPK-----VASGSSQ 116
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H +GCNC++SGC+KNYCEC++AG
Sbjct: 117 HLRGCNCRKSGCMKNYCECHQAGV 140
>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
Length = 780
Score = 86.7 bits (213), Expect = 4e-15, Method: Composition-based stats.
Identities = 45/103 (43%), Positives = 56/103 (54%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CF G FC + C+C C N + + Q RNP AF P
Sbjct: 477 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAP 536
Query: 154 KIGKCLVGE----------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + G+ RH +GCNCK+S CLK YCECY
Sbjct: 537 KVIRTEPGQELADDSNKTPASSRHKRGCNCKKSSCLKKYCECY 579
Score = 72.0 bits (175), Expect = 1e-10, Method: Composition-based stats.
Identities = 37/93 (39%), Positives = 48/93 (51%), Gaps = 11/93 (11%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
YC+CF G FC + C+C C N + + Q RNP AF PK+ + G+
Sbjct: 490 YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQELAD 549
Query: 56 ------GERRHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S CLK YCECY+ G
Sbjct: 550 DSNKTPASSRHKRGCNCKKSSCLKKYCECYQGG 582
>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
Length = 407
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 58/93 (62%), Gaps = 6/93 (6%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+C A C + CNC C N E +R A+ LERNP AF P
Sbjct: 49 KGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILERNPLAFLP 108
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K V G +H +GCNC++SGC+KNYCEC+
Sbjct: 109 K-----VASGSSQHLRGCNCRKSGCMKNYCECH 136
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 51/84 (60%), Gaps = 6/84 (7%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+C A C + CNC C N E +R A+ LERNP AF PK V G +
Sbjct: 62 YCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILERNPLAFLPK-----VASGSSQ 116
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H +GCNC++SGC+KNYCEC++AG
Sbjct: 117 HLRGCNCRKSGCMKNYCECHQAGV 140
>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1084
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/120 (46%), Positives = 68/120 (56%), Gaps = 12/120 (10%)
Query: 79 YEAGARTLEP-NGIRPRKPC-NCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDR 136
Y A +++P N RK C NC ++ CLKLYC+CFA C C C CFN E E R
Sbjct: 513 YTAQNSSIKPANHAEDRKICCNCKRTKCLKLYCECFAASRMCEGCTCQGCFNKPEFEAMR 572
Query: 137 HHAIRQCLERNPNAFRPKIG-----KCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 186
A + LERN +AF PKI K L V ++ H+KGCNCK+S CLK YCECY
Sbjct: 573 KEARQAILERNNSAFDPKIDSQSKLKQLCNEEGVDNEQKVHSKGCNCKKSNCLKKYCECY 632
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 53/94 (56%), Gaps = 10/94 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG-----KCL--- 52
YC+CFA C C C CFN E E R A + LERN +AF PKI K L
Sbjct: 544 YCECFAASRMCEGCTCQGCFNKPEFEAMRKEARQAILERNNSAFDPKIDSQSKLKQLCNE 603
Query: 53 --VGEGERRHTKGCNCKRSGCLKNYCECYEAGAR 84
V ++ H+KGCNCK+S CLK YCECY+ G +
Sbjct: 604 EGVDNEQKVHSKGCNCKKSNCLKKYCECYQLGVK 637
>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
Length = 448
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 11/104 (10%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA G +C C+C C N + +E R AI +RNP AF+P
Sbjct: 29 KKHCNCKNSQCLKLYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQP 88
Query: 154 KI-----------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI +H KGC+CK+SGCLK YCEC+
Sbjct: 89 KIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECF 132
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
YC+CFA G +C C+C C N + +E R AI +RNP AF+PKI
Sbjct: 43 YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 102
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 103 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQA 134
>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
Length = 873
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 52/105 (49%), Positives = 58/105 (55%), Gaps = 13/105 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC K+ CLKLYCDCFA G +C C C CFN E+ED +Q RNP AF P
Sbjct: 453 KRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAP 512
Query: 154 KI---GKCLVGEGE---------RRHTKGCNCKRSGCLKNYCECY 186
KI K E RH +GCNCK+S CLK YCECY
Sbjct: 513 KIVQHAKEFAASREDRSSSMPSLSRHKRGCNCKKSMCLKKYCECY 557
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 50/94 (53%), Gaps = 13/94 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YCDCFA G +C C C CFN E+ED +Q RNP AF PKI K
Sbjct: 466 YCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHAKEFAASR 525
Query: 57 E---------RRHTKGCNCKRSGCLKNYCECYEA 81
E RH +GCNCK+S CLK YCECY+A
Sbjct: 526 EDRSSSMPSLSRHKRGCNCKKSMCLKKYCECYQA 559
>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
Length = 727
Score = 86.3 bits (212), Expect = 5e-15, Method: Composition-based stats.
Identities = 53/143 (37%), Positives = 64/143 (44%), Gaps = 43/143 (30%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFAN 114
EGE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 428 EGEGEACKRCNCKKSKCLKLYCECFAAGVYCIEP-------------------------- 461
Query: 115 GEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE------G 163
C+C CFN HED +Q RNP AF PK+ + +G+
Sbjct: 462 ------CSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPVTEIGDEFSKTPA 515
Query: 164 ERRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S CLK YCECY
Sbjct: 516 SARHKRGCNCKKSNCLKKYCECY 538
Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 53/94 (56%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + +G
Sbjct: 448 YCECFAAGVYCIEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPVTEIG 507
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 508 DEFSKTPASARHKRGCNCKKSNCLKKYCECYQGG 541
>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
Length = 348
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/104 (47%), Positives = 57/104 (54%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC +S CLKLYC+CFA G +C C C CFN E+ED +Q RNP AF P
Sbjct: 20 KRCNCKRSKCLKLYCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAP 79
Query: 154 KIGKC-----------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
K+ K RH +GCNCK+S CLK YCECY
Sbjct: 80 KVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECY 123
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 50/95 (52%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
YC+CFA G +C C C CFN E+ED +Q RNP AF PK+ K
Sbjct: 33 YCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAPKVVKQATNSPANE 92
Query: 52 ---LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S CLK YCECY+AG
Sbjct: 93 EGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGV 127
>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
Length = 916
Score = 86.3 bits (212), Expect = 5e-15, Method: Composition-based stats.
Identities = 35/60 (58%), Positives = 42/60 (70%)
Query: 81 AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 140
A + +L+ G RK CNC+KS CLKLYCDCFANGEFC C C CFNN+++E R AI
Sbjct: 741 AASASLDVLGGSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAI 800
Score = 59.3 bits (142), Expect = 7e-07, Method: Composition-based stats.
Identities = 21/33 (63%), Positives = 24/33 (72%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAI 33
YCDCFANGEFC C C CFNN+++E R AI
Sbjct: 768 YCDCFANGEFCQDCTCKDCFNNLDYEVKRERAI 800
>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 707
Score = 86.3 bits (212), Expect = 5e-15, Method: Composition-based stats.
Identities = 38/64 (59%), Positives = 46/64 (71%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
RPRKPCNC S CLKLYCDCFA G +C C+C +C N ++ R +AIR LERNP+AF
Sbjct: 99 RPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAF 158
Query: 152 RPKI 155
R K+
Sbjct: 159 RSKV 162
Score = 76.6 bits (187), Expect = 5e-12, Method: Composition-based stats.
Identities = 49/152 (32%), Positives = 70/152 (46%), Gaps = 24/152 (15%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA G +C C+C +C N ++ R +AIR LERNP+AFR K+ +
Sbjct: 115 YCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAFRSKVVTLEYDTSSKDT 174
Query: 61 TKGCNCKR---------------SGCLKNYCECYEAGARTLEPNGIRPR--------KPC 97
NC S ++ EA + + +P K C
Sbjct: 175 ASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEASSTSTQPRTTGTTTQRHAQHIKGC 234
Query: 98 NCTKSMCLKLYCDCFANGEFCY-QCNCNSCFN 128
+C +S+CLK YC+CF NG +C C C++C N
Sbjct: 235 SCLRSLCLKKYCECFQNGVYCSASCRCSNCRN 266
Score = 46.6 bits (109), Expect = 0.004, Method: Composition-based stats.
Identities = 47/159 (29%), Positives = 62/159 (38%), Gaps = 25/159 (15%)
Query: 52 LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA----------------RTLEPNGIRP-- 93
L G R K CNCK S CLK YC+C+ AG TL N IR
Sbjct: 92 LEGGSVTRPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSL 151
Query: 94 -RKPCNCTKSMCLKLYCDC----FANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL-ERN 147
R P + +S + L D A+ + C + ++ ED Q E
Sbjct: 152 ERNP-HAFRSKVVTLEYDTSSKDTASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEA 210
Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ + +H KGC+C RS CLK YCEC+
Sbjct: 211 SSTSTQPRTTGTTTQRHAQHIKGCSCLRSLCLKKYCECF 249
>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 609
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 61/102 (59%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFA G +C + C+C +CFN H+D +Q RNP AF PK+
Sbjct: 330 CNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKV 389
Query: 156 ---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
++ GE RH +GCNCK+S CLK YCECY
Sbjct: 390 IRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 431
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 54/95 (56%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CFA G +C + C+C +CFN H+D +Q RNP AF PK+ ++ G
Sbjct: 341 YCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 400
Query: 57 E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCK+S CLK YCECY+ G
Sbjct: 401 EDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 435
>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
Length = 792
Score = 86.3 bits (212), Expect = 5e-15, Method: Composition-based stats.
Identities = 46/95 (48%), Positives = 53/95 (55%), Gaps = 10/95 (10%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC +S CLKLYC CFA C CNC+ C+NN E R AI LERNP AF K
Sbjct: 26 CNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAFDAKF 85
Query: 156 GKCLVGEGERR----HTKGCNCKRSGCLKNYCECY 186
E E+ H GC C++S CLK YCEC+
Sbjct: 86 -----RETEKHDAVVHKTGCKCRKSACLKKYCECF 115
Score = 71.2 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 43/104 (41%), Positives = 49/104 (47%), Gaps = 18/104 (17%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA C CNC+ C+NN E R AI LERNP AF K E E+
Sbjct: 37 YCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAFDAKFR-----ETEKH 91
Query: 60 ----HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNC 99
H GC C++S CLK YCEC+ G E CNC
Sbjct: 92 DAVVHKTGCKCRKSACLKKYCECFNKGVACSE--------KCNC 127
>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
Length = 609
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 61/102 (59%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFA G +C + C+C +CFN H+D +Q RNP AF PK+
Sbjct: 330 CNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKV 389
Query: 156 ---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
++ GE RH +GCNCK+S CLK YCECY
Sbjct: 390 IRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECY 431
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 54/95 (56%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CFA G +C + C+C +CFN H+D +Q RNP AF PK+ ++ G
Sbjct: 341 YCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVG 400
Query: 57 E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCK+S CLK YCECY+ G
Sbjct: 401 EDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 435
>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1114
Score = 86.3 bits (212), Expect = 5e-15, Method: Composition-based stats.
Identities = 44/108 (40%), Positives = 57/108 (52%), Gaps = 18/108 (16%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC K+ CLKLYC CF G C QC C C+N I + ++R AI+ +R P+AF K+
Sbjct: 714 CNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKV 773
Query: 156 G-----KCLVGEGER------------RHTKGCNCKRSGCLKNYCECY 186
+ + ER H KGCNCK+S C K YCEC+
Sbjct: 774 NFLPELARKIEDAERLDADLELQNQLLAHKKGCNCKKSACKKKYCECF 821
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 51/102 (50%), Gaps = 18/102 (17%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG-----KCLVG 54
YC CF G C QC C C+N I + ++R AI+ +R P+AF K+ +
Sbjct: 725 YCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKVNFLPELARKIE 784
Query: 55 EGER------------RHTKGCNCKRSGCLKNYCECYEAGAR 84
+ ER H KGCNCK+S C K YCEC+ AG +
Sbjct: 785 DAERLDADLELQNQLLAHKKGCNCKKSACKKKYCECFLAGVK 826
>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
Length = 176
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 49/89 (55%), Positives = 56/89 (62%), Gaps = 5/89 (5%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYCDCF G C CNC SC NN EH +R I+Q +ERNP+AFRPK+
Sbjct: 85 CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQIMERNPSAFRPKV 144
Query: 156 GKCLVGEGER----RHTKGCNCKRSGCLK 180
E E+ RH KGCNCK+S K
Sbjct: 145 ESKSNSEDEQDHKPRHFKGCNCKKSNWSK 173
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
YCDCF G C CNC SC NN EH +R I+Q +ERNP+AFRPK+ E E+
Sbjct: 96 YCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQIMERNPSAFRPKVESKSNSEDEQD 155
Query: 59 ---RHTKGCNCKRSGCLK 73
RH KGCNCK+S K
Sbjct: 156 HKPRHFKGCNCKKSNWSK 173
>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 760
Score = 85.9 bits (211), Expect = 6e-15, Method: Composition-based stats.
Identities = 45/103 (43%), Positives = 56/103 (54%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CF G FC + C+C C N + + Q RNP AF P
Sbjct: 457 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAP 516
Query: 154 KIGKCLVGE----------GERRHTKGCNCKRSGCLKNYCECY 186
K+ + G+ RH +GCNCK+S CLK YCECY
Sbjct: 517 KVIRTEPGQELADDSNKTPASSRHKRGCNCKKSSCLKKYCECY 559
Score = 71.2 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 37/93 (39%), Positives = 48/93 (51%), Gaps = 11/93 (11%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE---- 55
YC+CF G FC + C+C C N + + Q RNP AF PK+ + G+
Sbjct: 470 YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQELAD 529
Query: 56 ------GERRHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S CLK YCECY+ G
Sbjct: 530 DSNKTPASSRHKRGCNCKKSSCLKKYCECYQGG 562
>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
Length = 767
Score = 85.9 bits (211), Expect = 7e-15, Method: Composition-based stats.
Identities = 46/104 (44%), Positives = 59/104 (56%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA G +C + C+C C NN H + +Q RNP AF P
Sbjct: 457 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIESRNPLAFAP 516
Query: 154 KI-----GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
K+ +G+ RH +GCNCK+S CLK YCEC+
Sbjct: 517 KVIHTSESGLELGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 560
Score = 71.2 bits (173), Expect = 2e-10, Method: Composition-based stats.
Identities = 38/94 (40%), Positives = 51/94 (54%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----GKCLVG 54
YC+CFA G +C + C+C C NN H + +Q RNP AF PK+ +G
Sbjct: 470 YCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSESGLELG 529
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCEC++ G
Sbjct: 530 DFSNKTPASARHKRGCNCKKSSCLKKYCECFQGG 563
>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
Length = 497
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 69/118 (58%), Gaps = 19/118 (16%)
Query: 85 TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
+ E + P+K CNC S CLK+YC+CF ++C CNC++C N + +E+ R+ AI
Sbjct: 56 STEVKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEA 115
Query: 143 CLERNPNAFRPKI--------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+RNP+AF+PKI G V +H KGC+CK+SGCLK YCECY
Sbjct: 116 IRQRNPSAFQPKIENGPNTLNVRKDNSGAVPVVA---KHHKGCHCKKSGCLKKYCECY 170
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 57/95 (60%), Gaps = 17/95 (17%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------------ 48
YC+CF ++C CNC++C N + +E+ R+ AI +RNP+AF+PKI
Sbjct: 81 YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIENGPNTLNVRKD 140
Query: 49 --GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
G V +H KGC+CK+SGCLK YCECY+A
Sbjct: 141 NSGAVPVVA---KHHKGCHCKKSGCLKKYCECYQA 172
>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
Length = 710
Score = 85.5 bits (210), Expect = 8e-15, Method: Composition-based stats.
Identities = 50/122 (40%), Positives = 57/122 (46%), Gaps = 30/122 (24%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA G +C C C CFN E D +Q RNP AF P
Sbjct: 390 KRCHCKKSKCLKLYCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAP 449
Query: 154 KIGK-----------------------------CLVGEGERRHTKGCNCKRSGCLKNYCE 184
KI + C G RH +GCNCK+S CLK YCE
Sbjct: 450 KIVQTSDNTPTEGVGIWFSGHAMTNSLFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCE 509
Query: 185 CY 186
CY
Sbjct: 510 CY 511
Score = 72.4 bits (176), Expect = 7e-11, Method: Composition-based stats.
Identities = 50/136 (36%), Positives = 61/136 (44%), Gaps = 18/136 (13%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA G +C C C CFN E D +Q RNP AF PKI V +
Sbjct: 403 YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKI----VQTSDNT 458
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPN------GIRPRKPCNCTKSMCLKLYCDCFA 113
T+G SG L+PN R ++ CNC KS+CLK YC+C+
Sbjct: 459 PTEGVGIWFSG------HAMTNSLFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQ 512
Query: 114 NGEFCYQ-CNCNSCFN 128
G C C CN C N
Sbjct: 513 AGVGCSDGCKCNGCKN 528
>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
Length = 620
Score = 85.5 bits (210), Expect = 9e-15, Method: Composition-based stats.
Identities = 39/82 (47%), Positives = 53/82 (64%), Gaps = 2/82 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA G FC++C+C +C N ++ +Q RNPNAF KI G+G +H
Sbjct: 278 YCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKI--VASGDGGGQH 335
Query: 61 TKGCNCKRSGCLKNYCECYEAG 82
KGC+CK+S CLK YCEC++AG
Sbjct: 336 KKGCHCKKSACLKKYCECFQAG 357
Score = 82.4 bits (202), Expect = 7e-14, Method: Composition-based stats.
Identities = 38/80 (47%), Positives = 51/80 (63%), Gaps = 2/80 (2%)
Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
LYC+CFA G FC++C+C +C N ++ +Q RNPNAF KI G+G +
Sbjct: 277 LYCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKI--VASGDGGGQ 334
Query: 167 HTKGCNCKRSGCLKNYCECY 186
H KGC+CK+S CLK YCEC+
Sbjct: 335 HKKGCHCKKSACLKKYCECF 354
Score = 40.4 bits (93), Expect = 0.36, Method: Composition-based stats.
Identities = 28/94 (29%), Positives = 34/94 (36%), Gaps = 40/94 (42%)
Query: 75 YCECYEAGARTLE---------------------------PN------------GIRPRK 95
YCEC+ AG E PN G + +K
Sbjct: 278 YCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGGGQHKK 337
Query: 96 PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFN 128
C+C KS CLK YC+CF G C C C+ C N
Sbjct: 338 GCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKN 371
>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas
reinhardtii]
gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
Length = 91
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 53/91 (58%), Gaps = 10/91 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG----------K 50
YC+CFA+ +C CNC CFNN E+E R A+ +ERNPNAF+PKI
Sbjct: 1 YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60
Query: 51 CLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
RH KGCNCK+S CLK YCEC++
Sbjct: 61 VAAAGASGRHLKGCNCKKSFCLKKYCECFQV 91
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/89 (48%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG----------K 157
YC+CFA+ +C CNC CFNN E+E R A+ +ERNPNAF+PKI
Sbjct: 1 YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60
Query: 158 CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RH KGCNCK+S CLK YCEC+
Sbjct: 61 VAAAGASGRHLKGCNCKKSFCLKKYCECF 89
>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
Length = 772
Score = 85.1 bits (209), Expect = 1e-14, Method: Composition-based stats.
Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA G +C + C+C C NN H + +Q RNP AF P
Sbjct: 462 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIESRNPLAFAP 521
Query: 154 KI------GKCLVGEGER-----RHTKGCNCKRSGCLKNYCECY 186
K+ G L + RH +GCNCK+S CLK YCEC+
Sbjct: 522 KVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECF 565
Score = 70.5 bits (171), Expect = 3e-10, Method: Composition-based stats.
Identities = 39/94 (41%), Positives = 51/94 (54%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
YC+CFA G +C + C+C C NN H + +Q RNP AF PK+ G L
Sbjct: 475 YCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELR 534
Query: 54 GEGER-----RHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCEC++ G
Sbjct: 535 DFSNKTPASARHKRGCNCKKSSCLKKYCECFQGG 568
>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
Length = 782
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 60/103 (58%), Gaps = 13/103 (12%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA G +C + C+C C N HE+ +Q RNP AF PK+
Sbjct: 458 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 517
Query: 156 ------GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
G+ + GE RH +GCNCK+S CLK YCECY
Sbjct: 518 IRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 560
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 13/96 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
YC+CFA G +C + C+C C N HE+ +Q RNP AF PK+ G+ +
Sbjct: 469 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDIT 528
Query: 54 GE------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
GE RH +GCNCK+S CLK YCECY+ G
Sbjct: 529 GEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 564
>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
Length = 782
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 60/103 (58%), Gaps = 13/103 (12%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA G +C + C+C C N HE+ +Q RNP AF PK+
Sbjct: 470 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 529
Query: 156 ------GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
G+ + GE RH +GCNCK+S CLK YCECY
Sbjct: 530 IRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 572
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 13/96 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
YC+CFA G +C + C+C C N HE+ +Q RNP AF PK+ G+ +
Sbjct: 481 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDIT 540
Query: 54 GE------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
GE RH +GCNCK+S CLK YCECY+ G
Sbjct: 541 GEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 576
>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
Length = 794
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 60/103 (58%), Gaps = 13/103 (12%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA G +C + C+C C N HE+ +Q RNP AF PK+
Sbjct: 470 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 529
Query: 156 ------GKCLVGE------GERRHTKGCNCKRSGCLKNYCECY 186
G+ + GE RH +GCNCK+S CLK YCECY
Sbjct: 530 IRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 572
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 13/96 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
YC+CFA G +C + C+C C N HE+ +Q RNP AF PK+ G+ +
Sbjct: 481 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDIT 540
Query: 54 GE------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
GE RH +GCNCK+S CLK YCECY+ G
Sbjct: 541 GEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 576
>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 455
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 86/193 (44%), Gaps = 50/193 (25%)
Query: 1 YCDCFANGEFC--YQCNCNSCFNNIEH---EDDRHHAIRQCLERNPNAFRPKIGKCLVGE 55
YCDCF G C CNC +C N I + R +AIR L RNP AF
Sbjct: 271 YCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAFV---------- 320
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
T G A A P G + CNC +S CLKLYC CF G
Sbjct: 321 -----TAGV----------------ASAVQKLPAG---QVACNCIRSRCLKLYCTCFQAG 356
Query: 116 EFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNC 173
+ C C C +C N++E +R AI+ L++ P+AF+ K +R GC C
Sbjct: 357 KMCQPGICTCIACANSVEDHPERKQAIKHTLQKRPDAFQTK---------DRPVGLGCAC 407
Query: 174 KRSGCLKNYCECY 186
K + C++ YCEC+
Sbjct: 408 KNNKCIRKYCECF 420
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 48/95 (50%), Gaps = 6/95 (6%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC--YQCNCNSCFNNIEH---EDDRHHAIRQCLERNPNAF 151
C+C KS C+ LYCDCF G C CNC +C N I + R +AIR L RNP AF
Sbjct: 260 CSCPKSKCVALYCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAF 319
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
G + CNC RS CLK YC C+
Sbjct: 320 V-TAGVASAVQKLPAGQVACNCIRSRCLKLYCTCF 353
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 11/84 (13%)
Query: 1 YCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 58
YC CF G+ C C C +C N++E +R AI+ L++ P+AF+ K +R
Sbjct: 349 YCTCFQAGKMCQPGICTCIACANSVEDHPERKQAIKHTLQKRPDAFQTK---------DR 399
Query: 59 RHTKGCNCKRSGCLKNYCECYEAG 82
GC CK + C++ YCEC+ G
Sbjct: 400 PVGLGCACKNNKCIRKYCECFRNG 423
>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
Length = 596
Score = 84.7 bits (208), Expect = 2e-14, Method: Composition-based stats.
Identities = 57/130 (43%), Positives = 65/130 (50%), Gaps = 37/130 (28%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+GE+C CNCN C NNIE+E R I LERNPNAFRP
Sbjct: 33 KKNCNCKNSKCLKLYCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRP 92
Query: 154 KIGKCLV-------------------------------------GEGERRHTKGCNCKRS 176
KI G +H KGC CK+S
Sbjct: 93 KIASLPPQSPQITQKVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKS 152
Query: 177 GCLKNYCECY 186
GCLK YCEC+
Sbjct: 153 GCLKKYCECF 162
Score = 70.1 bits (170), Expect = 4e-10, Method: Composition-based stats.
Identities = 50/119 (42%), Positives = 58/119 (48%), Gaps = 37/119 (31%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
YC+CFA+GE+C CNCN C NNIE+E R I LERNPNAFRPKI
Sbjct: 47 YCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRPKIASLPPQSPQITQ 106
Query: 54 ------------------------------GEGERRHTKGCNCKRSGCLKNYCECYEAG 82
G +H KGC CK+SGCLK YCEC++AG
Sbjct: 107 KVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKSGCLKKYCECFQAG 165
>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 58/100 (58%), Gaps = 14/100 (14%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
P I K CNC KS CLK YCDCFANG+ C + CNC CFNN + + R A Q + R
Sbjct: 82 PKIITNTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINR 141
Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+P AF ++ KGCNCK+SGC K YCEC+
Sbjct: 142 DPGAF-------------KQSFKGCNCKKSGCQKKYCECF 168
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 49/83 (59%), Gaps = 14/83 (16%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFANG+ C + CNC CFNN + + R A Q + R+P AF+ +
Sbjct: 102 YCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFK-------------Q 148
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
KGCNCK+SGC K YCEC+ +G
Sbjct: 149 SFKGCNCKKSGCQKKYCECFLSG 171
>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
Length = 561
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 58/106 (54%), Gaps = 14/106 (13%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K CNC K+ CLK YCDCFA G +C + C C CFN +E+ED A +Q NP AF P
Sbjct: 426 KRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCFNRLEYEDTVLEARQQKESHNPLAFAP 485
Query: 154 KIGK-------------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI + L RH GCNCKRS C+K YCECY
Sbjct: 486 KIVQHVTEFHAINVEDASLFTPSSGRHKTGCNCKRSMCVKKYCECY 531
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 51/95 (53%), Gaps = 14/95 (14%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
YCDCFA G +C + C C CFN +E+ED A +Q NP AF PKI +
Sbjct: 439 YCDCFAAGIYCAETCACQGCFNRLEYEDTVLEARQQKESHNPLAFAPKIVQHVTEFHAIN 498
Query: 51 ----CLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
L RH GCNCKRS C+K YCECY+A
Sbjct: 499 VEDASLFTPSSGRHKTGCNCKRSMCVKKYCECYQA 533
>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
Length = 611
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/104 (49%), Positives = 64/104 (61%), Gaps = 14/104 (13%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFR 152
K C C KS CLKLYC+CFA G +C + C+C +CFN HED + R+ +E RNP AF
Sbjct: 431 KRCKCKKSQCLKLYCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKS-REVIEARNPLAFA 489
Query: 153 PKIGKC------LVGEGER-----RHTKGCNCKRSGCLKNYCEC 185
PK+ L E + RHT+GCNC++SGCLK YCEC
Sbjct: 490 PKVVSTSDTAIDLWVENSKTPASARHTRGCNCRKSGCLKKYCEC 533
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 57/97 (58%), Gaps = 14/97 (14%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKIGKC------L 52
YC+CFA G +C + C+C +CFN HED + R+ +E RNP AF PK+ L
Sbjct: 444 YCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKS-REVIEARNPLAFAPKVVSTSDTAIDL 502
Query: 53 VGEGER-----RHTKGCNCKRSGCLKNYCECYEAGAR 84
E + RHT+GCNC++SGCLK YCEC G R
Sbjct: 503 WVENSKTPASARHTRGCNCRKSGCLKKYCECNLMGVR 539
>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
Length = 553
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/83 (49%), Positives = 54/83 (65%), Gaps = 2/83 (2%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG
Sbjct: 162 CNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIG 221
Query: 157 KCLVGEGERRHTKGCNCKRSGCL 179
R + + N +G +
Sbjct: 222 SS--PHANRNNMQAANAAVNGAI 242
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 46/72 (63%), Gaps = 2/72 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA+G +C CNC++CFNN+++E R AI LERNP+AFRPKIG R +
Sbjct: 173 YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSS--PHANRNN 230
Query: 61 TKGCNCKRSGCL 72
+ N +G +
Sbjct: 231 MQAANAAVNGAI 242
>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
Length = 1430
Score = 83.6 bits (205), Expect = 3e-14, Method: Composition-based stats.
Identities = 43/101 (42%), Positives = 57/101 (56%), Gaps = 11/101 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRH------HAIRQCLERN- 147
KPCNC KS CLKLYC+C+A G FC CNC++C N E + + H HA+ + +
Sbjct: 1012 KPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPE-DGEAHPPYPATHALGRLRHGDV 1070
Query: 148 --PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P A K + ++R GC CK+S CLK YCEC+
Sbjct: 1071 AVPTAVLTSAQK-RKPQSKQRQESGCFCKKSKCLKKYCECF 1110
Score = 64.7 bits (156), Expect = 2e-08, Method: Composition-based stats.
Identities = 35/91 (38%), Positives = 49/91 (53%), Gaps = 11/91 (12%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRH------HAIRQCLERN---PNAFRPKIGKC 51
YC+C+A G FC CNC++C N E + + H HA+ + + P A K
Sbjct: 1025 YCECYAAGVFCRDCNCSNCLNKPE-DGEAHPPYPATHALGRLRHGDVAVPTAVLTSAQK- 1082
Query: 52 LVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
+ ++R GC CK+S CLK YCEC+EAG
Sbjct: 1083 RKPQSKQRQESGCFCKKSKCLKKYCECFEAG 1113
Score = 55.5 bits (132), Expect = 9e-06, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 42/116 (36%), Gaps = 49/116 (42%)
Query: 62 KGCNCKRSGCLKNYCECYEAGARTLEPN-------------------------------- 89
K CNCK+S CLK YCECY AG + N
Sbjct: 1012 KPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPEDGEAHPPYPATHALGRLRHGDVA 1071
Query: 90 -------GIRPRKP---------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
+ RKP C C KS CLK YC+CF G C C C +C N
Sbjct: 1072 VPTAVLTSAQKRKPQSKQRQESGCFCKKSKCLKKYCECFEAGVHCEASCKCENCEN 1127
>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
Length = 929
Score = 83.2 bits (204), Expect = 4e-14, Method: Composition-based stats.
Identities = 53/140 (37%), Positives = 66/140 (47%), Gaps = 41/140 (29%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHED---DRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
YCDCFANG +C C+C+SC N +E+ + AI+Q RNPNAF KI G
Sbjct: 541 YCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQ---RNPNAFVQKIEADAALGG 597
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGE 116
+ H +GCNCK+S C+K YCEC++AG PC GE
Sbjct: 598 Q--HRRGCNCKKSHCMKKYCECFQAGV------------PC-----------------GE 626
Query: 117 FCYQCNCNSCFNNIEHEDDR 136
C C SC N H R
Sbjct: 627 ---HCKCESCHNTAGHSGRR 643
Score = 79.0 bits (193), Expect = 9e-13, Method: Composition-based stats.
Identities = 41/84 (48%), Positives = 53/84 (63%), Gaps = 9/84 (10%)
Query: 107 LYCDCFANGEFC-YQCNCNSCFNNIEHE---DDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
LYCDCFANG +C C+C+SC N +E+ + AI+Q RNPNAF KI
Sbjct: 540 LYCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQ---RNPNAFVQKIEADAALG 596
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
G+ H +GCNCK+S C+K YCEC+
Sbjct: 597 GQ--HRRGCNCKKSHCMKKYCECF 618
>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
Length = 159
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 58/100 (58%), Gaps = 14/100 (14%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
P I K CNC KS CLK YCDCFANG+ C + CNC CFNN + + R A Q + R
Sbjct: 53 PKIITNTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNR 112
Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+P AF ++ KGCNCK+SGC K YCEC+
Sbjct: 113 DPGAF-------------KQSFKGCNCKKSGCQKKYCECF 139
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 14/83 (16%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFANG+ C + CNC CFNN + + R A Q + R+P AF+ +
Sbjct: 73 YCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNRDPGAFK-------------Q 119
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
KGCNCK+SGC K YCEC+ G
Sbjct: 120 SFKGCNCKKSGCQKKYCECFLNG 142
>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1022
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 56/95 (58%), Gaps = 5/95 (5%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
C C K+ CLKLYC+CF ++C NC C N E+ED R HA+ +RN +AF PK
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKAD 825
Query: 157 KC----LVGEGERR-HTKGCNCKRSGCLKNYCECY 186
K G + H+KGCNCK+S C K YCECY
Sbjct: 826 KSDQYYKSDSGTKAIHSKGCNCKKSDCRKKYCECY 860
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 52/89 (58%), Gaps = 5/89 (5%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC----LVGEG 56
YC+CF ++C NC C N E+ED R HA+ +RN +AF PK K G
Sbjct: 777 YCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKADKSDQYYKSDSG 836
Query: 57 ERR-HTKGCNCKRSGCLKNYCECYEAGAR 84
+ H+KGCNCK+S C K YCECY+ GA+
Sbjct: 837 TKAIHSKGCNCKKSDCRKKYCECYQMGAK 865
>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
Length = 223
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/52 (71%), Positives = 43/52 (82%)
Query: 30 HHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
+ + CL+RNP AF+PKIGK GE +RRH+KGCNCKRSGCLKNYCECYEA
Sbjct: 39 QYVTQACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECYEA 90
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 35/50 (70%), Positives = 41/50 (82%)
Query: 137 HHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ + CL+RNP AF+PKIGK GE +RRH+KGCNCKRSGCLKNYCECY
Sbjct: 39 QYVTQACLDRNPEAFKPKIGKGKEGESDRRHSKGCNCKRSGCLKNYCECY 88
>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
Length = 384
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 61/111 (54%), Gaps = 25/111 (22%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFAN C Q CNC C N I++ +R AI + L RNP+AF
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAF---- 247
Query: 156 GKCLVGEGERRHT--------------------KGCNCKRSGCLKNYCECY 186
+C +G+ + + KGCNCK+SGC K YCECY
Sbjct: 248 AQCFQQKGQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECY 298
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 67/146 (45%), Gaps = 19/146 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFAN C Q CNC C N I++ +R AI + L RNP+AF +C +G+
Sbjct: 203 YCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAF----AQCFQQKGQ-- 256
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
T+ ++ LK + N RK CNC KS C K YC+C++ C
Sbjct: 257 -TQFSVQQQDKPLK----------EPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCS 305
Query: 120 Q-CNCNSCFNNIEHEDDRHHAIRQCL 144
C C C N + E I Q L
Sbjct: 306 DLCKCEQCLNRTDAEIQAQQDIAQVL 331
>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
distachyon]
Length = 771
Score = 82.8 bits (203), Expect = 6e-14, Method: Composition-based stats.
Identities = 45/104 (43%), Positives = 58/104 (55%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CF G FC + C+C C N + + Q RNP AF P
Sbjct: 465 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIESRNPLAFAP 524
Query: 154 KI------GKCLVGEGER-----RHTKGCNCKRSGCLKNYCECY 186
K+ G+ L + + RH +GCNCK+S CLK YCEC+
Sbjct: 525 KVIRTSEPGQELGEQSNKTPASARHKRGCNCKKSSCLKKYCECF 568
Score = 68.2 bits (165), Expect = 2e-09, Method: Composition-based stats.
Identities = 37/94 (39%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLV 53
YC+CF G FC + C+C C N + + Q RNP AF PK+ G+ L
Sbjct: 478 YCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIESRNPLAFAPKVIRTSEPGQELG 537
Query: 54 GEGER-----RHTKGCNCKRSGCLKNYCECYEAG 82
+ + RH +GCNCK+S CLK YCEC++ G
Sbjct: 538 EQSNKTPASARHKRGCNCKKSSCLKKYCECFQGG 571
>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
Length = 551
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/107 (51%), Positives = 69/107 (64%), Gaps = 17/107 (15%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKLYC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRP
Sbjct: 94 QKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRP 153
Query: 154 KI--------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KI G+ L+ +H KGC+CK+SGCLK YCEC+
Sbjct: 154 KIASSPHRTRDIREDAGEILI---LGKHNKGCHCKKSGCLKKYCECF 197
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/95 (49%), Positives = 61/95 (64%), Gaps = 17/95 (17%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------------ 48
YC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPKI
Sbjct: 108 YCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRPKIASSPHRTRDIRE 167
Query: 49 --GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
G+ L+ +H KGC+CK+SGCLK YCEC++A
Sbjct: 168 DAGEILI---LGKHNKGCHCKKSGCLKKYCECFQA 199
>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 710
Score = 82.4 bits (202), Expect = 8e-14, Method: Composition-based stats.
Identities = 42/93 (45%), Positives = 55/93 (59%), Gaps = 2/93 (2%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C+KS CLKLYC+CFA G C C C C N + D A + ++R+P AF
Sbjct: 525 KKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEI 584
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K+ + G + +H KGC CK+SGC K YCEC+
Sbjct: 585 KVSQN-SGTDKLQHRKGCTCKKSGCQKGYCECF 616
Score = 66.2 bits (160), Expect = 5e-09, Method: Composition-based stats.
Identities = 34/83 (40%), Positives = 47/83 (56%), Gaps = 2/83 (2%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA G C C C C N + D A + ++R+P AF K+ + G + +
Sbjct: 538 YCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEIKVSQN-SGTDKLQ 596
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
H KGC CK+SGC K YCEC++ G
Sbjct: 597 HRKGCTCKKSGCQKGYCECFQLG 619
Score = 45.4 bits (106), Expect = 0.010, Method: Composition-based stats.
Identities = 36/130 (27%), Positives = 48/130 (36%), Gaps = 44/130 (33%)
Query: 61 TKGCNCKRSGCLKNYCECYEAG-------------------------------------- 82
TK C C +S CLK YCEC+ AG
Sbjct: 524 TKKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFE 583
Query: 83 ---ARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHH 138
++ + ++ RK C C KS C K YC+CF G C C C+ C N + D
Sbjct: 584 IKVSQNSGTDKLQHRKGCTCKKSGCQKGYCECFQLGVPCTDFCKCSGCANCEKKSD--QP 641
Query: 139 AIRQCLERNP 148
++ LE P
Sbjct: 642 CLQTSLESKP 651
>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
Length = 586
Score = 82.4 bits (202), Expect = 8e-14, Method: Composition-based stats.
Identities = 50/99 (50%), Positives = 61/99 (61%), Gaps = 9/99 (9%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI- 155
C+C S CLKLYC+CF G +C CNC +C NN+ HE+ R AI LERNP AF PK+
Sbjct: 174 CSCKYSKCLKLYCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVS 233
Query: 156 --------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K G+ +HTKGC CKR+ CLK YCEC+
Sbjct: 234 NSTAHNCESKAAEGDIVGKHTKGCKCKRTECLKKYCECF 272
Score = 69.7 bits (169), Expect = 5e-10, Method: Composition-based stats.
Identities = 44/90 (48%), Positives = 55/90 (61%), Gaps = 9/90 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
YC+CF G +C CNC +C NN+ HE+ R AI LERNP AF PK+ K
Sbjct: 185 YCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNSTAHNCESKA 244
Query: 52 LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
G+ +HTKGC CKR+ CLK YCEC++A
Sbjct: 245 AEGDIVGKHTKGCKCKRTECLKKYCECFKA 274
>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 82.0 bits (201), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 60/102 (58%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
CNC KS CLKLYC+CFAN C Q CNC C N I++ ++R AI + L RNP AF
Sbjct: 164 CNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTIL 223
Query: 152 -----RPKIGKCLVGEGE--RRHTKGCNCKRSGCLKNYCECY 186
+P+I + E + KGCNCK+S C K YCECY
Sbjct: 224 TNNGQQPQIIPEPKSQKEQSKETKKGCNCKKSECKKKYCECY 265
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 52/96 (54%), Gaps = 12/96 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---------RPKIGK 50
YC+CFAN C Q CNC C N I++ ++R AI + L RNP AF +P+I
Sbjct: 175 YCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTILTNNGQQPQIIP 234
Query: 51 CLVGEGE--RRHTKGCNCKRSGCLKNYCECYEAGAR 84
+ E + KGCNCK+S C K YCECY +
Sbjct: 235 EPKSQKEQSKETKKGCNCKKSECKKKYCECYSINQK 270
>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
Length = 800
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 58/102 (56%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA G +C + C+C C N HE+ +Q RNP AF PK+
Sbjct: 484 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAPKV 543
Query: 156 GKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
+ GE RH +GCNCK+S CLK YCECY
Sbjct: 544 IRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 585
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C+C C N HE+ +Q RNP AF PK+ + G
Sbjct: 495 YCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAPKVIRMSDAGLETG 554
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCK+S CLK YCECY+ G
Sbjct: 555 EDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 589
>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
Length = 175
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 56/100 (56%), Gaps = 14/100 (14%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
P + K CNC KS CLK YCDCFANG+ C + CNC CFNN + R A Q R
Sbjct: 69 PKIVSSTKTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTR 128
Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+P AF ++ KGCNCK+SGC K YCEC+
Sbjct: 129 DPRAF-------------KQAFKGCNCKKSGCQKKYCECF 155
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 46/81 (56%), Gaps = 14/81 (17%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFANG+ C + CNC CFNN + R A Q R+P AF+ +
Sbjct: 89 YCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFK-------------Q 135
Query: 60 HTKGCNCKRSGCLKNYCECYE 80
KGCNCK+SGC K YCEC++
Sbjct: 136 AFKGCNCKKSGCQKKYCECFQ 156
>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
Length = 190
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 56/100 (56%), Gaps = 14/100 (14%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
P + K CNC KS CLK YCDCFANG+ C + CNC CFNN + R A Q R
Sbjct: 84 PKIVTSTKTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQAR 143
Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+P AF ++ KGCNCK+SGC K YCEC+
Sbjct: 144 DPGAF-------------KQAFKGCNCKKSGCQKKYCECF 170
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 46/81 (56%), Gaps = 14/81 (17%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFANG+ C + CNC CFNN + R A Q R+P AF+ +
Sbjct: 104 YCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQARDPGAFK-------------Q 150
Query: 60 HTKGCNCKRSGCLKNYCECYE 80
KGCNCK+SGC K YCEC++
Sbjct: 151 AFKGCNCKKSGCQKKYCECFQ 171
>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 974
Score = 81.3 bits (199), Expect = 1e-13, Method: Composition-based stats.
Identities = 46/100 (46%), Positives = 56/100 (56%), Gaps = 10/100 (10%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC K+ CLKLYCDCFA GE C Q C C C NN + R+ AI L+RNP AF K
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVKP 695
Query: 156 GKCLVGEGERRHT---------KGCNCKRSGCLKNYCECY 186
+ + E ++ GCNC++S CLK YC CY
Sbjct: 696 EEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCY 735
Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats.
Identities = 43/111 (38%), Positives = 55/111 (49%), Gaps = 18/111 (16%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA GE C Q C C C NN + R+ AI L+RNP AF K + + E
Sbjct: 647 YCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVKPEEAKKSKKELN 706
Query: 60 HTK---------GCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTK 101
++ GCNC++S CLK YC CY +G + E C CT+
Sbjct: 707 NSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSE--------ACKCTE 749
>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 81.3 bits (199), Expect = 1e-13, Method: Composition-based stats.
Identities = 42/94 (44%), Positives = 53/94 (56%), Gaps = 12/94 (12%)
Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL---- 159
L+LYC+CFA G +C + C C CFN HED +Q RNP AF PK+ +
Sbjct: 467 LQLYCECFAAGVYCIEPCACQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPAP 526
Query: 160 -VGE------GERRHTKGCNCKRSGCLKNYCECY 186
+G+ RH +GCNCK+S CLK YCECY
Sbjct: 527 EIGDESSKTPASARHKRGCNCKKSSCLKKYCECY 560
Score = 80.9 bits (198), Expect = 2e-13, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C C CFN HED +Q RNP AF PK+ + +G
Sbjct: 470 YCECFAAGVYCIEPCACQDCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPAPEIG 529
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 530 DESSKTPASARHKRGCNCKKSSCLKKYCECYQGG 563
>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
Length = 196
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 55/93 (59%), Gaps = 14/93 (15%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+ CNC KS CLK YCDCFANG+ C + CNC CFNN + + R A Q + R+P AF+
Sbjct: 97 RSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFKQ 156
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
KGCNCK+SGC K YCEC+
Sbjct: 157 SF-------------KGCNCKKSGCQKKYCECF 176
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 14/83 (16%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFANG+ C + CNC CFNN + + R A Q + R+P AF+
Sbjct: 110 YCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFKQSF----------- 158
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
KGCNCK+SGC K YCEC+ +G
Sbjct: 159 --KGCNCKKSGCQKKYCECFLSG 179
>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 405
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/102 (54%), Positives = 67/102 (65%), Gaps = 10/102 (9%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K CNC S CLKLYC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPK
Sbjct: 26 KQCNCKNSRCLKLYCECFASGTYCEGCNCVNCCNNVENEIVRQEAVEATLERNPNAFRPK 85
Query: 155 IGKCL--------VGEGER--RHTKGCNCKRSGCLKNYCECY 186
I VGE +H KGC+CK+SGCLK YCEC+
Sbjct: 86 IFSSPGIRDSGEDVGEHPLAGKHNKGCHCKKSGCLKKYCECF 127
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 60/91 (65%), Gaps = 10/91 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-------- 52
YC+CFA+G +C CNC +C NN+E+E R A+ LERNPNAFRPKI
Sbjct: 39 YCECFASGTYCEGCNCVNCCNNVENEIVRQEAVEATLERNPNAFRPKIFSSPGIRDSGED 98
Query: 53 VGEGER--RHTKGCNCKRSGCLKNYCECYEA 81
VGE +H KGC+CK+SGCLK YCEC++A
Sbjct: 99 VGEHPLAGKHNKGCHCKKSGCLKKYCECFQA 129
>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
Length = 348
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 59/102 (57%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CF N C Q CNC C N I++ ++R AI + L RNP AF P +
Sbjct: 168 CNCKKSKCLKLYCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPIL 227
Query: 156 G-----KCLVGEG------ERRHTKGCNCKRSGCLKNYCECY 186
++ E ++ KGCNCK+S C K YCECY
Sbjct: 228 TNNGQQPQVIQEQKSQKDIQKETKKGCNCKKSECKKKYCECY 269
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 51/96 (53%), Gaps = 12/96 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG-----KCLVG 54
YC+CF N C Q CNC C N I++ ++R AI + L RNP AF P + ++
Sbjct: 179 YCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPILTNNGQQPQVIQ 238
Query: 55 EG------ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
E ++ KGCNCK+S C K YCECY +
Sbjct: 239 EQKSQKDIQKETKKGCNCKKSECKKKYCECYSINQK 274
>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
Length = 776
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 58/102 (56%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA G +C + C+C C N HE+ +Q RNP AF PK+
Sbjct: 466 CSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKV 525
Query: 156 GKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
+ GE RH +GCNCK+S CLK YCECY
Sbjct: 526 IRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECY 567
Score = 69.3 bits (168), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C+C C N HE+ +Q RNP AF PK+ + G
Sbjct: 477 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGLETG 536
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCK+S CLK YCECY+ G
Sbjct: 537 EDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 571
>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
Length = 695
Score = 80.5 bits (197), Expect = 2e-13, Method: Composition-based stats.
Identities = 49/142 (34%), Positives = 60/142 (42%), Gaps = 43/142 (30%)
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
GE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 395 GEGESCKRCNCKKSKCLKLYCECFAAGVYCIEP--------------------------- 427
Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVGEGE 164
C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 428 -----CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPAS 482
Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S C+K YCECY
Sbjct: 483 ARHKRGCNCKKSNCMKKYCECY 504
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 38/94 (40%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
YC+CFA G +C + C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 414 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 473
Query: 51 --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S C+K YCECY+ G
Sbjct: 474 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGG 507
>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 695
Score = 80.5 bits (197), Expect = 2e-13, Method: Composition-based stats.
Identities = 49/142 (34%), Positives = 60/142 (42%), Gaps = 43/142 (30%)
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
GE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 395 GEGESCKRCNCKKSKCLKLYCECFAAGVYCIEP--------------------------- 427
Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVGEGE 164
C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 428 -----CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPAS 482
Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S C+K YCECY
Sbjct: 483 ARHKRGCNCKKSNCMKKYCECY 504
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 38/94 (40%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
YC+CFA G +C + C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 414 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 473
Query: 51 --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S C+K YCECY+ G
Sbjct: 474 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGG 507
>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
Length = 386
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/107 (45%), Positives = 58/107 (54%), Gaps = 17/107 (15%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
CNC KS CLKLYC+CFAN C Q CNC C N I++ +R AI + L RNP+AF
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQCF 251
Query: 152 ------------RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ K K + KGCNCK+SGC K YCECY
Sbjct: 252 QQKSQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECY 298
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 50/101 (49%), Gaps = 17/101 (16%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF--------------- 44
YC+CFAN C Q CNC C N I++ +R AI + L RNP+AF
Sbjct: 203 YCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQCFQQKSQTQFSVQ 262
Query: 45 -RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGAR 84
+ K K + KGCNCK+SGC K YCECY +
Sbjct: 263 QQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLK 303
>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
Length = 674
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 75/187 (40%), Gaps = 44/187 (23%)
Query: 11 CYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSG 70
Q + + N I + D+ + L+ N PK + + GE K CNCK+S
Sbjct: 325 TLQDSLDQTENEIREDADQDVPVEPALQEL-NLSSPKKKRVKLDSGEGESCKRCNCKKSK 383
Query: 71 CLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNI 130
CLK YCEC+ AG +EP C+C CFN
Sbjct: 384 CLKLYCECFAAGVYCIEP--------------------------------CSCIDCFNKP 411
Query: 131 EHEDDRHHAIRQCLERNPNAFRPKI-----------GKCLVGEGERRHTKGCNCKRSGCL 179
HED +Q RNP AF PK+ RH +GCNCK+S CL
Sbjct: 412 IHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETGDDASKTPASARHKRGCNCKKSNCL 471
Query: 180 KNYCECY 186
K YCECY
Sbjct: 472 KKYCECY 478
Score = 72.4 bits (176), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 49/95 (51%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+
Sbjct: 388 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETG 447
Query: 49 GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S CLK YCECY+ G
Sbjct: 448 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 482
>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
Length = 613
Score = 80.1 bits (196), Expect = 4e-13, Method: Composition-based stats.
Identities = 45/108 (41%), Positives = 57/108 (52%), Gaps = 15/108 (13%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
RK CNC ++ CLKLYCDC A G +C C C++C N E+E + R+P AF
Sbjct: 286 RKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFA 345
Query: 153 PKI--------GKCLVGEG------ERRHTKGCNCKRSGCLKNYCECY 186
P+I L +G RH +GCNCK+S C K YCECY
Sbjct: 346 PRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECY 393
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 38/97 (39%), Positives = 49/97 (50%), Gaps = 15/97 (15%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKC 51
YCDC A G +C C C++C N E+E + R+P AF P+I
Sbjct: 300 YCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRIVNPDTDTTDNV 359
Query: 52 LVGEG------ERRHTKGCNCKRSGCLKNYCECYEAG 82
L +G RH +GCNCK+S C K YCECY+AG
Sbjct: 360 LQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAG 396
>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 235
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/60 (65%), Positives = 44/60 (73%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC S CLKLYC+CFA+G C CNC +CFNN E+E R AI LERNP+AFRPKIG
Sbjct: 157 CNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 216
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/49 (63%), Positives = 36/49 (73%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
YC+CFA+G C CNC +CFNN E+E R AI LERNP+AFRPKIG
Sbjct: 168 YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIG 216
>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
Length = 369
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 61/99 (61%), Gaps = 9/99 (9%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI- 155
C+C S CLKLYC+CF G +C CNC +C NN+ HE+ R AI LERNP AF PK+
Sbjct: 174 CSCKYSKCLKLYCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVL 233
Query: 156 --------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K G+ +HTKGC CKR+ CLK YCEC+
Sbjct: 234 NSTAHNCESKAAEGDIVGKHTKGCKCKRTECLKKYCECF 272
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 55/92 (59%), Gaps = 9/92 (9%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
YC+CF G +C CNC +C NN+ HE+ R AI LERNP AF PK+ K
Sbjct: 185 YCECFEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNSTAHNCESKA 244
Query: 52 LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
G+ +HTKGC CKR+ CLK YCEC++A
Sbjct: 245 AEGDIVGKHTKGCKCKRTECLKKYCECFKASV 276
>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
Length = 369
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 62/117 (52%), Gaps = 11/117 (9%)
Query: 80 EAGARTLEPNGIRPRK--PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRH 137
EA +T E GI RK C C +S CLKLYCDCFA+G C C+C C NN E D R
Sbjct: 50 EAKDKTDE-EGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEKCDARE 108
Query: 138 HAIRQCLERNPNAFRPKIGKCLV--------GEGERRHTKGCNCKRSGCLKNYCECY 186
A+ L RNPNAF K L ++GC CKR+ CLK YCEC+
Sbjct: 109 AAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLSRGCKCKRTRCLKKYCECF 165
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 47/89 (52%), Gaps = 8/89 (8%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
YCDCFA+G C C+C C NN E D R A+ L RNPNAF K L
Sbjct: 79 YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAA 138
Query: 54 -GEGERRHTKGCNCKRSGCLKNYCECYEA 81
++GC CKR+ CLK YCEC++A
Sbjct: 139 PDTKPGLLSRGCKCKRTRCLKKYCECFQA 167
>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 368
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 62/117 (52%), Gaps = 11/117 (9%)
Query: 80 EAGARTLEPNGIRPRK--PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRH 137
EA +T E GI RK C C +S CLKLYCDCFA+G C C+C C NN E D R
Sbjct: 50 EAKDKTDE-EGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEKCDARE 108
Query: 138 HAIRQCLERNPNAFRPKIGKCLV--------GEGERRHTKGCNCKRSGCLKNYCECY 186
A+ L RNPNAF K L ++GC CKR+ CLK YCEC+
Sbjct: 109 AAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLSRGCKCKRTRCLKKYCECF 165
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 47/89 (52%), Gaps = 8/89 (8%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
YCDCFA+G C C+C C NN E D R A+ L RNPNAF K L
Sbjct: 79 YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAA 138
Query: 54 -GEGERRHTKGCNCKRSGCLKNYCECYEA 81
++GC CKR+ CLK YCEC++A
Sbjct: 139 PDTKPGLLSRGCKCKRTRCLKKYCECFQA 167
>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
Length = 356
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 63/120 (52%), Gaps = 11/120 (9%)
Query: 77 ECYEAGARTLEPNGIRPRK--PCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHED 134
E EA +T E GI RK C C +S CLKLYCDCFA+G C C+C C NN E D
Sbjct: 47 EHSEAKDKTDE-EGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEKCD 105
Query: 135 DRHHAIRQCLERNPNAFRPKIGKCLV--------GEGERRHTKGCNCKRSGCLKNYCECY 186
R A+ L RNPNAF K L ++GC CKR+ CLK YCEC+
Sbjct: 106 AREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLSRGCKCKRTRCLKKYCECF 165
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 47/89 (52%), Gaps = 8/89 (8%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLV------- 53
YCDCFA+G C C+C C NN E D R A+ L RNPNAF K L
Sbjct: 79 YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAA 138
Query: 54 -GEGERRHTKGCNCKRSGCLKNYCECYEA 81
++GC CKR+ CLK YCEC++A
Sbjct: 139 PDTKPGLLSRGCKCKRTRCLKKYCECFQA 167
>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 510
Score = 79.0 bits (193), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 41/85 (48%), Positives = 53/85 (62%), Gaps = 4/85 (4%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RR 59
YC+CFA G FC C+C C N E+E +Q +RNP AF KI + G+ R
Sbjct: 208 YCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKI---MADAGDDAR 264
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
HTKGC+CK+S CLK YCEC++AG +
Sbjct: 265 HTKGCHCKKSACLKKYCECFQAGVK 289
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 50/81 (61%), Gaps = 4/81 (4%)
Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG-ER 165
LYC+CFA G FC C+C C N E+E +Q +RNP AF KI + G +
Sbjct: 207 LYCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKI---MADAGDDA 263
Query: 166 RHTKGCNCKRSGCLKNYCECY 186
RHTKGC+CK+S CLK YCEC+
Sbjct: 264 RHTKGCHCKKSACLKKYCECF 284
>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
Length = 651
Score = 79.0 bits (193), Expect = 8e-13, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 54/94 (57%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + ++ G
Sbjct: 365 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLETG 424
Query: 57 E--------RRHTKGCNCKRSGCLKNYCECYEAG 82
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 425 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGG 458
Score = 76.6 bits (187), Expect = 4e-12, Method: Composition-based stats.
Identities = 41/93 (44%), Positives = 53/93 (56%), Gaps = 12/93 (12%)
Query: 106 KLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVG 161
K YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + ++
Sbjct: 363 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLE 422
Query: 162 EGE--------RRHTKGCNCKRSGCLKNYCECY 186
G+ RH +GCNCK+S CLK YCECY
Sbjct: 423 TGDDASKTPASARHKRGCNCKKSNCLKKYCECY 455
>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
Length = 305
Score = 79.0 bits (193), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 56/90 (62%), Gaps = 2/90 (2%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 156
CNC KS CLKLYC+CFA+ FC CNC +C NN EH +R + ++R+P+ F P +
Sbjct: 189 CNCRKSRCLKLYCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHVFEPHV- 247
Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
R + +GC C+++ C+K YC+C+
Sbjct: 248 -VWDTPATRTNLRGCRCQKTRCVKKYCDCF 276
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 49/82 (59%), Gaps = 2/82 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA+ FC CNC +C NN EH +R + ++R+P+ F P + R +
Sbjct: 200 YCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHVFEPHV--VWDTPATRTN 257
Query: 61 TKGCNCKRSGCLKNYCECYEAG 82
+GC C+++ C+K YC+C++ G
Sbjct: 258 LRGCRCQKTRCVKKYCDCFQNG 279
>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
distachyon]
Length = 787
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 60/105 (57%), Gaps = 14/105 (13%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA G +C + C+C C N HE+ +Q RNP AF P
Sbjct: 475 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 534
Query: 154 KIGKCLVGEGER------------RHTKGCNCKRSGCLKNYCECY 186
K+ + L G+ RH +GCNCK+S CLK YCECY
Sbjct: 535 KVIR-LSDAGQETQEDPNNTPASARHKRGCNCKKSSCLKKYCECY 578
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 52/96 (54%), Gaps = 14/96 (14%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
YC+CFA G +C + C+C C N HE+ +Q RNP AF PK+ + L G+
Sbjct: 488 YCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIR-LSDAGQET 546
Query: 59 -----------RHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S CLK YCECY+ G
Sbjct: 547 QEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGV 582
>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
Length = 374
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/104 (47%), Positives = 62/104 (59%), Gaps = 14/104 (13%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
CNC KS CLKLYC+CFAN C Q CNC C N I++ ++R AI + L RNP+AF
Sbjct: 169 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAFAAIL 228
Query: 152 -----RPKIGKCLVGEGERRHT----KGCNCKRSGCLKNYCECY 186
+P+I + + E + KGCNCK+S C K YCECY
Sbjct: 229 TNNGQQPQIIQEEKSQKESQKDITTRKGCNCKKSECKKKYCECY 272
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 76/161 (47%), Gaps = 26/161 (16%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFAN C Q CNC C N I++ ++R AI + L RNP+AF L G++
Sbjct: 180 YCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAF----AAILTNNGQQP 235
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
E ++ I RK CNC KS C K YC+C++ + C
Sbjct: 236 QI----------------IQEEKSQKESQKDITTRKGCNCKKSECKKKYCECYSINQRCT 279
Query: 120 Q-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL 159
C C +C N ++ ++D AI Q L++ N + + + L
Sbjct: 280 DLCKCENCLNKVQPQED---AIEQ-LQKQENPKKESLNQNL 316
>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
Length = 671
Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats.
Identities = 39/84 (46%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA G FC C+C SC N + Q RNPNAF KI + +H
Sbjct: 359 YCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRYQIESRNPNAFANKIVD--DDSVDAKH 416
Query: 61 TKGCNCKRSGCLKNYCECYEAGAR 84
KGC+CK+S CLK YCEC++A R
Sbjct: 417 AKGCHCKKSACLKKYCECFQANVR 440
Score = 75.9 bits (185), Expect = 7e-12, Method: Composition-based stats.
Identities = 38/80 (47%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
LYCDCFA G FC C+C SC N + Q RNPNAF KI + +
Sbjct: 358 LYCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRYQIESRNPNAFANKIVD--DDSVDAK 415
Query: 167 HTKGCNCKRSGCLKNYCECY 186
H KGC+CK+S CLK YCEC+
Sbjct: 416 HAKGCHCKKSACLKKYCECF 435
>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
Length = 612
Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats.
Identities = 49/99 (49%), Positives = 60/99 (60%), Gaps = 9/99 (9%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI- 155
C+C S CLKLYC+CF G +C CNC +C NN+ HE+ R AI LERNP AF PK+
Sbjct: 174 CSCKYSKCLKLYCECFEKGRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVL 233
Query: 156 --------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K G+ +HTKGC CKR+ LK YCEC+
Sbjct: 234 NSTAHNCESKAAEGDIVGKHTKGCKCKRTEYLKKYCECF 272
Score = 65.5 bits (158), Expect = 9e-09, Method: Composition-based stats.
Identities = 43/90 (47%), Positives = 54/90 (60%), Gaps = 9/90 (10%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
YC+CF G +C CNC +C NN+ HE+ R AI LERNP AF PK+ K
Sbjct: 185 YCECFEKGRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNSTAHNCESKA 244
Query: 52 LVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
G+ +HTKGC CKR+ LK YCEC++A
Sbjct: 245 AEGDIVGKHTKGCKCKRTEYLKKYCECFKA 274
>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 639
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 59/104 (56%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C KS CLKLYC+CF+ G FC + C+C +CFN HED + RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512
Query: 154 KIGKC------LVGEGER-----RHTKGCNCKRSGCLKNYCECY 186
K+ L E + RH +GCNC++SGC K YCEC+
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECF 556
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC------LV 53
YC+CF+ G FC + C+C +CFN HED + RNP AF PK+ L
Sbjct: 466 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 525
Query: 54 GEGER-----RHTKGCNCKRSGCLKNYCECYEAGA 83
E + RH +GCNC++SGC K YCEC+ G
Sbjct: 526 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGV 560
>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 766
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/104 (46%), Positives = 61/104 (58%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA+ +C + C+C CFN+ HE+ + RNP AF P
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 530
Query: 154 KI----GKCL-VGE------GERRHTKGCNCKRSGCLKNYCECY 186
K+ G L GE RH +GCNCKRS C+K YCEC+
Sbjct: 531 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECF 574
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCL-VG 54
YC+CFA+ +C + C+C CFN+ HE+ + RNP AF PK+ G L G
Sbjct: 484 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFG 543
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCKRS C+K YCEC+++G
Sbjct: 544 EDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGV 578
>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
Length = 757
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/104 (46%), Positives = 61/104 (58%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA+ +C + C+C CFN+ HE+ + RNP AF P
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 521
Query: 154 KI----GKCL-VGE------GERRHTKGCNCKRSGCLKNYCECY 186
K+ G L GE RH +GCNCKRS C+K YCEC+
Sbjct: 522 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECF 565
Score = 66.6 bits (161), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCL-VG 54
YC+CFA+ +C + C+C CFN+ HE+ + RNP AF PK+ G L G
Sbjct: 475 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFG 534
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCKRS C+K YCEC+++G
Sbjct: 535 EDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGV 569
>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
Length = 644
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/104 (46%), Positives = 61/104 (58%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA+ +C + C+C CFN+ HE+ + RNP AF P
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 408
Query: 154 KI----GKCL-VGE------GERRHTKGCNCKRSGCLKNYCECY 186
K+ G L GE RH +GCNCKRS C+K YCEC+
Sbjct: 409 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECF 452
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 54/95 (56%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCL-VG 54
YC+CFA+ +C + C+C CFN+ HE+ + RNP AF PK+ G L G
Sbjct: 362 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFG 421
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCKRS C+K YCEC+++G
Sbjct: 422 EDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGV 456
>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
Length = 646
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 49/71 (69%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
L+ N ++ +K CNC S CLKLYC+C+A G +C CNC +C NN+ +E R AI LE
Sbjct: 139 LKDNTLKKQKRCNCKNSKCLKLYCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLE 198
Query: 146 RNPNAFRPKIG 156
+NPNAFRPKI
Sbjct: 199 KNPNAFRPKIA 209
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
YC+C+A G +C CNC +C NN+ +E R AI LE+NPNAFRPKI
Sbjct: 161 YCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIA 209
>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 886
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/238 (29%), Positives = 102/238 (42%), Gaps = 65/238 (27%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHED---DRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
YCDCF + FC + C C C N +E+ +R A+ LER P AF + G
Sbjct: 22 YCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERRPEAFGRESGAF----A 77
Query: 57 ERRHTK--GCNCKRSGCLKNYCECYEAGAR-------------------------TLEPN 89
ERR+ K GC C ++ CLK YC C+ A + T+E
Sbjct: 78 ERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPFGTIRIEDGVTCTIEGG 137
Query: 90 GIRPRK------------------PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNI 130
+ K C+CT++ CLKLYCDCF C C+C C N+
Sbjct: 138 KVNALKGAAFVPIRKLANLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSE 197
Query: 131 EHE---DDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCEC 185
E+ +R I+ LE+ P+ F+ +R+ GC+C+++ C YC+C
Sbjct: 198 EYSGPMGERTLVIKDILEKRPHIFQKT--------AKRKAGGGCSCQKNKCRAGYCDC 247
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 53/96 (55%), Gaps = 10/96 (10%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHE---DDRHHAIRQCLERNPNAFR 152
C CTK+ CL LYCDCF + FC + C C C N +E+ +R A+ LER P AF
Sbjct: 11 CKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERRPEAFG 70
Query: 153 PKIGKCLVGEGERRHTK--GCNCKRSGCLKNYCECY 186
+ G ERR+ K GC C ++ CLK YC C+
Sbjct: 71 RESGAF----AERRNKKDGGCTCAKNSCLKKYCVCF 102
Score = 38.9 bits (89), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 12/88 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHED---DRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
YCDCF C C+C C N+ E+ +R I+ LE+ P+ F+
Sbjct: 174 YCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDILEKRPHIFQKT--------A 225
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
+R+ GC+C+++ C YC+C +G +
Sbjct: 226 KRKAGGGCSCQKNKCRAGYCDCKGSGGK 253
>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 782
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 57/104 (54%), Gaps = 12/104 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA G +C + C+C C N HE+ + RNP AF P
Sbjct: 473 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKHIEFRNPLAFAP 532
Query: 154 KIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
K+ + E RH +GCNCK+S CLK YCECY
Sbjct: 533 KVIRMSEAGQEAQEDPKNTPASARHKRGCNCKKSSCLKKYCECY 576
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 49/95 (51%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
YC+CFA G +C + C+C C N HE+ + RNP AF PK+ + E
Sbjct: 486 YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKHIEFRNPLAFAPKVIRMSEAGQEAQ 545
Query: 59 ----------RHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S CLK YCECY+ G
Sbjct: 546 EDPKNTPASARHKRGCNCKKSSCLKKYCECYQGGV 580
>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 766
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 57/102 (55%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CF G FC + C+C C N + + +Q RNP AF PK+
Sbjct: 461 CSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIESRNPLAFAPKV 520
Query: 156 GKCL-----VGE------GERRHTKGCNCKRSGCLKNYCECY 186
+ +GE RH +GCNCK+S CLK YCECY
Sbjct: 521 IRTSEPGQELGEYSNRTPASARHKRGCNCKKSSCLKKYCECY 562
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CF G FC + C+C C N + + +Q RNP AF PK+ + +G
Sbjct: 472 YCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIESRNPLAFAPKVIRTSEPGQELG 531
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCK+S CLK YCECY+ G
Sbjct: 532 EYSNRTPASARHKRGCNCKKSSCLKKYCECYQGGV 566
>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
Length = 662
Score = 77.4 bits (189), Expect = 3e-12, Method: Composition-based stats.
Identities = 44/115 (38%), Positives = 58/115 (50%), Gaps = 12/115 (10%)
Query: 84 RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQ 142
R L+ + + CNC +S CLKLYC CFA +C C+C C NN HE+ ++
Sbjct: 354 RKLQKDDGESCRHCNCRRSKCLKLYCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKR 413
Query: 143 CLERNPNAFRPKI----GKCL-------VGEGERRHTKGCNCKRSGCLKNYCECY 186
NP AF P + G L RH +GCNC++S CLK YCEC+
Sbjct: 414 TESHNPLAFAPTVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECF 468
Score = 60.1 bits (144), Expect = 4e-07, Method: Composition-based stats.
Identities = 44/131 (33%), Positives = 57/131 (43%), Gaps = 24/131 (18%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA +C C+C C NN HE+ ++ NP AF P +
Sbjct: 378 YCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKRTESHNPLAFAPTV----------T 427
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG-EFC 118
HT G E + +T P R ++ CNC KS CLK YC+CF +G C
Sbjct: 428 HTYG----------TVLEFGDDSNKT--PASARHKRGCNCRKSSCLKKYCECFQSGVGCC 475
Query: 119 YQCNCNSCFNN 129
C C SC N+
Sbjct: 476 ISCRCMSCKNS 486
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 33/112 (29%), Positives = 44/112 (39%), Gaps = 36/112 (32%)
Query: 7 NGEFCYQCN-------------------------CNSCFNNIEHEDDRHHAIRQCLERNP 41
+GE C CN C C NN HE+ ++ NP
Sbjct: 360 DGESCRHCNCRRSKCLKLYCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKRTESHNP 419
Query: 42 NAFRPKI----GKCL-------VGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
AF P + G L RH +GCNC++S CLK YCEC+++G
Sbjct: 420 LAFAPTVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECFQSG 471
>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
Length = 808
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 59/123 (47%), Gaps = 31/123 (25%)
Query: 95 KPCNCTKSMCLKL-------------------YCDCFANGEFCYQ-CNCNSCFNNIEHED 134
K CNC KS CLKL YC+CFA G +C + C+C CFN HED
Sbjct: 488 KRCNCKKSKCLKLIDLKMTILVLELTILFFFSYCECFAAGVYCIEPCSCQECFNKPIHED 547
Query: 135 DRHHAIRQCLERNPNAFRPKIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYC 183
+Q RNP AF PK+ + E RH +GCNCK+S CLK YC
Sbjct: 548 TVLQTRKQIESRNPLAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYC 607
Query: 184 ECY 186
ECY
Sbjct: 608 ECY 610
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 52/98 (53%), Gaps = 12/98 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + E
Sbjct: 520 YCECFAAGVYCIEPCSCQECFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRSADSVPETG 579
Query: 59 ----------RHTKGCNCKRSGCLKNYCECYEAGARTL 86
RH +GCNCK+S CLK YCECY+ T+
Sbjct: 580 IDPNKTPASARHKRGCNCKKSNCLKKYCECYQVLILTI 617
>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
Length = 587
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 57/102 (55%), Gaps = 12/102 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA G +C + C+C C NN H + +Q RNP AF PK+
Sbjct: 462 CSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIESRNPLAFAPKV 521
Query: 156 ------GKCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 186
G L RH +GCNCK+S CLK YCEC+
Sbjct: 522 IHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 563
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 50/95 (52%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCL- 52
YC+CFA G +C + C+C C NN H + +Q RNP AF PK+ G L
Sbjct: 473 YCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIESRNPLAFAPKVIHTSEPGMDLG 532
Query: 53 ----VGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S CLK YCEC++ G
Sbjct: 533 DFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGV 567
>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 978
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 64/98 (65%), Gaps = 12/98 (12%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC K+ CLKLYC+CFA +C + C+C+ C N E+ED+R +AI++ RN +AF+PK
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKT 464
Query: 156 GKCLVGEGERRH-------TKGCNCKRSGCLKNYCECY 186
E ER++ KGCNC+++ CLK YCEC+
Sbjct: 465 E----SEQERKNKANNEQMKKGCNCRKTKCLKKYCECF 498
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 57/91 (62%), Gaps = 12/91 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA +C + C+C+ C N E+ED+R +AI++ RN +AF+PK E ER+
Sbjct: 416 YCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKTE----SEQERK 471
Query: 60 HT-------KGCNCKRSGCLKNYCECYEAGA 83
+ KGCNC+++ CLK YCEC+ AG
Sbjct: 472 NKANNEQMKKGCNCRKTKCLKKYCECFNAGT 502
>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
gaditana CCMP526]
Length = 365
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 71/171 (41%), Gaps = 42/171 (24%)
Query: 58 RRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEF 117
RR +RS L G T R K CNC KS CLKLYC+CFA G +
Sbjct: 123 RRQNPSQEARRSPPLNGSGGGTPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAY 182
Query: 118 CY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------------------- 155
C C C C N + E R A+ L RNP+AF KI
Sbjct: 183 CRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSL 242
Query: 156 --GKCLVGE-GERR-----------------HTKGCNCKRSGCLKNYCECY 186
G+C VG+ GE H +GCNCK+S C+K YCEC+
Sbjct: 243 PGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIKKYCECF 293
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 65/151 (43%), Gaps = 18/151 (11%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKC 51
YC+CFA G +C C C C N + E R A+ L RNP+AF KI G
Sbjct: 173 YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPV 232
Query: 52 LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPR----KPCNCTKSMCLKL 107
+G R G C + + E G G P + CNC KS+C+K
Sbjct: 233 AMGTHRRTSLPGGQCS----VGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIKK 288
Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRH 137
YC+CF G +C + C C C NN E H
Sbjct: 289 YCECFFAGVYCGENCQCEGCQNNPEASAGLH 319
>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
Length = 658
Score = 76.6 bits (187), Expect = 4e-12, Method: Composition-based stats.
Identities = 41/94 (43%), Positives = 51/94 (54%), Gaps = 12/94 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER- 58
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + E
Sbjct: 372 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETG 431
Query: 59 ----------RHTKGCNCKRSGCLKNYCECYEAG 82
RH +GCNCK+S CLK YCECY+ G
Sbjct: 432 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGG 465
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 41/93 (44%), Positives = 50/93 (53%), Gaps = 12/93 (12%)
Query: 106 KLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE 164
K YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + E
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQE 429
Query: 165 R-----------RHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S CLK YCECY
Sbjct: 430 TGDDASKTPASARHKRGCNCKKSNCLKKYCECY 462
>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
Length = 265
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 62/112 (55%), Gaps = 22/112 (19%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---- 151
CNC KS CLKLYC+CFAN C Q CNC C N I++ ++R A + L RNP+AF
Sbjct: 52 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAIL 111
Query: 152 -----RPKI--GKCLVGEGERRHT----------KGCNCKRSGCLKNYCECY 186
+P+I K L+ E + KGCNCK+S C K YCECY
Sbjct: 112 TNNGQQPQIIQDKTLILPLEEKSQKESQIDITTRKGCNCKKSECKKKYCECY 163
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 54/106 (50%), Gaps = 22/106 (20%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF---------RPKI-- 48
YC+CFAN C Q CNC C N I++ ++R A + L RNP+AF +P+I
Sbjct: 63 YCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAILTNNGQQPQIIQ 122
Query: 49 GKCLVGEGERRHTK----------GCNCKRSGCLKNYCECYEAGAR 84
K L+ E + K GCNCK+S C K YCECY +
Sbjct: 123 DKTLILPLEEKSQKESQIDITTRKGCNCKKSECKKKYCECYSINQK 168
>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
Length = 717
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 63/142 (44%), Gaps = 43/142 (30%)
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
GE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 417 GETEGCKRCNCKKSKCLKLYCECFAAGVYCVEP--------------------------- 449
Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE------GE 164
C+C CFN HED +Q RNP AF PK+ + VG+
Sbjct: 450 -----CSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPAS 504
Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S CLK YCECY
Sbjct: 505 ARHKRGCNCKKSSCLKKYCECY 526
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + VG
Sbjct: 436 YCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVG 495
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 496 DEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGV 530
>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
Length = 738
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 77/186 (41%), Gaps = 45/186 (24%)
Query: 14 CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG--KCLVGEGERRHTKGCNCKRSGC 71
N + N+ + E D Q LE A I + L GE K CNCK+S C
Sbjct: 394 VNKSLALNSSDRETDPAENGFQALEDASQASAYAISEERRLEHGGETEGCKRCNCKKSKC 453
Query: 72 LKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIE 131
LK YCEC+ AG +EP C+C CFN
Sbjct: 454 LKLYCECFAAGVYCVEP--------------------------------CSCQECFNKPI 481
Query: 132 HEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE------GERRHTKGCNCKRSGCLK 180
HED +Q RNP AF PK+ + VG+ RH +GCNCK+S CLK
Sbjct: 482 HEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLK 541
Query: 181 NYCECY 186
YCECY
Sbjct: 542 KYCECY 547
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + VG
Sbjct: 457 YCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVG 516
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 517 DEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGV 551
>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
Length = 373
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 49/105 (46%), Positives = 58/105 (55%), Gaps = 15/105 (14%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNA----F 151
CNC KS CLKLYC+CFAN C Q CNC C N I++ +R AI + L RN +A F
Sbjct: 188 CNCKKSKCLKLYCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFVQCF 247
Query: 152 RPKIGKCLVGEGERRHT----------KGCNCKRSGCLKNYCECY 186
K G V + + KGCNCK+SGC K YCECY
Sbjct: 248 STKGGAQFVQQDKPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECY 292
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 50/99 (50%), Gaps = 15/99 (15%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF----RPKIGKCLVGE 55
YC+CFAN C Q CNC C N I++ +R AI + L RN +AF K G V +
Sbjct: 199 YCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFVQCFSTKGGAQFVQQ 258
Query: 56 GERRHT----------KGCNCKRSGCLKNYCECYEAGAR 84
+ KGCNCK+SGC K YCECY R
Sbjct: 259 DKPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECYSQNIR 297
>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
Length = 774
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 63/142 (44%), Gaps = 43/142 (30%)
Query: 56 GERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANG 115
GE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 480 GETEGCKRCNCKKSKCLKLYCECFAAGVYCIEP--------------------------- 512
Query: 116 EFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVGE------GE 164
C+C CFN HED +Q RNP AF PK+ + +G+
Sbjct: 513 -----CSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPAS 567
Query: 165 RRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S CLK YCECY
Sbjct: 568 ARHKRGCNCKKSSCLKKYCECY 589
Score = 75.5 bits (184), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
YC+CFA G +C + C+C CFN HED +Q RNP AF PK+ + +G
Sbjct: 499 YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIG 558
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 559 DDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 593
>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 365
Score = 75.5 bits (184), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 71/171 (41%), Gaps = 42/171 (24%)
Query: 58 RRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEF 117
RR +RS L G T R K CNC KS CLKLYC+CFA G +
Sbjct: 123 RRQNPSQEARRSPPLNGSGGGTPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAY 182
Query: 118 CY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------------------- 155
C C C C N + E R A+ L RNP+AF KI
Sbjct: 183 CRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSL 242
Query: 156 --GKCLVGE-GERR-----------------HTKGCNCKRSGCLKNYCECY 186
G+C VG+ GE H +GCNCK+S C++ YCEC+
Sbjct: 243 PGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIQKYCECF 293
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 65/151 (43%), Gaps = 18/151 (11%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKC 51
YC+CFA G +C C C C N + E R A+ L RNP+AF KI G
Sbjct: 173 YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEKIQFIETGATGPV 232
Query: 52 LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPR----KPCNCTKSMCLKL 107
+G R G C + + E G G P + CNC KS+C++
Sbjct: 233 AMGTHRRTSLPGGQCS----VGDNGETTMMGGLQGTAGGALPTAAHVRGCNCKKSLCIQK 288
Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRH 137
YC+CF G +C + C C C NN E H
Sbjct: 289 YCECFFAGVYCGENCQCEGCQNNPEASAGLH 319
>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
Length = 473
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 59/122 (48%), Gaps = 29/122 (23%)
Query: 94 RKPCNCTKSMCLKL------------------YCDCFANGEFCYQCNCNSCFNNIEHEDD 135
+K CNC S CLKL YC+CFA G +C C+C C N + +E
Sbjct: 80 KKHCNCKNSQCLKLACSSILDLFTGQIDQLYGYCECFAAGLYCDGCHCKQCGNYVGNESA 139
Query: 136 RHHAIRQCLERNPNAFRPKI-----------GKCLVGEGERRHTKGCNCKRSGCLKNYCE 184
R AI +RNP AF+PKI +H KGC+CK+SGCLK YCE
Sbjct: 140 RQEAINSTKQRNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCE 199
Query: 185 CY 186
C+
Sbjct: 200 CF 201
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
YC+CFA G +C C+C C N + +E R AI +RNP AF+PKI
Sbjct: 112 YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 171
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 172 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQA 203
>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
Length = 84
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 50/83 (60%), Gaps = 1/83 (1%)
Query: 104 CLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG 163
CLKLYCDCFA G+FC C+C SC N E+ D R+P AF KI G G
Sbjct: 1 CLKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMD-APGGG 59
Query: 164 ERRHTKGCNCKRSGCLKNYCECY 186
+H +GCNCKRS CLK YCEC+
Sbjct: 60 GGKHKRGCNCKRSHCLKKYCECF 82
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 47/81 (58%), Gaps = 1/81 (1%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YCDCFA G+FC C+C SC N E+ D R+P AF KI G G +H
Sbjct: 5 YCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMD-APGGGGGKH 63
Query: 61 TKGCNCKRSGCLKNYCECYEA 81
+GCNCKRS CLK YCEC++
Sbjct: 64 KRGCNCKRSHCLKKYCECFQV 84
>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
Length = 387
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 58/116 (50%), Gaps = 30/116 (25%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFAN C Q CNC C N I++ +R AI + L RN +AF
Sbjct: 195 CNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAF---- 250
Query: 156 GKCLVGEGERRHT-------------------------KGCNCKRSGCLKNYCECY 186
+C +G + KGCNCK+SGC K YCECY
Sbjct: 251 AQCFTTKGAPQFVQQGIFNFIQDKPIKEPSKDNSSVVHKGCNCKKSGCKKKYCECY 306
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 50/110 (45%), Gaps = 30/110 (27%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFAN C Q CNC C N I++ +R AI + L RN +AF +C +G +
Sbjct: 206 YCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAF----AQCFTTKGAPQ 261
Query: 60 HT-------------------------KGCNCKRSGCLKNYCECYEAGAR 84
KGCNCK+SGC K YCECY +
Sbjct: 262 FVQQGIFNFIQDKPIKEPSKDNSSVVHKGCNCKKSGCKKKYCECYSQNLK 311
>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
subellipsoidea C-169]
Length = 90
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/90 (53%), Positives = 58/90 (64%), Gaps = 10/90 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 51
YC+CFA+G +C CNC +C NN E E R A+ LERNPNAFRPKI G
Sbjct: 1 YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60
Query: 52 LVGEG-ERRHTKGCNCKRSGCLKNYCECYE 80
+ +G RH KGCNCK+SGCLK YCEC++
Sbjct: 61 VAIQGAAARHNKGCNCKKSGCLKKYCECFQ 90
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 48/89 (53%), Positives = 57/89 (64%), Gaps = 10/89 (11%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---------GKC 158
YC+CFA+G +C CNC +C NN E E R A+ LERNPNAFRPKI G
Sbjct: 1 YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60
Query: 159 LVGEG-ERRHTKGCNCKRSGCLKNYCECY 186
+ +G RH KGCNCK+SGCLK YCEC+
Sbjct: 61 VAIQGAAARHNKGCNCKKSGCLKKYCECF 89
>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
Length = 902
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 58/100 (58%), Gaps = 12/100 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K C+C S CLK+YC+CFA C C C C NN + + A Q LERNP+AF PK
Sbjct: 360 KKCHCKNSKCLKMYCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPK 419
Query: 155 I--------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
I + ++ +H+KGC+C++S CLK YCEC+
Sbjct: 420 IKPSNSKINNQIIID----KHSKGCHCRKSSCLKKYCECF 455
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 51/90 (56%), Gaps = 12/90 (13%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI--------GKCL 52
YC+CFA C C C C NN + + A Q LERNP+AF PKI + +
Sbjct: 373 YCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPKIKPSNSKINNQII 432
Query: 53 VGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
+ +H+KGC+C++S CLK YCEC++A
Sbjct: 433 ID----KHSKGCHCRKSSCLKKYCECFQAS 458
>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
Length = 684
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 75/173 (43%), Gaps = 46/173 (26%)
Query: 26 EDDRHHAIRQCLERNPNAFR-PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGAR 84
++ A + +E P A P+ GEGE K CNCK+S CLK YCEC+ AG
Sbjct: 355 QETSDQAENEPVEEIPKALAFPERKSEQAGEGE--SCKRCNCKKSKCLKLYCECFAAGVY 412
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCL 144
+EP C+C CFN HE+ +Q
Sbjct: 413 CIEP--------------------------------CSCIDCFNKPIHEETVLATRKQIE 440
Query: 145 ERNPNAFRPKI---GKCLVGEGE--------RRHTKGCNCKRSGCLKNYCECY 186
RNP AF PK+ ++ G+ RH +GCNCK+S CLK YCECY
Sbjct: 441 SRNPLAFAPKVIRSADSIMEAGDDASKTPASARHKRGCNCKKSNCLKKYCECY 493
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 53/95 (55%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI---GKCLVGEG 56
YC+CFA G +C + C+C CFN HE+ +Q RNP AF PK+ ++ G
Sbjct: 403 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRSADSIMEAG 462
Query: 57 E--------RRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 463 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGV 497
>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
Length = 414
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 52/95 (54%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----CLVG 54
YC+CFA G +C + C+C C NN HED RQ RNP AF PK+ + G
Sbjct: 141 YCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVSEFG 200
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
E RH +GCNCK+S CLK YCEC++ G
Sbjct: 201 EETNKTPASARHKRGCNCKKSSCLKKYCECFQGGV 235
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 52/94 (55%), Gaps = 12/94 (12%)
Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----C 158
+K YC+CFA G +C + C+C C NN HED RQ RNP AF PK+ +
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVS 197
Query: 159 LVGE------GERRHTKGCNCKRSGCLKNYCECY 186
GE RH +GCNCK+S CLK YCEC+
Sbjct: 198 EFGEETNKTPASARHKRGCNCKKSSCLKKYCECF 231
>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
Length = 326
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 61/109 (55%), Gaps = 17/109 (15%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAF 151
RK CNC ++ CLKLYCDC A G +C C C++C N E+E IR+ +E R+P AF
Sbjct: 14 RKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENE-GVVQIIREKIESRDPLAF 72
Query: 152 RPKI--------GKCLVGEG------ERRHTKGCNCKRSGCLKNYCECY 186
P+I L +G RH +GCNCK+S C K YCECY
Sbjct: 73 APRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECY 121
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 51/97 (52%), Gaps = 17/97 (17%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKI--------GK 50
YCDC A G +C C C++C N E+E IR+ +E R+P AF P+I
Sbjct: 28 YCDCLAAGVYCTDSCACSNCLNKSENEG-VVQIIREKIESRDPLAFAPRIVNPDTDTTDN 86
Query: 51 CLVGEG------ERRHTKGCNCKRSGCLKNYCECYEA 81
L +G RH +GCNCK+S C K YCECY+
Sbjct: 87 VLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQV 123
>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
Length = 396
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 59/91 (64%), Gaps = 6/91 (6%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CF + E+C +CNC +C+NN +HE +R A+++ RNP+ F
Sbjct: 151 CNCKKSRCLKLYCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCF---- 206
Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ V E + + GC C+R+ C K YCEC+
Sbjct: 207 -ENHVDEKKGVNKSGCRCRRTHCDKKYCECF 236
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 51/84 (60%), Gaps = 6/84 (7%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CF + E+C +CNC +C+NN +HE +R A+++ RNP+ F + V E +
Sbjct: 162 YCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCF-----ENHVDEKKGV 216
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
+ GC C+R+ C K YCEC+ G
Sbjct: 217 NKSGCRCRRTHCDKKYCECFSHGV 240
>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
Length = 437
Score = 72.8 bits (177), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 51/92 (55%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 49
YC+CFA G +C C+C C N + +E R AI +RNP AF+PKI
Sbjct: 76 YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQA 167
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 49/90 (54%), Gaps = 11/90 (12%)
Query: 108 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----------G 156
YC+CFA G +C C+C C N + +E R AI +RNP AF+PKI
Sbjct: 76 YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135
Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+H KGC+CK+SGCLK YCEC+
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECF 165
>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 742
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 54/93 (58%), Gaps = 8/93 (8%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
KPCNC S CLK YCDCFA G +C +C C C N E+ED+R AI++ +R +AF P
Sbjct: 500 KPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKRIKDAFLP 559
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
I H KGC+CK S C K YC C+
Sbjct: 560 VI-------QNNSHIKGCHCKNSHCQKKYCVCH 585
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 8/84 (9%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA G +C +C C C N E+ED+R AI++ +R +AF P I
Sbjct: 513 YCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKRIKDAFLPVIQN-------NS 565
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
H KGC+CK S C K YC C++ G
Sbjct: 566 HIKGCHCKNSHCQKKYCVCHQNGV 589
>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 932
Score = 72.0 bits (175), Expect = 9e-11, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 51/85 (60%), Gaps = 8/85 (9%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE--GE 57
YCDCFA G FC +C C C N+ ++++ A + +R+ + G+ L GE
Sbjct: 822 YCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRSSTS-----GEALNGEPSAS 876
Query: 58 RRHTKGCNCKRSGCLKNYCECYEAG 82
R ++GC CKR+GCLK YCEC++AG
Sbjct: 877 ERSSRGCRCKRTGCLKRYCECFQAG 901
Score = 70.1 bits (170), Expect = 4e-10, Method: Composition-based stats.
Identities = 35/85 (41%), Positives = 50/85 (58%), Gaps = 8/85 (9%)
Query: 105 LKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE- 162
LK YCDCFA G FC +C C C N+ ++++ A + +R+ + G+ L GE
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRSSTS-----GEALNGEP 873
Query: 163 -GERRHTKGCNCKRSGCLKNYCECY 186
R ++GC CKR+GCLK YCEC+
Sbjct: 874 SASERSSRGCRCKRTGCLKRYCECF 898
Score = 39.3 bits (90), Expect = 0.73, Method: Composition-based stats.
Identities = 27/97 (27%), Positives = 34/97 (35%), Gaps = 40/97 (41%)
Query: 72 LKNYCECYEAG--------------------------------------ARTLEPNGI-R 92
LK YC+C+ AG A EP+ R
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRSSTSGEALNGEPSASER 878
Query: 93 PRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFN 128
+ C C ++ CLK YC+CF G C QC C C N
Sbjct: 879 SSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLN 915
>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
Length = 760
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 67/153 (43%), Gaps = 44/153 (28%)
Query: 46 PKIGKC-LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMC 104
PK +C L GE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 458 PKKKRCKLEPAGEGESCKRCNCKKSKCLKLYCECFAAGVYCIEP---------------- 501
Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL----- 159
C C+ CFN H + +Q RNP AF PK+ +
Sbjct: 502 ----------------CACHDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSE 545
Query: 160 VGE------GERRHTKGCNCKRSGCLKNYCECY 186
+G+ RH +GCNCK+S CLK YCECY
Sbjct: 546 IGDDPNKTPASARHKRGCNCKKSSCLKKYCECY 578
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C C+ CFN H + +Q RNP AF PK+ + +G
Sbjct: 488 YCECFAAGVYCIEPCACHDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIG 547
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 548 DDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 582
>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
sativa Japonica Group]
Length = 508
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 57/104 (54%), Gaps = 16/104 (15%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
RK CNC S YC+CFA +C C+C+ C N IE+E+ R AI L RNP AF+P
Sbjct: 122 RKHCNCKNS-----YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQP 176
Query: 154 KIGKCLVGEGER-----------RHTKGCNCKRSGCLKNYCECY 186
KI R +H KGC+C++S CLK YCEC+
Sbjct: 177 KIENSPNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECF 220
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 53/92 (57%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA +C C+C+ C N IE+E+ R AI L RNP AF+PKI R
Sbjct: 131 YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRKD 190
Query: 59 ---------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+C++S CLK YCEC++A
Sbjct: 191 NSEAIPPIPKHNKGCHCRKSECLKKYCECFQA 222
>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
Length = 144
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 50/102 (49%), Gaps = 19/102 (18%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
YC+CFA G +C C C CFN E D +Q RNP AF PKI +
Sbjct: 1 YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60
Query: 51 ---------CLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
C G RH +GCNCK+S CLK YCECY+AG
Sbjct: 61 VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGV 102
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 47/98 (47%), Gaps = 19/98 (19%)
Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 157
YC+CFA G +C C C CFN E D +Q RNP AF PKI +
Sbjct: 1 YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60
Query: 158 ---------CLVGEGERRHTKGCNCKRSGCLKNYCECY 186
C G RH +GCNCK+S CLK YCECY
Sbjct: 61 VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECY 98
>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
Length = 274
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 53/92 (57%), Gaps = 11/92 (11%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER-- 58
YC+CFA +C C+C+ C N IE+E+ R AI L RNP AF+PKI R
Sbjct: 88 YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRKD 147
Query: 59 ---------RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+C++S CLK YCEC++A
Sbjct: 148 NSEAIPPIPKHNKGCHCRKSECLKKYCECFQA 179
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 52/95 (54%), Gaps = 11/95 (11%)
Query: 103 MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
+ YC+CFA +C C+C+ C N IE+E+ R AI L RNP AF+PKI
Sbjct: 83 LVKSWYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTV 142
Query: 163 GER-----------RHTKGCNCKRSGCLKNYCECY 186
R +H KGC+C++S CLK YCEC+
Sbjct: 143 TVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECF 177
>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
Length = 155
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 57/115 (49%), Gaps = 25/115 (21%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA+G +C C C +C N EHED + RNP AF P++
Sbjct: 2 CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV 61
Query: 156 G---------KCLVGEGE---------------RRHTKGCNCKRSGCLKNYCECY 186
+ V +H +GCNCK+S CLK YCEC+
Sbjct: 62 ISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECF 116
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 59/132 (44%), Gaps = 11/132 (8%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA+G +C C C +C N EHED + RNP AF P++ + E
Sbjct: 13 YCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV----ISPAEV- 67
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
+ C S N A + + ++ CNC KS+CLK YC+CF + C
Sbjct: 68 -VRVCQAFVSS---NSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEVGCS 123
Query: 120 Q-CNCNSCFNNI 130
C C C N
Sbjct: 124 DACKCRGCKNTF 135
>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
Length = 141
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 57/115 (49%), Gaps = 25/115 (21%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC+CFA+G +C C C +C N EHED + RNP AF P++
Sbjct: 2 CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV 61
Query: 156 G---------KCLVGEGE---------------RRHTKGCNCKRSGCLKNYCECY 186
+ V +H +GCNCK+S CLK YCEC+
Sbjct: 62 ISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECF 116
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 59/132 (44%), Gaps = 11/132 (8%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA+G +C C C +C N EHED + RNP AF P++ + E
Sbjct: 13 YCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPRV----ISPAE-- 66
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
+ C S N A + + ++ CNC KS+CLK YC+CF + C
Sbjct: 67 VVRVCQAFVSS---NSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEVGCS 123
Query: 120 Q-CNCNSCFNNI 130
C C C N
Sbjct: 124 DACKCRGCKNTF 135
>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 62/145 (42%), Gaps = 45/145 (31%)
Query: 53 VGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCF 112
GEGE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 225 AGEGE--SCKRCNCKKSKCLKLYCECFAAGVYCIEP------------------------ 258
Query: 113 ANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVG 161
C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 259 --------CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKT 310
Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S C+K YCECY
Sbjct: 311 PASARHKRGCNCKKSNCMKKYCECY 335
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
YC+CFA G +C + C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 245 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 304
Query: 51 --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S C+K YCECY+ G
Sbjct: 305 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGGV 339
>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 62/145 (42%), Gaps = 45/145 (31%)
Query: 53 VGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCF 112
GEGE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 225 AGEGE--SCKRCNCKKSKCLKLYCECFAAGVYCIEP------------------------ 258
Query: 113 ANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK-----------CLVG 161
C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 259 --------CSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKT 310
Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S C+K YCECY
Sbjct: 311 PASARHKRGCNCKKSNCMKKYCECY 335
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 50/95 (52%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK--------- 50
YC+CFA G +C + C+C CFN HE+ +Q RNP AF PK+ +
Sbjct: 245 YCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAPKVIRNADSIMEAS 304
Query: 51 --CLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
RH +GCNCK+S C+K YCECY+ G
Sbjct: 305 DDASKTPASARHKRGCNCKKSNCMKKYCECYQGGV 339
>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
Length = 524
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/48 (66%), Positives = 36/48 (75%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
++PCNCTKS CLKLYCDCFANGEFCY CNC C N + + R AIR
Sbjct: 326 KRPCNCTKSQCLKLYCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIR 373
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 23/34 (67%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 34
YCDCFANGEFCY CNC C N + + R AIR
Sbjct: 340 YCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIR 373
>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
Length = 774
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 63/145 (43%), Gaps = 45/145 (31%)
Query: 53 VGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCF 112
GEGE K CNCK+S CLK YCEC+ AG +EP
Sbjct: 481 AGEGES--CKRCNCKKSKCLKLYCECFAAGVYCIEP------------------------ 514
Query: 113 ANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VGE----- 162
C C CFN H + +Q RNP AF PK+ + +G+
Sbjct: 515 --------CACRDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIGDDLNKT 566
Query: 163 -GERRHTKGCNCKRSGCLKNYCECY 186
RH +GCNCK+S CLK YCECY
Sbjct: 567 PASARHKRGCNCKKSSCLKKYCECY 591
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL-----VG 54
YC+CFA G +C + C C CFN H + +Q RNP AF PK+ + +G
Sbjct: 501 YCECFAAGVYCIEPCACRDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIG 560
Query: 55 E------GERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ RH +GCNCK+S CLK YCECY+ G
Sbjct: 561 DDLNKTPASARHKRGCNCKKSSCLKKYCECYQGGV 595
>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
Length = 142
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 57/99 (57%), Gaps = 3/99 (3%)
Query: 91 IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
IR + C C KS CL+LYC+CF NG FC + C C +C N ++ A ++ RNP+
Sbjct: 9 IRKVEFCKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKMRNPD 68
Query: 150 AFRPK--IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
AF K + K EG H KGCNC ++ C K YCEC+
Sbjct: 69 AFSQKFLVVKQNQYEGIVSHKKGCNCTKTQCTKKYCECF 107
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 50/90 (55%), Gaps = 3/90 (3%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK--IGKCLVGEGE 57
YC+CF NG FC + C C +C N ++ A ++ RNP+AF K + K EG
Sbjct: 26 YCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKMRNPDAFSQKFLVVKQNQYEGI 85
Query: 58 RRHTKGCNCKRSGCLKNYCECYEAGARTLE 87
H KGCNC ++ C K YCEC+ AG + E
Sbjct: 86 VSHKKGCNCTKTQCTKKYCECFNAGIKCTE 115
>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 490
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 50/98 (51%), Gaps = 8/98 (8%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C C K+ CLKLYC CF +C C C C N ED R +A++ L RNP+AF K
Sbjct: 217 CTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKF 276
Query: 156 GKCLVGEGERR-------HTKGCNCKRSGCLKNYCECY 186
K V H GC C++S C+K YCECY
Sbjct: 277 QKTPVDGATVETPSKLLTHKLGCKCRKSACMKKYCECY 314
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 42/87 (48%), Gaps = 8/87 (9%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF +C C C C N ED R +A++ L RNP+AF K K V
Sbjct: 228 YCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKFQKTPVDGATVE 287
Query: 60 -------HTKGCNCKRSGCLKNYCECY 79
H GC C++S C+K YCECY
Sbjct: 288 TPSKLLTHKLGCKCRKSACMKKYCECY 314
>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
trifallax]
Length = 600
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 56/90 (62%), Gaps = 7/90 (7%)
Query: 104 CLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDR-HHAIRQCLERNPNAFRPKI-----G 156
CLKLYCDCFA G C C C +C N ++++ A ++ + RNP AF+PK+
Sbjct: 455 CLKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGPA 514
Query: 157 KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ + +++H KGCNCK SGC KNYCEC+
Sbjct: 515 QMSQNDADKQHRKGCNCKNSGCQKNYCECF 544
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 54/89 (60%), Gaps = 7/89 (7%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDR-HHAIRQCLERNPNAFRPKI-----GKCLV 53
YCDCFA G C C C +C N ++++ A ++ + RNP AF+PK+ +
Sbjct: 459 YCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGPAQMSQ 518
Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEAG 82
+ +++H KGCNCK SGC KNYCEC++ G
Sbjct: 519 NDADKQHRKGCNCKNSGCQKNYCECFQFG 547
>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 113
Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 51
YC+CFA FC C C +CFN E+E + +Q R+P AF PKI +
Sbjct: 1 YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60
Query: 52 ---LVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
L RH +GCNCK+S CLK YCECY+AG
Sbjct: 61 DDALDTPASARHKRGCNCKKSLCLKKYCECYQAGV 95
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 48/91 (52%), Gaps = 12/91 (13%)
Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------- 158
YC+CFA FC C C +CFN E+E + +Q R+P AF PKI +
Sbjct: 1 YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60
Query: 159 ---LVGEGERRHTKGCNCKRSGCLKNYCECY 186
L RH +GCNCK+S CLK YCECY
Sbjct: 61 DDALDTPASARHKRGCNCKKSLCLKKYCECY 91
>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
Length = 377
Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 32/50 (64%), Positives = 38/50 (76%)
Query: 92 RPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 141
RPRK C+CTKS CLKLYCDCF++G C CNC +C NN E+E RH AI+
Sbjct: 325 RPRKTCHCTKSRCLKLYCDCFSSGMMCSSCNCINCHNNAENESQRHEAIK 374
Score = 44.7 bits (104), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 24/34 (70%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIR 34
YCDCF++G C CNC +C NN E+E RH AI+
Sbjct: 341 YCDCFSSGMMCSSCNCINCHNNAENESQRHEAIK 374
>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
Length = 781
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 58/105 (55%), Gaps = 16/105 (15%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC KS CLKLYC+CFA G +C C+C +C N E+ED +Q RNP AF PK+
Sbjct: 443 CNCKKSRCLKLYCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAPKV 502
Query: 156 --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
GK + RH KGCNCK+S C K YCEC+
Sbjct: 503 VNNATNSPANMMEEGKWM-KTSSSRHKKGCNCKKSKCSKKYCECF 546
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 51/98 (52%), Gaps = 16/98 (16%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC+CFA G +C C+C +C N E+ED +Q RNP AF PK+
Sbjct: 454 YCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAPKVVNNATNSPANM 513
Query: 49 ---GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
GK + RH KGCNCK+S C K YCEC++ G
Sbjct: 514 MEEGKWM-KTSSSRHKKGCNCKKSKCSKKYCECFQGGV 550
>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 56/96 (58%), Gaps = 11/96 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
+PC+CTK+ CLK+YC CF NG+ C + C C C N E+ + RH A I++ RN
Sbjct: 80 QPCSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVEYIKKKAHRNKKV 139
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ K+ E + GCNCK++ CLK YCEC+
Sbjct: 140 TQEKL-------FETKEVWGCNCKKTRCLKRYCECF 168
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 11/83 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNAFRPKIGKCLVGEG 56
YC CF NG+ C + C C C N E+ + RH A I++ RN + K+
Sbjct: 93 YCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVEYIKKKAHRNKKVTQEKLF------- 145
Query: 57 ERRHTKGCNCKRSGCLKNYCECY 79
E + GCNCK++ CLK YCEC+
Sbjct: 146 ETKEVWGCNCKKTRCLKRYCECF 168
>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
subellipsoidea C-169]
Length = 81
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 6/84 (7%)
Query: 104 CLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE 162
CLKLYCDCFA G FC C C C N + + A + + ++P AF+PK V +
Sbjct: 1 CLKLYCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPK-----VLD 55
Query: 163 GERRHTKGCNCKRSGCLKNYCECY 186
H KGC C++S CLK YCEC+
Sbjct: 56 ASGGHVKGCACRKSRCLKKYCECF 79
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA G FC C C C N + + A + + ++P AF+PK V +
Sbjct: 5 YCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPK-----VLDASGG 59
Query: 60 HTKGCNCKRSGCLKNYCECY 79
H KGC C++S CLK YCEC+
Sbjct: 60 HVKGCACRKSRCLKKYCECF 79
>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 120
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 51/102 (50%), Gaps = 19/102 (18%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC+CFA G +C C C CFN E+ + + +Q RNP AF PKI
Sbjct: 1 YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60
Query: 49 -------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ L RH +GCNCK+S CLK YCECY+AG
Sbjct: 61 VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGV 102
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 48/98 (48%), Gaps = 19/98 (19%)
Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 155
YC+CFA G +C C C CFN E+ + + +Q RNP AF PKI
Sbjct: 1 YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60
Query: 156 -------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ L RH +GCNCK+S CLK YCECY
Sbjct: 61 VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECY 98
>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 495
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 63/121 (52%), Gaps = 10/121 (8%)
Query: 76 CECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHED 134
C+ E + + + + R+ C C S CLKLYC+CFA G +C C C C N E E+
Sbjct: 235 CQNGENDSDNINDSEQKAREGCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECEN 294
Query: 135 DRHHAIRQCLERNPNAFRPK---IGKCLVGE------GERRHTKGCNCKRSGCLKNYCEC 185
+ A + L RNP+AF K +G + E G + KGC CKR+ C K YCEC
Sbjct: 295 EIKEARKVTLTRNPDAFTSKLEVVGTIVQDEDIEANRGMLGYKKGCKCKRTYCKKKYCEC 354
Query: 186 Y 186
Y
Sbjct: 355 Y 355
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 50/94 (53%), Gaps = 10/94 (10%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK---IGKCLVGE- 55
YC+CFA G +C C C C N E E++ A + L RNP+AF K +G + E
Sbjct: 267 YCECFAKGAYCRDICKCLQCSNTEECENEIKEARKVTLTRNPDAFTSKLEVVGTIVQDED 326
Query: 56 -----GERRHTKGCNCKRSGCLKNYCECYEAGAR 84
G + KGC CKR+ C K YCECY AG +
Sbjct: 327 IEANRGMLGYKKGCKCKRTYCKKKYCECYNAGVK 360
>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
Length = 713
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 12/101 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR--- 152
C C K+ CLK+YC CF++ FC+ C C+ C N +H+ +R AI+ L +P AF
Sbjct: 536 CGC-KTGCLKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAFSFAS 594
Query: 153 -PKIGKC------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
P+ L + +GC CK+S CLK YCEC+
Sbjct: 595 LPQSASTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECF 635
Score = 60.1 bits (144), Expect = 4e-07, Method: Composition-based stats.
Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 19/110 (17%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR----PKIGKC---- 51
YC CF++ FC+ C C+ C N +H+ +R AI+ L +P AF P+
Sbjct: 546 YCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAFSFASLPQSASTTGFL 605
Query: 52 --LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNC 99
L + +GC CK+S CLK YCEC++ NGI C C
Sbjct: 606 HLLPQKSSAVVMRGCRCKKSKCLKKYCECFQ--------NGIACTSHCRC 647
>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 56/96 (58%), Gaps = 11/96 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
+PC+C+K+ CLK+YC CF NG+ C QC C C N E+ + RH+A IR+ RN
Sbjct: 80 QPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEYIRKKAHRNKKV 139
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ K+ E + GCNCK++ C K YCEC+
Sbjct: 140 TQEKL-------FETKDVWGCNCKKTRCQKRYCECF 168
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 44/83 (53%), Gaps = 11/83 (13%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHA---IRQCLERNPNAFRPKIGKCLVGEG 56
YC CF NG+ C QC C C N E+ + RH+A IR+ RN + K+
Sbjct: 93 YCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEYIRKKAHRNKKVTQEKLF------- 145
Query: 57 ERRHTKGCNCKRSGCLKNYCECY 79
E + GCNCK++ C K YCEC+
Sbjct: 146 ETKDVWGCNCKKTRCQKRYCECF 168
>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 712
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 54/130 (41%), Gaps = 40/130 (30%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C C KS CLKLYC CF+ C C C C N E R A++ L+RNPNAFR K
Sbjct: 338 CTCKKSKCLKLYCHCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKF 397
Query: 156 GKCLVGE--------GER-------------------------------RHTKGCNCKRS 176
K GE ER H GCNC++S
Sbjct: 398 VKVTGGEEFAKEWEKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKS 457
Query: 177 GCLKNYCECY 186
CLK YCEC+
Sbjct: 458 FCLKRYCECF 467
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 66/163 (40%), Gaps = 18/163 (11%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG-ER 58
YC CF+ C C C C N E R A++ L+RNPNAFR K K GE +
Sbjct: 349 YCHCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKFVKVTGGEEFAK 408
Query: 59 RHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKP-----CNCTKSMCLKLYCDCFA 113
K K+ L + +E + P CNC KS CLK YC+CF
Sbjct: 409 EWEKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKSFCLKRYCECFG 468
Query: 114 NGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C C C C N + D H + +P +RP +
Sbjct: 469 AQTPCGLNCKCLGCQN----QPDYVHVL------DPRGYRPPL 501
>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
Length = 141
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C C+KS C+KLYCDCF G C CNC C N E ++R +++ L+ NP AF
Sbjct: 35 CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAFSEDS 94
Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
K GC C S CLK YC C+
Sbjct: 95 DKFNTKRQRMSRGNGCACPSSHCLKKYCSCF 125
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 39/84 (46%), Gaps = 1/84 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCF G C CNC C N E ++R +++ L+ NP AF K
Sbjct: 46 YCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAFSEDSDKFNTKRQRMS 105
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
GC C S CLK YC C+ A A
Sbjct: 106 RGNGCACPSSHCLKKYCSCFGADA 129
>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
Length = 259
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 53/100 (53%), Gaps = 6/100 (6%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLER 146
P KPC+C+K+ CL+LYC CF N C Q C C+ CFN+ +H D+ AI Q +
Sbjct: 134 PTMDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCSECFNDGQHSDEVLKAIEQIKIK 193
Query: 147 NPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
A + + R GC CK++ C+K YCEC+
Sbjct: 194 EQRASHHDL-----DSFDTRQVWGCKCKKTQCIKGYCECF 228
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 6/80 (7%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF N C Q C C+ CFN+ +H D+ AI Q + A + + R
Sbjct: 154 YCSCFHNRRQCTQECKCSECFNDGQHSDEVLKAIEQIKIKEQRASHHDLD-----SFDTR 208
Query: 60 HTKGCNCKRSGCLKNYCECY 79
GC CK++ C+K YCEC+
Sbjct: 209 QVWGCKCKKTQCIKGYCECF 228
>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
niloticus]
Length = 310
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/48 (64%), Positives = 33/48 (68%)
Query: 34 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
+ CL NP+AFRPKI G H KGCNCKRSGCLK YCECYEA
Sbjct: 169 KWCLGHNPDAFRPKIAGVKSGAVRGWHNKGCNCKRSGCLKRYCECYEA 216
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/46 (63%), Positives = 31/46 (67%)
Query: 141 RQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ CL NP+AFRPKI G H KGCNCKRSGCLK YCECY
Sbjct: 169 KWCLGHNPDAFRPKIAGVKSGAVRGWHNKGCNCKRSGCLKRYCECY 214
>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
Length = 214
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 26/119 (21%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
R C+C +S C +LYC CF + FC CNC+ C+N +ED +NPNAF
Sbjct: 7 RPKCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFD 66
Query: 153 PKI-------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P+I + G +R+H KGC+C++S C K YCEC+
Sbjct: 67 PRIIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECF 125
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 26/106 (24%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC CF + FC CNC+ C+N +ED +NPNAF P+I
Sbjct: 21 YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80
Query: 49 --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
+ G +R+H KGC+C++S C K YCEC++
Sbjct: 81 PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFK 126
>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 56/103 (54%), Gaps = 18/103 (17%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQC 143
T++ + ++ C C KS CLK+YCDC A GE+C +C+C +C N+ +H + R + Q
Sbjct: 30 TIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKVLEQM 89
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+E+N +A CNC++S C K YCECY
Sbjct: 90 VEKNSSA-----------------QLKCNCRKSNCQKKYCECY 115
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 18/85 (21%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDC A GE+C +C+C +C N+ +H + R + Q +E+N +A
Sbjct: 53 YCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKVLEQMVEKNSSA---------------- 96
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
CNC++S C K YCECY AG +
Sbjct: 97 -QLKCNCRKSNCQKKYCECYNAGVK 120
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 28/111 (25%)
Query: 48 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA------------------------ 83
I + + + ++ + C CK+S CLK YC+C G
Sbjct: 26 ITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKV 85
Query: 84 --RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
+ +E N K CNC KS C K YC+C+ G C + C C+ C N+++
Sbjct: 86 LEQMVEKNSSAQLK-CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKNSVK 135
>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
Length = 601
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC------LV 53
YC+CF+ G FC + C+C +CFN HED + RNP AF PK+ L
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487
Query: 54 GEGER-----RHTKGCNCKRSGCLKNYCECYEAGA 83
E + RH +GCNC++SGC K YCEC+ G
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGV 522
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 50/91 (54%), Gaps = 12/91 (13%)
Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC------LV 160
YC+CF+ G FC + C+C +CFN HED + RNP AF PK+ L
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487
Query: 161 GEGER-----RHTKGCNCKRSGCLKNYCECY 186
E + RH +GCNC++SGC K YCEC+
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECF 518
>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
[Oryza sativa Japonica Group]
Length = 194
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 26/119 (21%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
R C+C +S C +LYC CF + FC CNC+ C+N +ED +NPNAF
Sbjct: 7 RPKCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFD 66
Query: 153 PKI-------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P+I + G +R+H KGC+C++S C K YCEC+
Sbjct: 67 PRIIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECF 125
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 26/106 (24%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC CF + FC CNC+ C+N +ED +NPNAF P+I
Sbjct: 21 YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80
Query: 49 --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
+ G +R+H KGC+C++S C K YCEC++
Sbjct: 81 PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFK 126
>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
Length = 223
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 26/119 (21%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR 152
R C+C +S C +LYC CF + FC CNC+ C+N +ED +NPNAF
Sbjct: 7 RPKCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFD 66
Query: 153 PKI-------------------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
P+I + G +R+H KGC+C++S C K YCEC+
Sbjct: 67 PRIIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECF 125
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 26/106 (24%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 48
YC CF + FC CNC+ C+N +ED +NPNAF P+I
Sbjct: 21 YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80
Query: 49 --------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
+ G +R+H KGC+C++S C K YCEC++
Sbjct: 81 PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFK 126
>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 54/102 (52%), Gaps = 11/102 (10%)
Query: 89 NGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQC----NCNSCFNNIEHEDDRHHAIRQCL 144
N I +PC+CTK+ CLK+YC CF NG+ C + +C + NN+ D I++
Sbjct: 74 NEIAKWQPCSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKKA 133
Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RN + + E + GCNCK++ CLK YCECY
Sbjct: 134 HRNKKVSQETL-------FETKDIWGCNCKKTRCLKRYCECY 168
Score = 45.4 bits (106), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 40/83 (48%), Gaps = 11/83 (13%)
Query: 1 YCDCFANGEFCYQC----NCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEG 56
YC CF NG+ C + +C + NN+ D I++ RN + +
Sbjct: 93 YCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKKAHRNKKVSQETLF------- 145
Query: 57 ERRHTKGCNCKRSGCLKNYCECY 79
E + GCNCK++ CLK YCECY
Sbjct: 146 ETKDIWGCNCKKTRCLKRYCECY 168
Score = 37.0 bits (84), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 41/174 (23%), Positives = 68/174 (39%), Gaps = 48/174 (27%)
Query: 23 IEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG 82
IE+ D H I + ++ + K + E + C+C ++ CLK YC C+ G
Sbjct: 48 IEYNDQLHMVINKQIDIDE-------KKVESPKNEIAKWQPCSCTKTNCLKMYCSCFHNG 100
Query: 83 ARTLEP----------NGIRPRKP----------------------------CNCTKSMC 104
+E N + R CNC K+ C
Sbjct: 101 QTCVESCKCEDCKNTNNNVPQRDKAVEYIKKKAHRNKKVSQETLFETKDIWGCNCKKTRC 160
Query: 105 LKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK 157
LK YC+C+ + C +CNC C N ++D ++ IR+ E+ + R + G+
Sbjct: 161 LKRYCECYIRQKACTVECNCTHCENG--KDEDLYNEIRRQNEQPKQSKRQRSGR 212
>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 56/103 (54%), Gaps = 18/103 (17%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQC 143
T++ + ++ C C KS CLK+YCDC A GE+C +C+C +C N+ +H + R+ + Q
Sbjct: 30 TIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKVLEQM 89
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+E+N A CNC++S C K YCECY
Sbjct: 90 VEKNTPA-----------------QIKCNCRKSNCQKKYCECY 115
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 18/85 (21%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDC A GE+C +C+C +C N+ +H + R+ + Q +E+N A
Sbjct: 53 YCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKVLEQMVEKNTPA---------------- 96
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
CNC++S C K YCECY AG +
Sbjct: 97 -QIKCNCRKSNCQKKYCECYNAGVK 120
Score = 37.4 bits (85), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 28/111 (25%)
Query: 48 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA------------------------ 83
I + + + ++ + C CK+S CLK YC+C G
Sbjct: 26 ITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKV 85
Query: 84 --RTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIE 131
+ +E N K CNC KS C K YC+C+ G C + C C+ C N+++
Sbjct: 86 LEQMVEKNTPAQIK-CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKNSVK 135
>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
CCMP2712]
Length = 125
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 52/96 (54%), Gaps = 6/96 (6%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C C ++ CLK YC CF N C C C+ C N+ +HE+ R AIR NP+AF+
Sbjct: 13 CRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAFKGTA 72
Query: 156 -----GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ +G ++ +GC CKRS C K YCEC+
Sbjct: 73 LELEDQEVTTPKGGKKTVRGCRCKRSKCQKKYCECF 108
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI-----GKCLVG 54
YC CF N C C C+ C N+ +HE+ R AIR NP+AF+ +
Sbjct: 24 YCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAFKGTALELEDQEVTTP 83
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAG 82
+G ++ +GC CKRS C K YCEC+ AG
Sbjct: 84 KGGKKTVRGCRCKRSKCQKKYCECFSAG 111
>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
Length = 163
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/62 (51%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
CNC +S CLKLYC+CFA G +C C+C +CFN E+ D H +Q RNP AF PK+
Sbjct: 83 CNCKRSRCLKLYCECFAAGNYCLDTCSCENCFNKSEYADAVLHMRQQIEARNPVAFAPKV 142
Query: 156 GK 157
K
Sbjct: 143 VK 144
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGK 50
YC+CFA G +C C+C +CFN E+ D H +Q RNP AF PK+ K
Sbjct: 94 YCECFAAGNYCLDTCSCENCFNKSEYADAVLHMRQQIEARNPVAFAPKVVK 144
>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 53/96 (55%), Gaps = 11/96 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
+PC+CTK+ CLK+YC CF NG+ C + C C C N H R A I++ RN
Sbjct: 80 QPCSCTKTYCLKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKKAHRNKKV 139
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ K+ E + GCNCK++ CLK YCECY
Sbjct: 140 PQEKL-------FETKDIWGCNCKKTRCLKRYCECY 168
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 41/83 (49%), Gaps = 11/83 (13%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHA---IRQCLERNPNAFRPKIGKCLVGEG 56
YC CF NG+ C + C C C N H R A I++ RN + K+
Sbjct: 93 YCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKKAHRNKKVPQEKLF------- 145
Query: 57 ERRHTKGCNCKRSGCLKNYCECY 79
E + GCNCK++ CLK YCECY
Sbjct: 146 ETKDIWGCNCKKTRCLKRYCECY 168
>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
Length = 649
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/74 (45%), Positives = 45/74 (60%), Gaps = 3/74 (4%)
Query: 86 LEPNGIRPRKPCNCTKS---MCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQ 142
L+ N ++ +K CNC S YC+C+A G +C CNC +C NN+ +E R AI
Sbjct: 139 LKDNTLKKQKRCNCKNSKSDFATARYCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGM 198
Query: 143 CLERNPNAFRPKIG 156
LE+NPNAFRPKI
Sbjct: 199 TLEKNPNAFRPKIA 212
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG 49
YC+C+A G +C CNC +C NN+ +E R AI LE+NPNAFRPKI
Sbjct: 164 YCECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIA 212
>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
Length = 777
Score = 63.5 bits (153), Expect = 4e-08, Method: Composition-based stats.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 12/101 (11%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR--- 152
C C K+ CLK+YC CF++ FC+ C C+ C N + +R AI+ L +P AF
Sbjct: 540 CGC-KTGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAFSFAS 598
Query: 153 -PKIGKC------LVGEGERRHTKGCNCKRSGCLKNYCECY 186
P+ L + +GC CK+S CLK YCEC+
Sbjct: 599 LPQDTNTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECF 639
Score = 55.8 bits (133), Expect = 8e-06, Method: Composition-based stats.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 19/110 (17%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFR----PKIGKC---- 51
YC CF++ FC+ C C+ C N + +R AI+ L +P AF P+
Sbjct: 550 YCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAFSFASLPQDTNTTGFL 609
Query: 52 --LVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNC 99
L + +GC CK+S CLK YCEC++ NGI C C
Sbjct: 610 HLLPQKSSAVVMRGCRCKKSKCLKKYCECFQ--------NGIACTSHCRC 651
>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 617
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 49/85 (57%), Gaps = 3/85 (3%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CF G +C QC C C NN ++ ++R AI + RN +AF K +
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQ--ITSNSTVT 414
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
H KGC+CKR+ CLK YCEC+ AG +
Sbjct: 415 HKKGCHCKRTHCLKKYCECFNAGLK 439
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 3/80 (3%)
Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
YC+CF G +C QC C C NN ++ ++R AI + RN +AF K +
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQ--ITSNSTVT 414
Query: 167 HTKGCNCKRSGCLKNYCECY 186
H KGC+CKR+ CLK YCEC+
Sbjct: 415 HKKGCHCKRTHCLKKYCECF 434
>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 625
Score = 63.2 bits (152), Expect = 5e-08, Method: Composition-based stats.
Identities = 38/82 (46%), Positives = 50/82 (60%), Gaps = 3/82 (3%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRH 60
YC+CFA G FC C+C +C N ++ Q RNP AF KI + +GE +H
Sbjct: 281 YCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKI---VATDGEEKH 337
Query: 61 TKGCNCKRSGCLKNYCECYEAG 82
KGC+CK+S CLK YCEC++AG
Sbjct: 338 KKGCHCKKSACLKKYCECFQAG 359
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 37/80 (46%), Positives = 48/80 (60%), Gaps = 3/80 (3%)
Query: 107 LYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
LYC+CFA G FC C+C +C N ++ Q RNP AF KI + +GE +
Sbjct: 280 LYCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKI---VATDGEEK 336
Query: 167 HTKGCNCKRSGCLKNYCECY 186
H KGC+CK+S CLK YCEC+
Sbjct: 337 HKKGCHCKKSACLKKYCECF 356
>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
Length = 496
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 6/89 (6%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCLVG 54
YC CF C C+C C+N+ +HE +R AI +P AF ++ V
Sbjct: 300 YCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASVSVE 359
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
++H +GC CK S CLK YCEC+E G
Sbjct: 360 SKPKQHVRGCKCKNSKCLKKYCECFEFGV 388
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%), Gaps = 6/88 (6%)
Query: 105 LKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKC 158
++ YC CF C C+C C+N+ +HE +R AI +P AF ++
Sbjct: 297 VRQYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASV 356
Query: 159 LVGEGERRHTKGCNCKRSGCLKNYCECY 186
V ++H +GC CK S CLK YCEC+
Sbjct: 357 SVESKPKQHVRGCKCKNSKCLKKYCECF 384
>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 159
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 11/132 (8%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA G +C C C CFN E+ + + +Q RNP AF PKI V E
Sbjct: 1 YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKI----VQGAEPS 56
Query: 60 HTKGCNCKRS-GCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC 118
G + C++ + + T P R ++ CNC KS+CLK YC+C+ G C
Sbjct: 57 PVPGVTLPAAEACVQLVVQ--DEALDT--PASARHKRGCNCKKSLCLKKYCECYQAGVGC 112
Query: 119 YQ-CNCNSCFNN 129
+ C C C N
Sbjct: 113 SEGCRCEGCMNK 124
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 48/106 (45%), Gaps = 27/106 (25%)
Query: 108 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----------- 155
YC+CFA G +C C C CFN E+ + + +Q RNP AF PKI
Sbjct: 1 YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAEPSPVPG 60
Query: 156 ---------------GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ L RH +GCNCK+S CLK YCECY
Sbjct: 61 VTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECY 106
>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 313
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/92 (44%), Positives = 45/92 (48%), Gaps = 3/92 (3%)
Query: 96 PCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
PC C K CLKLYCDCFA FC C C C N + A R LERNP+AF +
Sbjct: 200 PCTCKKCRCLKLYCDCFAQSRFCGSACLCKQCCNRL-GAVVVQAARRAVLERNPDAFH-R 257
Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
G GC C RS CLK YC C+
Sbjct: 258 AHSLETRPGSVTSGAGCGCTRSHCLKKYCVCF 289
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 48/134 (35%), Gaps = 37/134 (27%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YCDCFA FC C C C N + A R LERNP+AF
Sbjct: 212 YCDCFAQSRFCGSACLCKQCCNRL-GAVVVQAARRAVLERNPDAF--------------- 255
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC- 118
+ A + P + C CT+S CLK YC CF C
Sbjct: 256 -------------------HRAHSLETRPGSVTSGAGCGCTRSHCLKKYCVCFQKSLLCG 296
Query: 119 YQCNCNSCFNNIEH 132
C C C N +E+
Sbjct: 297 AGCKCEGCLNTVEN 310
>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
Length = 482
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 56/106 (52%), Gaps = 14/106 (13%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C +S C+KLYC+CF + FC C CN C N E+E++ + L RNP AF
Sbjct: 357 KGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDCENMEENEEEIVKIREKILSRNPKAFEK 416
Query: 154 KI----GKCLVGEGER---------RHTKGCNCKRSGCLKNYCECY 186
K+ +G G + RH KGC CKRS C NYCEC+
Sbjct: 417 KLQIEEASQPLGMGPKAASAPSSNLRHIKGCTCKRSKCSNNYCECH 462
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 51/98 (52%), Gaps = 14/98 (14%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI----GKCLVGE 55
YC+CF + FC C CN C N E+E++ + L RNP AF K+ +G
Sbjct: 370 YCECFLSKGFCGPSCTCNDCENMEENEEEIVKIREKILSRNPKAFEKKLQIEEASQPLGM 429
Query: 56 GER---------RHTKGCNCKRSGCLKNYCECYEAGAR 84
G + RH KGC CKRS C NYCEC++ GA+
Sbjct: 430 GPKAASAPSSNLRHIKGCTCKRSKCSNNYCECHQHGAK 467
>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|194699308|gb|ACF83738.1| unknown [Zea mays]
gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|224028923|gb|ACN33537.1| unknown [Zea mays]
gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 359
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 53/123 (43%), Gaps = 21/123 (17%)
Query: 85 TLEPNGIRPR----------KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHE 133
+ P+ RPR K C C KS CLKLYC CF+ G +C +C C CFN
Sbjct: 47 AIAPDPKRPRVEEAADGNGCKHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFA 106
Query: 134 DDRHHAIRQCLERN----------PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYC 183
+ H + L R P A + +GCNCK+S CLK YC
Sbjct: 107 ETVHTTRKVLLSRQKRMSMKINRRPEANTEPMEDAHHSSSSTPPKRGCNCKKSSCLKKYC 166
Query: 184 ECY 186
+CY
Sbjct: 167 DCY 169
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 11/94 (11%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERN----------PNAFRPKIG 49
YC CF+ G +C +C C CFN + H + L R P A +
Sbjct: 80 YCPCFSGGGYCSDKCGCQPCFNKEAFAETVHTTRKVLLSRQKRMSMKINRRPEANTEPME 139
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+GCNCK+S CLK YC+CY+ G
Sbjct: 140 DAHHSSSSTPPKRGCNCKKSSCLKKYCDCYQDGT 173
>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 60.1 bits (144), Expect = 4e-07, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 38/76 (50%), Gaps = 13/76 (17%)
Query: 19 CFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------------LVGEGERRHTKGCN 65
CFN E+ED +Q RNP AF PKI + L RH GCN
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHVTEFQAIDVEDVDLFTPYSGRHKTGCN 503
Query: 66 CKRSGCLKNYCECYEA 81
CKRS C+K YCECY+A
Sbjct: 504 CKRSMCVKKYCECYQA 519
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 32/74 (43%), Positives = 36/74 (48%), Gaps = 13/74 (17%)
Query: 126 CFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC-------------LVGEGERRHTKGCN 172
CFN E+ED +Q RNP AF PKI + L RH GCN
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHVTEFQAIDVEDVDLFTPYSGRHKTGCN 503
Query: 173 CKRSGCLKNYCECY 186
CKRS C+K YCECY
Sbjct: 504 CKRSMCVKKYCECY 517
>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 375
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 49/104 (47%), Gaps = 13/104 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C KS CLKLYC CFA G +C + C C CFN + + + L R
Sbjct: 80 KHCACKKSRCLKLYCPCFAGGGYCSEKCGCVPCFNKADFAETVQTTRKVLLSRQ-KRMSL 138
Query: 154 KIGKCLVGEGERRHT-----------KGCNCKRSGCLKNYCECY 186
KI + L E +GCNCK+S CLK YC+CY
Sbjct: 139 KINRRLEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCY 182
Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 69/181 (38%), Gaps = 59/181 (32%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA G +C + C C CFN + + + L R ++R
Sbjct: 93 YCPCFAGGGYCSEKCGCVPCFNKADFAETVQTTRKVLLSR-----------------QKR 135
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLE-----PNGIRPRKPCNCTKSMCLKLYCDCFAN 114
+ N + EA A T+E + PR+ CNC KS CLK YCDC+ +
Sbjct: 136 MSLKINRR-----------LEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 184
Query: 115 GEFCYQ-CNCNSCFN---------------------NIEHEDDRHHAIRQCLERNPNAFR 152
G C C C C N +++H +D H I +ER+P
Sbjct: 185 GTGCSLFCRCEDCKNPFGKNEGIMADESKRFLYTGADLDHSEDEHDFI---VERSPRLQS 241
Query: 153 P 153
P
Sbjct: 242 P 242
>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 103
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/80 (46%), Positives = 42/80 (52%), Gaps = 13/80 (16%)
Query: 3 DCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGE-GERRHT 61
D A G +C+ F I L ++P +PKIG E RRHT
Sbjct: 15 DVSALGRLLPAGDCSELFGRITAG----------LGQSP--VQPKIGATSAAEDATRRHT 62
Query: 62 KGCNCKRSGCLKNYCECYEA 81
KGCNCKRSGCLKNYCECYEA
Sbjct: 63 KGCNCKRSGCLKNYCECYEA 82
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 43/88 (48%), Gaps = 13/88 (14%)
Query: 100 TKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCL 159
T S + D A G +C+ F I L ++P +PKIG
Sbjct: 5 TSSATITGITDVSALGRLLPAGDCSELFGRITAG----------LGQSP--VQPKIGATS 52
Query: 160 VGE-GERRHTKGCNCKRSGCLKNYCECY 186
E RRHTKGCNCKRSGCLKNYCECY
Sbjct: 53 AAEDATRRHTKGCNCKRSGCLKNYCECY 80
>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 319
Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 35/54 (64%)
Query: 28 DRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEA 81
DR AI LERNP AF+PK+ H +GCNC++S CLKNYCEC++A
Sbjct: 19 DRSQAIASILERNPLAFQPKVVANGSETNNLHHLRGCNCRKSNCLKNYCECHQA 72
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/52 (51%), Positives = 33/52 (63%)
Query: 135 DRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
DR AI LERNP AF+PK+ H +GCNC++S CLKNYCEC+
Sbjct: 19 DRSQAIASILERNPLAFQPKVVANGSETNNLHHLRGCNCRKSNCLKNYCECH 70
>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
Length = 286
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 52/94 (55%), Gaps = 8/94 (8%)
Query: 98 NCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF--RPKI 155
NC+ + C+K YC CF ++C CN C NN E+E++R AI L +NP AF + K
Sbjct: 144 NCSNTYCIKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKT 203
Query: 156 GKCLVGEGERR----HTKGCNCKRSGCLKNYCEC 185
K + + ++ HT CNC +SGC YC C
Sbjct: 204 RKNFIKKSKKDSNLIHT--CNCSKSGCNNKYCVC 235
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 8/88 (9%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF--RPKIGKCLVGEGER 58
YC CF ++C CN C NN E+E++R AI L +NP AF + K K + + ++
Sbjct: 154 YCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKTRKNFIKKSKK 213
Query: 59 R----HTKGCNCKRSGCLKNYCECYEAG 82
HT CNC +SGC YC C + G
Sbjct: 214 DSNLIHT--CNCSKSGCNNKYCVCLKEG 239
>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 356
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 47/103 (45%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
K C C KS CLKLYC CFA G +C +C C CFN + R +
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMK 121
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ R P A + +GCNCK+S CLK YC+CY
Sbjct: 122 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 164
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 11/94 (11%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
YC CFA G +C +C C CFN + R + + R P A +
Sbjct: 75 YCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 134
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+GCNCK+S CLK YC+CY+ G
Sbjct: 135 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 168
Score = 37.0 bits (84), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 53/151 (35%), Gaps = 57/151 (37%)
Query: 37 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY----------------- 79
L P+ R ++ + G G K C CK+S CLK YC C+
Sbjct: 41 LAIAPDPKRQRVEEAADGNG----CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFN 96
Query: 80 ------------------------------EAGARTLE-----PNGIRPRKPCNCTKSMC 104
EA A +E + PR+ CNC KS C
Sbjct: 97 KEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSC 156
Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHED 134
LK YCDC+ +G C C C+ C N D
Sbjct: 157 LKKYCDCYQDGTGCSLFCRCDDCQNPFGKND 187
>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
Length = 341
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 47/103 (45%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
K C C KS CLKLYC CFA G +C +C C CFN + R +
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMK 121
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ R P A + +GCNCK+S CLK YC+CY
Sbjct: 122 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 164
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 39/91 (42%), Gaps = 11/91 (12%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
YC CFA G +C +C C CFN + R + + R P A +
Sbjct: 75 YCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 134
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYE 80
+GCNCK+S CLK YC+CY+
Sbjct: 135 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQ 165
>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
Length = 434
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 51/101 (50%), Gaps = 12/101 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C ++ CLK YC CF N C C C C N+ +HE R A+R +P AF+
Sbjct: 15 KGCRCGRTKCLKQYCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMAFK- 73
Query: 154 KIGKCLVGEGERRHT--------KGCNCKRSGCLKNYCECY 186
G L E + HT +GC C+RS C K YCEC+
Sbjct: 74 --GTSLELENQEVHTPNGTLKTVRGCRCRRSKCQKKYCECF 112
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 46/93 (49%), Gaps = 12/93 (12%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF N C C C C N+ +HE R A+R +P AF+ G L E +
Sbjct: 28 YCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMAFK---GTSLELENQEV 84
Query: 60 HT--------KGCNCKRSGCLKNYCECYEAGAR 84
HT +GC C+RS C K YCEC+ AG +
Sbjct: 85 HTPNGTLKTVRGCRCRRSKCQKKYCECFGAGLK 117
>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
Length = 223
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 11/96 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHA---IRQCLERNPNA 150
+PC+C K+ CLK+YC CF NG FC C C C N E + R A +++ RN
Sbjct: 87 QPCSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKKAHRNKKV 146
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ KI + + GCNC ++ C+K YCEC+
Sbjct: 147 PKEKIFETV-------EIWGCNCSKTRCVKKYCECF 175
Score = 43.9 bits (102), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 11/88 (12%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAI---RQCLERNPNAFRPKIGKCLVGEG 56
YC CF NG FC C C C N E + R A+ ++ RN + KI + +
Sbjct: 100 YCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKKAHRNKKVPKEKIFETV---- 155
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEAGAR 84
GCNC ++ C+K YCEC+ G +
Sbjct: 156 ---EIWGCNCSKTRCVKKYCECFIRGKK 180
>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 52/97 (53%), Gaps = 12/97 (12%)
Query: 91 IRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPN 149
+ +PCNC+KS C K+YC+C A G C C C +C N ++ H I++ L+ N N
Sbjct: 66 LSSDEPCNCSKSECRKMYCECLAKGRLCSSACRCENCQNRTSNK-KVLHVIKE-LDLNKN 123
Query: 150 AFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ K +R GC CK+S CLK YCEC+
Sbjct: 124 LKKIK---------SKRFKDGCTCKKSMCLKKYCECF 151
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 12/83 (14%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+C A G C C C +C N ++ H I++ L+ N N + K +R
Sbjct: 83 YCECLAKGRLCSSACRCENCQNRTSNKK-VLHVIKE-LDLNKNLKKIK---------SKR 131
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
GC CK+S CLK YCEC+ +G
Sbjct: 132 FKDGCTCKKSMCLKKYCECFHSG 154
Score = 43.5 bits (101), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 44/102 (43%), Gaps = 32/102 (31%)
Query: 64 CNCKRSGCLKNYCECYEAG------------------------ARTLEPN-------GIR 92
CNC +S C K YCEC G + L+ N R
Sbjct: 72 CNCSKSECRKMYCECLAKGRLCSSACRCENCQNRTSNKKVLHVIKELDLNKNLKKIKSKR 131
Query: 93 PRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHE 133
+ C C KSMCLK YC+CF +G+ C Y CNC +C N + E
Sbjct: 132 FKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQNQLLDE 173
>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
distachyon]
Length = 377
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 49/104 (47%), Gaps = 13/104 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C KS CLKLYC CFA G +C + C C CFN ++ + + L R
Sbjct: 89 KHCACKKSKCLKLYCPCFAGGGYCSEKCGCVPCFNKSDYAETVQTTRKVLLSRQ-KRMSL 147
Query: 154 KIGKCLVGEGERRHT-----------KGCNCKRSGCLKNYCECY 186
KI + E +GCNCK+S CLK YC+CY
Sbjct: 148 KINRRSEANAEAMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCY 191
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 57/135 (42%), Gaps = 35/135 (25%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA G +C + C C CFN ++ + + L R ++R
Sbjct: 102 YCPCFAGGGYCSEKCGCVPCFNKSDYAETVQTTRKVLLSR-----------------QKR 144
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLE-----PNGIRPRKPCNCTKSMCLKLYCDCFAN 114
+ N +RS EA A +E + PR+ CNC KS CLK YCDC+ +
Sbjct: 145 MSLKIN-RRS----------EANAEAMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 193
Query: 115 GEFCYQ-CNCNSCFN 128
G C C C C N
Sbjct: 194 GTGCSLFCRCEDCKN 208
>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
Length = 359
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 47/103 (45%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
K C C KS CLKLYC CF+ G +C +C C CFN + R +
Sbjct: 67 KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMK 126
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ R P A + +GCNCK+S CLK YC+CY
Sbjct: 127 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 169
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 40/94 (42%), Gaps = 11/94 (11%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
YC CF+ G +C +C C CFN + R + + R P A +
Sbjct: 80 YCPCFSGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 139
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+GCNCK+S CLK YC+CY+ G
Sbjct: 140 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 173
Score = 37.4 bits (85), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 52/145 (35%), Gaps = 57/145 (39%)
Query: 37 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY----------------- 79
L P+ R ++ + G G K C CK+S CLK YC C+
Sbjct: 46 LAIAPDPKRQRVEEAADGNG----CKHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFN 101
Query: 80 ------------------------------EAGARTLE-----PNGIRPRKPCNCTKSMC 104
EA A +E + PR+ CNC KS C
Sbjct: 102 KEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSC 161
Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFN 128
LK YCDC+ +G C C C+ C N
Sbjct: 162 LKKYCDCYQDGTGCSLFCRCDDCQN 186
>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
Length = 356
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 47/103 (45%), Gaps = 11/103 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQC 143
K C C KS CLKLYC CFA G +C +C C CF+ + R +
Sbjct: 62 KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFSKEAFSETVQTTRKVLLSRQKRMSMK 121
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ R P A + +GCNCK+S CLK YC+CY
Sbjct: 122 INRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 164
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 40/94 (42%), Gaps = 11/94 (11%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
YC CFA G +C +C C CF+ + R + + R P A +
Sbjct: 75 YCPCFAGGGYCSDKCGCQPCFSKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 134
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+GCNCK+S CLK YC+CY+ G
Sbjct: 135 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 168
Score = 37.0 bits (84), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 53/151 (35%), Gaps = 57/151 (37%)
Query: 37 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY----------------- 79
L P+ R ++ + G G K C CK+S CLK YC C+
Sbjct: 41 LAIAPDPKRQRVEEAADGNG----CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFS 96
Query: 80 ------------------------------EAGARTLE-----PNGIRPRKPCNCTKSMC 104
EA A +E + PR+ CNC KS C
Sbjct: 97 KEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSC 156
Query: 105 LKLYCDCFANGEFCYQ-CNCNSCFNNIEHED 134
LK YCDC+ +G C C C+ C N D
Sbjct: 157 LKKYCDCYQDGTGCSLFCRCDDCQNPFGKND 187
>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
Length = 374
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 48/104 (46%), Gaps = 13/104 (12%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C C KS CLKLYC CFA G +C + C C CFN + + + L R
Sbjct: 86 KHCACKKSRCLKLYCPCFAGGGYCSEKCGCQPCFNKALYAETVQTTRKVLLSRQ-KRMSL 144
Query: 154 KIGKCLVGEGERRHT-----------KGCNCKRSGCLKNYCECY 186
KI + E +GCNCK+S CLK YC+CY
Sbjct: 145 KINRRSEANAEAVEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 188
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 25/130 (19%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA G +C + C C CFN + + + L R ++R
Sbjct: 99 YCPCFAGGGYCSEKCGCQPCFNKALYAETVQTTRKVLLSR-----------------QKR 141
Query: 60 HTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY 119
+ N + + + + + + T PR+ CNC KS CLK YCDC+ +G C
Sbjct: 142 MSLKINRRSEANAEAVEDAHHSSSST------PPRRGCNCKKSSCLKKYCDCYQDGTGCS 195
Query: 120 Q-CNCNSCFN 128
C C C N
Sbjct: 196 LFCRCEDCRN 205
>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
Length = 184
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/25 (100%), Positives = 25/25 (100%)
Query: 57 ERRHTKGCNCKRSGCLKNYCECYEA 81
ERRHTKGCNCKRSGCLKNYCECYEA
Sbjct: 1 ERRHTKGCNCKRSGCLKNYCECYEA 25
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/23 (100%), Positives = 23/23 (100%)
Query: 164 ERRHTKGCNCKRSGCLKNYCECY 186
ERRHTKGCNCKRSGCLKNYCECY
Sbjct: 1 ERRHTKGCNCKRSGCLKNYCECY 23
>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
Length = 466
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 40/67 (59%), Gaps = 5/67 (7%)
Query: 19 CFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCE 77
C N +E+ Q RNPNAF KI +G+ RHTKGC+CK+S CLK YCE
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKIED----DGDDARHTKGCHCKKSACLKKYCE 226
Query: 78 CYEAGAR 84
C++AG +
Sbjct: 227 CFQAGVK 233
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 67/154 (43%), Gaps = 27/154 (17%)
Query: 56 GERRHTKGCNCKRSG----CLKNY-CECYEAGARTLEPNGIRPRKPCNCTK------SMC 104
G+R T G + + + LKN+ + G R+ + PR P + S+
Sbjct: 79 GKRSSTTGASTEWTAGFTKTLKNFDVDSPRVGTRSARRTVVPPRTPSKAVRRPRSLRSIP 138
Query: 105 -LKLYC-------DCFANGEFCYQCNCNS---CFNNIEHEDDRHHAIRQCLERNPNAFRP 153
++L + A C N+ C N +E+ Q RNPNAF
Sbjct: 139 RIRLLAVNQLRARNATAKKASVSSCTANASRRCQNTVENVSVVEKTRSQIEARNPNAFLS 198
Query: 154 KIGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECY 186
KI +G+ RHTKGC+CK+S CLK YCEC+
Sbjct: 199 KIED----DGDDARHTKGCHCKKSACLKKYCECF 228
>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
Length = 478
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 48/95 (50%), Gaps = 26/95 (27%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
K C+C S CLK+YC+CFAN C C C C NN A K
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCNNCQCFGCQNN-------------------EANIEK 143
Query: 155 IG---KCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
IG L+ +H++GC+C++SGCLK YCEC+
Sbjct: 144 IGNDQSVLID----KHSRGCHCRKSGCLKKYCECF 174
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 42/86 (48%), Gaps = 26/86 (30%)
Query: 1 YCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIG---KCLVGEGE 57
YC+CFAN C C C C NN A KIG L+
Sbjct: 116 YCECFANKILCNNCQCFGCQNN-------------------EANIEKIGNDQSVLID--- 153
Query: 58 RRHTKGCNCKRSGCLKNYCECYEAGA 83
+H++GC+C++SGCLK YCEC++AG
Sbjct: 154 -KHSRGCHCRKSGCLKKYCECFQAGI 178
>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
Length = 113
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 39/62 (62%), Gaps = 1/62 (1%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC+CFA+ +C + C+C CFN+ HE+ + RNP AF P
Sbjct: 19 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 78
Query: 154 KI 155
K+
Sbjct: 79 KV 80
Score = 39.7 bits (91), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 48
YC+CFA+ +C + C+C CFN+ HE+ + RNP AF PK+
Sbjct: 32 YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKV 80
>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 50/93 (53%), Gaps = 5/93 (5%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+PCNC++S CLK YC CF +G C +C C C N E ++R AI ++ +
Sbjct: 149 QPCNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKKCHRDKKV 208
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ + L + + GC CK +GC K YCEC+
Sbjct: 209 PVNELL----SLQMSYGCKCKSTGCQKKYCECF 237
Score = 40.4 bits (93), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF +G C +C C C N E ++R AI ++ + + + L +
Sbjct: 162 YCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKKCHRDKKVPVNELL----SLQ 217
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
+ GC CK +GC K YCEC++ G
Sbjct: 218 MSYGCKCKSTGCQKKYCECFKRG 240
>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|219884557|gb|ACL52653.1| unknown [Zea mays]
gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
Length = 394
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC CFA+G C + C C C N + + A L + F
Sbjct: 87 KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL-KPVQTF-- 143
Query: 154 KIGKCLVGEGERRHT----KGCNCKRSGCLKNYCECY 186
+ G+ +V + R + C CK+SGCLK YC+CY
Sbjct: 144 EAGQDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCY 180
Score = 43.5 bits (101), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 8/88 (9%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA+G C + C C C N + + A L + F + G+ +V + R
Sbjct: 100 YCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL-KPVQTF--EAGQDIVEQVIRS 156
Query: 60 HT----KGCNCKRSGCLKNYCECYEAGA 83
+ C CK+SGCLK YC+CY+ GA
Sbjct: 157 PMDLIRRKCTCKKSGCLKKYCDCYQGGA 184
Score = 39.3 bits (90), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 41/111 (36%), Gaps = 44/111 (39%)
Query: 62 KGCNCKRSGCLKNYCECYEAGAR------------------TLEPNGIRPRKP------- 96
K C+CK+S CLK YC C+ +G TL + P KP
Sbjct: 87 KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPLKPVQTFEAG 146
Query: 97 ------------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
C C KS CLK YCDC+ G C C C+ C N
Sbjct: 147 QDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 197
>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 47/91 (51%), Gaps = 5/91 (5%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C+KS CL++YC CF NG C C C C N ++ R A+ ++ +
Sbjct: 89 CSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPE 148
Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
GK E GCNC ++ C+KNYCEC+
Sbjct: 149 GKIF----ETTEIWGCNCSKTRCVKNYCECF 175
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF NG C C C C N ++ R A+ ++ + GK E
Sbjct: 100 YCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPEGKIF----ETT 155
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
GCNC ++ C+KNYCEC+ G +
Sbjct: 156 EIWGCNCSKTRCVKNYCECFIRGKK 180
>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 622
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 1/59 (1%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
+ PC C KS+CLKLYC+CFA GE C + C C C N+ H + R+ AI+ +NP+AF
Sbjct: 326 KTPCKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNHLELRNEAIKVIEAKNPSAF 384
>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
Length = 321
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 11/108 (10%)
Query: 80 EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHH 138
+ ++ EPN PC CTKS CLKLYC CF C + C C +C N +H R
Sbjct: 188 DLHSQQQEPNTT----PCKCTKSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQT 243
Query: 139 AIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
A+ + + + K +R GC C++S C KNYCECY
Sbjct: 244 ALEKI------KIKSQRQKHDDDLFDRSRVWGCKCQKSQCQKNYCECY 285
>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
Length = 204
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC C+A+G C + C C C N +
Sbjct: 78 KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 137
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+I + L+ + C CK+SGCLK YC+CY
Sbjct: 138 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCY 170
Score = 42.4 bits (98), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC C+A+G C + C C C N + +I + L+
Sbjct: 91 YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 150
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
+ C CK+SGCLK YC+CY+ GA
Sbjct: 151 IRRKCTCKKSGCLKKYCDCYQGGA 174
Score = 41.2 bits (95), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 53/134 (39%), Gaps = 47/134 (35%)
Query: 38 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLE---------- 87
E N N+ R + K G+G K C+CK+S CLK YC CY +G+ E
Sbjct: 58 EANGNSDRKRRRKADDGDG----CKACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNK 113
Query: 88 -PNG------IRPRKP-------------------------CNCTKSMCLKLYCDCFANG 115
P G + P KP C C KS CLK YCDC+ G
Sbjct: 114 PPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGG 173
Query: 116 EFC-YQCNCNSCFN 128
C C C+ C N
Sbjct: 174 AGCSINCKCDDCRN 187
>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
Length = 409
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS C+KLYC CFA+G C + C C C N + A+ L+ +
Sbjct: 102 KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLKPVQTS--- 158
Query: 154 KIGKCLVGEGERRHT----KGCNCKRSGCLKNYCECY 186
+ G+ +V + R + C CK+SGCLK YC+CY
Sbjct: 159 EAGQDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCY 195
Score = 43.1 bits (100), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 8/88 (9%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA+G C + C C C N + A+ L+ + + G+ +V + R
Sbjct: 115 YCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLKPVQTS---EAGQDIVEQVIRS 171
Query: 60 HT----KGCNCKRSGCLKNYCECYEAGA 83
+ C CK+SGCLK YC+CY+ GA
Sbjct: 172 PMDLIRRKCTCKKSGCLKKYCDCYQGGA 199
Score = 39.3 bits (90), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 44/111 (39%), Gaps = 44/111 (39%)
Query: 62 KGCNCKRSGCLKNYCECYEAGARTLE-----------PNGIRPR-------KP------- 96
K C+CK+S C+K YC C+ +G+R E P + PR KP
Sbjct: 102 KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLKPVQTSEAG 161
Query: 97 ------------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
C C KS CLK YCDC+ G C C C+ C N
Sbjct: 162 QDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 212
>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
Length = 151
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 54/99 (54%), Gaps = 9/99 (9%)
Query: 89 NGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
N ++ C C K+ CLKLYC CF G+ C QC+C+ C N++ +R+ AI + + +
Sbjct: 19 NSEETKEGCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHKY 78
Query: 148 PNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ + K G++ GC CK+S C NYCEC+
Sbjct: 79 KDESKLKFS------GDK--GSGCCCKKSKCQLNYCECF 109
Score = 39.7 bits (91), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 9/83 (10%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF G+ C QC+C+ C N++ +R+ AI + + + + + K G++
Sbjct: 38 YCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHKYKDESKLKFS------GDK- 90
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
GC CK+S C NYCEC+ G
Sbjct: 91 -GSGCCCKKSKCQLNYCECFIKG 112
>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
Length = 123
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 52/103 (50%), Gaps = 11/103 (10%)
Query: 85 TLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQC 143
TL P IR C CTK+ CLKLYC+C G C C C C N + +R A++
Sbjct: 7 TLPPPSIR----CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFP-ERKKALKHI 61
Query: 144 LERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ NP AF ++ L G + R C C++S C K YC CY
Sbjct: 62 KKTNPKAFL-RVSN-LKGHEDSR---CCACQKSLCNKKYCSCY 99
Score = 37.4 bits (85), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+C G C C C C N + + R A++ + NP AF ++ L G + R
Sbjct: 26 YCECLKAGRTCTDLCACVDCLNRAKFPE-RKKALKHIKKTNPKAFL-RVSN-LKGHEDSR 82
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
C C++S C K YC CY++G
Sbjct: 83 ---CCACQKSLCNKKYCSCYQSG 102
>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 50/93 (53%), Gaps = 5/93 (5%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+PCNC++S CLK YC CF +G C +C C C N + ++R AI ++ +
Sbjct: 149 QPCNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKKCHRDKKV 208
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ + L + + GC CK +GC K YCEC+
Sbjct: 209 PVNELL----SLQMSYGCKCKTTGCQKKYCECF 237
Score = 38.9 bits (89), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF +G C +C C C N + ++R AI ++ + + + L +
Sbjct: 162 YCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKKCHRDKKVPVNELL----SLQ 217
Query: 60 HTKGCNCKRSGCLKNYCECYEAG 82
+ GC CK +GC K YCEC++ G
Sbjct: 218 MSYGCKCKTTGCQKKYCECFKRG 240
>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
Length = 391
Score = 52.8 bits (125), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC C+A+G C + C C C N +
Sbjct: 76 KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 135
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+I + L+ + C CK+SGCLK YC+CY
Sbjct: 136 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCY 168
Score = 41.6 bits (96), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC C+A+G C + C C C N + +I + L+
Sbjct: 89 YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 148
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
+ C CK+SGCLK YC+CY+ GA
Sbjct: 149 IRRKCTCKKSGCLKKYCDCYQGGA 172
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 43/110 (39%), Gaps = 43/110 (39%)
Query: 62 KGCNCKRSGCLKNYCECYEAGARTLE-----------PNG------IRPRKP-------- 96
K C+CK+S CLK YC CY +G+ E P G + P KP
Sbjct: 76 KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 135
Query: 97 -----------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
C C KS CLK YCDC+ G C C C+ C N
Sbjct: 136 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 185
>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
gi|194698372|gb|ACF83270.1| unknown [Zea mays]
gi|194708474|gb|ACF88321.1| unknown [Zea mays]
gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
Length = 394
Score = 52.8 bits (125), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
K C+C KS CLKLYC C+A+G C + C C C N +
Sbjct: 78 KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 137
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+I + L+ + C CK+SGCLK YC+CY
Sbjct: 138 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCY 170
Score = 41.6 bits (96), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC C+A+G C + C C C N + +I + L+
Sbjct: 91 YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 150
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
+ C CK+SGCLK YC+CY+ GA
Sbjct: 151 IRRKCTCKKSGCLKKYCDCYQGGA 174
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 43/110 (39%), Gaps = 43/110 (39%)
Query: 62 KGCNCKRSGCLKNYCECYEAGARTLE-----------PNG------IRPRKP-------- 96
K C+CK+S CLK YC CY +G+ E P G + P KP
Sbjct: 78 KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQ 137
Query: 97 -----------------CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
C C KS CLK YCDC+ G C C C+ C N
Sbjct: 138 EIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 187
>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 405
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C+C KS CLKLYC CFA+G C + C C+ C+N H ++ L+ A +
Sbjct: 97 CSCKKSRCLKLYCVCFASGSHCSELCGCDPCYNKPIHGVQQNTPPGLPLQVVRTAEAGQN 156
Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ C CK+SGCLK YC+CY
Sbjct: 157 SAEFARYQMDFFRRKCTCKKSGCLKKYCDCY 187
Score = 42.0 bits (97), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CFA+G C + C C+ C+N H ++ L+ A +
Sbjct: 108 YCVCFASGSHCSELCGCDPCYNKPIHGVQQNTPPGLPLQVVRTAEAGQNSAEFARYQMDF 167
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
+ C CK+SGCLK YC+CY+ GA
Sbjct: 168 FRRKCTCKKSGCLKKYCDCYQGGA 191
Score = 35.8 bits (81), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 54/140 (38%), Gaps = 47/140 (33%)
Query: 32 AIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLE---- 87
A+ E N ++ R K K GEG R C+CK+S CLK YC C+ +G+ E
Sbjct: 69 AVSPPAEANGSSDRRKKRKAEDGEGCRT----CSCKKSRCLKLYCVCFASGSHCSELCGC 124
Query: 88 ---------------PNGIRP-----------------------RKPCNCTKSMCLKLYC 109
P G+ R+ C C KS CLK YC
Sbjct: 125 DPCYNKPIHGVQQNTPPGLPLQVVRTAEAGQNSAEFARYQMDFFRRKCTCKKSGCLKKYC 184
Query: 110 DCFANGEFC-YQCNCNSCFN 128
DC+ G C C C+ C N
Sbjct: 185 DCYQGGAGCSINCKCDDCKN 204
>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
Length = 255
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 48/91 (52%), Gaps = 7/91 (7%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C CTKS CLKLYC CF + C C C C N EH + R +A+ + ++ R K
Sbjct: 136 CRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALEKVKQK---LHRQKN 192
Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
L +R GC C++S C KNYCEC+
Sbjct: 193 DDDLF---DRSKIWGCKCQKSQCQKNYCECF 220
Score = 36.6 bits (83), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF + C C C C N EH + R +A+ + ++ R K L +R
Sbjct: 147 YCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALEKVKQK---LHRQKNDDDLF---DRS 200
Query: 60 HTKGCNCKRSGCLKNYCECY 79
GC C++S C KNYCEC+
Sbjct: 201 KIWGCKCQKSQCQKNYCECF 220
>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
Length = 289
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
KPC+CTK+ CL+LYC CF N C +C CN C+N+ +HE+D A+ Q
Sbjct: 156 KPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYNDGQHEEDVLKAVEQI 205
>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
Length = 280
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
KPC+CTK+ CL+LYC CF N C +C CN C+N+ +HE++ A+ Q
Sbjct: 149 KPCHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEEEVQKAVEQI 198
>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
Length = 326
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 29/45 (64%), Gaps = 4/45 (8%)
Query: 44 FRPKIGKCLVGEGER----RHTKGCNCKRSGCLKNYCECYEAGAR 84
FRPK+ E E+ RH KGCNCK+S CLK YCECY+ G +
Sbjct: 244 FRPKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECYQMGVK 288
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 26/40 (65%), Gaps = 4/40 (10%)
Query: 151 FRPKIGKCLVGEGER----RHTKGCNCKRSGCLKNYCECY 186
FRPK+ E E+ RH KGCNCK+S CLK YCECY
Sbjct: 244 FRPKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECY 283
>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 45/107 (42%), Gaps = 29/107 (27%)
Query: 81 AGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHA 139
+ +E + C C KS C K YC+CFA G+ C CNC C N I + +
Sbjct: 62 TKIKQIEKPILNTITKCKCMKSKCQKSYCECFAAGKACNMDCNCQGCNNTICSQSIK--- 118
Query: 140 IRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ T GCNCK++GCLK YCECY
Sbjct: 119 -------------------------KVQTGGCNCKKTGCLKKYCECY 140
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 58/130 (44%), Gaps = 24/130 (18%)
Query: 37 LERNPNAFRP--------KIGKCLVGEGERRHT-KGCNCKRSGCLKNYCECYEAGAR--- 84
++ NP F P KI K E +T C C +S C K+YCEC+ AG
Sbjct: 42 IDDNPKFFSPIRPFPIQQKITKIKQIEKPILNTITKCKCMKSKCQKSYCECFAAGKACNM 101
Query: 85 ---------TLEPNGIRPRKP--CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEH 132
T+ I+ + CNC K+ CLK YC+C+ + C + CNC C N +
Sbjct: 102 DCNCQGCNNTICSQSIKKVQTGGCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDS 161
Query: 133 EDDRHHAIRQ 142
E ++ + Q
Sbjct: 162 ESEKENEQLQ 171
Score = 42.0 bits (97), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 36/85 (42%), Gaps = 29/85 (34%)
Query: 1 YCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA G+ C CNC C N I + + +
Sbjct: 89 YCECFAAGKACNMDCNCQGCNNTICSQSIK----------------------------KV 120
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
T GCNCK++GCLK YCECY R
Sbjct: 121 QTGGCNCKKTGCLKKYCECYLKKQR 145
>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 78
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 34/54 (62%), Gaps = 1/54 (1%)
Query: 103 MCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
MCLKLYC CFA C C C +C+N+ EH +R A+R+ L R+P+AF K
Sbjct: 1 MCLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKF 54
Score = 40.0 bits (92), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 48
YC CFA C C C +C+N+ EH +R A+R+ L R+P+AF K
Sbjct: 6 YCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKF 54
>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 100 TKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKC 158
T+S L YC+CFA G +C + C+C CFNN HE Q RNP AF PK+ +
Sbjct: 3 TRSCYLYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRN 62
Query: 159 LVGEGERRHTKGCN 172
L E R GC+
Sbjct: 63 LDSASEFRGGVGCS 76
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CFA G +C + C+C CFNN HE Q RNP AF PK+ + L E R
Sbjct: 11 YCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRNLDSASEFR 70
Query: 60 HTKGCN 65
GC+
Sbjct: 71 GGVGCS 76
>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
gi|194696492|gb|ACF82330.1| unknown [Zea mays]
Length = 278
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 20 FNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKR 68
NN H + +Q RNP AF PK+ G L RH +GCNCK+
Sbjct: 1 MNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKK 60
Query: 69 SGCLKNYCECYEAGA 83
S CLK YCEC++ G
Sbjct: 61 SSCLKKYCECFQGGV 75
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 34/71 (47%), Gaps = 11/71 (15%)
Query: 127 FNNIEHEDDRHHAIRQCLERNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKR 175
NN H + +Q RNP AF PK+ G L RH +GCNCK+
Sbjct: 1 MNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKK 60
Query: 176 SGCLKNYCECY 186
S CLK YCEC+
Sbjct: 61 SSCLKKYCECF 71
>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
Length = 285
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 51/98 (52%), Gaps = 12/98 (12%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNC-NSCFNNIEHEDDRHHAIRQCLE---RNP 148
++PC C S CLK YC CF +G C + C C C NN +H + R+ AI+ E RN
Sbjct: 157 QQPCFCRNSGCLKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRNK 216
Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
N + + K + GCNCK++ C YCEC+
Sbjct: 217 NVPKDALFKL-------KDCFGCNCKKTRCQTGYCECF 247
Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 42/84 (50%), Gaps = 12/84 (14%)
Query: 1 YCDCFANGEFCYQ-CNC-NSCFNNIEHEDDRHHAIRQCLE---RNPNAFRPKIGKCLVGE 55
YC CF +G C + C C C NN +H + R+ AI+ E RN N + + K
Sbjct: 171 YCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRNKNVPKDALFKL---- 226
Query: 56 GERRHTKGCNCKRSGCLKNYCECY 79
+ GCNCK++ C YCEC+
Sbjct: 227 ---KDCFGCNCKKTRCQTGYCECF 247
>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
Length = 278
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 52/98 (53%), Gaps = 12/98 (12%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCY-QCNC-NSCFNNIEHEDDRHHAIRQCLE---RNP 148
++PC C S CLK YC CF +G C +C C C NN +H ++R+ AI+ E RN
Sbjct: 150 QQPCFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRNK 209
Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
N + + K + GC+CK+S C YCEC+
Sbjct: 210 NIPKDALFKL-------KDCFGCSCKKSRCQTGYCECF 240
Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 12/84 (14%)
Query: 1 YCDCFANGEFCY-QCNC-NSCFNNIEHEDDRHHAIRQCLE---RNPNAFRPKIGKCLVGE 55
YC CF +G C +C C C NN +H ++R+ AI+ E RN N + + K
Sbjct: 164 YCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRNKNIPKDALFKL---- 219
Query: 56 GERRHTKGCNCKRSGCLKNYCECY 79
+ GC+CK+S C YCEC+
Sbjct: 220 ---KDCFGCSCKKSRCQTGYCECF 240
>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 46/101 (45%), Gaps = 25/101 (24%)
Query: 96 PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
PC C+KS CLKLYC CF C + C C+ C N ++ R A+
Sbjct: 224 PCKCSKSHCLKLYCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE------------- 270
Query: 155 IGKCLVGEGERRH---------TKGCNCKRSGCLKNYCECY 186
K V + R+H GC C++S C KNYCEC+
Sbjct: 271 --KVKVKQQRRKHDDDLFDKDTVWGCQCRKSQCKKNYCECF 309
>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 70/161 (43%), Gaps = 44/161 (27%)
Query: 31 HAIRQCLERNPNAFRP-KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPN 89
H +++ E AF+ +I + E ++ + CNCK SGCLK YC C+ +G
Sbjct: 114 HTLQKKQENEEQAFKYFQILQNDPSEIQKLQQQPCNCKNSGCLKRYCRCFHSGR------ 167
Query: 90 GIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCN-SCFNNIEHEDDRHHAIRQCLE--- 145
MCLK +C C+ C NN +H++ R++AI +
Sbjct: 168 -------------MCLK-------------ECQCSEDCQNNEQHQEQRNNAIIHVDQKCY 201
Query: 146 RNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RN R + K V GC+C +S C K YCECY
Sbjct: 202 RNRRMPRDALFKLDV-------IYGCSCTKSKCRKRYCECY 235
Score = 38.9 bits (89), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 12/84 (14%)
Query: 1 YCDCFANGEFCY-QCNCN-SCFNNIEHEDDRHHAIRQCLE---RNPNAFRPKIGKCLVGE 55
YC CF +G C +C C+ C NN +H++ R++AI + RN R + K V
Sbjct: 159 YCRCFHSGRMCLKECQCSEDCQNNEQHQEQRNNAIIHVDQKCYRNRRMPRDALFKLDV-- 216
Query: 56 GERRHTKGCNCKRSGCLKNYCECY 79
GC+C +S C K YCECY
Sbjct: 217 -----IYGCSCTKSKCRKRYCECY 235
>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERN 147
C C KSMCLK YCDCFANG C QC C C N +++++R AI + +N
Sbjct: 58 CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQN 109
>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
Length = 303
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 47/92 (51%), Gaps = 7/92 (7%)
Query: 96 PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
PCNC+KS CLKLYC CF + C + C C C N H R A+ + ++ R K
Sbjct: 182 PCNCSKSNCLKLYCQCFHQNKQCTELCKCLDCKNCDYHTQVRQTALEKIKMKSQ---RQK 238
Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
L + GC C++S C KNYCEC+
Sbjct: 239 HDNDLFDLSK---VWGCKCQKSQCQKNYCECF 267
>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 87 EPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLE 145
+ I C C KSMCLK YCDCFANG C QC C C N ++ ++R AI +
Sbjct: 48 QSQNIPKSLKCKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKM 107
Query: 146 RN 147
+N
Sbjct: 108 QN 109
>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
Length = 590
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 48/92 (52%), Gaps = 20/92 (21%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC--YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
C+C K+ C + YCDC NG+ C C+C SC N IE+++ R +
Sbjct: 470 CSCQKNSCTQAYCDCLKNGQACDPSSCSCASCLNTIENQN----------------LRLE 513
Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
I + +G+ + +GC+CK S C K YCEC+
Sbjct: 514 IQEKKQKQGQAK--EGCSCKNSQCQKRYCECF 543
Score = 39.3 bits (90), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 31/119 (26%)
Query: 64 CNCKRSGCLKNYCECYEAGA-------------RTLEPNGIR------------PRKPCN 98
C+C+++ C + YC+C + G T+E +R ++ C+
Sbjct: 470 CSCQKNSCTQAYCDCLKNGQACDPSSCSCASCLNTIENQNLRLEIQEKKQKQGQAKEGCS 529
Query: 99 CTKSMCLKLYCDCFANGEFC--YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C S C K YC+CF NG C +C C C NN+ H H++ Q F+ KI
Sbjct: 530 CKNSQCQKRYCECFQNGRQCDPSKCKCKDCKNNLPHS----HSVHQMRPIVQQLFQSKI 584
>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
Length = 275
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 11/95 (11%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE---RNPNA 150
+PCNC++S CLK YC CF +G C +C C C N ++R AI + R+ N
Sbjct: 151 QPCNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREAAINYVFKKCNRDKNV 210
Query: 151 FRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCEC 185
++ + + GC CK +GC K YCEC
Sbjct: 211 PANELLSLQI-------SYGCKCKSTGCQKKYCEC 238
>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
Length = 421
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 11/89 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 54
YC CFA+G C + C C+ C+N H R Q LE ++ ++ + +
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 186
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
G R+ C CK+SGCLK YC+CY+ GA
Sbjct: 187 FGRRK----CTCKKSGCLKKYCDCYQGGA 211
Score = 43.5 bits (101), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 161
YC CFA+G C + C C+ C+N H R Q LE ++ ++ + +
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 186
Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
G R+ C CK+SGCLK YC+CY
Sbjct: 187 FGRRK----CTCKKSGCLKKYCDCY 207
>gi|115489472|ref|NP_001067223.1| Os12g0605300 [Oryza sativa Japonica Group]
gi|77557030|gb|ABA99826.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649730|dbj|BAF30242.1| Os12g0605300 [Oryza sativa Japonica Group]
Length = 407
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 11/89 (12%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 54
YC CFA+G C + C C+ C+N H R Q LE ++ ++ + +
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 172
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
G R+ C CK+SGCLK YC+CY+ GA
Sbjct: 173 FGRRK----CTCKKSGCLKKYCDCYQGGA 197
Score = 43.5 bits (101), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIR-----QCLERNPNAFRPKIGKCLVG 161
YC CFA+G C + C C+ C+N H R Q LE ++ ++ + +
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTPALPLKAVQTLETGQDSVE-QLIRSPMD 172
Query: 162 EGERRHTKGCNCKRSGCLKNYCECY 186
G R+ C CK+SGCLK YC+CY
Sbjct: 173 FGRRK----CTCKKSGCLKKYCDCY 193
>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
Length = 710
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 38/135 (28%), Positives = 51/135 (37%), Gaps = 35/135 (25%)
Query: 46 PKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCL 105
PK + G+ K C+CK+S CLK YCEC+ AG EP
Sbjct: 423 PKKKRHKFDNGDGTSCKRCSCKKSKCLKLYCECFHAGVFCSEP----------------- 465
Query: 106 KLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 165
C+C C N + + Q RNP AF PK+ + G+ E
Sbjct: 466 ---------------CSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQ-EL 509
Query: 166 RHTKGC--NCKRSGC 178
GC +C+ GC
Sbjct: 510 AGGVGCSVSCRCEGC 524
Score = 41.2 bits (95), Expect = 0.19, Method: Composition-based stats.
Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 4/74 (5%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC+CF G FC + C+C C N + + Q RNP AF PK+ + G+ E
Sbjct: 452 YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIESRNPLAFAPKVIRTEPGQ-ELA 510
Query: 60 HTKGC--NCKRSGC 71
GC +C+ GC
Sbjct: 511 GGVGCSVSCRCEGC 524
>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
Length = 246
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 12/112 (10%)
Query: 80 EAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED-DRH 137
+ + L ++ ++PCNC S C+K YC CF +G+ C +C C N EH + +R
Sbjct: 119 QKKTKILNITSLKCQQPCNCKSSGCIKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERS 178
Query: 138 HAIRQCLE---RNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
AI+ E RN + + K V GC+C +S C K YCEC+
Sbjct: 179 EAIKHVNEKCYRNRKIPKEALFKLDV-------IYGCSCTKSKCRKRYCECF 223
>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
Length = 298
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 11/94 (11%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNPNAFRPKIG 49
YC CFA G +C +C C CFN + R + + R P A +
Sbjct: 17 YCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRPEANAEPME 76
Query: 50 KCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+GCNCK+S CLK YC+CY+ G
Sbjct: 77 DAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGT 110
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 11/98 (11%)
Query: 100 TKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHED----------DRHHAIRQCLERNP 148
T ++C+ YC CFA G +C +C C CFN + R + + R P
Sbjct: 9 TITVCICSYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKRMSMKINRRP 68
Query: 149 NAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
A + +GCNCK+S CLK YC+CY
Sbjct: 69 EANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 106
>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
Length = 96
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 32 AIRQCLERNPNAFRPKIGKCLVGEGERR--HTKGCNCKRSGCLKNYCECYEAGA 83
A+ RNPNAF K + ++ H KGC CK+S CLK YCEC+ AG
Sbjct: 2 AVNDAKNRNPNAFNIKFQIIEEKQDDKIVIHKKGCTCKKSNCLKKYCECFNAGV 55
Score = 42.7 bits (99), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 139 AIRQCLERNPNAFRPKIGKCLVGEGERR--HTKGCNCKRSGCLKNYCECY 186
A+ RNPNAF K + ++ H KGC CK+S CLK YCEC+
Sbjct: 2 AVNDAKNRNPNAFNIKFQIIEEKQDDKIVIHKKGCTCKKSNCLKKYCECF 51
Score = 36.6 bits (83), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHH 138
+K C C KS CLK YC+CF G C Q C C+SC N + E+++
Sbjct: 33 KKGCTCKKSNCLKKYCECFNAGVLCNQNCVCDSCKNMQKEENNQQQ 78
>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
+PC+C K+ CL+LYC CF N C +C CN C+N+ +HED+ A+ Q
Sbjct: 147 RPCHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYNDGKHEDEVLKAVEQI 196
>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQC 143
K C+C+K+ CL+LYC CF N C +C CN CFN+ +HED+ A+ Q
Sbjct: 124 KACHCSKTHCLQLYCSCFHNRRPCTNECKCNDCFNDGKHEDEVIKAVEQI 173
>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
gi|238005872|gb|ACR33971.1| unknown [Zea mays]
Length = 272
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 35/64 (54%), Gaps = 12/64 (18%)
Query: 32 AIRQCLE-RNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 79
+ RQ +E RNP AF PK+ G L RH +GCNCK+S CLK YCEC+
Sbjct: 6 STRQQIESRNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 65
Query: 80 EAGA 83
+ G
Sbjct: 66 QGGV 69
Score = 43.5 bits (101), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 12/60 (20%)
Query: 139 AIRQCLE-RNPNAFRPKI------GKCL-----VGEGERRHTKGCNCKRSGCLKNYCECY 186
+ RQ +E RNP AF PK+ G L RH +GCNCK+S CLK YCEC+
Sbjct: 6 STRQQIESRNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECF 65
>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
Length = 330
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 49/95 (51%), Gaps = 19/95 (20%)
Query: 93 PRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
++ C C K+ CLK YC CF G+FC + C C C NN ++++ I Q
Sbjct: 212 SKQGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE----LIEQT-------- 259
Query: 152 RPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
R +I + L E + C CK+S CLK YCECY
Sbjct: 260 RSQINRNLNDEVK------CFCKKSKCLKKYCECY 288
Score = 40.0 bits (92), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 45/105 (42%), Gaps = 28/105 (26%)
Query: 62 KGCNCKRSGCLKNYCECYEAG------------------------ARTLEPNGIRPRKPC 97
+GC CK++ CLK YC+C++ G R+ + C
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKELIEQTRSQINRNLNDEVKC 273
Query: 98 NCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIR 141
C KS CLK YC+C+ G+ C C C C N +E + H A R
Sbjct: 274 FCKKSKCLKKYCECYNAGKKCGVDCKCEGCMN---YEINLHEAQR 315
Score = 38.9 bits (89), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 43/85 (50%), Gaps = 19/85 (22%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC CF G+FC + C C C NN ++++ I Q R +I + L E +
Sbjct: 227 YCQCFQQGKFCGETCECLDCCNNEQNKE----LIEQT--------RSQINRNLNDEVK-- 272
Query: 60 HTKGCNCKRSGCLKNYCECYEAGAR 84
C CK+S CLK YCECY AG +
Sbjct: 273 ----CFCKKSKCLKKYCECYNAGKK 293
>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
Length = 573
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 15/23 (65%), Positives = 20/23 (86%)
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC CK+SGC+K YCEC++A
Sbjct: 452 KHNKGCQCKKSGCMKKYCECFQA 474
Score = 44.3 bits (103), Expect = 0.024, Method: Composition-based stats.
Identities = 14/21 (66%), Positives = 18/21 (85%)
Query: 166 RHTKGCNCKRSGCLKNYCECY 186
+H KGC CK+SGC+K YCEC+
Sbjct: 452 KHNKGCQCKKSGCMKKYCECF 472
Score = 38.1 bits (87), Expect = 1.6, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 19/35 (54%), Gaps = 1/35 (2%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFN 128
K C C KS C+K YC+CF C QC C C N
Sbjct: 455 KGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 489
>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 381
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 45/91 (49%), Gaps = 17/91 (18%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFNNIEHEDD---RHHAIRQCLERNPN 149
KPC C + CLKLYC CF G FC C C C+N E + R A+ + L R +
Sbjct: 277 KPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNETGGKRQEAVSEILLRRID 336
Query: 150 AF--RPKIGKCLVGEGERRHTKGCNCKRSGC 178
AF RPK ++ +GC CK++ C
Sbjct: 337 AFESRPK----------KKTGEGCACKKNRC 357
>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
Length = 97
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C C KS C+KLYC CF+ +C Q C C +C N +ED + +NP AF PKI
Sbjct: 9 CRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPKI 68
Query: 156 GKC 158
+
Sbjct: 69 VRV 71
>gi|76155324|gb|AAX26592.2| SJCHGC03965 protein [Schistosoma japonicum]
Length = 176
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/24 (79%), Positives = 21/24 (87%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEF 117
RKPCNCTKS CLKLYC+CFA G+
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQL 167
>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
Length = 628
Score = 45.8 bits (107), Expect = 0.009, Method: Composition-based stats.
Identities = 16/23 (69%), Positives = 21/23 (91%)
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 225 KHNKGCHCKKSGCLKKYCECFQA 247
Score = 43.5 bits (101), Expect = 0.040, Method: Composition-based stats.
Identities = 15/21 (71%), Positives = 19/21 (90%)
Query: 166 RHTKGCNCKRSGCLKNYCECY 186
+H KGC+CK+SGCLK YCEC+
Sbjct: 225 KHNKGCHCKKSGCLKKYCECF 245
Score = 38.9 bits (89), Expect = 1.00, Method: Composition-based stats.
Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDR 136
K C+C KS CLK YC+CF C + C C C N E ++R
Sbjct: 228 KGCHCKKSGCLKKYCECFQANILCSENCKCMDC-KNFEGSEER 269
>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
Length = 139
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 39/64 (60%), Gaps = 1/64 (1%)
Query: 93 PRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAF 151
++ C C K+ CLKLYC CF +G+ C Q C CN C N++ +R+ AI + + + + +
Sbjct: 18 TKEGCQCQKTKCLKLYCQCFHDGKCCGQSCGCNGCKNSLNDYLERNKAIGKIIHKYKDDY 77
Query: 152 RPKI 155
+ K+
Sbjct: 78 KQKL 81
>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
Length = 264
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 51/93 (54%), Gaps = 6/93 (6%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
KPC+C+K+ CL+LYC CF N C Q C C C+N+ +H D+ A+ Q + A
Sbjct: 146 KPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYNDGQHTDEVLKAVEQIKIKEQRASHH 205
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ + R GC CK++ C+K YCEC+
Sbjct: 206 DLDSF-----DTRQVWGCKCKKTKCVKGYCECF 233
>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 379
Score = 43.9 bits (102), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 7/97 (7%)
Query: 97 CNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
CNC + C+K YC C+AN FC C C SC N + + + L + +
Sbjct: 247 CNCQTTQCIKKYCSCYANNNFCVPSICKCQSCMNTFQQSYNSTSQKKPKLISSQKSLSQV 306
Query: 155 IGKCLVGEGERR-----HTKGCNCKRSGCLKNYCECY 186
I + + + T C C++S C K YC+CY
Sbjct: 307 ILYQIQNINQLKIQKENATISCTCQKSFCKKQYCDCY 343
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 7/91 (7%)
Query: 1 YCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGER 58
YC C+AN FC C C SC N + + + L + + I + +
Sbjct: 258 YCSCYANNNFCVPSICKCQSCMNTFQQSYNSTSQKKPKLISSQKSLSQVILYQIQNINQL 317
Query: 59 R-----HTKGCNCKRSGCLKNYCECYEAGAR 84
+ T C C++S C K YC+CY++G +
Sbjct: 318 KIQKENATISCTCQKSFCKKQYCDCYKSGKK 348
>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 43.9 bits (102), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 43/92 (46%), Gaps = 12/92 (13%)
Query: 96 PCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPK 154
PCNC++S C K+YC+C A G C C C +C H + + +E K
Sbjct: 71 PCNCSRSGCRKMYCECLAKGRLCSSFCRCQNC-----HNKTSNKLVLNVIEELDQNRNRK 125
Query: 155 IGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
RR GC C++S CLK YCEC+
Sbjct: 126 ------RFRSRRFKDGCTCQKSMCLKKYCECF 151
Score = 40.4 bits (93), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 54/139 (38%), Gaps = 38/139 (27%)
Query: 30 HHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAG------- 82
H I + +P + K G L+ + CNC RSGC K YCEC G
Sbjct: 44 QHTIPDNILISPEPSKKK-GSILLPSDD-----PCNCSRSGCRKMYCECLAKGRLCSSFC 97
Query: 83 ---------ARTLEPNGIRPRKP---------------CNCTKSMCLKLYCDCFANGEFC 118
+ L N I C C KSMCLK YC+CF +G+ C
Sbjct: 98 RCQNCHNKTSNKLVLNVIEELDQNRNRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKSC 157
Query: 119 YQ-CNCNSCFNNIEHEDDR 136
C C +C N + E ++
Sbjct: 158 GSGCKCVNCENYVLDEVEK 176
>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
Length = 754
Score = 43.5 bits (101), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 30/67 (44%), Positives = 37/67 (55%), Gaps = 7/67 (10%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFCYQ--CNCNSCFNNIEHEDDRHHAI----RQCLERNP 148
K C+C+K+ CLKLYC CFAN C + C C+ C N E E AI R L +NP
Sbjct: 232 KGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDC-KNCEAESTEDGAILKARRSILRKNP 290
Query: 149 NAFRPKI 155
AF+ K
Sbjct: 291 KAFKSKF 297
>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
Length = 244
Score = 43.5 bits (101), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 49/110 (44%), Gaps = 26/110 (23%)
Query: 86 LEPNGIRPRKP---CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIR 141
+EP+ + KP C C+KS C + YC+CFA G+ C C C +C N
Sbjct: 83 IEPSEVMSPKPQIKCQCSKSQCQQSYCECFARGKTCGKHCGCQNCQN------------- 129
Query: 142 QCLERNPNAFRPKIGKCLVGEG-----ERRHTKGCNCKRSGCLKNYCECY 186
N KI K LV + R+ KGC C +S C N+C C+
Sbjct: 130 ----MQMNKKLGKIQKRLVKQKRIKQRTPRYLKGCTCGKSKCQNNFCSCH 175
>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
Length = 261
Score = 43.5 bits (101), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 36/61 (59%), Gaps = 5/61 (8%)
Query: 88 PNGIRPRKPCNCTKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLER 146
P +R C C KSMCLK YCDCFANG C QC C C N+ E+ ++R AI + +
Sbjct: 85 PKSLR----CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHNSEEYMEEREEAISKLKIQ 140
Query: 147 N 147
N
Sbjct: 141 N 141
>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
Length = 677
Score = 42.7 bits (99), Expect = 0.063, Method: Composition-based stats.
Identities = 15/17 (88%), Positives = 17/17 (100%)
Query: 92 RPRKPCNCTKSMCLKLY 108
RPRKPCNCTKS+CLKL+
Sbjct: 520 RPRKPCNCTKSLCLKLW 536
>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
Length = 192
Score = 42.4 bits (98), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 50/108 (46%), Gaps = 35/108 (32%)
Query: 86 LEPNGIRPRK------PCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHH 138
++ N ++P++ C C KS C K YC+CFA G C CNC C N+
Sbjct: 57 IQKNIVKPKEIEKSIVKCKCLKSKCQKSYCECFAVGRSCSIDCNCLGCNNS--------- 107
Query: 139 AIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
C + KI K G GCNCK++GC+K YCECY
Sbjct: 108 ---NCSQ--------KIKKSQCG--------GCNCKKTGCVKKYCECY 136
Score = 35.8 bits (81), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 20/111 (18%)
Query: 38 ERNPNAFRPK----IGKCLVGEGE-RRHTKGCNCKRSGCLKNYCECYEAGAR-TLEPNGI 91
+ +PN K I K +V E + C C +S C K+YCEC+ G +++ N +
Sbjct: 43 DESPNILSAKKPFPIQKNIVKPKEIEKSIVKCKCLKSKCQKSYCECFAVGRSCSIDCNCL 102
Query: 92 ---------RPRKP----CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFN 128
+ +K CNC K+ C+K YC+C+ + C + CNC C N
Sbjct: 103 GCNNSNCSQKIKKSQCGGCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCN 153
>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
Length = 262
Score = 42.0 bits (97), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 54/137 (39%), Gaps = 21/137 (15%)
Query: 18 SCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTK-GCNCKRSGCLKNYC 76
S N + I + + +PK L + + + K GC+C + C YC
Sbjct: 84 SLVNQMYESQVFSQPILDISQISQIQLQPKTKLDLTDQPKYKKKKVGCSCTKRKCSSKYC 143
Query: 77 ECYEAG-------------------ARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEF 117
C + G ++ ++ + +K C C K+ CLK YCDCFA G
Sbjct: 144 FCAKVGQSCSNLCKCVDCSNNKSFMSQQIQVTNAQEQKGCKCKKNFCLKGYCDCFAKGMQ 203
Query: 118 CYQ-CNCNSCFNNIEHE 133
C C C SC N + E
Sbjct: 204 CSSNCKCISCKNMLTIE 220
>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 42.0 bits (97), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 6/93 (6%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
KPC+C+K+ CL+LYC CF + C +C C CFN+ H+ + A+++ +
Sbjct: 145 KPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFNDGNHKVEMVKALQKVKLK-----EY 199
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
++ + + + R GC CK++ C K YCEC+
Sbjct: 200 RVSQSDLDSFDTRQVLGCKCKKTHCKKGYCECF 232
>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
Length = 372
Score = 41.2 bits (95), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 1 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 59
YC C+A+G C + C C C N + +I + L+
Sbjct: 69 YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 128
Query: 60 HTKGCNCKRSGCLKNYCECYEAGA 83
+ C CK+SGCLK YC+CY+ GA
Sbjct: 129 IRRKCTCKKSGCLKKYCDCYQGGA 152
Score = 37.0 bits (84), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 1/80 (1%)
Query: 108 YCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERR 166
YC C+A+G C + C C C N + +I + L+
Sbjct: 69 YCVCYASGSHCTELCGCEPCHNKPPQGAPWTAPVLPLKPVQTLEAGQEIVEQLIRSPMDL 128
Query: 167 HTKGCNCKRSGCLKNYCECY 186
+ C CK+SGCLK YC+CY
Sbjct: 129 IRRKCTCKKSGCLKKYCDCY 148
>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 40.8 bits (94), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 13/102 (12%)
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCNSCFNNIEHEDDRHHAIRQCL 144
+E ++P+ C C KS C + YC+CFA G+ C + C C +C N +Q
Sbjct: 86 IEVKDLQPQIKCQCNKSQCQQSYCECFARGKTCGKHCGCQNCQN------------KQMN 133
Query: 145 ERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
++ + I + + + R+ KGC C +S C +C C+
Sbjct: 134 KKLEKIQKKLIKQKRIKQRTSRYLKGCTCGKSKCQNKFCSCH 175
>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
Length = 250
Score = 40.8 bits (94), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 24/30 (80%)
Query: 54 GEGERRHTKGCNCKRSGCLKNYCECYEAGA 83
G+ ++ KGC+CKRS C+KNYC+CY++ A
Sbjct: 16 GQAQQPTIKGCSCKRSQCIKNYCDCYQSMA 45
Score = 38.9 bits (89), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 21/26 (80%)
Query: 161 GEGERRHTKGCNCKRSGCLKNYCECY 186
G+ ++ KGC+CKRS C+KNYC+CY
Sbjct: 16 GQAQQPTIKGCSCKRSQCIKNYCDCY 41
>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
Length = 426
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 16/23 (69%), Positives = 21/23 (91%)
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 134 KHNKGCHCKKSGCLKKYCECFQA 156
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 39/93 (41%), Gaps = 18/93 (19%)
Query: 94 RKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRP 153
+K CNC S CLKL Q N I+ + +E NA
Sbjct: 80 KKHCNCKNSQCLKL------------QEAITVRSNGIQKPS------KLKIENGSNALNL 121
Query: 154 KIGKCLVGEGERRHTKGCNCKRSGCLKNYCECY 186
+ +H KGC+CK+SGCLK YCEC+
Sbjct: 122 RKDDAGAPASLPKHNKGCHCKKSGCLKKYCECF 154
>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
Length = 154
Score = 40.4 bits (93), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 16/23 (69%), Positives = 21/23 (91%)
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+CK+SGCLK YCEC++A
Sbjct: 11 KHNKGCHCKKSGCLKKYCECFQA 33
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/21 (71%), Positives = 19/21 (90%)
Query: 166 RHTKGCNCKRSGCLKNYCECY 186
+H KGC+CK+SGCLK YCEC+
Sbjct: 11 KHNKGCHCKKSGCLKKYCECF 31
>gi|357450559|ref|XP_003595556.1| Pentatricopeptide repeat protein [Medicago truncatula]
gi|355484604|gb|AES65807.1| Pentatricopeptide repeat protein [Medicago truncatula]
Length = 1040
Score = 39.3 bits (90), Expect = 0.71, Method: Composition-based stats.
Identities = 27/91 (29%), Positives = 39/91 (42%), Gaps = 7/91 (7%)
Query: 66 CKRSGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQ-CNCN 124
C+ + K C +A R+ E G + C C S YCD FA G +C + C C
Sbjct: 750 CQTAPKRKRKCLTLQAQVRSTEERG--SCRNCKCKISN----YCDAFATGVYCTEPCLCL 803
Query: 125 SCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C N ++D R+ N + F PK+
Sbjct: 804 ECLNKPINDDTVLQTRRKIESCNSHVFFPKV 834
>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
Length = 514
Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
Query: 100 TKSMCLKLYCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKIG 156
T L YCDC A G +C C C++C N E+E IR+ +E R+P AF P+I
Sbjct: 239 TLVFVLSSYCDCLAAGVYCTDSCACSNCLNKSENE-GVVQIIREKIESRDPLAFAPRIA 296
Score = 36.2 bits (82), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 30/51 (58%), Gaps = 3/51 (5%)
Query: 1 YCDCFANGEFCY-QCNCNSCFNNIEHEDDRHHAIRQCLE-RNPNAFRPKIG 49
YCDC A G +C C C++C N E+E IR+ +E R+P AF P+I
Sbjct: 247 YCDCLAAGVYCTDSCACSNCLNKSENE-GVVQIIREKIESRDPLAFAPRIA 296
>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
Length = 459
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 15/23 (65%), Positives = 19/23 (82%)
Query: 62 KGCNCKRSGCLKNYCECYEAGAR 84
KGCNCK++GCLK YCEC++ R
Sbjct: 196 KGCNCKKTGCLKRYCECFQNNKR 218
Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 14/18 (77%), Positives = 17/18 (94%)
Query: 169 KGCNCKRSGCLKNYCECY 186
KGCNCK++GCLK YCEC+
Sbjct: 196 KGCNCKKTGCLKRYCECF 213
>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
Length = 240
Score = 38.1 bits (87), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 15/23 (65%), Positives = 20/23 (86%)
Query: 61 TKGCNCKRSGCLKNYCECYEAGA 83
KGC+CKRS C+KNYC+CY++ A
Sbjct: 22 VKGCSCKRSQCIKNYCDCYQSMA 44
Score = 35.8 bits (81), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 14/19 (73%), Positives = 17/19 (89%)
Query: 168 TKGCNCKRSGCLKNYCECY 186
KGC+CKRS C+KNYC+CY
Sbjct: 22 VKGCSCKRSQCIKNYCDCY 40
>gi|345314646|ref|XP_001520178.2| PREDICTED: protein lin-54 homolog, partial [Ornithorhynchus
anatinus]
Length = 145
Score = 37.4 bits (85), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/16 (93%), Positives = 16/16 (100%)
Query: 92 RPRKPCNCTKSMCLKL 107
RPRKPCNCTKS+CLKL
Sbjct: 130 RPRKPCNCTKSLCLKL 145
>gi|330945164|ref|XP_003306504.1| hypothetical protein PTT_19658 [Pyrenophora teres f. teres 0-1]
gi|311315956|gb|EFQ85390.1| hypothetical protein PTT_19658 [Pyrenophora teres f. teres 0-1]
Length = 1707
Score = 37.0 bits (84), Expect = 3.8, Method: Composition-based stats.
Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 14/142 (9%)
Query: 26 EDDRHHAIRQCLERNPNAFRPKIGKCLVGEGERRHTKGCNCKRSGCLKNYCECYEAGART 85
E +RH +RQCLER+ ++ KC E + H K + ++N E +
Sbjct: 888 EMERHKKLRQCLERDGARYK----KCF--ENAKAHLKKMAAEHRSKMQNAAEMRTCLVKD 941
Query: 86 LEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFCYQCNCNSCFNNIEHEDDRHHAIRQCLE 145
E I K C+ + CLK CD ++ E C N +E D+ +++ L
Sbjct: 942 CE-GKINELKRCSEAEVSCLK--CD-YSEKE----TECKELSNKLEASADQLSQMKKRLA 993
Query: 146 RNPNAFRPKIGKCLVGEGERRH 167
++ + LVGE + +H
Sbjct: 994 GLQSSLEDMRKQQLVGETKIKH 1015
>gi|395855486|ref|XP_003800190.1| PREDICTED: usherin [Otolemur garnettii]
Length = 5202
Score = 36.6 bits (83), Expect = 4.8, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 55/132 (41%), Gaps = 22/132 (16%)
Query: 64 CNCKR--SGCLKNYCECYEAGARTLEPNGIRPRKPCNCTKSMCLKLYCDCFANGEFC--- 118
CNCKR SG N C+ + L+P+G RP CNC S + C N C
Sbjct: 662 CNCKRHVSGRQCNQCQNGFYNLQELDPDGCRP---CNCNTSGTVDGDITCHQNSGQCKCK 718
Query: 119 -----YQCN-CNSCFNNIEHEDDRHHAIRQC-LERNPNAF-RPKIGKCLVGEGERRHTKG 170
+C+ CN F ++ +D QC L + N F P G+C R+ KG
Sbjct: 719 ANVIGLRCDHCNFGFKFLQSFNDDGCEPCQCNLHGSVNKFCNPLSGQCEC----RKEAKG 774
Query: 171 CNCKRSGCLKNY 182
C C +N+
Sbjct: 775 LQCDT--CRENF 784
>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
Length = 296
Score = 36.6 bits (83), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 13/20 (65%), Positives = 18/20 (90%)
Query: 62 KGCNCKRSGCLKNYCECYEA 81
KGC CKR+ C+KNYC+CY++
Sbjct: 38 KGCACKRTSCIKNYCDCYQS 57
>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
Length = 296
Score = 36.6 bits (83), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 13/20 (65%), Positives = 18/20 (90%)
Query: 62 KGCNCKRSGCLKNYCECYEA 81
KGC CKR+ C+KNYC+CY++
Sbjct: 38 KGCACKRTSCIKNYCDCYQS 57
>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
Length = 140
Score = 35.8 bits (81), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 14/23 (60%), Positives = 20/23 (86%)
Query: 59 RHTKGCNCKRSGCLKNYCECYEA 81
+H KGC+C++S CLK YCEC++A
Sbjct: 23 KHNKGCHCRKSECLKKYCECFQA 45
>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
Length = 225
Score = 35.8 bits (81), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 23/29 (79%)
Query: 55 EGERRHTKGCNCKRSGCLKNYCECYEAGA 83
+ +++ KGC CKRS C+KNYC+CY++ A
Sbjct: 18 QPKQQAPKGCCCKRSQCIKNYCDCYQSMA 46
>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 957
Score = 35.4 bits (80), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 12/69 (17%)
Query: 95 KPCNCTKSMCLKLYCDCFANGEFC--YQCNCNSCFNN---IEHEDDRHHAI-------RQ 142
K C C K+ CLKLYC CF++ C C+C C N + + D A+ +
Sbjct: 316 KGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCANTQDELARDGDNGMAMSAVVTARKS 375
Query: 143 CLERNPNAF 151
LERNP AF
Sbjct: 376 ILERNPRAF 384
>gi|145529912|ref|XP_001450739.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418361|emb|CAK83342.1| unnamed protein product [Paramecium tetraurelia]
Length = 226
Score = 35.4 bits (80), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 37/90 (41%), Gaps = 26/90 (28%)
Query: 97 CNCTKSMCLKLYCDCFANGEFC-YQCNCNSCFNNIEHEDDRHHAIRQCLERNPNAFRPKI 155
C C KS C+K YC C AN + C ++C C +C N K
Sbjct: 129 CRCRKSKCIKNYCVCSANNQECTFKCECYNCSN-------------------------KQ 163
Query: 156 GKCLVGEGERRHTKGCNCKRSGCLKNYCEC 185
K V + GCNC+R C K YCEC
Sbjct: 164 PKSAVSKQSGSEYLGCNCRRQDCSKGYCEC 193
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.326 0.142 0.516
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,176,436,644
Number of Sequences: 23463169
Number of extensions: 132990450
Number of successful extensions: 318072
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 541
Number of HSP's successfully gapped in prelim test: 618
Number of HSP's that attempted gapping in prelim test: 311890
Number of HSP's gapped (non-prelim): 4654
length of query: 186
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 52
effective length of database: 9,215,130,721
effective search space: 479186797492
effective search space used: 479186797492
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 72 (32.3 bits)