BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007469
(602 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
Length = 592
Score = 848 bits (2190), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/612 (71%), Positives = 487/612 (79%), Gaps = 30/612 (4%)
Query: 1 MGEGEGGEFPPKKSQSESGG------DIPAKKLARQLDFT--GGFVGSVILPEHPQSQAV 52
M +GEGG+FPPKK QS++ G DIP KKLARQLDFT GG +V+LPEHPQSQ
Sbjct: 1 MEQGEGGDFPPKKVQSDTAGTAAVTSDIPLKKLARQLDFTAFGGASAAVVLPEHPQSQPQ 60
Query: 53 TANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK 112
HP V PVPP Q T P VRV KPESP+SR R N++VK+ TPKK
Sbjct: 61 AQAQHP--------VTMPVPP-------QTTHPSVRVVKPESPRSRPRPNIDVKDGTPKK 105
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE TLERNPNAFRP
Sbjct: 106 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRP 165
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KIASSPHG RD++EE+GE L+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCK
Sbjct: 166 KIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 225
Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTA 292
NFEGSEERQALFHGDHAN+MAYIQQAANAAITGAIGSSG+ SPPVSKKRKGQELF+G T+
Sbjct: 226 NFEGSEERQALFHGDHANSMAYIQQAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTS 285
Query: 293 KDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDL 351
KDPS+HRL FQQ + + +S+LSS PV+R ++AA G+SKFTYRSLLADIIQPQDL
Sbjct: 286 KDPSLHRLAQFQQASLTKASVTTSSLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDL 345
Query: 352 KELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDD 411
KELCSVLVV+S EAA++ A +RN EK AE++ TS ASS Q RLQS KE D+++ V DD
Sbjct: 346 KELCSVLVVVSEEAARTFADKRNLVEKPAEEQVETSRASSTQDRLQSQKESDIEKAVADD 405
Query: 412 CSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT 471
CSS NQADK+GPDDS SDG D K RPMSPGTLALMCDEQDT+FMAA+SPN L GH CNT
Sbjct: 406 CSSRNQADKLGPDDSSSDGGDVPKGRPMSPGTLALMCDEQDTMFMAAASPNVLMGHSCNT 465
Query: 472 SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSN 531
SSQLPYGQG+TE YAEQERI+LTKFRDCLNRLITFGEIKE K SSLARTE G Q + SN
Sbjct: 466 SSQLPYGQGITEAYAEQERIILTKFRDCLNRLITFGEIKETKCSSLARTESGGQNEPFSN 525
Query: 532 GTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGE 591
G A++RT TGNQ VSNGVSK P PA KMSQ VTA NN L P+ E
Sbjct: 526 GPANARTDTGNQQVPVSNGVSKPPFP-----PATKMSQTVTAAAASPNNTLLSKPSTHTE 580
Query: 592 -GDVKLKTENKM 602
GD+ LKTE ++
Sbjct: 581 NGDINLKTEKEL 592
>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
Length = 601
Score = 828 bits (2138), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/614 (70%), Positives = 473/614 (77%), Gaps = 29/614 (4%)
Query: 1 MGEGEGGEFPPKK---SQSESGGDIPAKKLARQLDFT-----GGFVGSVILPEHPQSQ-- 50
MGEGEG EFPPKK SQSES D P KKLARQLDFT +V+LPEHPQSQ
Sbjct: 1 MGEGEGNEFPPKKASSSQSESL-DFPTKKLARQLDFTTQGNNNNNNNNVVLPEHPQSQQQ 59
Query: 51 --AVTANSHPP---PQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEV 105
A A P PQ QP+ V P P PP QPPVR KPESPKS+ R N E+
Sbjct: 60 PPATVALPQEPVVAPQ-QPQAVAIPEAPLLPP------QPPVRAIKPESPKSKPRPNAEL 112
Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLER 165
K+ TPK+QKQCNCKHSRCLKLYCECFASG YCDGCNCVNC+NNVENEAARREAVEATLER
Sbjct: 113 KDGTPKRQKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLER 172
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
NPNAFRPKIASSPHGTRD++EE GE L+LGKHNKGCHCKKSGCLKKYCECFQANILCSEN
Sbjct: 173 NPNAFRPKIASSPHGTRDSREENGEGLILGKHNKGCHCKKSGCLKKYCECFQANILCSEN 232
Query: 226 CRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQE 285
C+CMDCKNFEGSEERQALFHGDH NNMAYIQQAANAAITGAIGSSGY SPP+SKKRKGQE
Sbjct: 233 CKCMDCKNFEGSEERQALFHGDHTNNMAYIQQAANAAITGAIGSSGYVSPPISKKRKGQE 292
Query: 286 LFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSI-PVARASSTAAVGSSKFTYRSLLAD 344
L FG T KDPS HRLGHFQQ HIR S SS+ S P+ARA S+A +G SKFTYRSLLAD
Sbjct: 293 LLFGPTTKDPSFHRLGHFQQAVHIRPSTSSSSLSSNPIARAGSSATLGPSKFTYRSLLAD 352
Query: 345 IIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDV 404
IIQPQDLKELCSVLVVLS EAAK++A QRN TE ED+ S+ASS Q RLQS K
Sbjct: 353 IIQPQDLKELCSVLVVLSGEAAKTLAGQRNATENWVEDQTEPSLASSPQERLQSQKGAGA 412
Query: 405 DRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRL 464
++ DCSSANQADK G S SDG D K RPMSPGTLALMCDE+DT+FM A+SPN L
Sbjct: 413 GNSIAHDCSSANQADKTGHGSSSSDGVDVPKGRPMSPGTLALMCDEEDTMFMTAASPNGL 472
Query: 465 TGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGS 524
TG GC+T+SQ P GQGMTE+Y EQERIVLTKFRDCLNRLITFGEIKE K SS ARTE+G+
Sbjct: 473 TGRGCSTTSQFPCGQGMTEIYTEQERIVLTKFRDCLNRLITFGEIKETKCSSFARTELGN 532
Query: 525 QKDALSNGTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTATVPISNNDLRH 584
KD +SNGT ++R GNQ SNGV + P P +K SQMV +TV S+NDL
Sbjct: 533 PKDPVSNGTTNARAEVGNQQVPYSNGVGRHVIP-----PTSKTSQMVGSTVATSSNDLSR 587
Query: 585 NPALPGEGDVKLKT 598
P LP GD K KT
Sbjct: 588 IPTLPQNGDNKPKT 601
>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
Length = 524
Score = 792 bits (2046), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/528 (75%), Positives = 440/528 (83%), Gaps = 7/528 (1%)
Query: 77 PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIY 136
P+P Q T P VRV KPESP+SR R N++VK+ TPKKQKQCNCKHSRCLKLYCECFASGIY
Sbjct: 2 PVPPQTTHPSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKLYCECFASGIY 61
Query: 137 CDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGK 196
CDGCNCVNCHNNVENEAARREAVE TLERNPNAFRPKIASSPHG RD++EE+GE L+LGK
Sbjct: 62 CDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSREESGEALVLGK 121
Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQ 256
HNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGDHAN+MAYIQ
Sbjct: 122 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANSMAYIQ 181
Query: 257 QAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSS 315
QAANAAITGAIGSSG+ SPPVSKKRKGQELF+G T+KDPS+HRL FQQ + + +S
Sbjct: 182 QAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTSKDPSLHRLAQFQQASLTKASVTTS 241
Query: 316 TLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNT 375
+LSS PV+R ++AA G+SKFTYRSLLADIIQPQDLKELCSVLVV+S EAA++ A +RN
Sbjct: 242 SLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDLKELCSVLVVVSEEAARTFADKRNL 301
Query: 376 TEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQK 435
EK AE++ TS ASS Q RLQS KE D+++ V DDCSS NQADK+GPDDS SDG D K
Sbjct: 302 VEKPAEEQVETSRASSTQDRLQSQKESDIEKAVADDCSSRNQADKLGPDDSSSDGGDVPK 361
Query: 436 ARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTK 495
RPMSPGTLALMCDEQDT+FMAA+SPN L GH CNTSSQLPYGQG+TE YAEQERI+LTK
Sbjct: 362 GRPMSPGTLALMCDEQDTMFMAAASPNVLMGHSCNTSSQLPYGQGITEAYAEQERIILTK 421
Query: 496 FRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSKAY 555
FRDCLNRLITFGEIKE K SSLARTE G Q + SNG A++RT TGNQ VSNGVSK
Sbjct: 422 FRDCLNRLITFGEIKETKCSSLARTESGGQNEPFSNGPANARTDTGNQQVPVSNGVSKPP 481
Query: 556 GPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGE-GDVKLKTENKM 602
P PA KMSQ VTA NN L P+ E GD+ LKTE ++
Sbjct: 482 FP-----PATKMSQTVTAAAASPNNTLLSKPSTHTENGDINLKTEKEL 524
>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
Length = 559
Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/532 (71%), Positives = 418/532 (78%), Gaps = 29/532 (5%)
Query: 23 PAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA 82
PAKKLARQLDFTG PEHPQ S PP P LP Q
Sbjct: 24 PAKKLARQLDFTGA-------PEHPQL------SQPPQLPVAV------------LPLQP 58
Query: 83 TQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNC 142
P RV KPESPKSRSR N E+K+ TPKKQKQCNCKHS+CLKLYCECFASGIYCDGCNC
Sbjct: 59 QAPHARVGKPESPKSRSRPNFEIKDATPKKQKQCNCKHSKCLKLYCECFASGIYCDGCNC 118
Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCH 202
VNC NNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD++EE GEVL+LGKHNKGCH
Sbjct: 119 VNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREEAGEVLILGKHNKGCH 178
Query: 203 CKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAA 262
CKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGD NNMAYIQQAANAA
Sbjct: 179 CKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNMAYIQQAANAA 238
Query: 263 ITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSSTLSSIP 321
ITGAIGSSGY+SPPVSKKRKGQELFF T KDPSI +LG QQ NH+R +PSS+LS +
Sbjct: 239 ITGAIGSSGYSSPPVSKKRKGQELFFWPTTKDPSISKLG--QQVNHVRGPAPSSSLSPVS 296
Query: 322 VARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE 381
AR TA +G SKF YRSLLADIIQPQ LKELCSVLV++S +AAK++ Q+ EKQAE
Sbjct: 297 GARV-GTATLGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKILMEKQAE 355
Query: 382 DRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSP 441
D+ TS+ASS+Q +L S KE V++TV DDCSSANQ DK+ PD+S SDG+D K RPMSP
Sbjct: 356 DQTETSLASSSQEQLPSQKEGRVEKTVADDCSSANQTDKISPDNSSSDGADVPKGRPMSP 415
Query: 442 GTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLN 501
GTLALMCDEQDT+FM A+SP H CNTSSQ PYGQ MTEVYAEQERIVLTKFRD LN
Sbjct: 416 GTLALMCDEQDTMFMTAASPIAPMAHACNTSSQFPYGQEMTEVYAEQERIVLTKFRDFLN 475
Query: 502 RLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSK 553
R+IT GEI E K SSLAR E+ SQKD + NG ++ + Q T SNGV+K
Sbjct: 476 RVITMGEINETKCSSLARNELESQKDPIINGIGNTSSEIVLQHGTTSNGVTK 527
>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
Length = 559
Score = 743 bits (1918), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/608 (64%), Positives = 446/608 (73%), Gaps = 59/608 (9%)
Query: 1 MGEGEGGEFPPKKSQSES--------GGDIPAKKLARQLDFTGGFVGSVILPEHPQSQAV 52
MGEGEGG +K+ S S ++PAKKLARQLDFTG PEHP
Sbjct: 1 MGEGEGGGDCVQKNASLSEVVAPSAASLEVPAKKLARQLDFTGA-------PEHP----- 48
Query: 53 TANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK 112
P +P PLP QA PP RV KPESPKSRSR N E+K+ TPKK
Sbjct: 49 ----------------PQLPVAVLPLPLQA--PPARVGKPESPKSRSRPNFEMKDATPKK 90
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
QKQCNCKHS+CLKLYCECFASGIYCDGCNCVNC NNVENEAARREAVEATLERNPNAFRP
Sbjct: 91 QKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRP 150
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KIASSPHGTRD++E+ GEVL+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCK
Sbjct: 151 KIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 210
Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTA 292
NFEGSEERQALFHGD NNMAYIQQAANAAITGAIGSSGY+SPPVSKKRKGQELFF T
Sbjct: 211 NFEGSEERQALFHGDQ-NNMAYIQQAANAAITGAIGSSGYSSPPVSKKRKGQELFFWPTT 269
Query: 293 KDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDL 351
KDPSI +LG QQ NH+R +PSS+LS + AR TA +G SKF YRSLLADIIQPQ L
Sbjct: 270 KDPSISKLG--QQVNHVRGPAPSSSLSPVSGARV-GTATLGPSKFMYRSLLADIIQPQHL 326
Query: 352 KELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDD 411
KELCSVLV++S +AAK++ Q+ EKQAED+ TS+ASS+Q +L S KE +++TV DD
Sbjct: 327 KELCSVLVLVSGQAAKTLTDQKILMEKQAEDQTETSLASSSQEQLPSQKEGHIEKTVADD 386
Query: 412 CSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT 471
CSSANQ DK+ PD+S SDG+D K RPMSPGTLALMCDEQDT+FM +SP H CNT
Sbjct: 387 CSSANQTDKISPDNSSSDGADVPKGRPMSPGTLALMCDEQDTMFMTTASPIAPMAHACNT 446
Query: 472 SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSN 531
SSQ PYGQGMTEVY EQERIVLTKFRD LNR+IT GEI E K SSL R E+ SQKD + N
Sbjct: 447 SSQFPYGQGMTEVYPEQERIVLTKFRDFLNRVITMGEINETKCSSLTRNELESQKDPIIN 506
Query: 532 GTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGE 591
G ++ + Q SNGV+K P +VT + + ++ N GE
Sbjct: 507 GIGNASSEIVLQQGATSNGVTKPGSP------------LVTTSTSMIPGNIAEN----GE 550
Query: 592 GDVKLKTE 599
VK++ E
Sbjct: 551 TKVKIEKE 558
>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
Length = 519
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/515 (74%), Positives = 414/515 (80%), Gaps = 12/515 (2%)
Query: 1 MGEGEGGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPE-HPQSQAVT---ANS 56
MGE EG EFPPKK Q D PAKKLARQLDFT G +LPE P VT A
Sbjct: 1 MGESEGSEFPPKKLQQSETADFPAKKLARQLDFTQG-----VLPELQPSPVVVTPASAQC 55
Query: 57 HPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQC 116
P PQP P+P V P P QP R KPESPKSR R N E+K+ TPKKQKQC
Sbjct: 56 QPQPQPHPQPQPQVVSIPVAPQQQTQQQPAARAVKPESPKSRPRLNAELKDGTPKKQKQC 115
Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
NCKHSRCLKLYCECFASG YCDGCNCVNC+NNVENEAARREAVEATLERNPNAFRPKIAS
Sbjct: 116 NCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIAS 175
Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
SPHGTRD++EETGE L+ KHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEG
Sbjct: 176 SPHGTRDSREETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEG 235
Query: 237 SEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPS 296
SEERQALFHGDH NNMAYIQQAANAAITGAIGSSGYAS PVS+KRKGQELFFG T KDPS
Sbjct: 236 SEERQALFHGDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPS 295
Query: 297 IHRLGHFQQGNHIR-TSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELC 355
HRLGHFQQ +HIR +PSS L S PVARA +T +GSSK TYRSLLADIIQ QDLKELC
Sbjct: 296 FHRLGHFQQASHIRPAAPSSPLPSNPVARAGNTTTLGSSKITYRSLLADIIQSQDLKELC 355
Query: 356 SVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSA 415
SVLVVLS EAAK+++ QRN+ +K+ ED+ T +ASS Q RLQS KE D D+ V++DCSSA
Sbjct: 356 SVLVVLSGEAAKTLSDQRNSMKKRVEDQRDTFLASSTQERLQSQKEIDADKIVSNDCSSA 415
Query: 416 NQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQL 475
N G DDS DG D K RPMSPGT+ALMCDEQDTIFMAA+SPN L GHGC++SSQL
Sbjct: 416 NHEG--GLDDSSLDGVDMPKGRPMSPGTMALMCDEQDTIFMAAASPNGLMGHGCDSSSQL 473
Query: 476 PYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIK 510
P GQGMTE +AEQERIVL KFRDCLNRLITFGEIK
Sbjct: 474 PCGQGMTEAHAEQERIVLMKFRDCLNRLITFGEIK 508
>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
Length = 545
Score = 726 bits (1874), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/546 (69%), Positives = 416/546 (76%), Gaps = 49/546 (8%)
Query: 1 MGEGEGGEFPPKKSQSES-GGDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPP 59
MGE EG EFPPKK Q +S KKLARQLDFT G +LP+HPQSQ P
Sbjct: 1 MGESEGSEFPPKKQQQQSENAYFQTKKLARQLDFTQG-----VLPDHPQSQ-------PS 48
Query: 60 PQPQPKPVVPPVPPPPPPLPGQATQ--------------------------------PPV 87
P +P +V P P P +Q PP+
Sbjct: 49 PLQKPPLLVSPAAAQSQPQPHPQSQLQLQPQVAEIQVVPQQQTQQQQQPARAVLRLLPPM 108
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHN 147
KPESPKS N E+K+ TPKKQ+QCNCKHSRCLKLYCECFASG YCDGCNCVNC+N
Sbjct: 109 -FRKPESPKSIPIPNTELKDGTPKKQRQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYN 167
Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
NVENEAARREAVEATLERNPNAFRPKIASSPHGTRD++EETG+ L+ KHNKGCHCKKSG
Sbjct: 168 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSG 227
Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAI 267
CLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGDH NNMAYIQQAANAAITGAI
Sbjct: 228 CLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHGNNMAYIQQAANAAITGAI 287
Query: 268 GSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ--GNHIR-TSPSSTLSSIPVAR 324
GSSGYASPPVS+KRKGQELFFG T KD S RLGHFQQ G+H R +PSS+L S P+AR
Sbjct: 288 GSSGYASPPVSRKRKGQELFFGHTVKDQSFDRLGHFQQVNGSHTRPPAPSSSLPSNPIAR 347
Query: 325 ASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRA 384
A + +G SK TYRSLLADIIQPQDLKELCSVLVVLS EAAK+ + QRN+ EK+ ED+
Sbjct: 348 AGNAITLGPSKITYRSLLADIIQPQDLKELCSVLVVLSGEAAKTFSDQRNSMEKRVEDQR 407
Query: 385 GTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTL 444
T +ASS Q RLQSHKE D D+ V +DCSSAN ADKVGPDDS SDG+D K RPMSPGTL
Sbjct: 408 ETLLASSTQERLQSHKESDADKIVANDCSSANHADKVGPDDSSSDGADMPKGRPMSPGTL 467
Query: 445 ALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLI 504
LMCDEQDT+ MAA+SP+ L GHGCNTSSQLP GQGM E +AEQERIVLTKFRDCLNRLI
Sbjct: 468 ELMCDEQDTMLMAAASPSGLMGHGCNTSSQLPCGQGMAEAHAEQERIVLTKFRDCLNRLI 527
Query: 505 TFGEIK 510
TFGEIK
Sbjct: 528 TFGEIK 533
>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
Length = 561
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/567 (66%), Positives = 422/567 (74%), Gaps = 42/567 (7%)
Query: 2 GEGEGGEFPPKKSQSESG-------GDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTA 54
GE EG + PK S D+PAKKLARQLDFT F G +P
Sbjct: 4 GEAEGSDGAPKNIVSNDVVANALLPFDVPAKKLARQLDFTD-FGG---MPA--------- 50
Query: 55 NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
PPQP PV+ P+PP VRV KPESPKSR R E+KE TPKKQK
Sbjct: 51 ----PPQP---PVILPLPP----------SASVRVGKPESPKSRPRPGFEIKEATPKKQK 93
Query: 115 QCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
QCNCKHS+CLKLYCECFASGIYCDGCNCVNC+NNVENEAARREAVEATLERNPNAFRPKI
Sbjct: 94 QCNCKHSKCLKLYCECFASGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKI 153
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
ASSPHGTRD++E+ GEVL+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNF
Sbjct: 154 ASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNF 213
Query: 235 EGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKD 294
EGSEERQALFHGD NNM YIQQAANAAITGAIGSSGY+SPP+SKKRKGQEL FG T KD
Sbjct: 214 EGSEERQALFHGDQNNNMVYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKD 273
Query: 295 PSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
PS+ R G QQ N++R +PSS+LS IP AR A +G SK YRSLLADIIQPQ LKE
Sbjct: 274 PSVGRQG--QQANNVRVPAPSSSLSPIPGARVGP-ATLGPSKLMYRSLLADIIQPQHLKE 330
Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT-DDC 412
LCSVLV++S +AAK +N +K AED+ TS ASS Q +L S KE +V++ + DDC
Sbjct: 331 LCSVLVLVSGQAAKMFTDHKNFMDKHAEDQTETSRASSTQEQLPSQKEANVEKAMADDDC 390
Query: 413 SSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS 472
SSANQ D + PD+S SDG+D K RPMSPGTLALMCDEQDT+FM A+S H CNTS
Sbjct: 391 SSANQTDNISPDNSCSDGADVPKGRPMSPGTLALMCDEQDTMFMTAASTVGSRAHACNTS 450
Query: 473 SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNG 532
Q PYGQ MTEVYAEQERIVLTKFRD LNR+IT GEI E K SSLAR+E+ SQKD N
Sbjct: 451 LQPPYGQVMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSELESQKDPNINC 510
Query: 533 TASSRTGTGNQPATVSNGVSKAYGPSA 559
T ++ T T +Q +NGV+K G S+
Sbjct: 511 TGNASTETQHQQGATNNGVAKTAGNSS 537
>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
Length = 629
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/647 (56%), Positives = 431/647 (66%), Gaps = 65/647 (10%)
Query: 1 MGEGEGGEFPPKK---------------------SQSESGGDIPAKKLARQLDFTGGFVG 39
M EGEGG+ PPK + + + D+P+KKLARQLDF +
Sbjct: 1 MAEGEGGDCPPKNVVHPEFVTVTAAAATAAAAANATATAWLDVPSKKLARQLDFNAMLME 60
Query: 40 SVILPEHPQSQAVTANSHPPPQPQPKPV-VPPVPPPPPPLPGQATQPPVRVPKPESPKSR 98
PQ Q VT S KPV V +P P P VRV K ESPK R
Sbjct: 61 Q----SKPQQQVVTQGSVMVQ----KPVGVGGLPMPVPAQVQTLQHSSVRVGKQESPKPR 112
Query: 99 SRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
SR N EVKE TPKKQ+QCNCKHS+CLKLYCECFASGIYCDGCNCVNC NNV+NEAARREA
Sbjct: 113 SRPNFEVKEGTPKKQRQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREA 172
Query: 159 VEATLERNPNAFRPKIASSPHGTRDNK---------EETGEVLMLGKHNKGCHCKKSGCL 209
VEATLERNPNAFRPKIASSP G RD++ EE GE L+L KH+KGCHCKKSGCL
Sbjct: 173 VEATLERNPNAFRPKIASSPQGARDSRVSRSSILWQEEAGEGLILIKHHKGCHCKKSGCL 232
Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGS 269
KKYCECFQAN+LCSENCRCMDCKNFEGSEERQALF GD NN+ Y+QQAANAAITGAIGS
Sbjct: 233 KKYCECFQANVLCSENCRCMDCKNFEGSEERQALFRGDQNNNV-YLQQAANAAITGAIGS 291
Query: 270 SGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASS-- 327
G++SPP SKKRKGQELF TAKDPSI + G Q S T + + + + +
Sbjct: 292 YGFSSPPASKKRKGQELFLWPTAKDPSISKPG---QQVKFAYCTSRTGTGVNLVKGPAPS 348
Query: 328 --------------TAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQR 373
T SK YRSLL+D++QP LKELCSVLV++S +AAK++A Q+
Sbjct: 349 SSASPVSSARGTNPTLGQSPSKLKYRSLLSDVVQPHHLKELCSVLVLVSGQAAKTLADQK 408
Query: 374 NTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDG 433
T EK+ ED+ TS+ASS Q +L S KE DV++ + DDCSSANQ DK+ P +S SDG+D
Sbjct: 409 KTVEKRTEDQTETSLASSTQEQLLSQKEVDVEKAMDDDCSSANQTDKISPGNSCSDGADV 468
Query: 434 QKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVL 493
K RPMSPGTLALMCDEQD++FM A+SP T H CNTSSQ P GQG+TEVYAEQERIVL
Sbjct: 469 PK-RPMSPGTLALMCDEQDSMFMTAASPIGQTTHACNTSSQFPDGQGVTEVYAEQERIVL 527
Query: 494 TKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSK 553
T+FRD LNR+IT GEI E K SSLAR+E+ ++KD ++N T ++ T T +Q SNG +K
Sbjct: 528 TQFRDFLNRVITMGEINETKCSSLARSELENKKDLINNETGNASTETVHQQEATSNGDAK 587
Query: 554 AYGPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGEGDVKLKTEN 600
A P A + M T + P+ +D + G+ KLK EN
Sbjct: 588 AAIPPMAATSTPAVPPMATTSTPVVPSD-----TVAENGESKLKMEN 629
>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
Length = 551
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/576 (63%), Positives = 417/576 (72%), Gaps = 43/576 (7%)
Query: 2 GEGEGGEFPPKKSQSESG-------GDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTA 54
GE EG + PK + D+PAKKLARQLDFTG F G + T
Sbjct: 4 GEAEGSDGAPKNVVANDVVANALLPFDVPAKKLARQLDFTG-FGG----------MSTTP 52
Query: 55 NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
PP +P VRV KPESPKSR R E+KE TPKKQK
Sbjct: 53 QPQPPVVLPLRPSTS-----------------VRVGKPESPKSRPRPGFEIKEATPKKQK 95
Query: 115 QCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
QCNCKHS+CLKLYCECFASGIYCDGCNCVNC NNVENEAARREAVEATLERNPNAFRPKI
Sbjct: 96 QCNCKHSKCLKLYCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRPKI 155
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
ASSPH TRD +E+ GE+L+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNF
Sbjct: 156 ASSPHRTRDIREDAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNF 215
Query: 235 EGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKD 294
EGSEERQALFHGD NN+ YIQQAANAAITGAIGSSGY+SPP+SKKRKGQEL FG T KD
Sbjct: 216 EGSEERQALFHGDQNNNLVYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKD 275
Query: 295 PSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
PS+ R G QQ N++R +PSS+LS IP AR A +G SK YRSLLADIIQPQ LKE
Sbjct: 276 PSVGRQG--QQANNVRVPAPSSSLSPIPGARVGP-ATLGPSKLMYRSLLADIIQPQHLKE 332
Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT-DDC 412
LCSVLV++S +AAK+ Q+N +K AED+ TS ASS Q +L S KE +VD+ + DDC
Sbjct: 333 LCSVLVLVSGQAAKTFTDQKNLMDKHAEDQTETSRASSTQEQLPSQKEANVDKAMADDDC 392
Query: 413 SSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS 472
SSANQ D + D+S SDG+D K RPMSPGTLALMCDEQDT+FM A+SP H CNTS
Sbjct: 393 SSANQTDNISLDNSCSDGADAPKGRPMSPGTLALMCDEQDTMFMTAASPVGSRAHACNTS 452
Query: 473 SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNG 532
Q PYGQ MTEVYAEQERIVLTKFRD LNR+IT GEI E K SSLAR+E+ SQK N
Sbjct: 453 LQPPYGQVMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSELESQKGPNMNC 512
Query: 533 TASSRTGTGNQPATVS---NGVSKAYGPSALAAPAA 565
+ + T T +Q A + NGV+KA +++ P +
Sbjct: 513 SGNDSTETAHQQAATNNNINGVAKAAAGNSILFPGS 548
>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
Length = 602
Score = 633 bits (1633), Expect = e-179, Method: Compositional matrix adjust.
Identities = 345/558 (61%), Positives = 400/558 (71%), Gaps = 31/558 (5%)
Query: 21 DIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPP---- 76
D+PAKKLARQLDFTG G +LPEHP Q S Q P V V
Sbjct: 31 DVPAKKLARQLDFTG--TGGAVLPEHPHFQ-----SRSRCQESESPAVMVVQSQSQPQSP 83
Query: 77 ------PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQC---NCKHSRCLKLY 127
P+ +A PP R KP+SPKSRSRSNVE K+ TPKKQKQC NCKHSRCLKLY
Sbjct: 84 QQLVVLPIGTKAPLPPAR--KPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLY 141
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
CECFASG+YCDGCNC NCHNNVE+EA+RREAVE TLERNPNAFRPKIA+SPHGTR++++E
Sbjct: 142 CECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKIANSPHGTRESRDE 201
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
GE++MLGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGD
Sbjct: 202 IGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGD 261
Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGN 307
HANN+AYIQQAANAAITGAIGSSGYA PVSKKRKG EL FG KD ++ + Q N
Sbjct: 262 HANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQAN 321
Query: 308 HIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAK 367
+I +S ST S P A S A G SKF++RSLLAD+IQP DLKELCSVLVVL+SE AK
Sbjct: 322 NIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381
Query: 368 SVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSG 427
A QR + EKQ D TS ASS+ +Q K ++ +C S+NQ+D+ D S
Sbjct: 382 IQAEQR-SDEKQINDPQQTSRASSSGDGVQHQKAE--EKAADGECGSSNQSDRSVHDSSN 438
Query: 428 SDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAE 487
S+ SD +ARPMSPGTLALMCDEQDT+FM A + H CNTSS +P + ++EVY E
Sbjct: 439 SNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADGSAAHDCNTSSHVP-DRSLSEVYIE 497
Query: 488 QERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATV 547
QERIVLTKFRDCLN+LIT GEIKE K +S + +EVG+Q N + + + +G Q ++
Sbjct: 498 QERIVLTKFRDCLNKLITLGEIKETKLTSRSESEVGNQ-----NQSNNFTSNSGCQQRSI 552
Query: 548 SNGVSKAYGPSALAAPAA 565
SNGV K SA P A
Sbjct: 553 SNGVVKNVALSAPRIPPA 570
>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
[Cucumis sativus]
Length = 556
Score = 613 bits (1580), Expect = e-172, Method: Compositional matrix adjust.
Identities = 326/504 (64%), Positives = 371/504 (73%), Gaps = 26/504 (5%)
Query: 21 DIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPP---- 76
D+PAKKLARQLDFTG G +LPEHP Q S Q P V V
Sbjct: 31 DVPAKKLARQLDFTG--TGGAVLPEHPHFQ-----SRSRCQESESPAVMVVQSQSQPQSP 83
Query: 77 ------PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQC---NCKHSRCLKLY 127
P+ +A PP R KP+SPKSRSRSNVE K+ TPKKQKQC NCKHSRCLKLY
Sbjct: 84 QQLVVLPIGTKAPLPPAR--KPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLY 141
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
CECFASG+YCDGCNC NCHNNVE+EA+RREAVE TLERNPNAFRPK A+SPHGTR++++E
Sbjct: 142 CECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKXANSPHGTRESRDE 201
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
GE++MLGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGD
Sbjct: 202 IGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGD 261
Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGN 307
HANN+AYIQQAANAAITGAIGSSGYA PVSKKRKG EL FG KD ++ + Q N
Sbjct: 262 HANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQAN 321
Query: 308 HIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAK 367
+I +S ST S P A S A G SKF++RSLLAD+IQP DLKELCSVLVVL+SE AK
Sbjct: 322 NIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381
Query: 368 SVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSG 427
A QR + EKQ D TS ASS+ +Q K ++ +C S+NQ+D+ D S
Sbjct: 382 IQAEQR-SDEKQINDPQQTSRASSSGDGVQHQKAE--EKAADGECGSSNQSDRSVHDSSN 438
Query: 428 SDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAE 487
S+ SD +ARPMSPGTLALMCDEQDT+FM A + H CNTSS +P + ++EVY E
Sbjct: 439 SNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADGSAAHDCNTSSHVP-DRSLSEVYIE 497
Query: 488 QERIVLTKFRDCLNRLITFGEIKE 511
QERIVLTKFRDCLN+LIT GEIKE
Sbjct: 498 QERIVLTKFRDCLNKLITLGEIKE 521
>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 603
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 339/579 (58%), Positives = 397/579 (68%), Gaps = 46/579 (7%)
Query: 1 MGE-GEGGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPP 59
MGE G GGEFPPKK E G P KK ARQLDFTGG + P + V A S P
Sbjct: 1 MGEDGGGGEFPPKKDGVEEG--FPTKKPARQLDFTGGSDEHSL--SKPAAPTVVATSVKP 56
Query: 60 PQPQPKP----------------VVPPVPPPP----PPLPGQATQPPVRVPKPESPKSRS 99
P V P +P PP Q P+R P PESPK+R
Sbjct: 57 IISSSVPSTIRPGMTIAIGQVTQVRPTLPMATTMSNPPSQSQIVNAPIRHPIPESPKARG 116
Query: 100 -RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
R NVE ++ TP+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNV+NE ARREA
Sbjct: 117 PRPNVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREA 176
Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
VEATLERNP AFRPKIASSPHG RD +E+ GEV++LGKHNKGCHCKKSGCLKKYCECFQA
Sbjct: 177 VEATLERNPFAFRPKIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQA 236
Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVS 278
NILCSENC+C+DCKNFEGSEERQALFHG+H+N+MAY+QQAANAAITGA+GSSG+A P
Sbjct: 237 NILCSENCKCLDCKNFEGSEERQALFHGEHSNHMAYLQQAANAAITGAVGSSGFAPSPAP 296
Query: 279 KKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFT 337
K+RKGQE+ F KD S RL HF Q N+ RT P+S S PV+RA A+ SKF
Sbjct: 297 KRRKGQEILFNQAIKDSS--RLSHFPQVNNGRTGGPTSGTSPSPVSRAGGNASSVPSKFV 354
Query: 338 YRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQ 397
YRSLLADIIQP D++ LCSVLV ++ EAAK+ +RN E + +D+ TS+ASSAQ + Q
Sbjct: 355 YRSLLADIIQPHDVRALCSVLVTVAGEAAKTSTDKRNEIENRVDDQTETSLASSAQDQPQ 414
Query: 398 -SHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFM 456
++ DV+ TD NQADK GP++S SDG D K P+SP TLALMCDEQDTIFM
Sbjct: 415 GNNNAADVEMVATDH----NQADKSGPEESNSDGVDASKVTPLSPATLALMCDEQDTIFM 470
Query: 457 -AASSPN-RLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKY 514
AA SPN + +GC P QG +E+YAEQER+VLTKFRDCLNRLI++ EIKE K
Sbjct: 471 VAAPSPNGSVDPNGCR-----PNSQGQSEIYAEQERLVLTKFRDCLNRLISYAEIKESKC 525
Query: 515 SSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSK 553
SLAR + Q A++ T + G Q + NG S+
Sbjct: 526 LSLARMHI--QPPAIA--TVKTENGIQQQ-VPIVNGASR 559
>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
lyrata]
Length = 601
Score = 595 bits (1534), Expect = e-167, Method: Compositional matrix adjust.
Identities = 334/583 (57%), Positives = 395/583 (67%), Gaps = 38/583 (6%)
Query: 1 MGE-GEGGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQSQ-------AV 52
MGE G GGEFPPKK E G P KK ARQLDFTGG + + V
Sbjct: 1 MGEDGGGGEFPPKKDGVEEG--FPTKKPARQLDFTGGSDEQSLSKAAAPTVVATAVKPVV 58
Query: 53 TANSHPPPQP-------QPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS-RSNVE 104
T++ +P Q +P +P PP Q P+R PKPESPK+R R VE
Sbjct: 59 TSSIPSTIRPGVTIAIGQVRPTLPMATTSNPPSQSQILNAPIRHPKPESPKARGPRPIVE 118
Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLE 164
++ TP+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNV+NE ARREAVEATLE
Sbjct: 119 GRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLE 178
Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
RNP AFRPKIASSPHG RD +E+ GEV++LGKHNKGCHCKKSGCLKKYCECFQANILCSE
Sbjct: 179 RNPFAFRPKIASSPHGVRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSE 238
Query: 225 NCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQ 284
NC+C+DCKNFEGSEERQALFHG+HAN+MAY+QQAANAAITGA+GSSG+A P K+RKGQ
Sbjct: 239 NCKCLDCKNFEGSEERQALFHGEHANHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQ 298
Query: 285 ELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLA 343
E+ F KD S RLG F Q N R P+S S PV+RA A+ SKF YRSLLA
Sbjct: 299 EILFNQATKDSS--RLGQFPQVNSGRASGPTSGSSPSPVSRAGGNASSAPSKFVYRSLLA 356
Query: 344 DIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKE-P 402
DIIQP D++ LCSVLV ++ EAAK+ +RN E + ED+ TS+ASSAQ + Q +
Sbjct: 357 DIIQPHDVRALCSVLVAVAGEAAKTSTDKRNEIENRVEDQTETSLASSAQDQPQGDNDAA 416
Query: 403 DVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFM-AASSP 461
D++ TD NQADK G ++S SDG+D K P+SP TLALMCDEQDTIFM AA SP
Sbjct: 417 DMEMVATD----GNQADKSGAEESNSDGADASKGNPLSPATLALMCDEQDTIFMVAAPSP 472
Query: 462 NRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTE 521
N G ++ QG +E+YAEQER+VLTKFRDCL+RLI++ EIKE K SLAR
Sbjct: 473 NGAVDPGGRRTN----SQGQSEIYAEQERLVLTKFRDCLSRLISYAEIKESKCLSLARMH 528
Query: 522 VGSQKDALSNGTASSRTGTG-NQPATVSNGVSKAYGPSALAAP 563
+ + TA+ +T G Q + NG S+ L P
Sbjct: 529 I------QPSATATVKTENGVQQQVPIVNGASRTNSQPTLNKP 565
>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 571
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 316/574 (55%), Positives = 383/574 (66%), Gaps = 69/574 (12%)
Query: 1 MGEGE-GGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHP-QSQAVTANSHP 58
MGEGE G +FPPK + + K ARQLDFTGG S + H Q+ ++ A S P
Sbjct: 1 MGEGEEGDKFPPKTDEVTQ----ESMKSARQLDFTGG--SSDVEHSHSNQASSMAAASIP 54
Query: 59 PP---------QPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENT 109
P Q P V PPPP G P+R P+PESP S R E ++ T
Sbjct: 55 SPIVTVTRPIITSQAPPTVATPIPPPPQSQGIILHVPIRHPRPESPNSMPRPAGETRDGT 114
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
P+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNA
Sbjct: 115 PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNA 174
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
FRPKIA+SPHG RDN+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC+C+
Sbjct: 175 FRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCL 234
Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
DCKNFEGSE RQ+LFHG+H++N+AY+Q ANAAITGAIGSSG+AS P K+RKGQE+FF
Sbjct: 235 DCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFN 293
Query: 290 STAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQ 349
KD S HRLG Q N+ RT+ S T S RA A++G SK Y+SLLA+II+P
Sbjct: 294 QGTKDSSTHRLG---QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPM 345
Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
D+K LCSVLV ++ EAAK++ TEK+ ++ TSVASS Q
Sbjct: 346 DVKALCSVLVAVAGEAAKTL------TEKRLANQKETSVASSVQ---------------- 383
Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
D N+A+K G +DS +DGS G R +SP TLALMCDE+DT+ M A+SPN C
Sbjct: 384 DQGHVNNKAEKSGLEDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------C 434
Query: 470 NT--SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKD 527
+ +SQLP GQ +VYAEQE++VLTKFRDCLNR+I+ GE+KE S S+ D
Sbjct: 435 SVEPTSQLPNGQD--QVYAEQEKVVLTKFRDCLNRIISCGEVKESNCSM-------SRMD 485
Query: 528 ALSNGTASSRTGTGNQPATVSNGVSK-AYGPSAL 560
+ + R Q A V+NGVS+ A PS L
Sbjct: 486 LDTPVQTTVRIDPVVQQAPVANGVSQTAKQPSQL 519
>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 578
Score = 550 bits (1418), Expect = e-154, Method: Compositional matrix adjust.
Identities = 314/574 (54%), Positives = 381/574 (66%), Gaps = 62/574 (10%)
Query: 1 MGEGE-GGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHP-QSQAVTANSHP 58
MGEGE G +FPPK + + K ARQLDFTGG S + H Q+ ++ A S P
Sbjct: 1 MGEGEEGDKFPPKTDEVTQ----ESMKSARQLDFTGG--SSDVEHSHSNQASSMAAASIP 54
Query: 59 PP---------QPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENT 109
P Q P V PPPP G P+R P+PESP S R E ++ T
Sbjct: 55 SPIVTVTRPIITSQAPPTVATPIPPPPQSQGIILHVPIRHPRPESPNSMPRPAGETRDGT 114
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
P+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNA
Sbjct: 115 PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNA 174
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
FRPKIA+SPHG RDN+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC+C+
Sbjct: 175 FRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCL 234
Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
DCKNFEGSE RQ+LFHG+H++N+AY+Q ANAAITGAIGSSG+AS P K+RKGQE+FF
Sbjct: 235 DCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFN 293
Query: 290 STAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQ 349
KD S HRLG Q N+ RT+ S T S RA A++G SK Y+SLLA+II+P
Sbjct: 294 QGTKDSSTHRLG---QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPM 345
Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
D+K LCSVLV ++ EAAK++ TEK+ ++ TSVASS Q
Sbjct: 346 DVKALCSVLVAVAGEAAKTL------TEKRLANQKETSVASSVQ---------------- 383
Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
D N+A+K G +DS +DGS G R +SP TLALMCDE+DT+ M A+SPN C
Sbjct: 384 DQGHVNNKAEKSGLEDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------C 434
Query: 470 NT--SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKD 527
+ +SQLP GQ +VYAEQE++VLTKFRDCLNR+I+ GE+K S+ D
Sbjct: 435 SVEPTSQLPNGQD--QVYAEQEKVVLTKFRDCLNRIISCGEVKVFCLFVAESNCSMSRMD 492
Query: 528 ALSNGTASSRTGTGNQPATVSNGVSK-AYGPSAL 560
+ + R Q A V+NGVS+ A PS L
Sbjct: 493 LDTPVQTTVRIDPVVQQAPVANGVSQTAKQPSQL 526
>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
Length = 536
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 290/500 (58%), Positives = 350/500 (70%), Gaps = 53/500 (10%)
Query: 64 PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRC 123
P V P+PPPP G P+R P+PESP S R E ++ TP+K+KQCNCKHSRC
Sbjct: 35 PPTVATPIPPPPQS-QGIILHVPIRHPRPESPNSMPRPAGETRDGTPQKKKQCNCKHSRC 93
Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
LKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNAFRPKIA+SPHG RD
Sbjct: 94 LKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRD 153
Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
N+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC C+DCKNFEGSE RQ+L
Sbjct: 154 NREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQSL 213
Query: 244 FHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHF 303
FHG+H++N+AY+Q ANAAITGAIGSSG+AS P K+RKGQE+FF KD S HRLG
Sbjct: 214 FHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-- 270
Query: 304 QQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSS 363
Q N+ RT+ S T S RA A++G SK Y+SLLA+II+P D+K LCSVLV ++
Sbjct: 271 -QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAG 324
Query: 364 EAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGP 423
EAAK++ TEK+ ++ TSVASS Q Q H N+A+K G
Sbjct: 325 EAAKTL------TEKRLANQKETSVASSVQD--QGHVN--------------NKAEKSGL 362
Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT--SSQLPYGQGM 481
+DS +DGS G R +SP TLALMCDE+DT+ M A+SPN C+ +SQLP GQ
Sbjct: 363 EDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------CSVEPTSQLPNGQD- 412
Query: 482 TEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTG 541
+VYAEQE++VLTKFRDCLNR+I+ GE+KE S S+ D + + R
Sbjct: 413 -QVYAEQEKVVLTKFRDCLNRIISCGEVKESNCSM-------SRMDLDTPVQTTVRIDPV 464
Query: 542 NQPATVSNGVSK-AYGPSAL 560
Q A V+NGVS+ A PS L
Sbjct: 465 VQQAPVANGVSQTAKQPSQL 484
>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
Length = 543
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 288/500 (57%), Positives = 348/500 (69%), Gaps = 46/500 (9%)
Query: 64 PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRC 123
P V P+PPPP G P+R P+PESP S R E ++ TP+K+KQCNCKHSRC
Sbjct: 35 PPTVATPIPPPPQS-QGIILHVPIRHPRPESPNSMPRPAGETRDGTPQKKKQCNCKHSRC 93
Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
LKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNAFRPKIA+SPHG RD
Sbjct: 94 LKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRD 153
Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
N+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC C+DCKNFEGSE RQ+L
Sbjct: 154 NREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQSL 213
Query: 244 FHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHF 303
FHG+H++N+AY+Q ANAAITGAIGSSG+AS P K+RKGQE+FF KD S HRLG
Sbjct: 214 FHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-- 270
Query: 304 QQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSS 363
Q N+ RT+ S T S RA A++G SK Y+SLLA+II+P D+K LCSVLV ++
Sbjct: 271 -QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAG 324
Query: 364 EAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGP 423
EAAK++ TEK+ ++ TSVASS Q Q H N+A+K G
Sbjct: 325 EAAKTL------TEKRLANQKETSVASSVQD--QGHVN--------------NKAEKSGL 362
Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT--SSQLPYGQGM 481
+DS +DGS G R +SP TLALMCDE+DT+ M A+SPN C+ +SQLP GQ
Sbjct: 363 EDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------CSVEPTSQLPNGQD- 412
Query: 482 TEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTG 541
+VYAEQE++VLTKFRDCLNR+I+ GE+K S+ D + + R
Sbjct: 413 -QVYAEQEKVVLTKFRDCLNRIISCGEVKVFCLFVAESNCSMSRMDLDTPVQTTVRIDPV 471
Query: 542 NQPATVSNGVSK-AYGPSAL 560
Q A V+NGVS+ A PS L
Sbjct: 472 VQQAPVANGVSQTAKQPSQL 491
>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 568
Score = 527 bits (1357), Expect = e-147, Method: Compositional matrix adjust.
Identities = 297/520 (57%), Positives = 357/520 (68%), Gaps = 48/520 (9%)
Query: 1 MGEGEGGE-FPPKKSQSESGGDIPAKKLARQLDFTG-GFVGSVILPEHPQSQAVTANSHP 58
MGEGE GE FPPK + + + K ARQLDFTG + S P S A + P
Sbjct: 1 MGEGEEGEKFPPKTDEV----TLESMKSARQLDFTGVEYSQSNQPPVASSSTAAASIPSP 56
Query: 59 PPQP---QPKPVVPPV--PPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ 113
+P Q +PV V P PPPP P+R P S R E ++ TP+K+
Sbjct: 57 VTRPITSQARPVTATVGTPIPPPPQSQGILHVPIRHPVDTCFFSFVRPVGEARDGTPQKK 116
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
KQCNCKHSRCLKLYCECFASG YCDGCNCVNCHNNVENE RREA+ +TLERNPNAFRPK
Sbjct: 117 KQCNCKHSRCLKLYCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRPK 176
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
IA+SPHG RDN+EE GEV++LG+HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKN
Sbjct: 177 IAASPHGGRDNREEVGEVVLLGRHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKN 236
Query: 234 FEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK 293
FEGSE RQ+LFHG+H++N+AY+Q ANAAITGAIGSSG+AS P K+RKGQE+FF K
Sbjct: 237 FEGSEVRQSLFHGEHSHNVAYLQH-ANAAITGAIGSSGFASAPAPKRRKGQEIFFNQGPK 295
Query: 294 DPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
D S HRLG Q N RT+ T S A A++ SK YRSLLADII+P+D+K
Sbjct: 296 DSSTHRLG---QANSGRTTSPKTGS-----HAGGNASLRPSKVVYRSLLADIIKPKDVKA 347
Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCS 413
LCSVLV ++ EAAK++ TE++ D+ TSVASS Q + S+ + T + +
Sbjct: 348 LCSVLVSVAGEAAKTL------TEERLADQKETSVASSVQDQGHSNNK----GTEGEKAA 397
Query: 414 SANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS- 472
S NQA+K GP DG K R +SP TLALMCDE+DT+ M A+SPN C+
Sbjct: 398 SGNQAEKSGP--------DGSKGRSLSPETLALMCDERDTMLMVAASPN------CSVEP 443
Query: 473 -SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKE 511
SQLP GQ +VYAEQE++VLTKFRDCLNR+I+ GE+KE
Sbjct: 444 PSQLPNGQ--DQVYAEQEKVVLTKFRDCLNRIISCGEVKE 481
>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
Length = 676
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 317/579 (54%), Positives = 384/579 (66%), Gaps = 59/579 (10%)
Query: 21 DIPAKKLARQLDFTGGFVGS--VILPEHPQ-------------------SQAVTANSHPP 59
D PAK RQLDFT S VILPEHPQ SQ+ P
Sbjct: 89 DFPAK---RQLDFTAMCRASANVILPEHPQLPLQTQPQRSQSQSQWQLQSQSPPQLLPPQ 145
Query: 60 PQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCK 119
P+PQ PVVP + P ++P ++ K ESP+S RSN EVK+ TPKK+K CNCK
Sbjct: 146 PRPQLVPVVPRITNP--------SRPTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCK 197
Query: 120 HSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPH 179
+SRCLKLYCECFASG+YC+ C+C NC NNV NEAAR+EAV +TL+RNPNAFRPKIA+SPH
Sbjct: 198 NSRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPH 257
Query: 180 GTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
G+ D EE G+V ++GKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEE
Sbjct: 258 GSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 317
Query: 240 RQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHR 299
R+ALFH D+ N MAYIQQAANAAI+GAIGSSGY + SKKRK Q++ F +TA + S HR
Sbjct: 318 RRALFH-DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTANNQSNHR 375
Query: 300 LGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLV 359
FQQ NH+R S S SI V ++ A +GSSKFTYRS LADI+Q QD+K+LCS+LV
Sbjct: 376 NTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKDLCSLLV 435
Query: 360 VLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQAD 419
V+S EA K++ TE + D TS+ SS Q R + + DV + + D C S NQ +
Sbjct: 436 VVSQEAGKTL------TEAE-RDHIETSIISSTQER-EHLQTDDVQKPLPDACLSENQVE 487
Query: 420 KVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP--- 476
+V D+ D +D Q R MSPGTLALMCDEQD+IFM SP+ + G SSQ+P
Sbjct: 488 RVQTDE--LDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSPDGVVG-----SSQIPTVK 540
Query: 477 --YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTA 534
+GQ TEVY EQER+VLTKF D LNRLIT G IKE S LA++ G+Q++ L +
Sbjct: 541 SSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSHTGNQQEPLEDDAT 600
Query: 535 SSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTA 573
TG GN N + K PS A +MSQ + A
Sbjct: 601 KVGTGKGNHKEPNGNSIVK---PS--FAATTQMSQTIAA 634
>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
Length = 532
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 289/500 (57%), Positives = 348/500 (69%), Gaps = 57/500 (11%)
Query: 64 PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRC 123
P V P+PPPP G P+R P+PESP S R E ++ TP+K+KQCNCKHSRC
Sbjct: 35 PPTVATPIPPPPQSQ-GIILHVPIRHPRPESPNSMPRPAGETRDGTPQKKKQCNCKHSRC 93
Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
LKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNAFRPKIA+SPHG RD
Sbjct: 94 LKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRD 153
Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
N+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC C+DCKNFEGSE RQ+L
Sbjct: 154 NREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQSL 213
Query: 244 FHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHF 303
FHG+H++N+AY+Q ANAAITGAIGSSG+AS P K+RKGQE+FF KD S HRLG
Sbjct: 214 FHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-- 270
Query: 304 QQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSS 363
Q N+ RT+ S T S RA A++G SK SLLA+II+P D+K LCSVLV ++
Sbjct: 271 -QANNGRTTSSQTGS-----RAGGNASLGPSK----SLLANIIKPMDVKALCSVLVAVAG 320
Query: 364 EAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGP 423
EAAK++ TEK+ ++ TSVASS Q Q H N+A+K G
Sbjct: 321 EAAKTL------TEKRLANQKETSVASSVQD--QGHVN--------------NKAEKSGL 358
Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT--SSQLPYGQGM 481
+DS +DGS G R +SP TLALMCDE+DT+ M A+SPN C+ +SQLP GQ
Sbjct: 359 EDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------CSVEPTSQLPNGQD- 408
Query: 482 TEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTG 541
+VYAEQE++VLTKFRDCLNR+I+ GE+KE S S+ D + + R
Sbjct: 409 -QVYAEQEKVVLTKFRDCLNRIISCGEVKESNCSM-------SRMDLDTPVQTTVRIDPV 460
Query: 542 NQPATVSNGVSK-AYGPSAL 560
Q A V+NGVS+ A PS L
Sbjct: 461 VQQAPVANGVSQTAKQPSQL 480
>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
Length = 579
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 315/585 (53%), Positives = 382/585 (65%), Gaps = 76/585 (12%)
Query: 15 QSESGGDIPAKKLARQLDFTGGFVGS--VILPEHPQ-------------------SQAVT 53
QSE+ D PAK RQLDFT S VILPEHPQ SQ+
Sbjct: 3 QSEAS-DFPAK---RQLDFTAMCRASANVILPEHPQLPLQTQPQRSQSQSQWQLQSQSPP 58
Query: 54 ANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ 113
P P+PQ PVVP + P ++P ++ K ESP+S RSN EVK+ TPKK+
Sbjct: 59 QLLPPQPRPQLVPVVPRITNP--------SRPTLQAVKQESPRSLPRSNSEVKDGTPKKK 110
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
K CNCK+SRCLKLYCECFASG+YC+ C+C NC NNV NEAAR+EAV +TL+RNPNAFRPK
Sbjct: 111 KHCNCKNSRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPK 170
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
IA+SPHG+ D EE G+V ++GKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKN
Sbjct: 171 IANSPHGSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 230
Query: 234 FEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK 293
FEGSEER+ALFH D+ N MAYIQQAANAAI+GAIGSSGY + SKKRK Q++ F +TA
Sbjct: 231 FEGSEERRALFH-DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTAN 288
Query: 294 DPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
+ S HR FQQ NH+R S S SI V ++ A +GSSKFTYRS LADI+Q QD+K+
Sbjct: 289 NQSNHRNTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKD 348
Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCS 413
LCS+LVV+S EA K++ TE++ LQ+ DV + + D C
Sbjct: 349 LCSLLVVVSQEAGKTL------TERE---------------HLQTD---DVQKPLPDACL 384
Query: 414 SANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSS 473
S NQ ++V D+ D +D Q R MSPGTLALMCDEQD+IFM SP+ + G SS
Sbjct: 385 SENQVERVQTDE--LDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSPDGVVG-----SS 437
Query: 474 QLP-----YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDA 528
Q+P +GQ TEVY EQER+VLTKF D LNRLIT G IKE S LA++ G+Q++
Sbjct: 438 QIPTVKSSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSHTGNQQEP 497
Query: 529 LSNGTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTA 573
L + TG GN N + K PS A +MSQ + A
Sbjct: 498 LEDDATKVGTGKGNHKEPNGNSIVK---PS--FAATTQMSQTIAA 537
>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
Length = 629
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 286/634 (45%), Positives = 375/634 (59%), Gaps = 75/634 (11%)
Query: 12 KKSQSESGGDIPAKKLARQLDFTGGFVGSV--ILPEHPQSQAVTANS------------- 56
++SQSE+ D KKLARQLDFT SV L EH
Sbjct: 2 EQSQSEAISDFAPKKLARQLDFTAVCRASVNVALSEHSSRLQQQQPPPLPPPPPPPMSQP 61
Query: 57 ------------------HPPPQPQPKP--------VVPPVPPPPPPLPGQATQPPVRVP 90
PP+ QP+ +VPP P + + +++
Sbjct: 62 QLHLKLQQQPQPVLQLKLETPPKQQPQATQARPQHVMVPPAMRRIPHPVQKLSLHTLQLI 121
Query: 91 KPESPKSRSRSNVEVKEN--TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNN 148
K +SP SR R+N+E K+N TPKK K CNCK+SRCLKLYCECFA+G++C GCNC NCHNN
Sbjct: 122 KQQSPSSRPRNNIEAKDNSGTPKKAKHCNCKNSRCLKLYCECFAAGVHCTGCNCTNCHNN 181
Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
VENEA+R+EAV A LERNP AF+PKIASSPHG+RD KE+ EV ++GKHNKGCHCKKSGC
Sbjct: 182 VENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKEDAMEVQLVGKHNKGCHCKKSGC 241
Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG 268
LKKYCECFQANILCSENC+CMDCKNFEGSEER+ALFH +H N + Y+QQAANAAI GAIG
Sbjct: 242 LKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNH-NGIVYMQQAANAAICGAIG 300
Query: 269 SSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASST 328
SSGY +P S KRK +EL FGST +D R+ QQ NH R +P+++ S + V +
Sbjct: 301 SSGYGTPLTSNKRKSEEL-FGSTNRDQLGRRVAKTQQENHQR-NPAASASPLSVPVPHNA 358
Query: 329 AAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTT-----EKQAEDR 383
A+GSSKFTY+S LA I+QPQD+K++CS+LVVLS EA K+ A + + ++ E
Sbjct: 359 TALGSSKFTYKSPLAGILQPQDVKKMCSLLVVLSQEARKAYAGKMDMQPGGDDNRKLESS 418
Query: 384 AGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGT 443
+ +S+ G + + +V +TV +D + N A+K DSG DG + RP+SP
Sbjct: 419 SSSSIREREDGLMGN----NVHKTVPNDHVNGNTAEK-DSSDSGKDGGGLENGRPLSP-E 472
Query: 444 LALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRL 503
+ LMC EQ+ +F S + G N + Q +E YAEQE+++LT FRD LNRL
Sbjct: 473 IDLMCHEQEMVFREGESATGMAGLSQNEIQKSSSAQECSEAYAEQEKLILTGFRDFLNRL 532
Query: 504 ITFGEIK---------------EMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVS 548
IT G IK E S LAR++ GS ++ + T N +
Sbjct: 533 ITCGSIKGKLLWSETFLCSNFHETTCSPLARSKTGSHQEPADGEMIKAGIETLNDKNAYA 592
Query: 549 NGVSKAYGPSALAAPAAKMSQMVTATVPISNNDL 582
+G+ K+ LA A+ + + A +PI N ++
Sbjct: 593 SGIVKS---PVLATSASNVDLPIKALLPIGNGEV 623
>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
Length = 576
Score = 463 bits (1192), Expect = e-127, Method: Compositional matrix adjust.
Identities = 238/444 (53%), Positives = 307/444 (69%), Gaps = 22/444 (4%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
KPESPK E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 89 KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 148
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NE ARREA++ATLERNP+AFRPKI SSPH R+N E G++ ++GKHNKGCHCKKSGCLK
Sbjct: 149 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCLK 208
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQANILCSENC+CMDCKNFEGSEER+ALF GDH N++ ++QQAANAA+ GAIG++
Sbjct: 209 KYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSI-HMQQAANAAVNGAIGAA 267
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
++SP S+KRK + +AK+ HR H Q + + S +P+ +
Sbjct: 268 AFSSPSTSRKRKHIDPSLDHSAKENGAHRTNHVPQVFYQKNGMPSD-GPLPINQPVHPPT 326
Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEK----------QA 380
+G K TYR LLADI+QP D+KELC +LVV+S AAK+ A +++ +K +
Sbjct: 327 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAYAGKKSQEDKVVEKEDEMGGKR 386
Query: 381 EDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS-ANQADKVGPDDSGSDGSDGQKA-RP 438
E G S+AS+ R +++K+P+ + DD S+ +A K ++S D +D QK+ RP
Sbjct: 387 EGEIGGSLASTNHDREENNKDPNNQKASADDPSTGGTRAGKASLEESKHDCNDDQKSNRP 446
Query: 439 MSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRD 498
MSPGTLALMCDEQDT+FM T P Q +E+YAEQER VLT+FRD
Sbjct: 447 MSPGTLALMCDEQDTMFM--------TSQNVVPQQPAPVNQNQSELYAEQERCVLTEFRD 498
Query: 499 CLNRLITFGEIKEMKYSSLARTEV 522
CL++L+TFG +KE K+S ++EV
Sbjct: 499 CLHKLVTFGRMKEEKFSMAIKSEV 522
>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 619
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 241/443 (54%), Positives = 305/443 (68%), Gaps = 25/443 (5%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
KPESPK E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 137 KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 196
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NE ARREA++ATLERNP+AFRPKI SSPH R+N E G++ ++GKHNKGCHCKKSGCLK
Sbjct: 197 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCLK 256
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQANILCSENC+CMDCKNFEGSEER+ALF GDH N++ ++QQAANAA+ GAIG++
Sbjct: 257 KYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSI-HMQQAANAAVNGAIGAA 315
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
++SP S+KRK + +AK+ HR H Q N + PS +P+ +
Sbjct: 316 AFSSPSASRKRKHIDPSLDHSAKENGAHRTNHVPQKNGM---PSD--GPLPINQPVHPPT 370
Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVA---VQRNTTEKQ------AE 381
+G K TYR LLADI+QP D+KELC +LVV+S AAK+ A Q EK+ E
Sbjct: 371 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAYAGKKSQDKVVEKEDEMGGKRE 430
Query: 382 DRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS-ANQADKVGPDDSGSDGSDGQKA-RPM 439
G S+AS+ R +++ +P+ + DD S+ +A K ++S D +D QK+ RPM
Sbjct: 431 GEIGGSLASTNHDREENNNDPNNQKASADDPSTGGTRAGKASLEESKHDCNDDQKSNRPM 490
Query: 440 SPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDC 499
SPGTLALMCDEQDT+FM T P Q +E+YAEQER VLT+FRDC
Sbjct: 491 SPGTLALMCDEQDTMFM--------TSQNVVPQQPAPVNQNQSELYAEQERCVLTEFRDC 542
Query: 500 LNRLITFGEIKEMKYSSLARTEV 522
L++L+TFG +KE K+S ++EV
Sbjct: 543 LHKLVTFGRMKEEKFSMAIKSEV 565
>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 615
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 265/548 (48%), Positives = 330/548 (60%), Gaps = 67/548 (12%)
Query: 23 PAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPP-------PQPQ------------ 63
P KKL RQLDF G P + A + S PP PQP
Sbjct: 32 PIKKLVRQLDFNYGN------PAMAAAAAAVSRSVPPRALQVGYPQPHQLRAALPMGVPH 85
Query: 64 --------------------PKPVVPPVPPPPPP-----LPGQATQPPVRVP-KPESPKS 97
P+P +P P P P L +PPV VP KPESPK
Sbjct: 86 QLQPRGLPVMRPHQMVHAQLPRPALPMAMPVPVPQARSALSQPVQRPPVAVPLKPESPKP 145
Query: 98 RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARRE 157
R R + K+ TP K+K CNC+HS+CLKLYCECFASGIYCDGCNC NC NNVENE ARRE
Sbjct: 146 RGRL-YDGKDGTPTKKKCCNCRHSKCLKLYCECFASGIYCDGCNCTNCFNNVENEVARRE 204
Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
A+EATLERNP+AFRPKI SSPH R+N E G++ ++GKHNKGCHCKKSGCLKKYCECFQ
Sbjct: 205 AIEATLERNPDAFRPKIGSSPHTNRNNMEVAGDLPLIGKHNKGCHCKKSGCLKKYCECFQ 264
Query: 218 ANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPV 277
ANILCSENC+C DCKNF+GSE+R+ L DH N+ ++QQAANAA+ GAIG++ ++SP
Sbjct: 265 ANILCSENCKCRDCKNFDGSEDRKHL-SLDH-KNVVHMQQAANAAVNGAIGATAFSSPST 322
Query: 278 SKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFT 337
SKKRK + + K+ R G Q + + + +S S+PV ++ +G K T
Sbjct: 323 SKKRKHIDSSLDLSTKEHLAMRNGQLSQVIYQKNAMASD-GSLPVGQSVHPHMMGPFKVT 381
Query: 338 YRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGT--SVASSAQGR 395
YR LLADIIQP D+KELC +LV +S +AAK+ ++ EK AE GT S+AS+ R
Sbjct: 382 YRPLLADIIQPGDVKELCKLLVTVSGQAAKAYTGRKIQEEKVAEKEDGTGGSLASTNHDR 441
Query: 396 LQSHKEPDVDRTVTDDCSSA-NQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDT 453
++ + +T DD SS K DDS D +D QK +RPMSPGTLALMCDEQD
Sbjct: 442 EGKTQDQNHRKTSIDDRSSVGTDMGKASLDDSRPDCNDDQKSSRPMSPGTLALMCDEQDA 501
Query: 454 IFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMK 513
+F T + Q +E+YAEQER VLT+FRDCL +L+ +G +KE K
Sbjct: 502 MFA--------TSQNAVAPPAVAADQNRSELYAEQERCVLTEFRDCLRKLVQYGRMKEEK 553
Query: 514 YSSLARTE 521
YS R+E
Sbjct: 554 YSMAIRSE 561
>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
Length = 515
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 263/539 (48%), Positives = 334/539 (61%), Gaps = 70/539 (12%)
Query: 15 QSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQSQA------------------VTANS 56
QSE+ D+ +KLARQLDF + P H + + +
Sbjct: 3 QSETASDLAPRKLARQLDFAAVYGD----PSHQKLPPPLPPPQSPLRPPSHSPLRLQLDL 58
Query: 57 HPPPQ-----------------PQPKPVVPP--VPPPPPPLPGQATQPPVRVPKPESPKS 97
PP Q P+PK V P +P P P + Q V KPESP+S
Sbjct: 59 QPPGQRPWLYSEAQEQQPGLYSPRPKLVSPVRRIPHPVVKFPVKVLQ----VVKPESPRS 114
Query: 98 RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARRE 157
+ R NVE+K+ TPKKQKQCNCK+SRCLKLYCECFA+GIYCDGCNCVNCHNNV+NEAAR+E
Sbjct: 115 QPRCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVDNEAARQE 174
Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
AV TLERNPNAFRPKIASSP RD+KE E+ ++GKHNKGCHCKKSGCLKKYCECFQ
Sbjct: 175 AVGITLERNPNAFRPKIASSPQEQRDSKE--CEIKVIGKHNKGCHCKKSGCLKKYCECFQ 232
Query: 218 ANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPV 277
ANILCSENC+CMDCKNFEGS+ER A+FH D+ N+ +++QAANA I+GA+GSSGY +
Sbjct: 233 ANILCSENCKCMDCKNFEGSDERIAIFHKDY--NLVHMKQAANATISGAVGSSGYGTHIT 290
Query: 278 SKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFT 337
KKRK QE+F G +A D +++ + Q S S+LS ++ S+T GSS+ T
Sbjct: 291 PKKRKNQEMFPGKSAMDQTVNMTAQYLQEIDPMASSPSSLSDSFISDPSNTRISGSSRST 350
Query: 338 YRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED-RAGTSVASSAQGRL 396
YRS+LAD+++P ++K LCS+ VVLS AAK+ A R ++Q + SVASSA+ L
Sbjct: 351 YRSVLADVLKPNNVKNLCSLFVVLSRVAAKTNAEMRGKVDQQTKTGNFEASVASSAKS-L 409
Query: 397 QSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFM 456
Q + DV + DD + ++A+ V RP+SP TLALMCDEQD +
Sbjct: 410 QETR--DVHQPACDDHVNKDEAEAVDV---------AHYNRPLSPETLALMCDEQDDMLF 458
Query: 457 AASSPNRLTGHGCNTSSQLPY-----GQGMTEVYAEQERIVLTKFRDCLNRLITFGEIK 510
S + G CN++ Q G T+VY EQER++LTKF D L L+T G IK
Sbjct: 459 GNCSAD---GVACNSTFQNMIQKSFNSDGCTDVYREQERLILTKFLDILRGLVTHGSIK 514
>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
Length = 529
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 239/443 (53%), Positives = 304/443 (68%), Gaps = 21/443 (4%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
KPESPK E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 43 KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 102
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NE ARREA++ATLERNP+AFRPKI SSPH R+N E G++ ++GKHNKGCHCKKSGCLK
Sbjct: 103 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCLK 162
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQANILCSENC+CMDCKNFEGSEER+ALF GDH N++ ++QQAANAA+ GAIG++
Sbjct: 163 KYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSI-HMQQAANAAVNGAIGAA 221
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
++SP S+KRK + +AK+ HR H Q + + S +P+ +
Sbjct: 222 AFSSPSASRKRKHIDPSLDHSAKENGAHRTNHVPQVFYQKNGMPSD-GPLPINQPVHPPT 280
Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVA---VQRNTTEKQ------AE 381
+G K TYR LLADI+QP D+KELC +LVV+S AAK+ A Q EK+ E
Sbjct: 281 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAYAGKKSQDKVVEKEDEMGGKRE 340
Query: 382 DRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS-ANQADKVGPDDSGSDGSDGQKA-RPM 439
G S+AS+ R +++ +P+ + DD S+ +A K ++S D +D QK+ RPM
Sbjct: 341 GEIGGSLASTNHDREENNNDPNNQKASADDPSTGGTRAGKASLEESKHDCNDDQKSNRPM 400
Query: 440 SPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDC 499
SPGTLALMCDEQDT+FM T P Q +E+YAEQER VLT+FRDC
Sbjct: 401 SPGTLALMCDEQDTMFM--------TSQNVVPQQPAPVNQNQSELYAEQERCVLTEFRDC 452
Query: 500 LNRLITFGEIKEMKYSSLARTEV 522
L++L+TFG +KE K+S ++EV
Sbjct: 453 LHKLVTFGRMKEEKFSMAIKSEV 475
>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
Length = 656
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 264/558 (47%), Positives = 333/558 (59%), Gaps = 33/558 (5%)
Query: 35 GGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQ----PPVRVP 90
G G VI S A + NS P + + V + A + +++
Sbjct: 119 AGTTGVVIFTAAEASFAGSTNSAPREEASGEGVASGEYSDRAQVYHCAFRMLNFSAIKIV 178
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
KP+SP+S+ + NVE+K+ TPKKQKQCNCK+SRCLKLYCECFA+GIYCDGCNCVNCHNNV+
Sbjct: 179 KPKSPRSQPQCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVD 238
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NEAAR+EAV TLERNPNAFRPKIASSP RD+KE E+ ++GKHNKGCHCKKSGCLK
Sbjct: 239 NEAARQEAVGITLERNPNAFRPKIASSPLERRDSKE--CEIQVIGKHNKGCHCKKSGCLK 296
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQANILCSENC+CMDCKNFEGS+ER+A+FH D+ N+ +I+QAANA I+ GSS
Sbjct: 297 KYCECFQANILCSENCKCMDCKNFEGSDERRAIFHKDY--NLVHIKQAANATISRVAGSS 354
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
GY + KKRK E+F G +A D + + +QQ S S LS V+ S+T
Sbjct: 355 GYGTHLTPKKRKIHEMFPGKSAMDQTDNITAQYQQEIDPMASSPSFLSDSFVSDPSNTRI 414
Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVAS 390
SS+ TYRS+LAD QPQ++K LCS+ VVLS AAK+ A ++Q + G AS
Sbjct: 415 SASSRSTYRSVLADFFQPQNVKNLCSLFVVLSQVAAKTNAEMIGKVDQQT--KTGNFEAS 472
Query: 391 SAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDE 450
A + DV + V DD + ++AD V + RP+SP TLALMCDE
Sbjct: 473 VASSSKSLQETRDVHQLVCDDHVNKDEADAVDIANYN---------RPLSPETLALMCDE 523
Query: 451 QDTIFMAASSPNRLTGHGCNTSSQLPY-----GQGMTEVYAEQERIVLTKFRDCLNRLIT 505
QD + SS N G CN++ Q G T+V EQER++LTKF D L L+T
Sbjct: 524 QDDMIFGNSSAN---GVACNSTLQNMIQKSSNSDGCTDVCREQERLILTKFLDVLRGLVT 580
Query: 506 FGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSKAYGPSALAAPAA 565
G IKE SS + S+K+ N + T G + +N PS
Sbjct: 581 HGSIKETMCSSSTKKGERSKKEPADNANIGAETNVGREKGIHNNSFGNYLIPS-----VT 635
Query: 566 KMSQMVTA-TVPISNNDL 582
K+SQ A T NNDL
Sbjct: 636 KISQTNYAVTNGHGNNDL 653
>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
Length = 550
Score = 430 bits (1106), Expect = e-118, Method: Compositional matrix adjust.
Identities = 241/492 (48%), Positives = 313/492 (63%), Gaps = 47/492 (9%)
Query: 60 PQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------VP-KPESPKSRSRSNVEV 105
P +P VV VP P +P P +R VP KPESPK R+R E
Sbjct: 45 PVMRPHQVVGHVPLPRHAVPMAVPVPQLRPMPPQPVQRPPVAVPLKPESPKPRARL-YEG 103
Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLER 165
K++TP K+K CNC++SRCLKLYCECFASG +CDGCNC NC NN ENE ARREA+EATLER
Sbjct: 104 KDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLER 163
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
NP+AFRPKI SSPH R+N E + ++ ++GKHNKGCHCKKSGCLKKYCECFQANILCSEN
Sbjct: 164 NPDAFRPKIGSSPHTNRNN-EVSSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSEN 222
Query: 226 CRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQE 285
C+CMDCKNFEGSEER+ LF GDH N + +QQA NAA+ GAIG++G+ SP S+KRK +
Sbjct: 223 CKCMDCKNFEGSEERRNLFQGDHKNAIN-MQQATNAAVNGAIGATGFPSPSTSRKRKHID 281
Query: 286 LFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADI 345
++ K+ HR H Q + + + IP+++++ +G K TYR LLADI
Sbjct: 282 PSLDNSNKEHVAHRNYHISQSVYQKNAVPD--GPIPISQSAHPPNLGPFKVTYRPLLADI 339
Query: 346 IQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-----------DRAGTSVASSAQG 394
+Q +D+KELC +LVV+S EAAK+ A ++ E+ E DRAG S+A +
Sbjct: 340 VQEEDIKELCKLLVVVSGEAAKTYAGRKTMEERVGEKEDERGGQKEDDRAG-SLALTNHD 398
Query: 395 RLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKA-RPMSPGTLAL------- 446
R ++++PD ++ D S K ++S + +D QK+ RPMSPGTLAL
Sbjct: 399 REGNNQDPDQKASIDDHSSRGTHTGKAILEESRPNCADDQKSNRPMSPGTLALLCDEEVK 458
Query: 447 MCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITF 506
MCDEQDT+F T + Q + +YAEQER+VLT+FRDCL + +
Sbjct: 459 MCDEQDTVFT--------TSQNAVPQQTITVNQNQSALYAEQERVVLTEFRDCLRKFVER 510
Query: 507 GEIKEMKYSSLA 518
G KE +YSS+A
Sbjct: 511 GRNKERRYSSMA 522
>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
Length = 682
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 227/424 (53%), Positives = 295/424 (69%), Gaps = 18/424 (4%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
K +SP SR + TPKK KQCNCK+S CLKLYCECFA+GI+C+GCNC+NC NNVE
Sbjct: 91 KQDSPGSRL-----ANDGTPKKPKQCNCKNSMCLKLYCECFAAGIHCNGCNCLNCRNNVE 145
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NE R+EAVE TL+RNPNAFRPKIASSPHG+RD E+ EV MLGKHNKGC+CK+SGCLK
Sbjct: 146 NEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTEDAQEVQMLGKHNKGCNCKRSGCLK 205
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD-HANNMAYIQQAANAAITGAIGS 269
KYCECFQ NI CSENC+C++CKNFEGS+ER+ALFHG HA AY QQA NAAI+GAIGS
Sbjct: 206 KYCECFQGNIFCSENCKCLECKNFEGSDERRALFHGSPHA--TAYAQQATNAAISGAIGS 263
Query: 270 SGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPS-STLSSIPVARASST 328
SGY + SKKRK +EL FG KD H F QGN +R S + S+ S+PV+R ++
Sbjct: 264 SGYGTTLASKKRKSEEL-FGIATKDQPAHSTAKFSQGNTLRNSAAFSSPLSVPVSRTANG 322
Query: 329 AAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-DRAGTS 387
A +GS+K TYRS LA +++PQ +KE+C++LV +S +A +++A + + Q+E + T
Sbjct: 323 AVLGSTKPTYRSPLAGVLKPQFVKEICTLLVTISGKATEALAGKVGKMDTQSERENIETY 382
Query: 388 VASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALM 447
AS Q + S K + + DDC + N+A+K G DSG D + ARP SP + LM
Sbjct: 383 SASPTQEKESSLKGWNAIEHMPDDCLNGNKAEKDGSIDSG----DVENARPSSPD-IDLM 437
Query: 448 CDEQDTIFMAASSPNRLTGHGCNTSSQLP-YGQGMTEVYAEQERIVLTKFRDCLNRLITF 506
C E++ +FM SP + C + +Q P G ++++AEQE+I+LT F D LN + T+
Sbjct: 438 CHEEEMMFMEMGSPIGV-ARLCQSKTQKPDDGYECSQLHAEQEKIILTSFLDILNTVSTY 496
Query: 507 GEIK 510
G IK
Sbjct: 497 GSIK 500
>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
gi|224030123|gb|ACN34137.1| unknown [Zea mays]
gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
Length = 607
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 218/451 (48%), Positives = 284/451 (62%), Gaps = 39/451 (8%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
KPESPK R+R E K++TP K+K NC++S+CLKLYCECFASG YCDGCNC NC NN +
Sbjct: 128 KPESPKLRARL-YEGKDSTPTKKKCYNCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPD 186
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NE ARREAVEA L+RNP+AFRPKI SSPH R+N E ++ ++GKHNKGCHCKKSGCLK
Sbjct: 187 NEVARREAVEAILDRNPDAFRPKIGSSPHMHRNN-EVPSDLPLVGKHNKGCHCKKSGCLK 245
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQANILCSENC+CMDCKNFEGSEER+ LF GDH N + +QQ NAA+ GAIG++
Sbjct: 246 KYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAIN-MQQVTNAAVNGAIGAT 304
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
G+ SP S+KRK + K+ H+ H Q N + SIP+++++
Sbjct: 305 GFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQKNAVPD------GSIPISQSAHPPT 358
Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEK----------QA 380
+G FTY+ LLAD +Q +D+KELC +LVV+S EAAK+ A Q+ E+ Q
Sbjct: 359 LGP--FTYKPLLADTVQAEDIKELCKLLVVVSGEAAKTYAGQKTKEERVGEKEDEIGGQK 416
Query: 381 EDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKA-RPM 439
ED S+A + R ++++ D + D S K ++S + D K RPM
Sbjct: 417 EDDRVGSLALTNHDREGNNQDQDQKAPIDDHSSRGTHTGKAIIEESRPNCVDDHKTHRPM 476
Query: 440 SPGTLAL---------MCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQER 490
SPGTLAL MCDEQDT+F T + + +YAEQE
Sbjct: 477 SPGTLALMCDEEAFEKMCDEQDTMFT--------TSQNAAPQQTVIVNHNQSALYAEQEN 528
Query: 491 IVLTKFRDCLNRLITFGEIKEMKYSSLARTE 521
+VLT+FRDCL + + +G +KE +YS ++E
Sbjct: 529 VVLTEFRDCLRKFVRYGRMKEERYSMAIKSE 559
>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
Length = 628
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 228/426 (53%), Positives = 278/426 (65%), Gaps = 37/426 (8%)
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
EE G+V ++GKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEER+ALFH
Sbjct: 215 EEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFH 274
Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
D+ N MAYIQQAANAAI+GAIGSSGY + SKKRK QE+ F +TA + S HR FQQ
Sbjct: 275 -DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRKQQEVSFCTTANNQSNHRNTQFQQ 333
Query: 306 GNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEA 365
NH+R S S SI V ++ A +GSSKFTYRS LADI+QPQD+K+LCS+LVV+S EA
Sbjct: 334 DNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQPQDVKDLCSLLVVVSQEA 393
Query: 366 AKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDD 425
K++ TE + D TS+ SS R + + DV + + D C S NQ ++V D+
Sbjct: 394 GKTL------TEAE-RDHIETSIISSTHER-EHLQTDDVQKPLPDACLSENQVERVQTDE 445
Query: 426 SGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP-----YGQG 480
D +D Q R MSPGTLALMCDEQD+IFMA SP+ + G SSQ+P +GQ
Sbjct: 446 --LDQADVQNGRLMSPGTLALMCDEQDSIFMARGSPDGVVG-----SSQIPTVKSSHGQA 498
Query: 481 MTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGT 540
TEVY EQER+VLTKF D LNRLIT G IKE S LA++E G+Q++ L + RTG
Sbjct: 499 CTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSETGNQQEPLEDDATKVRTGK 558
Query: 541 GNQPATVSNGVSKAYGPSALAAPAAKMSQMVTAT-----------VPISNNDLRHNPALP 589
GN N + K PS A +MSQ + A P + + NPA+
Sbjct: 559 GNHKEPNGNSIVK---PS--FAATTQMSQTIAAAKGNGIAKQKPPFPSTTRISQTNPAVH 613
Query: 590 GEGDVK 595
G G K
Sbjct: 614 GNGTAK 619
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 40/79 (50%), Gaps = 9/79 (11%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVE------ATL 163
K K C+CK S CLK YCECF + I C + C C++C N E RR A +
Sbjct: 225 KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDC-KNFEGSEERRALFHDNGNYMAYI 283
Query: 164 ERNPN-AFRPKIASSPHGT 181
++ N A I SS +GT
Sbjct: 284 QQAANAAISGAIGSSGYGT 302
>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
Length = 646
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 250/630 (39%), Positives = 333/630 (52%), Gaps = 129/630 (20%)
Query: 15 QSESGGDIPAKKLARQLDFT---GGFVGSVILPEHPQSQAVT-------------ANSH- 57
+SE+ D+ +K ARQLDF GG + P PQS + T A H
Sbjct: 3 KSETTLDLANRKFARQLDFAAACGGPSYLTLSPPTPQSLSKTIPRPPTPSYLPPPAQMHL 62
Query: 58 ------------PPPQPQPK------PVVPPVPPPPPPLPGQATQPPVRVPK-------- 91
PPQ Q + PV+P P P L + P PK
Sbjct: 63 DFQSPVRRPWKQSPPQWQQQRTSQSPPVMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQA 122
Query: 92 ---PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNN 148
SP+S+S++ +K+NT KKQK+CNCK+S+CLKLYCEC+A+GIYCDGCNC NCHNN
Sbjct: 123 LNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKCLKLYCECYAAGIYCDGCNCQNCHNN 182
Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPH----------------------------- 179
+ NEAAR+EA+ TLE+NPNAFRPKIASSP
Sbjct: 183 LNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSFELK 242
Query: 180 -GTRDN---------------------KEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
G +N +EE E+ ++G+HNKGCHCKK GCLKKYCECF
Sbjct: 243 FGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCECFH 301
Query: 218 ANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG-----SSGY 272
AN+LCSENC+C+DCKNFEGS+ + + + + + I+QA NAAI GA+G S +
Sbjct: 302 ANVLCSENCKCIDCKNFEGSDVWRIVLQEECS--LVQIRQATNAAINGAVGFGPSISGTH 359
Query: 273 ASPPVSKKRKGQELFFGSTAKD-PSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAV 331
+P KKRK QE F G + D P H ++ + I +SP S +S S TA
Sbjct: 360 ITP---KKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASF----VSDTAYK 412
Query: 332 GSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-DRAGTSVAS 390
S+ +RS+LAD++Q Q++K LCS+LVVLS EAAK+ A R ++ + ++ S+AS
Sbjct: 413 RISRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEASIAS 472
Query: 391 SAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDE 450
S+Q LQ + +V R +S N A+K D D + RP+SP TL LMCDE
Sbjct: 473 SSQ-LLQDSR--NVVR------ASENHANKDVADAVDIDIHN----RPLSPETLKLMCDE 519
Query: 451 QDTIFMAASSPNRLTGHGC--NTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGE 508
D +F S N + N + G T VYAEQER++LTKFRD L LI G
Sbjct: 520 LDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELIILGS 579
Query: 509 IKEMKYSSLARTEVGSQKDALSNGTASSRT 538
IKE +SS + +V +K +NG + + T
Sbjct: 580 IKETMHSSSVKKDVSIEKTPKNNGDSGAET 609
>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
Length = 220
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 179/253 (70%), Positives = 192/253 (75%), Gaps = 35/253 (13%)
Query: 1 MGEGEGGEFPPKKSQSESGGDIPAKKLARQLDFT--GGFVGSVILPEHPQSQAVTANSHP 58
MGEGEG + PPK + +S KKLARQLDF GG + LPEHPQ
Sbjct: 1 MGEGEGSDIPPKNAPLDS-----VKKLARQLDFNAFGGTPVTAPLPEHPQ---------- 45
Query: 59 PPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNC 118
P P +PP+P ++ KPESPKS+SR N E K+ TPKKQKQCNC
Sbjct: 46 -----PSPTLPPLPA-------------TKLGKPESPKSKSRPNFETKDATPKKQKQCNC 87
Query: 119 KHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSP 178
KHSRCLKLYCECFASGIYCDGCNCVNC NNV+NEAARREAVEATLERNPNAFRPKIASSP
Sbjct: 88 KHSRCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSP 147
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
HGTRDNKEETGEV +L KHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSE
Sbjct: 148 HGTRDNKEETGEVKVLVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSE 207
Query: 239 ERQALFHGDHANN 251
ERQALFHGD NN
Sbjct: 208 ERQALFHGDQNNN 220
>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 459
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 200/446 (44%), Positives = 263/446 (58%), Gaps = 54/446 (12%)
Query: 71 VPPPPPPLP--GQATQPPVRVPKPESPKSRSRSNVEVKENTP-KKQKQCNCKHSRCLKLY 127
PP PPL G RV + SR + VE KENTP K+QK CNCK+S+CLKLY
Sbjct: 49 TPPQKPPLHQFGHNQVQKNRVSNQDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKLY 108
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
CECFASG YC+GCNCVNCHN +ENE++R+ A+ LERNP+AF+PKIA SPHG +D +E
Sbjct: 109 CECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQEN 168
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
+VL++GKH+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+AL HG
Sbjct: 169 VQQVLLIGKHSKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERKALLHGS 228
Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLG-HFQQG 306
++ YIQQ NAA+ AI S Y PP S+KRK +++ D + G +Q+
Sbjct: 229 QVSD-TYIQQMTNAAVNRAIDMSAYLYPPESRKRKSKDI------SDSVVSSYGVQYQRA 281
Query: 307 NHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAA 366
NH+R + ++L S+P +A S GS+ YRS ++ QP ++ELCS+LV +
Sbjct: 282 NHVRRNGENSLFSLPNNKAVS----GSTTSAYRSSWSNTFQPHHVRELCSLLV------S 331
Query: 367 KSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTD--DCS-SANQADKVGP 423
SV V ++K ++ G S AQ V + D DC AN+ D+
Sbjct: 332 NSVDVANKLSDKGRKNDKGPSSLDGAQR---------VANEINDSPDCVLDANRMDE--- 379
Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTE 483
+P+SP T ALMCDE+ I + R+ TS + +
Sbjct: 380 -------------KPISPATRALMCDEEHEIGSEKETSARVK-----TSQEKEDTDTSSG 421
Query: 484 VYAEQERIVLTKFRDCLNRLITFGEI 509
+Y EQER +L+ FRD L +L I
Sbjct: 422 IYLEQERQILSIFRDYLIQLSNRARI 447
>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 412
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 199/390 (51%), Positives = 244/390 (62%), Gaps = 53/390 (13%)
Query: 23 PAKKLARQLDFTGGFVGSVI-----------------LP---EHPQSQAVTANSHPP--- 59
P KKL RQLDF + LP +HPQ P
Sbjct: 25 PIKKLVRQLDFNSAALAGNPAMAAAAAAVSRALQPRALPAGFQHPQHVRAAVPMVVPQQL 84
Query: 60 -----PQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------VP-KPESPKSRSR 100
P +P PVV VP PL P +R VP KP+SPK +R
Sbjct: 85 HQRALPVMRPHPVVGHVPRHAVPL--AVPVPQLRPMPPQPVQRPPVAVPLKPDSPKPWAR 142
Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE 160
E K++TP K+K CNC++SRCLKLYCECFASG +CDGCNC NC NN ENE ARREA+E
Sbjct: 143 L-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIE 201
Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
ATLERNP+AFRPKI SSPH R+N E + ++ ++GKHNKGCHCKKSGCLKKYCECFQANI
Sbjct: 202 ATLERNPDAFRPKIGSSPHTNRNN-EVSSDLPLVGKHNKGCHCKKSGCLKKYCECFQANI 260
Query: 221 LCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKK 280
LCSENC+CMDCKNFEGSEER+ LF GDH N + IQQA NAA+ GAIG++G+ SP S+K
Sbjct: 261 LCSENCKCMDCKNFEGSEERRNLFQGDHKNAIN-IQQATNAAVNGAIGTTGFPSPSTSRK 319
Query: 281 RKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRS 340
RK + + K+ + HR H Q N + SIP+++++ +G K TYR
Sbjct: 320 RKHIDPSLDHSNKEHAAHRNCHIPQKNAVPD------GSIPISQSAHPPTLGPFKVTYRP 373
Query: 341 LLADIIQPQDLKELCSVLVVLSSEAAKSVA 370
LLADI+Q +D+KELC +LVV+S EAAK+ A
Sbjct: 374 LLADIVQAEDVKELCKLLVVVSGEAAKTYA 403
>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
Length = 649
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 246/633 (38%), Positives = 329/633 (51%), Gaps = 132/633 (20%)
Query: 15 QSESGGDIPAKKLARQLDFT---GGFVGSVILPEHPQSQAVT-------------ANSH- 57
+SE+ D+ +K ARQLDF GG + P PQS + T A H
Sbjct: 3 KSETTLDLANRKFARQLDFAAACGGPSYLTLSPPTPQSLSKTIPRPPTPSYLPPPAQMHL 62
Query: 58 ------------PPPQPQPK------PVVPPVPPPPPPLPGQATQPPVRVPK-------- 91
PPQ Q + PV+P P P L + P PK
Sbjct: 63 DFQSPVRRPWKQSPPQWQQQRTSQSPPVMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQA 122
Query: 92 ---PESPKSRSRSNVEVKENTPKKQKQCNCKHSR---CLKLYCECFASGIYCDGCNCVNC 145
SP+S+S++ +K+NT KKQK+CNCK+S+ YCEC+A+GIYCDGCNC NC
Sbjct: 123 LNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKSDFATARYCECYAAGIYCDGCNCQNC 182
Query: 146 HNNVENEAARREAVEATLERNPNAFRPKIASSPH-------------------------- 179
HNN+ NEAAR+EA+ TLE+NPNAFRPKIASSP
Sbjct: 183 HNNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSF 242
Query: 180 ----GTRDN---------------------KEETGEVLMLGKHNKGCHCKKSGCLKKYCE 214
G +N +EE E+ ++G+HNKGCHCKK GCLKKYCE
Sbjct: 243 ELKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCE 301
Query: 215 CFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG-----S 269
CF AN+LCSENC+C+DCKNFEGS+ + + + + + I+QA NAAI GA+G S
Sbjct: 302 CFHANVLCSENCKCIDCKNFEGSDVWRIVLQEECS--LVQIRQATNAAINGAVGFGPSIS 359
Query: 270 SGYASPPVSKKRKGQELFFGSTAKD-PSIHRLGHFQQGNHIRTSPSSTLSSIPVARASST 328
+ +P KKRK QE F G + D P H ++ + I +SP S +S S T
Sbjct: 360 GTHITP---KKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASF----VSDT 412
Query: 329 AAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-DRAGTS 387
A S+ +RS+LAD++Q Q++K LCS+LVVLS EAAK+ A R ++ + ++ S
Sbjct: 413 AYKRISRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEAS 472
Query: 388 VASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALM 447
+ASS+Q LQ + +V R +S N A+K D D + RP+SP TL LM
Sbjct: 473 IASSSQ-LLQDSR--NVVR------ASENHANKDVADAVDIDIHN----RPLSPETLKLM 519
Query: 448 CDEQDTIFMAASSPNRLTGHGC--NTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLIT 505
CDE D +F S N + N + G T VYAEQER++LTKFRD L LI
Sbjct: 520 CDELDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELII 579
Query: 506 FGEIKEMKYSSLARTEVGSQKDALSNGTASSRT 538
G IKE +SS + +V +K +NG + + T
Sbjct: 580 LGSIKETMHSSSVKKDVSIEKTPKNNGDSGAET 612
>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
Length = 497
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 196/432 (45%), Positives = 256/432 (59%), Gaps = 30/432 (6%)
Query: 86 PVRVPKPESPK-SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVN 144
P+ K ESP+ S ++ ++ TPK +QC+CK S+CLKLYCECFASG YCD C+C N
Sbjct: 64 PLSFKKSESPRISLPQARTKIINGTPKSHRQCHCKQSKCLKLYCECFASGSYCDECSCAN 123
Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCK 204
CHNNVENE RREA E LERNPNAF+PKI SP D+ + +VL++ KH KGCHCK
Sbjct: 124 CHNNVENEDVRREAAECILERNPNAFKPKITGSPCTPPDDGDAAKDVLVMAKHIKGCHCK 183
Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
++GCLKKYCECFQANILCSENC+C+ CKN EG E + +H AYIQQ + +
Sbjct: 184 RTGCLKKYCECFQANILCSENCKCVSCKNLEGRELGMVVTSENHCKTKAYIQQENANSSS 243
Query: 265 GAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVAR 324
+ SS S S+KR +EL F S +D IH H +P SIP R
Sbjct: 244 AVVSSSHIFSQE-SRKRTFREL-FDSNMRDTEIHEF-----TKHHAVNPVGVHVSIPTLR 296
Query: 325 A------SSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEK 378
A +S+AA+GSSK TYRSLLAD I P D +ELCS+LVV+ SEAAK +A + +
Sbjct: 297 ADCVCQIASSAALGSSKLTYRSLLADAIHPLDTRELCSLLVVV-SEAAKFLADEH--CKA 353
Query: 379 QAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARP 438
Q ++ +A + + KEPDV + V + ++ D D+S S D Q P
Sbjct: 354 QIKEVKEYQMACDLEKDCK--KEPDVHQVVPHNQFPGSRTDITVMDNSRSCDIDMQDGGP 411
Query: 439 MSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRD 498
S LMCD++D +F ++S ++ HGCN + T+ E ER+VLT RD
Sbjct: 412 FSSEEDYLMCDKKDKLFTRSASLDQNLNHGCN--------ENFTK---EHERLVLTCLRD 460
Query: 499 CLNRLITFGEIK 510
CL +LI FG +K
Sbjct: 461 CLGKLIAFGTMK 472
>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
distachyon]
Length = 487
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 204/476 (42%), Positives = 258/476 (54%), Gaps = 71/476 (14%)
Query: 66 PVVPP---VPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSR 122
PVV P P P P P P+ KP + EVK TPKK+K CNCK+SR
Sbjct: 29 PVVAPQQQKPAAPVPRPWPVAFTPM---KPAA---------EVKNVTPKKRKHCNCKNSR 76
Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
CLKLYCECFA+ + CDGCNC NC NN ENE R+ A+EATL RNP AF+PKI + P
Sbjct: 77 CLKLYCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQPKIENGPSTVN 136
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
K+ +G V + KHNKGCHCKKSGCLKKYCECFQANILCS+NCRCMDCKNFEGSEER+A
Sbjct: 137 VRKDNSGAVPLRPKHNKGCHCKKSGCLKKYCECFQANILCSKNCRCMDCKNFEGSEERRA 196
Query: 243 LFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGH 302
L GD+ ++ IQQAAN A+ GAIGSSGY P +K +++ S+ +
Sbjct: 197 LVQGDNTSDRNNIQQAANTALNGAIGSSGYKCSPARRK----------ISQEDSLSEM-Q 245
Query: 303 FQQGNHIRTSPSSTLSSIPVARASSTAAVG---------SSKFTYRSLLADIIQPQDLKE 353
FQQ N LS + S T G SSK YRS LA+ I D+ +
Sbjct: 246 FQQVN---------LSDVSQLTPSCTGFGGHNGCYSQSNSSKMIYRSPLANTIHLSDVND 296
Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCS 413
L +V+ +AA+ A ++A + ++ KE + + ++ S
Sbjct: 297 LVKHVVIACRKAAE----------------AFPTIADNKGVEVEVEKEFHANYGLNNNRS 340
Query: 414 SANQADKVGPDDSGSDGSDGQKA---RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCN 470
+ P D S Q RP SPGT ALMCDEQDT F + C+
Sbjct: 341 KVQDLKEASPMDIHSKACTDQHNVDPRPASPGTQALMCDEQDTSFGNDIYRSSFAATSCD 400
Query: 471 TSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQK 526
+ ++E+ A QE +VLT FRD L LIT G+I E K SS A E+ +Q+
Sbjct: 401 --------KNISEINALQENLVLTGFRDYLRVLITRGKINEAKSSSEAAMELNAQR 448
>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 405
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 212/429 (49%), Positives = 269/429 (62%), Gaps = 39/429 (9%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
K ESP+ RSRS + K+ TPKK KQCNCK+SRCLKLYCECFASG YC+GCNCVNC NNVE
Sbjct: 4 KHESPRQRSRS-FDAKDGTPKKCKQCNCKNSRCLKLYCECFASGTYCEGCNCVNCCNNVE 62
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NE R+EAVEATLERNPNAFRPKI SSP G RD+ E+ GE + GKHNKGCHCKKSGCLK
Sbjct: 63 NEIVRQEAVEATLERNPNAFRPKIFSSP-GIRDSGEDVGEHPLAGKHNKGCHCKKSGCLK 121
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQANILCSENC+C+DCKNF+GSEER+ALF DH ++ ++Q + + S
Sbjct: 122 KYCECFQANILCSENCKCVDCKNFDGSEERRALFQADHNMSVNFMQPLGGSGVLS--NSP 179
Query: 271 GYASP-PVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTA 329
Y SP P+ KKR+ EL F + G + + SST++S P ++ S
Sbjct: 180 AYLSPSPLLKKRRTHELVFSGPGLKEQLTSGG--GTAPSLSLALSSTIASTPSSQGVS-P 236
Query: 330 AVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVA 389
AV +K ++SLLA ++Q + + ELC +LVV+SSE T+K+ + VA
Sbjct: 237 AVPPAKPMHKSLLAGVVQIEAIHELCKLLVVVSSE-----------TQKEILGMLLSDVA 285
Query: 390 SSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCD 449
+S+ R S + DR VT GPDD +D + RPMSPGTLALMCD
Sbjct: 286 NSSTLRAFS-QSFQCDRAVT----------TTGPDDEDAD----KPQRPMSPGTLALMCD 330
Query: 450 EQDTIFMAASSP-NRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGE 508
E++ +F A SP L +TS + ++AEQER +L +FRDCL R+ + G
Sbjct: 331 EKEPLFTAPPSPIGGLASEELSTSGS----PQIALLHAEQERAILLEFRDCLRRITSVGN 386
Query: 509 IKEMKYSSL 517
+ M S
Sbjct: 387 RRGMVLGSF 395
>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
Length = 448
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 185/473 (39%), Positives = 253/473 (53%), Gaps = 72/473 (15%)
Query: 58 PPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKEN-TPKKQKQC 116
P P+P P +PP P E+K + K++K C
Sbjct: 2 PTPRPWPVAFIPPKPV-----------------------------AEIKSTPSTKRKKHC 32
Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
NCK+S+CLKLYCECFA+G+YCDGC+C C N V NE+AR+EA+ +T +RNP AF+PKI +
Sbjct: 33 NCKNSQCLKLYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIEN 92
Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
+ K++ G L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEG
Sbjct: 93 GSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEG 152
Query: 237 SEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPS 296
SEE + + GD++++ IQ AAN A+ GAIGSSGY PV +KR ++ + + S
Sbjct: 153 SEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGS 212
Query: 297 IHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCS 356
+ FQ+ NH+ SS ++S + S YRS LA+ I P D L
Sbjct: 213 TMQT-QFQEANHV---DSSEITSSTGLEGCYSNYQSRSNVVYRSALANTISPTDATGLAK 268
Query: 357 VLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSAN 416
LV++ +AA + AE++ E +V+R + D A
Sbjct: 269 HLVIVCRKAADAFLT-------TAENKG----------------EMEVEREIHTDSDGAT 305
Query: 417 QADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP 476
D+ D G + + +RP SPGT ALMCDEQD+ F G + P
Sbjct: 306 NMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF------------GTDYRISFP 353
Query: 477 ---YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQK 526
+ Q +E+ A+QE+ VLT FRD L +IT G+I E SS A ++ ++K
Sbjct: 354 VALHDQDTSELNAQQEKAVLTGFRDYLRTVITRGKINEANRSSEAAMQLDTRK 406
>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 912
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 199/466 (42%), Positives = 266/466 (57%), Gaps = 58/466 (12%)
Query: 80 GQATQPPVRVPK-----PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASG 134
G +Q P R P P P SRSRS ++ K+ TPKK KQCNCK+SRCLKLYCECFASG
Sbjct: 430 GSPSQSPHRSPSIKHSSPRRPVSRSRSALDGKDGTPKKCKQCNCKNSRCLKLYCECFASG 489
Query: 135 IYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL-M 193
YCDGCNCVNC NN E+E R+EAVEATLERNPNAFRPKIA+SP D KE+ +
Sbjct: 490 TYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPKIANSPRILHDGKEDVVDRNGS 549
Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMA 253
KHNKGCHCKKSGCLKKYCEC+QANILCSENC+C+DC+NF S+ER+ALF GD A ++
Sbjct: 550 AAKHNKGCHCKKSGCLKKYCECYQANILCSENCKCVDCRNFITSDERRALFPGDPALGLS 609
Query: 254 YIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSP 313
++Q+ + A SG+ S P+S+KRK Q++ + + L + NH T+P
Sbjct: 610 FVQRPSGCA-------SGHMS-PLSRKRKTQDVEYIMPGLQTNKSNLVKVE--NHAATTP 659
Query: 314 SSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSV---- 369
++ + K RSLLAD++ +++LC +LV++S++ V
Sbjct: 660 KMSIYTASAEHIPEPMV----KVVNRSLLADVLHTDVIQDLCKLLVIVSADVQHEVSGIP 715
Query: 370 AVQRNTTEKQAEDRAGTS--------VASSAQGRLQSHKE-PDVDRTVTDDCSSA--NQA 418
AV T D T A +A RLQ+ + V +D +S A
Sbjct: 716 AVDNGTNNASTVDPTNTEDCPSGKEPSAPAAPDRLQAESTGTTMAEYVRNDPTSTPPVAA 775
Query: 419 DKVGPD------DSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS 472
+ P D + + +K RPMSPGTLALMCDE+D + A SSP
Sbjct: 776 VRTAPSPSNQALDVVAKREETRKERPMSPGTLALMCDEKDPLLTAPSSPAH--------- 826
Query: 473 SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLA 518
+++ QER++L++FRDCL R+I+ G + + S+ A
Sbjct: 827 --------ENQLHVVQERVILSEFRDCLRRIISVGNRRAYQLSTEA 864
>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
Length = 509
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 199/468 (42%), Positives = 258/468 (55%), Gaps = 46/468 (9%)
Query: 59 PPQPQPKPVVPPVPPPP---PPLPGQATQP--PVRVPKPESPKSRSRSNVEVKENTP-KK 112
PP P+ V P P P P QP PV P P + VE+K TP KK
Sbjct: 10 PPALVPEAVAEPRAAPQLQQQPKPAVQVQPSMPVLRPWPMEITLSMKPVVEMKSGTPAKK 69
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNCK+S+CLKLYCECFA+G YCDGCNC C N VENE R++A+ T RNPNAF+P
Sbjct: 70 KKHCNCKNSQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQP 129
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI + P K+ G + L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+CMDCK
Sbjct: 130 KIENGPIPPSVRKD-AGALPSLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCK 188
Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTA 292
N+EGSEE ++ GD++ + +QQAAN A+ GAIGSSGY PV +KR A
Sbjct: 189 NYEGSEELRSTIQGDNSCDRNNLQQAANVALNGAIGSSGYRFSPVRRKR---------PA 239
Query: 293 KDPSIHRLG--------HFQQGNHIRTSPSSTLSSIPVARASSTAAV-GSSKFTYRSLLA 343
+DP RL FQ+ N++ S ++ R ST G SK YRS LA
Sbjct: 240 EDPHYQRLNVEGSMMQTQFQEANNVDVSQITS----STGRDGSTGNFQGRSKLVYRSPLA 295
Query: 344 DIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPD 403
+ I D+ +L + LV++ +AA+ + +AE + ++ + +K D
Sbjct: 296 NTIHVTDVNDLANHLVIVCRKAAERFTT---IVDNKAEMEIDREICTNTYQKFDENK--D 350
Query: 404 VDRTVTDDCSSANQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDTIF-MAASSP 461
V + D+ DDS SD Q+ ARP SPGT AL+CDEQD F A SP
Sbjct: 351 VQKDALSQLGKVTNIDQQIVDDSILYCSDTQEDARPASPGTQALLCDEQDLTFGTAHRSP 410
Query: 462 NRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEI 509
+ H Q ++E++A QE VL +FR+ L +I G+I
Sbjct: 411 IPIALH----------DQDISELHAAQENAVLKEFRNYLRLIIARGKI 448
>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
Length = 461
Score = 294 bits (752), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 186/447 (41%), Positives = 252/447 (56%), Gaps = 98/447 (21%)
Query: 75 PPPLPGQATQPPVR-------VPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLY 127
PP PG+ + PVR +P+P E+KE KK KQCNCK++RCLKLY
Sbjct: 60 PPAQPGEPGRKPVRQLNFSSILPRPPV--------YELKEVAAKKCKQCNCKNTRCLKLY 111
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
CECFASG YCDGCNC+NC NNVENEA R+EAVE TLERNPNAFRPKIA++ H RD+ ++
Sbjct: 112 CECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADD 171
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
+ LGKHNKGC+CKKSGCLKKYCECFQANILCS+NC+C+DCKN++ S+ER+AL+ D
Sbjct: 172 ----MRLGKHNKGCNCKKSGCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPD 227
Query: 248 HANNMAY-IQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK-DPSIHRLGHFQQ 305
++ + + + TG++ + P K+R +F G + K I+R Q
Sbjct: 228 YSQSTYHGGPPPYTTSATGSLACVSSSPMPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQ 287
Query: 306 GNHIRTS---PSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLS 362
+ S P+S+L S+K SLLA ++QP+ + +LC++LV+ S
Sbjct: 288 DAKLSGSMLPPNSSL---------------SAKTIQTSLLAGVVQPETVSDLCNLLVMAS 332
Query: 363 SEAAKSVAVQRNTTEKQAED--RAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADK 420
+EAAK++ +++ AE+ AG
Sbjct: 333 AEAAKAMIASGPVSKRPAENGQEAG----------------------------------- 357
Query: 421 VGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQG 480
SG D ++ RPMSPGTLALMCDEQD +F A+ +G
Sbjct: 358 -----SGQGDEDIREKRPMSPGTLALMCDEQDPLFTPAT-----------------HGDY 395
Query: 481 MTEVYAEQERIVLTKFRDCLNRLITFG 507
E+ AEQER++L +FRDCL ++IT G
Sbjct: 396 DKELAAEQERLILEEFRDCLRKIITVG 422
>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
Length = 460
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 186/447 (41%), Positives = 252/447 (56%), Gaps = 98/447 (21%)
Query: 75 PPPLPGQATQPPVR-------VPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLY 127
PP PG+ + PVR +P+P E+KE KK KQCNCK++RCLKLY
Sbjct: 60 PPAQPGEPGRKPVRQLNFSSILPRPPV--------YELKEVAAKKCKQCNCKNTRCLKLY 111
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
CECFASG YCDGCNC+NC NNVENEA R+EAVE TLERNPNAFRPKIA++ H RD+ ++
Sbjct: 112 CECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADD 171
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
+ LGKHNKGC+CKKSGCLKKYCECFQANILCS+NC+C+DCKN++ S+ER+AL+ D
Sbjct: 172 ----MRLGKHNKGCNCKKSGCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPD 227
Query: 248 HANNMAY-IQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK-DPSIHRLGHFQQ 305
++ + + + TG++ + P K+R +F G + K I+R Q
Sbjct: 228 YSQSTYHGGPPPYTTSATGSLACVSSSPMPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQ 287
Query: 306 GNHIRTS---PSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLS 362
+ S P+S+L S+K SLLA ++QP+ + +LC++LV+ S
Sbjct: 288 DAKLSGSMLPPNSSL---------------SAKTIQTSLLAGVVQPETVSDLCNLLVMAS 332
Query: 363 SEAAKSVAVQRNTTEKQAED--RAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADK 420
+EAAK++ +++ AE+ AG
Sbjct: 333 AEAAKAMIASGPVSKRPAENGQEAG----------------------------------- 357
Query: 421 VGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQG 480
SG D ++ RPMSPGTLALMCDEQD +F A+ +G
Sbjct: 358 -----SGQGDEDIREKRPMSPGTLALMCDEQDPLFTPAT-----------------HGDY 395
Query: 481 MTEVYAEQERIVLTKFRDCLNRLITFG 507
E+ AEQER++L +FRDCL ++IT G
Sbjct: 396 DKELAAEQERLILEEFRDCLRKIITVG 422
>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
Length = 473
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 183/471 (38%), Positives = 249/471 (52%), Gaps = 73/471 (15%)
Query: 68 VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKL- 126
V P P P P P A PP V + +S S K++K CNCK+S+CLKL
Sbjct: 48 VQPQMPTPRPWP-VAFIPPKPVAEIKSTPS------------TKRKKHCNCKNSQCLKLA 94
Query: 127 -----------------YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
YCECFA+G+YCDGC+C C N V NE+AR+EA+ +T +RNP A
Sbjct: 95 CSSILDLFTGQIDQLYGYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKA 154
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F+PKI + + K++ G L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C
Sbjct: 155 FQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQ 214
Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
DCKNFEGSEE + + GD++++ IQ AAN A+ GAIGSSGY PV +KR ++ +
Sbjct: 215 DCKNFEGSEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQ 274
Query: 290 STAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQ 349
+ S + FQ+ NH+ SS ++S + S YRS LA+ I P
Sbjct: 275 KLNGEGSTMQT-QFQEANHV---DSSEITSSTGLEGCYSNYQSRSNVVYRSALANTISPT 330
Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
D L LV++ +AA + AE++ E +V+R +
Sbjct: 331 DATGLAKHLVIVCRKAADAFLT-------TAENKG----------------EMEVEREIH 367
Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
D A D+ D G + + +RP SPGT ALMCDEQD+ F G
Sbjct: 368 TDSDGATNMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF------------GT 415
Query: 470 NTSSQLP---YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSL 517
+ P + Q +E+ A+QE+ VLT FRD L +IT G+I +++ +
Sbjct: 416 DYRISFPVALHDQDTSELNAQQEKAVLTGFRDYLRTVITRGKINGTEFAKI 466
>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
gi|223943031|gb|ACN25599.1| unknown [Zea mays]
gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
Length = 530
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 195/505 (38%), Positives = 275/505 (54%), Gaps = 49/505 (9%)
Query: 45 EHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS--RSN 102
+HPQ S PP + P P P P P + VP+ + P + + +S+
Sbjct: 8 QHPQQL-----SSPPVLAAASEIKAPAVPSQQPKPAA---PVMSVPR-QWPIAFNPLKSS 58
Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
E K TPKK+K CNC++S+CLK+YCECF YCDGCNC NC N V NE AR EA+EA
Sbjct: 59 TEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAI 118
Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
+RNP+AF+PKI + + K+ +G V ++ KH+KGCHCKKSGCLKKYCEC+QAN+ C
Sbjct: 119 RQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQANVFC 178
Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
S+NCRCM CKN EG+++ + DHA++ +I+QAAN A G++GSSGY +S+KR
Sbjct: 179 SKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGYIC-SLSRKRS 237
Query: 283 GQE---LFFGSTAKDPSIHRLGHFQQGNHIRTS---PSSTLSSIPVARASSTAAVGSSKF 336
++ + S P +QQGNH S P ST A++T +
Sbjct: 238 HEDTQCVRINSEGSMPET----QYQQGNHADVSMLPPCST--GFDGHNAANTKPYNPN-- 289
Query: 337 TYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQG-- 394
YRS LA+ I ++ +L LV++ AA ++A + T+ AE++ S + G
Sbjct: 290 -YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIA--DDETDLIAEEKGSQSNNGLSNGNC 346
Query: 395 RLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDT 453
+ Q KE +++ CS D+ D++GS S +RP SP T ALMCDEQD
Sbjct: 347 KQQDFKEASSMVILSEGCS-----DQPNIDETGSHRSKTLDFSRPASPTTQALMCDEQDA 401
Query: 454 IF---------MAASSPNRLTGHGCNTSSQLPY---GQGMTEVYAEQERIVLTKFRDCLN 501
F A + T G + S P Q ++E+ A QE +VLT R+ L
Sbjct: 402 TFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEINAAQENLVLTGLREYLR 461
Query: 502 RLITFGEIKEMKYSSLARTEVGSQK 526
+IT G+I E K S A E+ +Q+
Sbjct: 462 VIITRGKINEHKSSLEAAMELDAQQ 486
>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
Length = 356
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 139/215 (64%), Positives = 166/215 (77%), Gaps = 3/215 (1%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
KPESPK R+R E K++TP K+K CNC++S+CLKLYCECFASG YCDGCNC NC NN +
Sbjct: 128 KPESPKLRARL-YEGKDSTPTKKKCCNCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPD 186
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
NE ARREAVEA L+RNP+AFRPKI SSPH R+N E ++ ++GKHNKGCHCKKSGCLK
Sbjct: 187 NEVARREAVEAILDRNPDAFRPKIGSSPHMHRNN-EVPSDLPLVGKHNKGCHCKKSGCLK 245
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQANILCSENC+CMDCKNFEGSEER+ LF GDH N + +QQ NAA+ GAIG++
Sbjct: 246 KYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAIN-MQQVTNAAVNGAIGAT 304
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
G+ SP S+KRK + K+ H+ H Q
Sbjct: 305 GFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQ 339
>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
Length = 485
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 189/488 (38%), Positives = 266/488 (54%), Gaps = 49/488 (10%)
Query: 45 EHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS--RSN 102
+HPQ S PP + P P P P P + VP+ + P + + +S+
Sbjct: 8 QHPQQL-----SSPPVLAAASEIKAPAVPSQQPKPAA---PVMSVPR-QWPIAFNPLKSS 58
Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
E K TPKK+K CNC++S+CLK+YCECF YCDGCNC NC N V NE AR EA+EA
Sbjct: 59 TEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAI 118
Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
+RNP+AF+PKI + + K+ +G V ++ KH+KGCHCKKSGCLKKYCEC+QAN+ C
Sbjct: 119 RQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQANVFC 178
Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
S+NCRCM CKN EG+++ + DHA++ +I+QAAN A G++GSSGY +S+KR
Sbjct: 179 SKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGYIC-SLSRKRS 237
Query: 283 GQE---LFFGSTAKDPSIHRLGHFQQGNHIRTS---PSSTLSSIPVARASSTAAVGSSKF 336
++ + S P +QQGNH S P ST A++T +
Sbjct: 238 HEDTQCVRINSEGSMPET----QYQQGNHADVSMLPPCST--GFDGHNAANTKPYNPN-- 289
Query: 337 TYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQG-- 394
YRS LA+ I ++ +L LV++ AA ++A + T+ AE++ S + G
Sbjct: 290 -YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIA--DDETDLIAEEKGSQSNNGLSNGNC 346
Query: 395 RLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDT 453
+ Q KE +++ CS D+ D++GS S +RP SP T ALMCDEQD
Sbjct: 347 KQQDFKEASSMVILSEGCS-----DQPNIDETGSHRSKTLDFSRPASPTTQALMCDEQDA 401
Query: 454 IF---------MAASSPNRLTGHGCNTSSQLPY---GQGMTEVYAEQERIVLTKFRDCLN 501
F A + T G + S P Q ++E+ A QE +VLT R+ L
Sbjct: 402 TFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEINAAQENLVLTGLREYLR 461
Query: 502 RLITFGEI 509
+IT G+I
Sbjct: 462 VIITRGKI 469
>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
sativa Japonica Group]
Length = 508
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 173/421 (41%), Positives = 234/421 (55%), Gaps = 49/421 (11%)
Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
VEVK +PKK+K CNCK+S YCECFA+ +YCDGC+C C N +ENE R+EA+E
Sbjct: 112 VEVKNASPKKRKHCNCKNS-----YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETI 166
Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
L RNP AF+PKI +SP+ K+ + + + KHNKGCHC+KS CLKKYCECFQANILC
Sbjct: 167 LLRNPLAFQPKIENSPNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILC 226
Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
S+NCRC DCKNFEGSEER+AL +A++ +IQ+AAN A+ GA GS GY + PV +++K
Sbjct: 227 SKNCRCQDCKNFEGSEERKALVQMKNASDRNHIQEAANFALNGATGSLGYKNSPV-RRKK 285
Query: 283 GQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLL 342
QE G I FQ +H S ++ + + SS+ YRS L
Sbjct: 286 YQENSLGE-----QILSEAQFQPTDHADVSQLASSCTGFGGDIAINYQSKSSEMIYRSPL 340
Query: 343 ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEP 402
A+ I ++ +L +V +A ++ +AG V A+ + Q++ E
Sbjct: 341 ANTIPLIEVNDLVKNALVACRKAPEAFLT-----------KAGNKVEMQAEKQYQTNDEI 389
Query: 403 DVDRTVTDDCSSANQAD---KVGPDDSGSDGSDGQKA------RPMSPGTLALMCDEQDT 453
+ D+T + A+ D K D +G++ A RP SPGT ALMCDE DT
Sbjct: 390 NNDKTKEQNLKEASLKDIQNKACIDRQNINGTEPHLANSFKDSRPASPGTQALMCDEMDT 449
Query: 454 IFMAASSPNRLTGHGCNTSSQLPY-----GQGMTEVYAEQERIVLTKFRDCLNRLITFGE 508
F N + P+ Q ++E+ A+QERIVLT RD L LIT G
Sbjct: 450 TFG-------------NEDYRSPFVVPSRDQDISELNADQERIVLTGLRDYLRVLITRGN 496
Query: 509 I 509
I
Sbjct: 497 I 497
>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
gi|223944643|gb|ACN26405.1| unknown [Zea mays]
gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
Length = 499
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 188/457 (41%), Positives = 259/457 (56%), Gaps = 31/457 (6%)
Query: 86 PVRVPKPESPKSRSRSNVEVKENTP-KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVN 144
PV P P + VE+K TP KK+K CNCK+S+CLKLYCECFA+G YCDGCNC
Sbjct: 44 PVLRPWPMEITLSMKPVVEMKSGTPAKKKKHCNCKNSQCLKLYCECFAAGDYCDGCNCKQ 103
Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCK 204
C N VENE R+EA+ T RNPNAF+PKI + P + K+ G + KHNKGCHCK
Sbjct: 104 CGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIPSSVWKD-AGALPSHPKHNKGCHCK 162
Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
KSGCLKKYCECFQANILCS+NC+CMDCKN+EGSEE ++ GD++ + IQQAAN A+
Sbjct: 163 KSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEELRSTTQGDNSCDRNNIQQAANVALN 222
Query: 265 GAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVAR 324
GAIGSSGY PV +KR ++ + + S+ + F++ N++ S ++S
Sbjct: 223 GAIGSSGYRFSPVRRKRPPEDPHCQTLNVEGSMMQT-QFRETNNVDV---SQMTSSTGRD 278
Query: 325 ASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRA 384
S+ G+SK YRS LA+ I D+ +L + LV+ +AA+ + + E
Sbjct: 279 GSTVNFQGTSKLVYRSPLANTIHITDVNDLANHLVIACRKAAERFTT---IVDNKVEMEI 335
Query: 385 GTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQ-KARPMSPGT 443
+ ++ + +K+ V + D+ DDS SD Q ARP SPGT
Sbjct: 336 DKEICTNTDQKFDENKK-GVQKAALSQLGKVTDIDQQIVDDSRLHCSDTQDDARPASPGT 394
Query: 444 LALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP---YGQGMTEVYAEQERIVLTKFRDCL 500
AL+CDEQD F G S +P + Q ++E++A QE VL + R+ L
Sbjct: 395 QALLCDEQDLTF------------GTVHRSSIPVAMHDQVISELHAAQESAVLKELRNNL 442
Query: 501 NRLITFGEIK-EMKYSSLARTEVGSQKDALSNGTASS 536
+I G++ E K +S R E+ DAL + +S+
Sbjct: 443 RLIIARGKLNAEGKSASGTRMEL----DALRHHESST 475
>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
Length = 437
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 160/394 (40%), Positives = 220/394 (55%), Gaps = 42/394 (10%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YCECFA+G+YCDGC+C C N V NE+AR+EA+ +T +RNP AF+PKI + + K+
Sbjct: 76 YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
+ G L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE + + G
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQG 195
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQG 306
D++++ IQ AAN A+ GAIGSSGY PV +KR ++ + + S + FQ+
Sbjct: 196 DNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEA 254
Query: 307 NHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAA 366
NH+ SS ++S + S YRS LA+ I P D L LV++ +AA
Sbjct: 255 NHV---DSSEITSSTGLEGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKAA 311
Query: 367 KSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDS 426
+ AE++ E +V+R + D A D+ D
Sbjct: 312 DAFLT-------TAENKG----------------EMEVEREIHTDSDGATNMDQQNGGDF 348
Query: 427 GSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP---YGQGMTE 483
G + + +RP SPGT ALMCDEQD+ F G + P + Q +E
Sbjct: 349 GPCCNSLEDSRPASPGTQALMCDEQDSTF------------GTDYRISFPVALHDQDTSE 396
Query: 484 VYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSL 517
+ A+QE+ VLT FRD L +IT G+I +++ +
Sbjct: 397 LNAQQEKAVLTGFRDYLRTVITRGKINGTEFAKI 430
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 9/77 (11%)
Query: 88 RVPKPESPKSRSRSN-VEVKEN-------TPKKQKQCNCKHSRCLKLYCECFASGIYCD- 138
R PK PK + SN + ++++ PK K C+CK S CLK YCECF + I C
Sbjct: 114 RNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSK 173
Query: 139 GCNCVNCHNNVENEAAR 155
C C +C N +E R
Sbjct: 174 NCKCQDCKNFEGSEELR 190
>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
Length = 497
Score = 275 bits (702), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 181/471 (38%), Positives = 255/471 (54%), Gaps = 48/471 (10%)
Query: 63 QPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSR 122
QPKP P +P P Q P+ P P + EVK TPKK+K CNC++S+
Sbjct: 32 QPKPAAPVMPVP--------RQWPMAF-NPLKPST------EVKSVTPKKKKHCNCRNSK 76
Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
CLK+YCECF YCDGCNC NC N V NE AR EA+EA +RNP+AF+PKI + P+
Sbjct: 77 CLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIENGPNTLN 136
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
K+ +G V ++ KH+KGCHCKKSGCLKKYCEC+QAN+ CS+NCRCM CKN EG+E+ +
Sbjct: 137 VRKDNSGAVPVVAKHHKGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNSEGNEDTET 196
Query: 243 LFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSI-HRLG 301
DHA++ ++IQQAAN A G +GSSGY +S+KR ++ + S+
Sbjct: 197 SSQRDHASDRSHIQQAANVAFNGTVGSSGYICS-LSRKRSHEDALGVRIKSEGSMPETQT 255
Query: 302 HFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVL 361
+QQGNH SP + S+ + A S+ YRS LA+ I +++ +L LV++
Sbjct: 256 QYQQGNHADVSPLAPCST--GFDRHNAANSKSNNPIYRSPLANTIHLREVNDLVKHLVMV 313
Query: 362 SSEAAKSVAVQR---------NTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDC 412
A +A ++ N KQ + + +S+ ++G SH +P+++
Sbjct: 314 CRMAEAKIAEEKGFQSNNGLSNGNCKQQDFKETSSIDILSKG--CSH-QPNINEMGFHWS 370
Query: 413 SSANQADKVGPDDSGSDGSDGQKA----RPMSPGTLALMCDEQDTIF---------MAAS 459
++N + P + S D Q P SP T ALMC+EQ+T F A
Sbjct: 371 ETSNVSRPASPT-TQSLMCDEQNTTFGNEPASPTTQALMCEEQNTTFGNEPASPTTQALM 429
Query: 460 SPNRLTGHGCNTSSQLPY---GQGMTEVYAEQERIVLTKFRDCLNRLITFG 507
+ T G + S P GQ ++E+ A QE +VLT R L +I G
Sbjct: 430 CEEQNTTFGNDYKSSFPLVSRGQDISEINAVQENLVLTGLRQYLCTIIRRG 480
>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
Length = 154
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 129/154 (83%), Positives = 136/154 (88%), Gaps = 1/154 (0%)
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
EETGE L+ KHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFH
Sbjct: 1 EETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFH 60
Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
GDH NNMAYIQQAANAAITGAIGSSGYAS PVS+KRKGQELFFG T KDPS HRLGHFQQ
Sbjct: 61 GDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQ 120
Query: 306 GNHIR-TSPSSTLSSIPVARASSTAAVGSSKFTY 338
+HIR +PSS L S PVARA +T +GSSK TY
Sbjct: 121 ASHIRPAAPSSPLPSNPVARAGNTTTLGSSKITY 154
Score = 45.4 bits (106), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
K K C+CK S CLK YCECF + I C + C C++C N
Sbjct: 11 KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 48
>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
distachyon]
Length = 540
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 184/469 (39%), Positives = 252/469 (53%), Gaps = 47/469 (10%)
Query: 66 PVVP---PVPPPPPPLPGQATQPPVRVPKPESPKSR---SRSNVEVKENTPKKQKQCNCK 119
PV+P P+ PLPG + SR S ++ E TP++ K C CK
Sbjct: 96 PVLPQDLPLSAAKLPLPGVQKVTAGILCSEMMLNSRYPWSETSSESINGTPRR-KNCKCK 154
Query: 120 HSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSP 178
+S+CLKLYC+CFASG YC D CNC NC N+V +E AR++A+ A +ERNP AF PKI + P
Sbjct: 155 NSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAINAVMERNPVAFMPKIGNIP 214
Query: 179 -HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS 237
H ++ + E +GKH KGC CK+S CLKKYCECFQ+N+LCSENC+C DCKN+E S
Sbjct: 215 RHAAQNREYRAAEGPRVGKHMKGCQCKRSECLKKYCECFQSNVLCSENCKCTDCKNYESS 274
Query: 238 EERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSI 297
E+ + + + + Y+ N A+ G IG S P ++K G L S +D I
Sbjct: 275 EDMKEMRRMTQQHGV-YVHHVQNLALKGMIGPSAVL-PRAAEKFSG--LSVASLGRDQPI 330
Query: 298 HRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSV 357
+ N S L+S+P S+ +G TYR+LLADIIQ +D+ LC V
Sbjct: 331 N-------NNDSSQVLSPLLTSVPTEDTESSVRLGRHGVTYRTLLADIIQIEDVNVLCKV 383
Query: 358 LVVLSSEAAKSV---AVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS 414
LV++S +AA + + NT K+ DRA + + S+ R K+ D CS
Sbjct: 384 LVLVSRQAAGAFLDSGFKENTNIKKL-DRAESHIPSTNHDRKAVQKQHD-----EQLCSL 437
Query: 415 ANQADKVGPDDSG------SDGSDGQK--ARPMSPGTLALMCDEQDTIFMAASSPNRLTG 466
N V P G SD SD K R +SPGT ALMC+EQD +F+ SS + G
Sbjct: 438 GNSLIAV-PLSEGRAGMPRSDPSDTWKDNRRSVSPGTQALMCNEQD-MFVHTSS---VAG 492
Query: 467 HGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYS 515
+T+ + ++E+Y EQE+ VLT D L L G ++E K S
Sbjct: 493 AILSTTK-----ENLSEIYIEQEKRVLTNLLDFLRELAKSGRLQEEKRS 536
>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
Length = 612
Score = 224 bits (571), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 174/493 (35%), Positives = 263/493 (53%), Gaps = 27/493 (5%)
Query: 76 PPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGI 135
P LP AT V + S K S E K+ T ++ C+CK+S+CLKLYCECF G
Sbjct: 137 PLLPRPATFIDVML---SSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCECFEKGR 193
Query: 136 YCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG 195
YC GCNC NC NNV +E R++A+ LERNP AF PK+ +S ++K G++ +G
Sbjct: 194 YCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNSTAHNCESKAAEGDI--VG 251
Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
KH KGC CK++ LKKYCECF+A++ CSENCRC CKNF+ +E+R + + HA+ ++
Sbjct: 252 KHTKGCKCKRTEYLKKYCECFKASVFCSENCRCTGCKNFKSNEDRISQKNTVHAH---HV 308
Query: 256 QQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPS-IHRLGHFQQGNHIRTSPS 314
Q A++ + G +A+ S + S + S + L H Q + +
Sbjct: 309 QNPASSGMVGQSVIIFHAAENDSSLSLAASVSDHSINNNTSHVILLQHTFQISPFQALSC 368
Query: 315 STLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSV--AVQ 372
S SS+ A S + TYR LADIIQ +D+ ELC++L+++S +A ++ +
Sbjct: 369 SLTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATLLESGM 428
Query: 373 RNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSD 432
+ T+++ +R + ++S+ R KEP + S+ + + +D D
Sbjct: 429 KENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRLELPRADPFD 488
Query: 433 GQKA--RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQER 490
QK RP+SPGT ALMCDEQD +F + +P+ ++ + + +++ EQE+
Sbjct: 489 IQKGNKRPLSPGTQALMCDEQDAVFQDSKAPD---------ATPSTTEENLVDIFKEQEK 539
Query: 491 IVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQP---ATV 547
VLT FRD L +L T G+++E+K SLA T Q A +G SS T T+
Sbjct: 540 RVLTNFRDYLCKLATCGKLQELKL-SLASTNCSEQAFA-DHGNNSSITRVAKVTRIRQTI 597
Query: 548 SNGVSKAYGPSAL 560
SKA +AL
Sbjct: 598 PQFSSKAASTTAL 610
>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
Length = 586
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 172/492 (34%), Positives = 257/492 (52%), Gaps = 51/492 (10%)
Query: 76 PPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGI 135
P LP AT V + S K S E K+ T ++ C+CK+S+CLKLYCECF G
Sbjct: 137 PLLPRPATFIDVML---SSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCECFEKGR 193
Query: 136 YCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG 195
YC GCNC NC NNV +E AR++A+ LERNP AF PK+++S ++K G++ +G
Sbjct: 194 YCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNSTAHNCESKAAEGDI--VG 251
Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
KH KGC CK++ CLKKYCECF+A++ CSENCRC CKN++ + +R + + HA+++
Sbjct: 252 KHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHHV--- 308
Query: 256 QQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSS 315
N A +G +G S + F + D S S
Sbjct: 309 ---QNPASSGMVGQSV--------------IIFHAAENDSS--------LSLAASALSCS 343
Query: 316 TLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSV--AVQR 373
SS+ A S + TYR LADIIQ +D+ ELC++L+++S +A ++ + +
Sbjct: 344 LTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATLLESGMK 403
Query: 374 NTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDG 433
T+++ +R + ++S+ R KEP + S+ + + +D D
Sbjct: 404 ENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRLELPRADPFDI 463
Query: 434 QKA--RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERI 491
QK RP+SPGT ALMCDEQD +F + +P+ ++ + + +++ EQE+
Sbjct: 464 QKGNKRPLSPGTQALMCDEQDAVFQDSKAPD---------ATPSTTEENLVDIFKEQEKR 514
Query: 492 VLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQP---ATVS 548
VLT FRD L +L T G+++E+K SLA T Q A +G SS T T+
Sbjct: 515 VLTNFRDYLCKLATCGKLQELKL-SLASTNCSEQAFA-DHGNNSSITRVAKVTRIRQTIP 572
Query: 549 NGVSKAYGPSAL 560
SKA +AL
Sbjct: 573 QFSSKAASTTAL 584
>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
Length = 426
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 155/453 (34%), Positives = 215/453 (47%), Gaps = 84/453 (18%)
Query: 68 VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLY 127
V P P P P P A PP V + +S S K++K CNCK+S+CLKL
Sbjct: 48 VQPQMPTPRPWP-VAFIPPKPVAEIKSTPS------------TKRKKHCNCKNSQCLKL- 93
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
+EA+ + KI + + K++
Sbjct: 94 ----------------------------QEAITVRSNGIQKPSKLKIENGSNALNLRKDD 125
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
G L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE + + GD
Sbjct: 126 AGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGD 185
Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGN 307
++++ IQ AAN A+ GAIGSSGY PV +KR ++ + + S + FQ+ N
Sbjct: 186 NSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEAN 244
Query: 308 HIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAK 367
H+ SS ++S + S YRS LA+ I P D L LV++ +AA
Sbjct: 245 HV---DSSEITSSTGLEGCYSNYQCRSNVVYRSALANTISPTDATGLAKHLVIVCRKAAD 301
Query: 368 SVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSG 427
+ AE++ E +V+R + D A D+ D G
Sbjct: 302 AFLT-------TAENKG----------------EMEVEREIHTDSHGATNMDQQNGGDFG 338
Query: 428 SDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP---YGQGMTEV 484
+ + +RP SPGT ALMCDEQ + F G + P + Q +E+
Sbjct: 339 PCCNSLEDSRPASPGTQALMCDEQGSTF------------GTDYRISFPVALHDQDTSEL 386
Query: 485 YAEQERIVLTKFRDCLNRLITFGEIKEMKYSSL 517
A+QE+ VLT FRD L +IT G+I +++ +
Sbjct: 387 NAQQEKAVLTGFRDYLRTVITRGKINGTEFAKI 419
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 52/131 (39%), Gaps = 35/131 (26%)
Query: 59 PPQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------------------VPKPE 93
P P P+P P PP P+ + P + + KP
Sbjct: 50 PQMPTPRP-WPVAFIPPKPVAEIKSTPSTKRKKHCNCKNSQCLKLQEAITVRSNGIQKPS 108
Query: 94 SPKSRSRSN-VEVKEN-------TPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVN 144
K + SN + ++++ PK K C+CK S CLK YCECF + I C C C +
Sbjct: 109 KLKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQD 168
Query: 145 CHNNVENEAAR 155
C N +E R
Sbjct: 169 CKNFEGSEELR 179
>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
Length = 553
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 141/354 (39%), Positives = 194/354 (54%), Gaps = 44/354 (12%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-----------------RQ 241
K C+CK S CLK YCECF + + C + C C +C N +E R
Sbjct: 160 KCCNCKHSKCLKLYCECFASGVYC-DGCNCSNCFNNVKNETARREAIDATLERNPDAFRP 218
Query: 242 ALFHGDHAN--NMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHR 299
+ HAN NM QAANAA+ GAIG++ ++SP S+KRK + +AK+ HR
Sbjct: 219 KIGSSPHANRNNM----QAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHR 274
Query: 300 LGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLV 359
H Q + + S +P+ + +G K TYR LLADI+QP D+KELC +LV
Sbjct: 275 TNHVPQVFYQKNGMPSD-GPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLV 333
Query: 360 VLSSEAAKSVA---VQRNTTEKQ------AEDRAGTSVASSAQGRLQSHKEPDVDRTVTD 410
V+S AAK+ A Q EK+ E G S+AS+ R +++ +P+ + D
Sbjct: 334 VVSGNAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASAD 393
Query: 411 DCSSA-NQADKVGPDDSGSDGSDGQKA-RPMSPGTLALMCDEQDTIFMAASSPNRLTGHG 468
D S+ +A K ++S D +D QK+ RPMSPGTLALMCDEQDT+FM T
Sbjct: 394 DPSTGGTRAGKASLEESKHDCNDDQKSNRPMSPGTLALMCDEQDTMFM--------TSQN 445
Query: 469 CNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEV 522
P Q +E+YAEQER VLT+FRDCL++L+TFG +KE K+S ++EV
Sbjct: 446 VVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 499
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 68/96 (70%), Positives = 78/96 (81%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
KPESPK E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 137 KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 196
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
NE ARREA++ATLERNP+AFRPKI SSPH R+N +
Sbjct: 197 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMQ 232
>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
Length = 702
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 92/131 (70%), Positives = 105/131 (80%), Gaps = 9/131 (6%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNCK+SRCLKLYCECFASG YCDGCNC+NC+NN E+EA R+ AVEA LERNPNAFRP
Sbjct: 153 KKHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRP 212
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KIA DN+ G +HNKGC+CKKSGCLKKYCECFQA I+CSE C+C+DCK
Sbjct: 213 KIAPG----ADNRRAGG-----ARHNKGCNCKKSGCLKKYCECFQAGIVCSEICKCLDCK 263
Query: 233 NFEGSEERQAL 243
N+EGSE R AL
Sbjct: 264 NYEGSEARDAL 274
>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
Length = 274
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 96/164 (58%), Positives = 122/164 (74%), Gaps = 1/164 (0%)
Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
YCECFA+ +YCDGC+C C N +ENE R+EA+E L RNP AF+PKI +SP+ K
Sbjct: 87 WYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRK 146
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ + + + KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL
Sbjct: 147 DNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQ 206
Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
+A++ +IQ+AAN A++GA GS GY S PV +++K QE G
Sbjct: 207 MKNASDRNHIQEAANFALSGATGSLGYKSSPV-RRKKYQENSLG 249
Score = 45.8 bits (107), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREA 158
+ N E PK K C+C+ S CLK YCECF + I C C C +C N E R+
Sbjct: 146 KDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDC-KNFEGSEERKAL 204
Query: 159 VE 160
V+
Sbjct: 205 VQ 206
>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
Length = 321
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 112/246 (45%), Positives = 139/246 (56%), Gaps = 27/246 (10%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERN 166
N +K CNC++SRCLKLYCECFASG YC CNCV C NNVEN R+EAVEATLERN
Sbjct: 15 NIQTDRKTCNCRNSRCLKLYCECFASGQYCLQNCNCVECKNNVENSVQRQEAVEATLERN 74
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
PNAFRPK+ + +G ++ KHNKGCHCKKSGCLKKYCECFQA+I CS+ C
Sbjct: 75 PNAFRPKVLPTLNGGEGDE----------KHNKGCHCKKSGCLKKYCECFQASIFCSDMC 124
Query: 227 RCMDCKNFEGSEERQALFH-------GDHANNMAY--IQQAANAAITGAIGSSGYA---S 274
RC+DCKNFEGS+ +A+ G A+ + +Q N GA+ Y S
Sbjct: 125 RCLDCKNFEGSDAHKAIASQQEFMPAGSDASLLVAPGLQMDVNKTNGGAMKQEAYVPQKS 184
Query: 275 PPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSS 334
P + G F T K P +H L + N + + LS+ VAR + V
Sbjct: 185 PMGFAAKGGASPHFLKT-KKPILHGL---IKPNAVDQLGEALLSAAEVARKRAHDQVAYD 240
Query: 335 KFTYRS 340
R+
Sbjct: 241 ALKRRT 246
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
E K K C+CK S CLK YCECF + I+C D C C++C N
Sbjct: 90 EGDEKHNKGCHCKKSGCLKKYCECFQASIFCSDMCRCLDCKN 131
>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 255
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 85/130 (65%), Positives = 101/130 (77%), Gaps = 6/130 (4%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K++K CNC++SRCLKLYCECFASG++CD CNCVNC NN+ + A R AVE+TLERNPNAF
Sbjct: 4 KRKKYCNCRNSRCLKLYCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAF 63
Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
RPKIA G + + E +HN+GCHCKKS CLKKYCECFQANI CS+ CRCM+
Sbjct: 64 RPKIAPGV-GPLEKRREGA-----SRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCME 117
Query: 231 CKNFEGSEER 240
CKNF GS +R
Sbjct: 118 CKNFLGSSQR 127
Score = 46.6 bits (109), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+E + + C+CK S CLK YCECF + I+C D C C+ C N
Sbjct: 78 REGASRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCMECKN 120
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
M K K C+C+ S CLK YCECF + + C ++C C++C N
Sbjct: 1 MPSKRKKYCNCRNSRCLKLYCECFASGLHC-DSCNCVNCSN 40
>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
Length = 695
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 77/151 (50%), Positives = 101/151 (66%), Gaps = 10/151 (6%)
Query: 93 ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE 152
ES SN+E N + +K CNC S+CLKLYC+CFA+G +C CNC+NC NN+ENE
Sbjct: 464 ESITMTPESNLE--SNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 521
Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
R+ A++ LERNPNAFRPKI + D+ + KH KGC+CK+SGCLK Y
Sbjct: 522 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 573
Query: 213 CECFQANILCSENCRCMDCKNFEGSEERQAL 243
CEC++A I CS NC+CM C+N E S E+++L
Sbjct: 574 CECYEAKIACSNNCKCMGCRNIEDSMEKKSL 604
>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
Length = 752
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 77/151 (50%), Positives = 101/151 (66%), Gaps = 10/151 (6%)
Query: 93 ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE 152
ES SN+E N + +K CNC S+CLKLYC+CFA+G +C CNC+NC NN+ENE
Sbjct: 521 ESITMTPESNLE--SNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 578
Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
R+ A++ LERNPNAFRPKI + D+ + KH KGC+CK+SGCLK Y
Sbjct: 579 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 630
Query: 213 CECFQANILCSENCRCMDCKNFEGSEERQAL 243
CEC++A I CS NC+CM C+N E S E+++L
Sbjct: 631 CECYEAKIACSNNCKCMGCRNIEDSMEKKSL 661
>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
tauri]
Length = 766
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 74/127 (58%), Positives = 90/127 (70%), Gaps = 11/127 (8%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG--CNCVNCHNNVENEAARREAVEATLERNPNAFR 171
K CNCK+SRCLKLYCECFA+G C C C NC N VE+ R A+E L+RNPNAFR
Sbjct: 16 KGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVLDRNPNAFR 75
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
PKIA+ +RD + +HN+GCHCK+SGCLKKYCECFQA I C+E CRC+ C
Sbjct: 76 PKIAAVASPSRD---------VALRHNRGCHCKRSGCLKKYCECFQAAIYCAETCRCVSC 126
Query: 232 KNFEGSE 238
+N+EGS+
Sbjct: 127 ENYEGSD 133
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 36/59 (61%), Gaps = 7/59 (11%)
Query: 90 PKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
PK + S SR +V ++ N + C+CK S CLK YCECF + IYC + C CV+C N
Sbjct: 76 PKIAAVASPSR-DVALRHN-----RGCHCKRSGCLKKYCECFQAAIYCAETCRCVSCEN 128
>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 421
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 140/455 (30%), Positives = 202/455 (44%), Gaps = 106/455 (23%)
Query: 84 QPPVRVPKPESPKSRSRSNVEVKENTPKK----QKQCNCKHSRCLKLYCECFASGIYCDG 139
+ PV+ P P SR S E K+ T ++ +K+C CK S+CLKLYC+CFASG+ C
Sbjct: 32 ESPVKEIPPFPPISREHS--ETKDTTDQEGITWRKRCRCKQSKCLKLYCDCFASGVLCTD 89
Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
C+CV+CHNN +N AR AV L RNPNAF K+ SS D + + G ++
Sbjct: 90 CDCVDCHNNSDNYDARDAAVVNVLGRNPNAFNEKLFSS---INDKQCKAAPDTRPGLLSR 146
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKN-------------------FEGSEER 240
GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN +EG +
Sbjct: 147 GCKCKRTKCLKKYCECFQANVLCSDNCKCINCKNVSEAFQPSVFAWGLNSRKLYEGFDNP 206
Query: 241 QALFHGD------HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKD 294
+ F D +++ + + A G S + SP V ++R+ QEL ++
Sbjct: 207 EKKFVCDLGIISPSEDSVGFNSHNTAGCMNYAPGFSAHNSPQVYRRRRHQELPEWNSCPA 266
Query: 295 PSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSL-----LADIIQPQ 349
P N I+ + S +SS P K YR A + P
Sbjct: 267 PLFS-----IPDNSIQNALGSPMSSSP-------------KLPYRKKKPPLGYASTVVP- 307
Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
DL ++CS+L+ S A + AED Q R+ + VD T
Sbjct: 308 DLGDICSLLLAASETATAN-----------AED----------QNRICIKPDDKVDNVYT 346
Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
+ S + +SG+ + Q R + I + + N C
Sbjct: 347 EVLSES---------ESGNVEEEIQSFRRL--------------IELIDAQYNGEEHSKC 383
Query: 470 NTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLI 504
T + + T++Y EQER VL FRDCL I
Sbjct: 384 KTETSV----HETDIYMEQERAVLETFRDCLQTFI 414
>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 151
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 81/145 (55%), Positives = 101/145 (69%), Gaps = 10/145 (6%)
Query: 94 SPKSRSRSNVEVKENTPK----KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNV 149
SP+S + + +TP+ ++K+CNCK+S+CLKLYCECFASG YCD CNC C NN
Sbjct: 12 SPRS-----IAARASTPRDSASQRKRCNCKNSKCLKLYCECFASGKYCDACNCAGCKNND 66
Query: 150 ENEAARREAVEATLERNPNAFRPKIAS-SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
E+ R+ A+E TLERNPNAFRPKI + + +GT D +HN+GCHCK+SGC
Sbjct: 67 EHAHERQSAIEQTLERNPNAFRPKIINQTTNGTLDGAGAGDGATTEARHNRGCHCKRSGC 126
Query: 209 LKKYCECFQANILCSENCRCMDCKN 233
LKKYCECFQA I C E CRC +CKN
Sbjct: 127 LKKYCECFQAAIYCVERCRCAECKN 151
>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
CCMP2712]
Length = 138
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/129 (60%), Positives = 98/129 (75%), Gaps = 4/129 (3%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNP 167
TP K+K CNCK+S+CLKLYCECFASG+YC+ CNC +CHNNV E R+ AV +TLERNP
Sbjct: 10 TPAKRKPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERNP 69
Query: 168 NAFRPKIASS-PHGTRDNKEETGEVLML--GKHNKGCHCKKSGCLKKYCECFQANILCSE 224
+AFRPKI + G D+ + LM +H KGCHCKKSGCLKKYCECFQA ++CS+
Sbjct: 70 HAFRPKITQTGQQGIVDSPAKGARWLMFVKARHMKGCHCKKSGCLKKYCECFQAAVMCSD 129
Query: 225 NCRCMDCKN 233
C+C++C+N
Sbjct: 130 TCKCIECRN 138
>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 546
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 71/124 (57%), Positives = 94/124 (75%), Gaps = 12/124 (9%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
K+CNCK S+C+KLYCEC+A+G+YC+GC+CV+C NN ++E R +A++ TL +NP AF PK
Sbjct: 69 KKCNCKQSQCIKLYCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPK 128
Query: 174 IASSPHGTRDNKEETG----EVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
I+ K+ET + L LGKHNKGC CK+SGC+KKYCECFQA + CSENC+C
Sbjct: 129 ISQ--------KQETASKVVDPLNLGKHNKGCACKRSGCMKKYCECFQAGVPCSENCKCT 180
Query: 230 DCKN 233
DC+N
Sbjct: 181 DCQN 184
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
G +K C+CK+S C+K YCEC+ A + C+ C C+ CKN + SE
Sbjct: 65 GYTSKKCNCKQSQCIKLYCECYAAGVYCN-GCHCVSCKNNDQSE 107
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 34/64 (53%), Gaps = 2/64 (3%)
Query: 85 PPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCV 143
P +PK S K + S V N K K C CK S C+K YCECF +G+ C + C C
Sbjct: 122 PTAFLPKI-SQKQETASKVVDPLNLGKHNKGCACKRSGCMKKYCECFQAGVPCSENCKCT 180
Query: 144 NCHN 147
+C N
Sbjct: 181 DCQN 184
>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
Length = 1133
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 68/109 (62%), Positives = 79/109 (72%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YCECFASG YC+ CNCV C NN E+EA R+ AVEA LERNPNAFRPKI S+
Sbjct: 144 YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQAPAAAAA 203
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
G+H KGC+CKKS CLKKYCECFQA I CS+NC+C++CKN+E
Sbjct: 204 VVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYE 252
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
+ K CNCK S CLK YCECF +GIYC D C CV C N E
Sbjct: 213 RHLKGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYEE 253
>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
Length = 116
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 75/116 (64%), Positives = 84/116 (72%), Gaps = 9/116 (7%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YCECFASG+YCDGCNC+NC NN ENE R+EAVE LERNPNAFRPKI S D
Sbjct: 1 YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60
Query: 187 ETGEVLM---------LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
++ + +HNKGC CKKSGCLKKYCECFQANILCSENCRC+DC+N
Sbjct: 61 CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ K C CK S CLK YCECF + I C + C CV+C N
Sbjct: 77 VARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116
>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
Length = 596
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/158 (55%), Positives = 102/158 (64%), Gaps = 26/158 (16%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNCK+S+CLKLYCECFASG YC CNC C NN+ENE+ R+E + LERNPNAFRP
Sbjct: 33 KKNCNCKNSKCLKLYCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRP 92
Query: 173 KIAS----SPHGTRDNKEETG----------------------EVLMLGKHNKGCHCKKS 206
KIAS SP T+ TG + + GKH KGC CKKS
Sbjct: 93 KIASLPPQSPQITQKVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKS 152
Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
GCLKKYCECFQA I CS+NC+C DCKNF+GS +R+ L
Sbjct: 153 GCLKKYCECFQAGIFCSDNCKCCDCKNFDGSLDRRILL 190
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 10/109 (9%)
Query: 40 SVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS 99
++IL +P + S PP PQ V V P P Q + + P
Sbjct: 80 AIILERNPNAFRPKIASLPPQSPQITQKVVGVTGGPINNPSQIN---IGLSTP------I 130
Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ V K K C CK S CLK YCECF +GI+C D C C +C N
Sbjct: 131 MTGVNDGVAAGKHAKGCACKKSGCLKKYCECFQAGIFCSDNCKCCDCKN 179
>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 491
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 72/126 (57%), Positives = 89/126 (70%), Gaps = 13/126 (10%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
+K CNCK+SRCLKLYCECFASG YCD C CV+C NN N+ AR++AV+ATLERNP AFR
Sbjct: 137 KKTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFR 196
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
PKI S+ +H+KGCHC+KS C K+YCEC+QA I C+ +C+C+DC
Sbjct: 197 PKINSA------------ATESWARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDC 244
Query: 232 KNFEGS 237
KN S
Sbjct: 245 KNHSHS 250
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-RQALFHGDHANNMAYIQQ 257
K C+CK S CLK YCECF + C +C+C+ C N E ++E RQ N A +
Sbjct: 138 KTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFRP 197
Query: 258 AANAAIT 264
N+A T
Sbjct: 198 KINSAAT 204
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 98 RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNN 148
R + N E+ + K C+C+ S C K YCEC+ +GI C C CV+C N+
Sbjct: 196 RPKINSAATESWARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDCKNH 247
>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
Length = 750
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/180 (43%), Positives = 108/180 (60%), Gaps = 23/180 (12%)
Query: 74 PPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFAS 133
P P G+ + V PKS S N K +K CNC S+CLKLYC+CFA+
Sbjct: 506 PTVPKIGKPSTSTALVLSETKPKSTSDVN--------KTKKPCNCTRSQCLKLYCDCFAN 557
Query: 134 GIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLM 193
G +C CNC +C+NN+E+E R +A+ + L+RNPNAFRPKI K G+
Sbjct: 558 GEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAFRPKI----------KSFVGQA-- 605
Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMA 253
+H KGC+CK+SGCLK YCECF+A I CS+NC+C+ C+N E ++ F GDH +++
Sbjct: 606 ERQHTKGCNCKRSGCLKNYCECFEARIACSQNCKCIGCRNME--DDSHTSF-GDHPGSLS 662
>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
Length = 916
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 78/153 (50%), Positives = 100/153 (65%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+E+E R++A+ A LERN
Sbjct: 657 EDNSRPRKPCNCTKSQCLKLYCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGACLERN 716
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
PNAFRPKI G + +H KGC+CK+SGCLK YCEC++A ILCS C
Sbjct: 717 PNAFRPKIGKGKEGDHER-----------RHTKGCNCKRSGCLKNYCECYEAKILCSSMC 765
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L A + QQAA
Sbjct: 766 KCVGCKNFEDSSERKTLMQLADAAEVRVQQQAA 798
>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
Length = 757
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 66/124 (53%), Positives = 89/124 (71%), Gaps = 10/124 (8%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
K++ CNC S+CLKLYC+CFA+G +C CNC +C+NN++NE R++A+ ATLERNP+AF+
Sbjct: 488 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 547
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
PKI G E+ L H KGC+CK+SGCLK YCEC++A I CS NC+C+ C
Sbjct: 548 PKI-----GAVSADEDA-----LRLHTKGCNCKRSGCLKNYCECYEAKIACSANCKCIGC 597
Query: 232 KNFE 235
+N E
Sbjct: 598 RNTE 601
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDC-KNFEGSEERQALFHGDHANN 251
C+C KS CLK YC+CF C NC C DC N + EERQ N
Sbjct: 492 CNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEERQKAIRATLERN 542
>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
Length = 605
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 100/163 (61%), Gaps = 18/163 (11%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K +K CNC S CLKLYCECFA+G +CD CNC+NCHNN+E + R +A+++ LERNP AF
Sbjct: 375 KIRKPCNCTKSMCLKLYCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAF 434
Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
RPKI R H KGC+CK+SGCLK YCEC++A I C+ C+C+
Sbjct: 435 RPKIGRGRDANRT-------------HQKGCNCKRSGCLKNYCECYEARIPCTSKCKCIG 481
Query: 231 CKNFE----GSEERQALFHGDHANNMAYIQQAANAA-ITGAIG 268
CKN E +E+ L + A + QQAA+ + I+ IG
Sbjct: 482 CKNLEEDSVALKEQHPLMNLADAAAVRCQQQAASKSRISTHIG 524
>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
Length = 587
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 70/159 (44%), Positives = 100/159 (62%), Gaps = 13/159 (8%)
Query: 77 PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIY 136
P+P A +P + KP S + S V+ + N ++K CNC S+CLKLYC+CFA+G +
Sbjct: 330 PVPAPANKPNFEILKPPSSAA-STPAVDAQLNLASRRKHCNCSKSQCLKLYCDCFANGEF 388
Query: 137 CDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGK 196
C C C +C NN++NE R +A+ + L+RNP+AF+PKI + G +
Sbjct: 389 CKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKPKITAPASGD------------MRL 436
Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
HNKGC+CK+SGCLK YCEC++A I CS C+C+ C+N E
Sbjct: 437 HNKGCNCKRSGCLKNYCECYEAKIPCSSMCKCVGCRNME 475
>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
magnipapillata]
Length = 530
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 110/174 (63%), Gaps = 12/174 (6%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
+ +S KS S++NV + +K CNC S+CLKLYCECFA+G +C+ CNC C NN+
Sbjct: 296 RIDSNKSASKANVGPITESKILKKPCNCTKSQCLKLYCECFANGEFCNNCNCRICFNNIA 355
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
+E R +A+++ L+RNP AF PKI G K +T +H KGC+C++SGCLK
Sbjct: 356 HEVERSKAIKSCLDRNPYAFHPKI-----GKGKIKGDTER-----RHTKGCNCRRSGCLK 405
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
YCEC++A ILCS C+C CKNFE S +R+ L A + +QQ NAAI+
Sbjct: 406 NYCECYEAKILCSTLCKCSGCKNFEESADRKTLMQLADAAEVRVLQQ--NAAIS 457
>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
Length = 448
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 69/134 (51%), Positives = 91/134 (67%), Gaps = 11/134 (8%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNC S CLKLYC+CFA+G +C CNC +CHNN++NEA R A++A+LERNPNAF+P
Sbjct: 221 KKPCNCTKSMCLKLYCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNPNAFKP 280
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI + G D++ H KGCHCKKS CLK YCEC++A + C+E C+C C+
Sbjct: 281 KIGVASRGRVDSER---------LHQKGCHCKKSNCLKNYCECYEAKVPCTERCKCSSCR 331
Query: 233 NFEGSEERQALFHG 246
N E +R + F G
Sbjct: 332 NTES--DRASKFRG 343
>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
occidentalis]
Length = 609
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 67/131 (51%), Positives = 87/131 (66%), Gaps = 11/131 (8%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
K CNC S+CLKLYC+CFA+G +C CNCV C+NN+E+E R AV+ LERNPNAF PK
Sbjct: 393 KPCNCTKSQCLKLYCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHPK 452
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
I G ++ +H KGC+CK+SGCLK YCEC++A ILCS CRC+ C N
Sbjct: 453 IGKYKPGDKER-----------RHTKGCNCKRSGCLKNYCECYEARILCSGVCRCVGCHN 501
Query: 234 FEGSEERQALF 244
E + + ++L
Sbjct: 502 IEENMDAESLL 512
>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
Length = 999
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 87/155 (56%), Gaps = 33/155 (21%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K+ CNCK S+CLKLYCECFASG YCD CNC++C N E R++A+ A LE+NPNAF
Sbjct: 563 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNAF 622
Query: 171 RPKIASSPHGTRDNKEE--------------------------------TGEVLMLGKHN 198
+PKI ++P T + + +L H
Sbjct: 623 KPKIGATPGMTPGSARRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMHK 682
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
GCHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 683 HGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 717
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 27/39 (69%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
C+CKKS CLK YCECF + C E+C C+DC N +EE
Sbjct: 567 CNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEE 605
>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
Length = 944
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 110/186 (59%), Gaps = 14/186 (7%)
Query: 50 QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENT 109
Q + P Q P V P +P PP T+P + K P + + + V+
Sbjct: 671 QRIRQQQLPTEQSAPTKVEPKLPTQPPGFNPVPTKPLFDILK--QPATGAAATVDALGGM 728
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+A
Sbjct: 729 TSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNMDYEVERERAIRSCLDRNPSA 788
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F+PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I C+ C+C+
Sbjct: 789 FQPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCV 836
Query: 230 DCKNFE 235
C+N E
Sbjct: 837 GCRNME 842
>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/153 (47%), Positives = 88/153 (57%), Gaps = 31/153 (20%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K+ CNCK S+CLKLYCECFASG YCD CNC++C N E R++A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNAF 619
Query: 171 RPKIASSPH-------GTR----DNKEETGEVL-------------------MLGKHNKG 200
+PKI ++ G R D + TG L H G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKHG 679
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVSCGENCRCIDCKN 712
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 191 VLMLGKHNKG-CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
V+M+ K C+CKKS CLK YCECF + C E+C C+DC N +E+
Sbjct: 553 VVMIPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAED 602
>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 972
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/153 (47%), Positives = 88/153 (57%), Gaps = 31/153 (20%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K+ CNCK S+CLKLYCECFASG YCD CNC++C N E R++A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNAF 619
Query: 171 RPKIASSPH-------GTR----DNKEETGEVLM-------------------LGKHNKG 200
+PKI ++ G R D + TG L H G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKHG 679
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 712
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 191 VLMLGKHNKG-CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
V+M+ K C+CKKS CLK YCECF + C E+C C+DC N +E+
Sbjct: 553 VVMIPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAED 602
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
NT + C+CK S C K YCECF +G+ C + C C++C N
Sbjct: 672 NTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKNQT 714
>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
Length = 356
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 64/313 (20%)
Query: 95 PKSRSRSNVEVKEN----TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
P SR S + K + T +K K C CK S+CLKLYC+CFASG+ C C+CV+CHNN E
Sbjct: 43 PISREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSE 102
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
AR A+ L RNPNAF K S DN+ + G ++GC CK++ CLK
Sbjct: 103 KCDAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLK 159
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQAN+LCS+NC+C++CKN + + A S
Sbjct: 160 KYCECFQANLLCSDNCKCINCKNVSEAFQPPAF--------------------------S 193
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIP---VARASS 327
+ SP V ++R+ +EL ++ P L SIP + A
Sbjct: 194 AHNSPQVYRRRRDRELTEWNSCPAP---------------------LFSIPDNSIQNALG 232
Query: 328 TAAVGSSKFTY---RSLL--ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED 382
+ S K Y RSL+ + P DL +LCS+LV +SE+A + A +N + +D
Sbjct: 233 SPMSCSPKLPYRKKRSLMGYTSTLLP-DLGDLCSLLVA-ASESATTTAEDQNRIFTKPDD 290
Query: 383 RAGTSVASSAQGR 395
+ ++S ++ R
Sbjct: 291 KEAIELSSESESR 303
>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
CCMP2712]
Length = 118
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/124 (62%), Positives = 93/124 (75%), Gaps = 9/124 (7%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K+KQCNCK+S+CLKLYCECFASG YC+ CNC C NN E R+ A++ATLERNP AF
Sbjct: 2 KRKQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAF 61
Query: 171 RPKIASS-PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
+PKIASS P G D+ + G+H KGCHCKKSGCLKKYCECFQA +LCS C+C
Sbjct: 62 QPKIASSAPPGIEDSP-------VQGRHTKGCHCKKSGCLKKYCECFQAGVLCSAQCKCH 114
Query: 230 DCKN 233
+C+N
Sbjct: 115 ECRN 118
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 28/43 (65%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
K C+CK S CLK YCECF + C+ +C C C+N E E+++
Sbjct: 4 KQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQR 46
>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
protein 54
gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
Length = 435
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/170 (42%), Positives = 100/170 (58%), Gaps = 9/170 (5%)
Query: 97 SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
S R + K P ++K CNC S+CLKLYC+CFA+G +C CNC +CHNN+E ++ R
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 217
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
+A+ +LERNPNAF+PKI + G D + H KGCHCKKSGCLK YCEC+
Sbjct: 218 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 268
Query: 217 QANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGA 266
+A + C++ C+C C+N E + G +N + NA+ T
Sbjct: 269 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASSTAT 318
>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 368
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 64/313 (20%)
Query: 95 PKSRSRSNVEVKEN----TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
P SR S + K + T +K K C CK S+CLKLYC+CFASG+ C C+CV+CHNN E
Sbjct: 43 PISREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSE 102
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
AR A+ L RNPNAF K S DN+ + G ++GC CK++ CLK
Sbjct: 103 KCDAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLK 159
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQAN+LCS+NC+C++CKN + + A S
Sbjct: 160 KYCECFQANLLCSDNCKCINCKNVSEAFQPPAF--------------------------S 193
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIP---VARASS 327
+ SP V ++R+ +EL ++ P L SIP + A
Sbjct: 194 AHNSPQVYRRRRDRELTEWNSCPAP---------------------LFSIPDNSIQNALG 232
Query: 328 TAAVGSSKFTY---RSLL--ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED 382
+ S K Y RSL+ + P DL +LCS+LV +SE+A + A +N + +D
Sbjct: 233 SPMSCSPKLPYRKKRSLMGYTSTLLP-DLGDLCSLLVA-ASESATTTAEDQNRIFTKPDD 290
Query: 383 RAGTSVASSAQGR 395
+ ++S ++ R
Sbjct: 291 KEAIELSSESESR 303
>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
Length = 369
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 64/313 (20%)
Query: 95 PKSRSRSNVEVKEN----TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
P SR S + K + T +K K C CK S+CLKLYC+CFASG+ C C+CV+CHNN E
Sbjct: 43 PISREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSE 102
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
AR A+ L RNPNAF K S DN+ + G ++GC CK++ CLK
Sbjct: 103 KCDAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLK 159
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQAN+LCS+NC+C++CKN + + A S
Sbjct: 160 KYCECFQANLLCSDNCKCINCKNVSEAFQPPAF--------------------------S 193
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIP---VARASS 327
+ SP V ++R+ +EL ++ P L SIP + A
Sbjct: 194 AHNSPQVYRRRRDRELTEWNSCPAP---------------------LFSIPDNSIQNALG 232
Query: 328 TAAVGSSKFTY---RSLL--ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED 382
+ S K Y RSL+ + P DL +LCS+LV +SE+A + A +N + +D
Sbjct: 233 SPMSCSPKLPYRKKRSLMGYTSTLLP-DLGDLCSLLVA-ASESATTTAEDQNRIFTKPDD 290
Query: 383 RAGTSVASSAQGR 395
+ ++S ++ R
Sbjct: 291 KEAIELSSESESR 303
>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
Length = 408
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 106/172 (61%), Gaps = 15/172 (8%)
Query: 89 VPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+P P +S+ +S + N P+K CNCK+S+CLKLYC+CFA+G +C D CNC NC N
Sbjct: 156 IPPPSHQQSKPKSGANLASNKPRKP--CNCKNSQCLKLYCDCFANGEFCRDSCNCQNCKN 213
Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
+ ++E R AV+A LERNPNAF+PK+ HG ++ +H KGC+CKKS
Sbjct: 214 SFQHEGDRSRAVKACLERNPNAFKPKVG---HGRVGDER---------RHIKGCNCKKSS 261
Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
CLK YCEC++A I CS CRC+ C+N E + L +A ++ QQ A
Sbjct: 262 CLKNYCECYEAKIPCSHLCRCVGCQNLADRPEGKGLMQLANAADLRTQQQKA 313
>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
Length = 429
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/170 (42%), Positives = 100/170 (58%), Gaps = 9/170 (5%)
Query: 97 SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
S R + K P ++K CNC S+CLKLYC+CFA+G +C CNC +CHNN+E ++ R
Sbjct: 152 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 211
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
+A+ +LERNPNAF+PKI + G D + H KGCHCKKSGCLK YCEC+
Sbjct: 212 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 262
Query: 217 QANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGA 266
+A + C++ C+C C+N E + G +N + NA+ T
Sbjct: 263 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASSTAT 312
>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
Length = 705
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 77/147 (52%), Positives = 101/147 (68%), Gaps = 11/147 (7%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNC S+CLKLYC+CFA+G +C CNC NC NN+++E R +AV+A L+RNP+AF P
Sbjct: 482 RKPCNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKACLDRNPHAFHP 541
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI HG++ N+ +HNKGC+CK+SGCLK YCEC++A ILCS C+C+ CK
Sbjct: 542 KIGKG-HGSQTNR----------RHNKGCNCKRSGCLKNYCECYEAKILCSNFCKCVGCK 590
Query: 233 NFEGSEERQALFHGDHANNMAYIQQAA 259
NFE S ER+ L H A + QQ A
Sbjct: 591 NFEESPERKTLMHLADAAEVRVQQQTA 617
>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
Length = 879
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 86/124 (69%), Gaps = 12/124 (9%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R EA+ + LERNP+AF+
Sbjct: 663 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFK 722
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
PKI + +G++ + HNKGC+CK+SGCLK YCEC++A I CS C+C+ C
Sbjct: 723 PKITA---------PNSGDIRL---HNKGCNCKRSGCLKNYCECYEAKIPCSTMCKCVGC 770
Query: 232 KNFE 235
+N E
Sbjct: 771 RNME 774
>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
Length = 437
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 66/136 (48%), Positives = 91/136 (66%), Gaps = 9/136 (6%)
Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAV 159
R + K P ++K CNC S+CLKLYC+CFA+G +C CNC +CHNN++ ++ R +A+
Sbjct: 161 RMKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 220
Query: 160 EATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
+LERNPNAF+PKI + GT D + H KGCHCKKSGCLK YCEC++A
Sbjct: 221 RQSLERNPNAFKPKIGIARGGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 271
Query: 220 ILCSENCRCMDCKNFE 235
+ C++ C+C C+N E
Sbjct: 272 VPCTDRCKCKGCQNTE 287
>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
Length = 749
Score = 148 bits (374), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+T + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
Length = 793
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+T + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 560 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 619
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 620 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 668
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 669 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 701
>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
Length = 821
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/157 (48%), Positives = 101/157 (64%), Gaps = 11/157 (7%)
Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
V E T + +K CNC S+CLKLYC+CFA+G +C+ CNC NC NN+E+E R +A++
Sbjct: 584 VSAAETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTC 643
Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
L+RNP AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+C
Sbjct: 644 LDRNPEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMC 692
Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
S C+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 693 SSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 729
>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
Length = 433
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 66/137 (48%), Positives = 91/137 (66%), Gaps = 9/137 (6%)
Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAV 159
R + K P ++K CNC S+CLKLYC+CFA+G +C CNC +CHNN++ ++ R +A+
Sbjct: 159 RLKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 218
Query: 160 EATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
+LERNPNAF+PKI + GT D + H KGCHCKKSGCLK YCEC++A
Sbjct: 219 RQSLERNPNAFKPKIGIARGGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 269
Query: 220 ILCSENCRCMDCKNFEG 236
+ C++ C+C C+N E
Sbjct: 270 VPCTDRCKCKCCQNTEA 286
>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
Length = 709
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 983
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/152 (48%), Positives = 91/152 (59%), Gaps = 24/152 (15%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K+ CNCK S+CLKLYCECFA+G YCD CNC++C N +E R+ AV + LE+NPNAF
Sbjct: 583 KKAPCNCKKSKCLKLYCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPNAF 642
Query: 171 RPKIA------------------SSPHGTRDNKEET--GEVLMLGK---HNKGCHCKKSG 207
+PKI S+ G + T G GK H GCHCKKS
Sbjct: 643 KPKILPSQSSAVKSLYEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKKSA 702
Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
C KKYCECFQA +LC +NCRC+DC+N E +E
Sbjct: 703 CQKKYCECFQAGVLCGDNCRCIDCRNVEKKKE 734
>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
Length = 950
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 110/187 (58%), Gaps = 14/187 (7%)
Query: 50 QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
Q + P Q P V P +P PP + +P V KP + + + V+
Sbjct: 675 QRIRQQQLPTEQSTPIKVEPKLPTLPPGVKANVPAKPLFEVLKPPA-TAAAAGAVDPLGG 733
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+
Sbjct: 734 MTSRRKHCNCSRSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 793
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
AF+PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I CS C+C
Sbjct: 794 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSSICKC 841
Query: 229 MDCKNFE 235
+ C+N E
Sbjct: 842 VGCRNME 848
>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
Length = 807
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 100/153 (65%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E P+ +K CNC S+CLKLYC+CFA+G +C+ CNC NC NN+E+E R +A++ L+RN
Sbjct: 574 ETAPRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRN 633
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 634 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 682
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 683 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 715
>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
protein of 120 kDa
gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
Length = 950
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 110/187 (58%), Gaps = 14/187 (7%)
Query: 50 QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
Q + P Q P V P +P PP + +P V KP + + + V+
Sbjct: 675 QRIRQQQLPTEQSTPIKVEPKLPTLPPGVKANVPAKPLFEVLKPPA-TAAAAGAVDPLGG 733
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+
Sbjct: 734 MTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 793
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
AF+PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I CS C+C
Sbjct: 794 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSSICKC 841
Query: 229 MDCKNFE 235
+ C+N E
Sbjct: 842 VGCRNME 848
>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
Length = 952
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 110/187 (58%), Gaps = 14/187 (7%)
Query: 50 QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
Q + P Q P V P +P PP + +P V KP + + + V+
Sbjct: 677 QRIRQQQLPTEQSTPIKVEPKLPTLPPGVKANVPAKPLFEVLKPPA-TAAAAGAVDPLGG 735
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+
Sbjct: 736 MTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 795
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
AF+PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I CS C+C
Sbjct: 796 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSSICKC 843
Query: 229 MDCKNFE 235
+ C+N E
Sbjct: 844 VGCRNME 850
>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
Length = 749
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
domain-containing protein 1
gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
gorilla]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
Length = 950
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 111/187 (59%), Gaps = 15/187 (8%)
Query: 50 QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
Q + P Q P V P +P P L A +P + KP P + + + V+
Sbjct: 676 QRIRQQQLPTEQNAPTKVEPKLPTLPTGLKANAPAKPLFDILKP--PATGAAAAVDALGG 733
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+
Sbjct: 734 MTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 793
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
AF+PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I C+ C+C
Sbjct: 794 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKC 841
Query: 229 MDCKNFE 235
+ C+N E
Sbjct: 842 VGCRNME 848
>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
discoideum AX4]
Length = 902
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 71/138 (51%), Positives = 92/138 (66%), Gaps = 14/138 (10%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
K+C+CK+S+CLK+YCECFA+ + C+GC C C NN N A TLERNP+AF PK
Sbjct: 360 KKCHCKNSKCLKMYCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPK 419
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
I P ++ N + +++ KH+KGCHC+KS CLKKYCECFQA+I C+ENC+C DCKN
Sbjct: 420 IK--PSNSKINNQ-----IIIDKHSKGCHCRKSSCLKKYCECFQASIPCNENCKCYDCKN 472
Query: 234 FEGSEERQALFHGDHANN 251
QA H H +N
Sbjct: 473 -------QAHHHLHHPDN 483
>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
Length = 748
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 575 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 623
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 624 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656
>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
Length = 748
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 575 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 623
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 624 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656
>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
Length = 407
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 66/136 (48%), Positives = 91/136 (66%), Gaps = 10/136 (7%)
Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAV 159
R + K P ++K CNC S+CLKLYC+CFA+G +C CNC +CHNN++ ++ R +A+
Sbjct: 134 RMKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 193
Query: 160 EATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
+LERNPNAF+PKI + GT D + H KGCHCKKSGCLK YCEC++A
Sbjct: 194 RQSLERNPNAFKPKIGIA-RGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 243
Query: 220 ILCSENCRCMDCKNFE 235
+ C++ C+C C+N E
Sbjct: 244 VPCTDRCKCKGCQNTE 259
>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
Length = 748
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
boliviensis]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
Length = 1772
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 65/133 (48%), Positives = 87/133 (65%), Gaps = 17/133 (12%)
Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLE 164
K + K+ K CNC++SRCLKLYC+CFA+G +C+ C CV+CHN+ + R +A++ATLE
Sbjct: 323 KASATKRFKPCNCRNSRCLKLYCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAIKATLE 382
Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
+NP AFR K+ + E T H GCHCKK+ CLKKYCECF+A I C
Sbjct: 383 KNPKAFRAKV---------DAEAT-------THQNGCHCKKTKCLKKYCECFEAGITCGA 426
Query: 225 NCRCMDCKNFEGS 237
C+C DC+N+ GS
Sbjct: 427 KCKCADCENYPGS 439
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
R+ V+ + T Q C+CK ++CLK YCECF +GI C C C +C N
Sbjct: 389 RAKVDAEATT--HQNGCHCKKTKCLKKYCECFEAGITCGAKCKCADCEN 435
>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
Length = 749
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
Length = 749
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
Length = 980
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 88/124 (70%), Gaps = 12/124 (9%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + LERNP+AF+
Sbjct: 765 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLERNPSAFK 824
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I CS C+C+ C
Sbjct: 825 PKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSAMCKCVGC 872
Query: 232 KNFE 235
+N E
Sbjct: 873 RNME 876
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 4/49 (8%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN---FEGSEER 240
G K C+C KS CLK YC+CF C ++C C DC N +E ER
Sbjct: 763 GSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKRER 810
>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
Length = 749
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
Length = 749
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 101/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R+ A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
Length = 749
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
Length = 749
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
Length = 775
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/172 (45%), Positives = 106/172 (61%), Gaps = 12/172 (6%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E T + +K CNC S+CLKLYC+CFA+G +C+ CNC NC NN+E+E R +A++ L+RN
Sbjct: 542 ETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRN 601
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 602 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 650
Query: 227 RCMDCKNFEGSEERQALFH-GDHANNMAYIQQAANAAITGAIGSSGYASPPV 277
+C+ CKNFE S ER+ L H D A Q AA ++ I + PV
Sbjct: 651 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTAAKTKLSSQISDLLMRTTPV 702
>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
familiaris]
Length = 749
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
Length = 749
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
Length = 771
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 78/161 (48%), Positives = 105/161 (65%), Gaps = 11/161 (6%)
Query: 99 SRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
S + + E + +K CNC S+CLKLYC+CFA+G +C+ CNCVNC NN+++E+ R +A
Sbjct: 531 SLNGLSTSEANSRPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKA 590
Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
++A L+RNP AF+PKI G D +H+KGC+CKKSGCLK YCEC++A
Sbjct: 591 IKACLDRNPVAFKPKIGKGKEGESDR-----------RHSKGCNCKKSGCLKNYCECYEA 639
Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
I+CS C+CM CKNFE S ER+ L H A + QQ A
Sbjct: 640 KIMCSSICKCMGCKNFEESPERKTLMHLADAAEVRVQQQTA 680
>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
Length = 813
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 580 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 639
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 640 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 688
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 689 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 721
>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
Length = 741
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 508 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 567
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 568 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 616
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 617 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 649
>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
Length = 827
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 594 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 653
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 654 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 702
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 703 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 735
>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
Length = 978
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 94/143 (65%), Gaps = 12/143 (8%)
Query: 93 ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE 152
+SP+ S + ++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E
Sbjct: 745 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 804
Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
R A+ + L+RNP+AF+PKI ++P+ +G++ + HNKGC+CK+SGCLK Y
Sbjct: 805 VERERAIRSCLDRNPSAFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNY 852
Query: 213 CECFQANILCSENCRCMDCKNFE 235
CEC++A I CS C+C+ C+N E
Sbjct: 853 CECYEAKIPCSSICKCVGCRNME 875
>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
Length = 921
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 59/124 (47%), Positives = 88/124 (70%), Gaps = 12/124 (9%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+AF+
Sbjct: 706 RRKHCNCSKSQCLKLYCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPSAFK 765
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I C+ C+C+ C
Sbjct: 766 PKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGC 813
Query: 232 KNFE 235
+N E
Sbjct: 814 RNME 817
>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
Length = 763
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 530 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 589
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 590 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 638
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 639 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 671
>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
gorilla]
gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
Length = 660
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
boliviensis]
Length = 660
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
Length = 659
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 426 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 485
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 486 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 534
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 535 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 567
>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
Length = 660
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
Length = 660
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
Length = 660
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
Length = 515
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+T + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 282 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 341
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 342 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 390
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 391 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 423
>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
Length = 621
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 388 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 447
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 448 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 496
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 497 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 529
>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
Length = 660
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
familiaris]
Length = 660
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
Length = 660
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568
>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
Length = 725
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/137 (47%), Positives = 88/137 (64%), Gaps = 18/137 (13%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
K++ CNC S+CLKLYC+CFA+G YC CNC +C N +++ R++A+ +TLERNPNAF+
Sbjct: 454 KKRPCNCTKSQCLKLYCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERNPNAFK 513
Query: 172 PK---IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
PK I S+ GTR H KGC+CK+SGCLK YCEC++ I CS NC+C
Sbjct: 514 PKIGSIGSTDDGTR-------------LHTKGCNCKRSGCLKNYCECYEGKIACSSNCKC 560
Query: 229 MDCKNFEGSEERQALFH 245
+ +F + LFH
Sbjct: 561 VGTPSFHLLQSH--LFH 575
>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
Length = 949
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 103/178 (57%), Gaps = 12/178 (6%)
Query: 58 PPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCN 117
P Q P V P +P PP P+ + + V+ ++K CN
Sbjct: 682 PTEQSSPIKVEPKLPTLPPGAKANVPAKPLFEVLKPAATPAAAGAVDQLGGMTSRRKHCN 741
Query: 118 CKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASS 177
C S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+AF+PKI ++
Sbjct: 742 CSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKI-TA 800
Query: 178 PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
P+ +G++ + HNKGC+CK+SGCLK YCEC++A I C+ C+C+ C+N E
Sbjct: 801 PN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 847
>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Saccoglossus kowalevskii]
Length = 946
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/158 (48%), Positives = 103/158 (65%), Gaps = 11/158 (6%)
Query: 102 NVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEA 161
+V+V+ + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+E+E+ R +A+++
Sbjct: 491 SVQVEAPGTRPRKPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKS 550
Query: 162 TLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
LERNP AF PKI G D +HNKGC+CK+SGCLK YCEC++A I+
Sbjct: 551 CLERNPLAFHPKIGKGKEGQADR-----------RHNKGCNCKRSGCLKNYCECYEAKIM 599
Query: 222 CSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
CS C+C+ CKNFE S +R+ L H A + QQ A
Sbjct: 600 CSSICKCVGCKNFEESPDRKTLMHLADAAEVRVQQQTA 637
>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
Length = 804
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+++E R++A++A L+RN
Sbjct: 571 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 630
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 631 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 679
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 680 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 712
>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 560
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 327 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 386
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 387 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 435
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 436 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 468
>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
Length = 751
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+++E R++A++A L+RN
Sbjct: 518 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 577
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 578 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 626
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 627 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 659
>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
Length = 756
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 101/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC NN+E+E R++A++A L+RN
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRN 582
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 583 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 631
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 632 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 664
>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
Length = 469
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/172 (43%), Positives = 102/172 (59%), Gaps = 21/172 (12%)
Query: 69 PPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-----QKQCNCKHSRC 123
P P PPL +A Q V+ ++R + V + PKK +K C+C S C
Sbjct: 196 PQTGPLSPPLAYRAAQASVQ-------RNRVFTPVLAAKLAPKKRPLGSKKPCHCTRSMC 248
Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
LKLYC+CFA+G +C+ C+C +C N +E E R A+ +LERNPNAF+PKI G
Sbjct: 249 LKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVAT 303
Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
N++ E L H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E
Sbjct: 304 NRQVEPERL----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTE 351
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
QK C+CK S CLK YCEC+ + + C D C C+ C N + AAR
Sbjct: 314 QKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 357
>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
Length = 284
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/139 (50%), Positives = 99/139 (71%), Gaps = 14/139 (10%)
Query: 105 VKENTPKKQ---KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVE 160
++EN+ K K C C S+CLK+YC+CF SG+YC C CV C+NN +++ R A++
Sbjct: 1 MEENSSTKTDVVKGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIK 60
Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
+ LE+NP+AF+PK+A+ RD +G +++ KHNKGC+C+K+ CLKKYCECFQ+ +
Sbjct: 61 SVLEKNPDAFQPKVAT----FRD----SGGLIV--KHNKGCNCRKTACLKKYCECFQSGV 110
Query: 221 LCSENCRCMDCKNFEGSEE 239
LCSE C+C CKNFEGS E
Sbjct: 111 LCSELCKCSGCKNFEGSLE 129
>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
Length = 950
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 104/178 (58%), Gaps = 12/178 (6%)
Query: 58 PPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCN 117
P Q P V P +P PP + P+ + + V+ ++K CN
Sbjct: 683 PTEQSSPIKVEPKLPTLPPGVKADVPAKPLFEVLKPAATPAAAGAVDPLGGMTSRRKHCN 742
Query: 118 CKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASS 177
C S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+AF+PKI ++
Sbjct: 743 CSKSQCLKLYCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKPKI-TA 801
Query: 178 PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
P+ +G++ + HNKGC+CK+SGCLK YCEC++A I C+ C+C+ C+N E
Sbjct: 802 PN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 848
>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
Length = 752
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+++E R++A++A L+RN
Sbjct: 519 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 578
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 579 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 627
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 628 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 660
>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
carolinensis]
Length = 749
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+++E R++A++A L+RN
Sbjct: 516 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
Length = 756
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+++E R++A++A L+RN
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 582
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 583 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 631
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 632 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 664
>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
Length = 495
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/192 (40%), Positives = 111/192 (57%), Gaps = 22/192 (11%)
Query: 73 PPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-----QKQCNCKHSRCLKLY 127
P PPL +A Q ++ ++R + V + PKK +K C+C S CLKLY
Sbjct: 226 PLSPPLAYRAAQASIQ-------RNRVFTPVLAAKLAPKKRPLGSKKPCHCTRSMCLKLY 278
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
C+CFA+G +C+ C+C +C N +E E R A+ +LERNPNAF+PKI G N++
Sbjct: 279 CDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVATNRQV 333
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
E L H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E S+ L +
Sbjct: 334 EPERL----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCVCCRNTE-SDRAARLANTK 388
Query: 248 HANNMAYIQQAA 259
A + I+ AA
Sbjct: 389 SATALTDIRSAA 400
>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
Length = 460
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 63/125 (50%), Positives = 84/125 (67%), Gaps = 11/125 (8%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K ++ CNC S+CLKLYCECFA+G +C C C NC NN E+ R +AV+A L+RNP AF
Sbjct: 262 KSKRPCNCTKSQCLKLYCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAF 321
Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
RPKI S G + G H KGC+CK+SGCLK YCEC++A I+C+ C+C+
Sbjct: 322 RPKIGSRKQGN-----------VKGCHTKGCNCKRSGCLKNYCECYEAKIMCTSTCKCVG 370
Query: 231 CKNFE 235
C+N++
Sbjct: 371 CRNYD 375
>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
Length = 528
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
Length = 757
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+++E R++A++A L+RN
Sbjct: 524 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 583
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 584 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 632
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 633 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 665
>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
Length = 521
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 288 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 347
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 348 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 396
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 397 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 429
>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
gorilla]
Length = 528
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
Length = 528
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
Length = 527
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 294 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 353
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 354 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 402
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 403 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 435
>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
Length = 528
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
Length = 528
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
boliviensis]
Length = 528
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
Length = 528
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
Length = 452
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/192 (40%), Positives = 111/192 (57%), Gaps = 22/192 (11%)
Query: 73 PPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ-----KQCNCKHSRCLKLY 127
P PPL +A Q ++ ++R + V + PKK+ K C+C S CLKLY
Sbjct: 183 PLSPPLAYRAAQASIQ-------RNRVFTPVLAAKLAPKKRPLGSKKPCHCTRSMCLKLY 235
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
C+CFA+G +C+ C+C +C N +E E R A+ +LERNPNAF+PKI G N++
Sbjct: 236 CDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVATNRQV 290
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
E L H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E S+ L +
Sbjct: 291 EPERL----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCVCCRNTE-SDRAARLANTK 345
Query: 248 HANNMAYIQQAA 259
A + I+ AA
Sbjct: 346 SATALTDIRSAA 357
>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
familiaris]
gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
Length = 528
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
garnettii]
Length = 544
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 68/146 (46%), Positives = 92/146 (63%), Gaps = 11/146 (7%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
K C C S CLKL C+CFA+G + + CNC + +N++E+E R++A++A L+RNP AF+PK
Sbjct: 345 KPCECVISLCLKLDCDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNPEAFKPK 404
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
T KEE + H+KGC+CK SGCLK YCEC++A I+CS C+C CKN
Sbjct: 405 -------TGKGKEEDSD----XHHSKGCNCKXSGCLKNYCECYEAKIMCSSICKCKGCKN 453
Query: 234 FEGSEERQALFHGDHANNMAYIQQAA 259
FE ER+ L H A + QQ A
Sbjct: 454 FEEXPERKTLMHLAKAAKVRVQQQTA 479
>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
both male and female reproductiv [Ectocarpus
siliculosus]
Length = 735
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 64/122 (52%), Positives = 77/122 (63%), Gaps = 14/122 (11%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIA 175
CNCK S+CLKLYCECF YC+GCNCV C N E R+ A++ T+ERNP AF K
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKFE 499
Query: 176 SSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
R K HN GC+CKKS CLKKYCECFQA + C NC+C++CKN+E
Sbjct: 500 R-----RAGKRS---------HNAGCNCKKSACLKKYCECFQAGVACGTNCKCVNCKNYE 545
Query: 236 GS 237
G+
Sbjct: 546 GA 547
>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 64/122 (52%), Positives = 91/122 (74%), Gaps = 9/122 (7%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK+S+C+KLYCEC+ + ++CD C+C NC NN N R++A++ TLE+NP+AF+P
Sbjct: 16 KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAFQP 75
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI +S + + L GKHNKGC CKKSGC+KKYCECFQA + CS+ C+C++C+
Sbjct: 76 KITTS--------DSKPDPLNFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECR 127
Query: 233 NF 234
N+
Sbjct: 128 NY 129
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
EE+ +V G +K C+CK S C+K YCEC+Q + C NC C +C N
Sbjct: 4 EESDQVFEEGP-SKKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFN 50
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
N K K C CK S C+K YCECF + + C D C C+ C N
Sbjct: 88 NFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 128
>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
Length = 597
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 68/129 (52%), Positives = 85/129 (65%), Gaps = 13/129 (10%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K C+CK+S+CLKLYCECFA+ + C+GC+C CHNN N + + LERNP AF P
Sbjct: 341 RKGCHCKNSKCLKLYCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNP 400
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K KEET + KH KGCHC+KS CLKKYCECFQA I C E+C+C+DC+
Sbjct: 401 KFK--------QKEETKQ----HKHTKGCHCRKSECLKKYCECFQAGIPCGEHCKCIDCR 448
Query: 233 NF-EGSEER 240
N E +ER
Sbjct: 449 NTGEDIKER 457
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 8/86 (9%)
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCH 146
R P+ +PK + + E K++ K K C+C+ S CLK YCECF +GI C + C C++C
Sbjct: 393 RNPEAFNPKFKQKE--ETKQH--KHTKGCHCRKSECLKKYCECFQAGIPCGEHCKCIDCR 448
Query: 147 NNVENEAARREAVEATL---ERNPNA 169
N E+ R + E L ER P A
Sbjct: 449 NTGEDIKERSKPKEQPLKKEERKPTA 474
>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1852
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 19/162 (11%)
Query: 78 LPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC 137
P + P+ VP E+P N +TP+ + CNCK S+CLKLYCECF++ + C
Sbjct: 1481 FPSSIVKTPLAVPGKENP------NGITPMSTPRNR--CNCKKSQCLKLYCECFSAEVMC 1532
Query: 138 DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKH 197
+GCNC +C N A R +A+ L +N +AF+P+IA++P G D L H
Sbjct: 1533 EGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAP-GAAD----------LQGH 1581
Query: 198 NKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
+ GC CKKS CLKKYCECFQA +LC C+C CKN GS++
Sbjct: 1582 SMGCKCKKSECLKKYCECFQAGVLCGLKCKCESCKNIAGSQK 1623
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 48/104 (46%), Gaps = 2/104 (1%)
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F I +P + G M N+ C+CKKS CLK YCECF A ++C E C C
Sbjct: 1481 FPSSIVKTPLAVPGKENPNGITPMSTPRNR-CNCKKSQCLKLYCECFSAEVMC-EGCNCT 1538
Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYA 273
DC+N +A D N A+ + AA GA G++
Sbjct: 1539 DCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAPGAADLQGHS 1582
>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
Length = 802
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 648
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 649 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 698
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 699 YEAKIPCSANCKCIGCRNVE 718
>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
Length = 829
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 592 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 651
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 652 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 701
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 702 YEAKIPCSANCKCIGCRNVE 721
>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
Length = 835
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 598 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 657
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 658 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 707
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 708 YEAKIPCSANCKCIGCRNVE 727
>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
Length = 835
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 598 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 657
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 658 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 707
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 708 YEAKIPCSANCKCIGCRNVE 727
>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
Length = 804
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 590 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 649
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 650 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 699
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 700 YEAKIPCSANCKCIGCRNVE 719
>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
Length = 667
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+++E R++A++A L+RN
Sbjct: 434 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 493
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 494 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 542
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 543 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 575
>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
Length = 826
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 648
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 649 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 698
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 699 YEAKIPCSANCKCIGCRNVE 718
>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
Length = 829
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 592 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 651
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 652 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 701
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 702 YEAKIPCSANCKCIGCRNVE 721
>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
Length = 235
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 96/214 (44%), Positives = 114/214 (53%), Gaps = 45/214 (21%)
Query: 23 PAKKLARQLDFTGGFVGSVI-----------------LP---EHPQSQAVTANSHPP--- 59
P KKL RQLDF + LP +HPQ P
Sbjct: 25 PIKKLVRQLDFNSAALAGNPAMAAAAAAVSRALQPRALPAGFQHPQHVRAAVPMVVPQQL 84
Query: 60 -----PQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------VP-KPESPKSRSR 100
P +P PVV VP PL P +R VP KP+SPK +R
Sbjct: 85 HQRALPVMRPHPVVGHVPRHAVPL--AVPVPQLRPMPPQPVQRPPVAVPLKPDSPKPWAR 142
Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE 160
E K++TP K+K CNC++SRCLKLYCECFASG +CDGCNC NC NN ENE ARREA+E
Sbjct: 143 L-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIE 201
Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLML 194
ATLERNP+AFRPKI SSPH R+N ++L +
Sbjct: 202 ATLERNPDAFRPKIGSSPHTNRNNVCSVVQILFV 235
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
K C+C+ S CLK YCECF + C + C C +C N
Sbjct: 155 KCCNCRNSRCLKLYCECFASGTHC-DGCNCTNCFN 188
>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
Length = 827
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
KS RS V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 590 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 649
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 650 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 699
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 700 YEAKIPCSANCKCIGCRNVE 719
>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
Length = 495
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 262 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 321
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 322 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 370
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 371 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 403
>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 444
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 67/119 (56%), Positives = 81/119 (68%), Gaps = 5/119 (4%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYCECFASG YC C C C NN +N + A E TLERNP AF PKI
Sbjct: 318 CNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKI 377
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ S E G+VL + H++GC+CK+S C KKYCECFQA + C +NC+C+DCKN
Sbjct: 378 SESTAAV----TEEGQVLRVAAHHRGCNCKRSNCRKKYCECFQAGVPCGDNCKCVDCKN 432
>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
Length = 494
Score = 141 bits (356), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 261 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 320
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 321 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 369
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 370 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 402
>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
Length = 458
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 85/201 (42%), Positives = 117/201 (58%), Gaps = 14/201 (6%)
Query: 95 PKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAA 154
P+ + + ++ + +K CNC S+CLKLYC+CFA+G +C+ CNC NC NN+E+EA
Sbjct: 217 PRQQVNGTLNTEQVGTRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHEAE 276
Query: 155 RREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCE 214
R ++++A LERNP AF PKI G D +HNKGC+CK+SGCLK YCE
Sbjct: 277 RAKSIKACLERNPYAFHPKIGKGKEGEGDR-----------RHNKGCNCKRSGCLKNYCE 325
Query: 215 CFQANILCSENCRCMDCKNFEGSEERQALFH-GDHANNMAYIQQAANAAITGAIGSSGYA 273
C++A I+CS C+C+ CKNFE S ER+ L H D A Q AA ++ I
Sbjct: 326 CYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTAAKTKLSSQIQDIPAR 385
Query: 274 SPPVSKKRKGQELFFGSTAKD 294
P V+ G+ L F +D
Sbjct: 386 PPAVNT--AGERLPFSFVTRD 404
>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
Length = 818
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 75/139 (53%), Positives = 96/139 (69%), Gaps = 12/139 (8%)
Query: 97 SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
SRS +N + P+K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R+
Sbjct: 583 SRSFNNSDANGIRPRKP--CNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQ 640
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
A+++ LERNPNAFRPKI ETGE + +HNKGC+CK+SGCLK YCEC+
Sbjct: 641 RAIKSCLERNPNAFRPKIGKG--------RETGEDIR--RHNKGCNCKRSGCLKNYCECY 690
Query: 217 QANILCSENCRCMDCKNFE 235
+A I CSENC+C+ C+N E
Sbjct: 691 EAKIPCSENCKCIGCRNVE 709
>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
Length = 419
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+T + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 186 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 245
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 246 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 294
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 295 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 327
>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
Length = 1001
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/133 (51%), Positives = 83/133 (62%), Gaps = 2/133 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYC+CFA+GIYC +GC CV C N E + E + RNP AF P
Sbjct: 567 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 626
Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
KI +G+ N E G +H +GC+CKKS CLKKYCEC+QAN+ CS CRC C
Sbjct: 627 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGC 686
Query: 232 KNFEGSEERQALF 244
KN G +E F
Sbjct: 687 KNVYGRKEEYGAF 699
Score = 45.8 bits (107), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCEC+ + + C GC C C N
Sbjct: 651 RHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKN 688
>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
Length = 614
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 98/149 (65%), Gaps = 10/149 (6%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
+ +K CNC S+CLKLYC+CFA+G +C CNCVNC NN+E+E R +++ LERNP AF
Sbjct: 391 RPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAF 450
Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
PKI S G D + +HNKGC+CK+SGCLK YCEC++A ILC+ C+C+
Sbjct: 451 HPKIGKSKAGQGD----------IRRHNKGCNCKRSGCLKNYCECYEAKILCTSLCKCVG 500
Query: 231 CKNFEGSEERQALFHGDHANNMAYIQQAA 259
CKNFE S +R+ L H A + QQ A
Sbjct: 501 CKNFEESPDRKTLMHLADAAEVRVQQQTA 529
>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
Length = 438
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 205 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 264
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 265 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 313
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 314 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 346
>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
Length = 369
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 88/195 (45%), Positives = 119/195 (61%), Gaps = 11/195 (5%)
Query: 76 PPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGI 135
P LP AT V + S K S E K+ T ++ C+CK+S+CLKLYCECF G
Sbjct: 137 PLLPRPATFIDVML---SSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCECFEKGR 193
Query: 136 YCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG 195
YC GCNC NC NNV +E AR++A+ LERNP AF PK+ +S ++K G++ +G
Sbjct: 194 YCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNSTAHNCESKAAEGDI--VG 251
Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
KH KGC CK++ CLKKYCECF+A++ CSENCRC CKN++ + +R + N +
Sbjct: 252 KHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRIS------QKNTVHA 305
Query: 256 QQAANAAITGAIGSS 270
N A +G +G S
Sbjct: 306 HHVQNPASSGMVGQS 320
>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
Length = 1094
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/133 (51%), Positives = 83/133 (62%), Gaps = 2/133 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYC+CFA+GIYC +GC CV C N E + E + RNP AF P
Sbjct: 574 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 633
Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
KI +G+ N E G +H +GC+CKKS CLKKYCEC+QAN+ CS CRC C
Sbjct: 634 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGC 693
Query: 232 KNFEGSEERQALF 244
KN G +E F
Sbjct: 694 KNVYGRKEEYGAF 706
Score = 45.4 bits (106), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCEC+ + + C GC C C N
Sbjct: 658 RHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKN 695
>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
Length = 391
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 97/163 (59%), Gaps = 16/163 (9%)
Query: 73 PPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFA 132
P PPL +A Q V+ + +P V + +K C+C S CLKLYC+CFA
Sbjct: 201 PLSPPLAYRAAQASVQRNRVFTP-------VLAAKRPLGSKKPCHCTRSMCLKLYCDCFA 253
Query: 133 SGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL 192
+G +C+ C+C +C N +E E R A+ +LERNPNAF+PKI G N++ E L
Sbjct: 254 NGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVATNRQVEPERL 308
Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E
Sbjct: 309 ----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTE 347
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
QK C+CK S CLK YCEC+ + + C D C C+ C N + AAR
Sbjct: 310 QKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 353
>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
Length = 419
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 186 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 245
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 246 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 294
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 295 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 327
>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1174
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 72/177 (40%), Positives = 99/177 (55%), Gaps = 31/177 (17%)
Query: 76 PPLPGQATQPPVRV---PKPESPKSRSR-SNVEV----------KENTPKKQKQCNCKHS 121
P PG PPV + P P P S SR + E KEN+ + + C CK+S
Sbjct: 809 PSNPGSLAHPPVMMAVAPTPTHPSSVSRMTTFETPAVAKRGGGGKENS-ENESTCKCKNS 867
Query: 122 RCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGT 181
+CLKLYC+CF++ YC GC C++C N E+ R +A+ T RNPNAF+ K+ +S
Sbjct: 868 KCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLNAS---- 923
Query: 182 RDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
+ +H GC CKKS CLKKYCECF ++C E+C+C +CKN+ GS+
Sbjct: 924 ------------VTEHATGCKCKKSECLKKYCECFANLVVCGESCKCSNCKNYVGSQ 968
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 26/39 (66%), Gaps = 1/39 (2%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEA 153
C CK S CLK YCECFA+ + C + C C NC N V ++A
Sbjct: 931 CKCKKSECLKKYCECFANLVVCGESCKCSNCKNYVGSQA 969
>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
Length = 528
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 74/153 (48%), Positives = 101/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+C A+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
Length = 209
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 67/137 (48%), Positives = 91/137 (66%), Gaps = 12/137 (8%)
Query: 97 SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
S++++ ++ N K +K CNC S+CLKLYC+CFA+G +C GCNCV C NN+E+E R
Sbjct: 6 SQAKAQSVLESNGLKPRKPCNCTKSQCLKLYCDCFANGEFCHGCNCVCCANNLEHEELRL 65
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
A+ + L+RNP+AF+PKI G +HNKGCHCK+SGCLK YCEC+
Sbjct: 66 RAIRSCLDRNPHAFKPKIG------------VGWGPEPRRHNKGCHCKRSGCLKNYCECY 113
Query: 217 QANILCSENCRCMDCKN 233
+A I CS C+C+ CKN
Sbjct: 114 EAKIACSAICKCIGCKN 130
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 7/64 (10%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAA------RREAVEATL 163
+ K C+CK S CLK YCEC+ + I C C C+ C N V+ + +R + A L
Sbjct: 93 RHNKGCHCKRSGCLKNYCECYEAKIACSAICKCIGCKNCVDPAGSPPGPGEKRLSKMAAL 152
Query: 164 ERNP 167
+ NP
Sbjct: 153 QTNP 156
>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
Length = 822
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/131 (53%), Positives = 92/131 (70%), Gaps = 10/131 (7%)
Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLE 164
V+ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R+ A+++ LE
Sbjct: 594 VEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLE 653
Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
RNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC++A I CS
Sbjct: 654 RNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCECYEAKIPCSA 703
Query: 225 NCRCMDCKNFE 235
NC+C+ C+N E
Sbjct: 704 NCKCIGCRNIE 714
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 20/97 (20%)
Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAV--- 159
E ++ + K CNCK S CLK YCEC+ + I C C C+ C N+E+ ++++
Sbjct: 668 ETGDDIRRHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCR-NIEDPILEKKSLKDL 726
Query: 160 --------------EATLERNPNAFRPKIASSPHGTR 182
+A L+ + AFRP A S GTR
Sbjct: 727 TETTEVRTTQLSLNKAQLQLSEMAFRPP-AVSNTGTR 762
>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
Length = 348
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 115 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 174
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 175 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 223
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 224 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 256
>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
Length = 305
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 82/134 (61%), Gaps = 13/134 (9%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
P CNC+ SRCLKLYCECFAS ++C+GCNCVNC NN E+ R E +A ++R+P+
Sbjct: 183 PNHSHTCNCRKSRCLKLYCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHV 242
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F P + TR N +GC C+K+ C+KKYC+CFQ I C NCRC+
Sbjct: 243 FEPHVVWDTPATRTNL-------------RGCRCQKTRCVKKYCDCFQNGICCGPNCRCV 289
Query: 230 DCKNFEGSEERQAL 243
DCKN E + +AL
Sbjct: 290 DCKNTEEAALNRAL 303
>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
Length = 859
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/223 (41%), Positives = 126/223 (56%), Gaps = 21/223 (9%)
Query: 15 QSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQ-SQAVTANSHPPPQPQPKPVVPPVPP 73
QS++ IPA L++ G +L +P V + Q QP V P
Sbjct: 546 QSKNAVAIPASALSQLAS------GQAVLSTNPNVGNIVVLPAQYIQQQQPIDEVKLKSP 599
Query: 74 PPPPLPG--QATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECF 131
P L G Q++ + V ++ + SNVE N + +K CNC S+CLKLYC+CF
Sbjct: 600 TTPGLLGSSQSSTATLCVIDGKNSQRGVYSNVE--PNGIRPRKPCNCTKSQCLKLYCDCF 657
Query: 132 ASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEV 191
A+G +C CNC NC NN+ NE R+ A+++ LERNPNAFRPKI ETGE
Sbjct: 658 ANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKG--------RETGED 709
Query: 192 LMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
+ +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N
Sbjct: 710 IR--RHNKGCNCKRSGCLKNYCECYEAKIPCSGNCKCIGCRNI 750
>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
Length = 825
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/140 (50%), Positives = 95/140 (67%), Gaps = 10/140 (7%)
Query: 96 KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
K+ R ++ N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R
Sbjct: 588 KNSQRGYGNIEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 647
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+ A+++ LERNPNAFRPKI ETG+ + +HNKGC+CK+SGCLK YCEC
Sbjct: 648 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 697
Query: 216 FQANILCSENCRCMDCKNFE 235
++A I CS NC+C+ C+N E
Sbjct: 698 YEAKIPCSANCKCIGCRNIE 717
>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
Length = 834
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 93/134 (69%), Gaps = 12/134 (8%)
Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE 160
SNVE N + +K CNC S+CLKLYC+CFA+G +C CNC NC NN+ NE R+ A++
Sbjct: 604 SNVE--PNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIK 661
Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
+ LERNPNAFRPKI ETGE + +HNKGC+CK+SGCLK YCEC++A I
Sbjct: 662 SCLERNPNAFRPKIGKG--------RETGEDIR--RHNKGCNCKRSGCLKNYCECYEAKI 711
Query: 221 LCSENCRCMDCKNF 234
CS NC+C+ C+N
Sbjct: 712 PCSGNCKCIGCRNI 725
>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1223
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 90/156 (57%), Gaps = 9/156 (5%)
Query: 93 ESPKSRSRSNVEVKENTPKKQ--------KQCNCKHSRCLKLYCECFASGIYCDG-CNCV 143
+SP+S + +++T Q K+CNCK S+CLKLYCECFA+ +C G C C
Sbjct: 708 DSPQSSKNRRLSRRKSTASSQGEKPGDGCKRCNCKKSKCLKLYCECFAAREFCVGSCACR 767
Query: 144 NCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHC 203
NC N E EA + RNP AF PKI +P + +E + +H +GC+C
Sbjct: 768 NCFNKPEYEATVLNTRQQIESRNPLAFAPKIVQAPESSPIPGDEALDTPASARHKRGCNC 827
Query: 204 KKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
KKS CLKKYCEC+QA + CSE CRC C+N G +E
Sbjct: 828 KKSLCLKKYCECYQAGVGCSEGCRCEGCRNMYGRKE 863
>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
sulphuraria]
Length = 542
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 83/239 (34%), Positives = 118/239 (49%), Gaps = 29/239 (12%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K++K C CK+S CLKLYCECFA+G C C+C NC N+ ++E EA A L RNP AF
Sbjct: 228 KERKSCKCKNSMCLKLYCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAF 287
Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
PK+ + P +E G + + KH KGC+CK+SGC K YCECF A + CS C+C D
Sbjct: 288 EPKMTAVP-------KEDGALAI--KHQKGCNCKRSGCQKNYCECFHAGVFCSNICKCND 338
Query: 231 CKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG-----SSGY-----ASPPVSKK 280
C N E + + + N+++I + I GA+ ++GY P +
Sbjct: 339 CGNREDD-------NSEKSTNLSHILSDSGIGIRGALSPLRGETNGYQHLLFMESPRKRW 391
Query: 281 RKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPS---STLSSIPVARASSTAAVGSSKF 336
R+ + + L HF + R + S TLS + V A + F
Sbjct: 392 RESSSSSYSDQSSFRKSRSLTHFDSAFYERKTWSVNADTLSLLKVVEAEKKKLMEEESF 450
>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
[Cucumis sativus]
Length = 581
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 85/132 (64%), Gaps = 1/132 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK SRCLKLYCECFA+G+YC + C+C +C N +EA E RNP AF P
Sbjct: 287 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 346
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + + ++++ + +H +GC+CKKSGCLKKYCEC+Q + CS NCRC CK
Sbjct: 347 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 406
Query: 233 NFEGSEERQALF 244
N G ++ AL
Sbjct: 407 NAFGRKDESALL 418
>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
sativus]
Length = 602
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 63/132 (47%), Positives = 85/132 (64%), Gaps = 1/132 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK SRCLKLYCECFA+G+YC + C+C +C N +EA E RNP AF P
Sbjct: 308 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 367
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + + ++++ + +H +GC+CKKSGCLKKYCEC+Q + CS NCRC CK
Sbjct: 368 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 427
Query: 233 NFEGSEERQALF 244
N G ++ AL
Sbjct: 428 NAFGRKDESALL 439
>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
Length = 873
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 88/253 (34%), Positives = 125/253 (49%), Gaps = 25/253 (9%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK ++CLKLYC+CFA+GIYC D C C +C N E E E + RNP AF P
Sbjct: 453 KRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAP 512
Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
KI ++E+ + L +H +GC+CKKS CLKKYCEC+QAN+ CS CRC C
Sbjct: 513 KIVQHAKEFAASREDRSSSMPSLSRHKRGCNCKKSMCLKKYCECYQANVGCSSECRCEGC 572
Query: 232 KNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGST 291
KN G +E + ++ + + AI ++ P EL+
Sbjct: 573 KNGYGRKEEYGIIEETVSDRVGEERLEGRVDDKLAIVATDEDLLPA-------ELY---- 621
Query: 292 AKDPSIHRLG----HFQQGNHIRTSPSSTLSS---IPVARASSTAAVGSSKFTYRSLLAD 344
+H L FQ +H +++P S LSS +P + + ++K T S D
Sbjct: 622 ----DLHNLTPSTPSFQHSDHGKSTPKSPLSSSRHVPSPESDISILPSNAKSTRSSRYCD 677
Query: 345 IIQPQDLKELCSV 357
I P+ KE +
Sbjct: 678 TI-PEASKETVDI 689
>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
Length = 360
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 101/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC NN+E+E R++A++A L+RN
Sbjct: 127 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRN 186
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 187 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 235
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 236 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 268
>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 707
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 91/174 (52%), Gaps = 45/174 (25%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
+ +K CNCK+S+CLKLYC+CFA+G YC+GC+C NC N + R+ A+ +LERNP+AF
Sbjct: 99 RPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAF 158
Query: 171 RPKIASSPHGT-------------------------RDNKEETGEVL------------- 192
R K+ + + T D+ E E +
Sbjct: 159 RSKVVTLEYDTSSKDTASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEASSTSTQPR 218
Query: 193 -------MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
+H KGC C +S CLKKYCECFQ + CS +CRC +C+NFEGS E
Sbjct: 219 TTGTTTQRHAQHIKGCSCLRSLCLKKYCECFQNGVYCSASCRCSNCRNFEGSAE 272
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 30/54 (55%), Gaps = 4/54 (7%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN---NVENEAARREAVEATL 163
K C+C S CLK YCECF +G+YC C C NC N + E AR V A L
Sbjct: 232 KGCSCLRSLCLKKYCECFQNGVYCSASCRCSNCRNFEGSAELSQARERLVLAEL 285
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
+A R +SP G N E G V + K C+CK S CLK YC+CF A C+ C
Sbjct: 75 SALRKSAVASP-GLSSNALEGGSVT---RPRKPCNCKNSKCLKLYCDCFAAGTYCN-GCH 129
Query: 228 CMDCKNF 234
C +C N
Sbjct: 130 CTNCLNL 136
>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
Length = 729
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 63/126 (50%), Positives = 81/126 (64%), Gaps = 1/126 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYC+CFA+G+YC + C+C +C N +E E + RNP AF P
Sbjct: 450 KRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAP 509
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ S D +ET + +H +GC+CKKS CLKKYCECFQ + CS NCRC CK
Sbjct: 510 KVIRSTDYVSDFGDETNKTPASARHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCK 569
Query: 233 NFEGSE 238
N GS+
Sbjct: 570 NSFGSK 575
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 26/40 (65%)
Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+G K C+CK+S CLK YC+CF A + C E C C DC N
Sbjct: 445 VGASCKRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFN 484
Score = 46.2 bits (108), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ ++ CNCK S CLK YCECF G+ C C C C N
Sbjct: 533 RHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCKN 570
>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
Length = 429
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 60/121 (49%), Positives = 78/121 (64%), Gaps = 10/121 (8%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+ CNCK+S+CLKLYCECFA+G YC CNCV+C N E R++A+E TL +NP+AFRP
Sbjct: 216 RTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRP 275
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K G +++ H KGC C +S C+K+YCECFQA I C+ C+C DC
Sbjct: 276 KF-----GANKSRQSAAP----SAHVKGCRCARSKCIKRYCECFQAGIPCTSACQCRDCH 326
Query: 233 N 233
N
Sbjct: 327 N 327
>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/121 (53%), Positives = 83/121 (68%), Gaps = 8/121 (6%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYC+CFA+G+ C CNC +CHNN ++ R ++ +ERNP AFRPK+
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKV 263
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
S R N E+ E+ +H KGC+CKKS CLKKYCEC+Q + CSE C+C DCKN
Sbjct: 264 DS-----RSNSED--EIDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDDCKNC 316
Query: 235 E 235
E
Sbjct: 317 E 317
>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 995
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 98/139 (70%), Gaps = 12/139 (8%)
Query: 98 RSRS-NVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
RS S N ++ N + +K CNC S+CLKLYCECFA+G +C CNC NC+NN+E+E AR+
Sbjct: 765 RSNSPNRVLEANGIRMKKPCNCTRSQCLKLYCECFANGEFCFQCNCNNCYNNIEHEEARQ 824
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
+++++ LERNP AFRPKI G + E +HNKGCHCK+SGCLK YCEC+
Sbjct: 825 KSIKSCLERNPCAFRPKIQ---IGETEEDER--------RHNKGCHCKRSGCLKNYCECY 873
Query: 217 QANILCSENCRCMDCKNFE 235
+A I CS++C+C+ C+N E
Sbjct: 874 EAKITCSKSCKCIGCRNTE 892
>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
lyrata]
Length = 684
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 63/134 (47%), Positives = 86/134 (64%), Gaps = 7/134 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N + E A R+ +E+ RNP A
Sbjct: 390 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 446
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ S + ++ + +H +GC+CKKS CLKKYCEC+Q + CS NCRC
Sbjct: 447 FAPKVIRSADSIMEAGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSVNCRCE 506
Query: 230 DCKNFEGSEERQAL 243
CKN G +E L
Sbjct: 507 GCKNVFGRKEGSLL 520
>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 1042
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/119 (51%), Positives = 76/119 (63%), Gaps = 19/119 (15%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYC+CFA G+ C CNC +C N NE R++A++ ERNPNAF+PKI
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEE-RKQAMDQITERNPNAFKPKI 712
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G H+KGCHCKKSGCLKKYCEC+Q+ ++C+ C C CKN
Sbjct: 713 QEK-----------------GFHSKGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKN 754
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 30/41 (73%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
C+CKKS CLK YC+CF + CS +C C DC N EG+EER+
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERK 694
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARRE 157
K C+CK S CLK YCEC+ SG+ C C C C N EN A+ R+
Sbjct: 720 KGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKNCDENMASSRK 764
>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 69/145 (47%), Positives = 89/145 (61%), Gaps = 13/145 (8%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYC+CFA+G+ C CNC +CHNN ++ R ++ +ERNP AFRPK+
Sbjct: 195 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKV 254
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
S R N E+ E+ +H KGC+CKKS CLKKYCEC+Q + CSE C+C +CKN
Sbjct: 255 ES-----RSNSED--EIDQKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKNC 307
Query: 235 E-----GSEERQALFHGDHANNMAY 254
E S +R H N Y
Sbjct: 308 EMPVKKESRKRVKHTHSSFKNTEMY 332
>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
lyrata]
Length = 623
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 99/163 (60%), Gaps = 9/163 (5%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
P SPK + R + + E K+CNCK S+CLKLYCECFA+G+YC + C+C++C N
Sbjct: 318 PISPKKKRRKSEQSGEGG-SSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPI 376
Query: 151 NE---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
+E A R+ +E+ RNP AF PK+ + + E+ + +H +GC+CKKS
Sbjct: 377 HEDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 433
Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHAN 250
CLKKYCEC+Q + CS NCRC CKN G ++ +LF D N
Sbjct: 434 CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQDEEN 475
>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
Length = 226
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 80/152 (52%), Positives = 99/152 (65%), Gaps = 13/152 (8%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNC S+CLKLYC+CFA G +C CNCVNC NN+E+E R +A++A LERNP+AF P
Sbjct: 4 RKPCNCTKSQCLKLYCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAFHP 63
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI G + +HNKGCHCK+SGCLK YCEC++A ILC+ C+C CK
Sbjct: 64 KIGKGKVGESER-----------RHNKGCHCKRSGCLKNYCECYEAKILCTSLCKCTGCK 112
Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
NFE S ER+ L H A + QQ NAA T
Sbjct: 113 NFEESPERKTLMHLADAAEVRVKQQ--NAAKT 142
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
+V E+ + K C+CK S CLK YCEC+ + I C C C C N E+
Sbjct: 69 KVGESERRHNKGCHCKRSGCLKNYCECYEAKILCTSLCKCTGCKNFEES 117
>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 609
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 9/163 (5%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN--- 147
P SPK + R + + E K+CNCK S+CLKLYCECFA+G YC + C+C+NC N
Sbjct: 307 PISPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPI 365
Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
+ + A R+ +E+ RNP AF PK+ + + E+ + +H +GC+CKKS
Sbjct: 366 HKDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 422
Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHAN 250
CLKKYCEC+Q + CS NCRC CKN G ++ +LF D N
Sbjct: 423 CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQDEEN 464
>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
Length = 609
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 9/163 (5%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN--- 147
P SPK + R + + E K+CNCK S+CLKLYCECFA+G YC + C+C+NC N
Sbjct: 307 PISPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPI 365
Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
+ + A R+ +E+ RNP AF PK+ + + E+ + +H +GC+CKKS
Sbjct: 366 HKDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 422
Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHAN 250
CLKKYCEC+Q + CS NCRC CKN G ++ +LF D N
Sbjct: 423 CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQDEEN 464
>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
Length = 348
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 94/257 (36%), Positives = 131/257 (50%), Gaps = 59/257 (22%)
Query: 44 PEHPQSQAVTANSHPPPQPQPKPVVPPVPPP---------PP-----PLPGQATQPPVRV 89
P +P SQ VT + QPQ + ++P P P PP P PG +
Sbjct: 18 PINPTSQIVTTS-----QPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVGYAVL 72
Query: 90 PKPESPKSRSRSNVEVKENT---------------------------PKKQKQCNCKHSR 122
P + + S V + N+ P+K + NC S
Sbjct: 73 PAQYVTQLQQSSYVSIASNSTFTGTSGIQTQARLPFNGIIPSESASRPRKPR--NCTKSL 130
Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
CL+LYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RNP AF+PKI G
Sbjct: 131 CLRLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGES 190
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
D +H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+
Sbjct: 191 DR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKT 239
Query: 243 LFHGDHANNMAYIQQAA 259
L H A + QQ A
Sbjct: 240 LMHLADAAEVRVQQQTA 256
>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
Length = 806
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 91/147 (61%), Gaps = 3/147 (2%)
Query: 94 SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
SPK + R + + EN + K+CNCK S+CLKLYCECFA+G+YC + C+C +C N +E
Sbjct: 485 SPKRKRRRSEQAAEN--EACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHE 542
Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
+ + RNP AF PK+ S + +E+ + +H +GC+CKKS CLKKY
Sbjct: 543 DTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKY 602
Query: 213 CECFQANILCSENCRCMDCKNFEGSEE 239
CECFQ + CS +CRC CKN G ++
Sbjct: 603 CECFQGGVGCSISCRCEGCKNTFGRKD 629
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 30/55 (54%)
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
HG+ K E + K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGSPKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537
>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 91/147 (61%), Gaps = 3/147 (2%)
Query: 94 SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
SPK + R + + EN + K+CNCK S+CLKLYCECFA+G+YC + C+C +C N +E
Sbjct: 485 SPKRKRRRSEQAAEN--EACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHE 542
Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
+ + RNP AF PK+ S + +E+ + +H +GC+CKKS CLKKY
Sbjct: 543 DTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKY 602
Query: 213 CECFQANILCSENCRCMDCKNFEGSEE 239
CECFQ + CS +CRC CKN G ++
Sbjct: 603 CECFQGGVGCSISCRCEGCKNTFGRKD 629
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 30/55 (54%)
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
HG+ K E + K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGSPKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537
>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
Length = 777
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 91/147 (61%), Gaps = 3/147 (2%)
Query: 94 SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
SPK + R + + EN + K+CNCK S+CLKLYCECFA+G+YC + C+C +C N +E
Sbjct: 456 SPKRKRRRSEQAAEN--EACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHE 513
Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
+ + RNP AF PK+ S + +E+ + +H +GC+CKKS CLKKY
Sbjct: 514 DTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKY 573
Query: 213 CECFQANILCSENCRCMDCKNFEGSEE 239
CECFQ + CS +CRC CKN G ++
Sbjct: 574 CECFQGGVGCSISCRCEGCKNTFGRKD 600
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 30/55 (54%)
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
HG+ K E + K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 454 HGSPKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 508
>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
Length = 140
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 59/86 (68%), Positives = 71/86 (82%)
Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL +A++ +I
Sbjct: 23 KHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI 82
Query: 256 QQAANAAITGAIGSSGYASPPVSKKR 281
Q+AAN A+ GA GS GY + PV +K+
Sbjct: 83 QEAANFALNGATGSLGYKNSPVRRKK 108
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVE 160
PK K C+C+ S CLK YCECF + I C C C +C N E R+ V+
Sbjct: 21 IPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDC-KNFEGSEERKALVQ 72
>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
Length = 613
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/132 (48%), Positives = 81/132 (61%), Gaps = 4/132 (3%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
T +K CNCK ++CLKLYC+C A+G+YC D C C NC N ENE + E R+P
Sbjct: 282 TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 341
Query: 168 NAFRPKIASSPHGTRDN-KEETGEVLM--LGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
AF P+I + T DN +E G +H +GC+CKKS C KKYCEC+QA I CS+
Sbjct: 342 LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAGIGCSD 401
Query: 225 NCRCMDCKNFEG 236
CRC DC+N G
Sbjct: 402 GCRCEDCRNSFG 413
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 7/83 (8%)
Query: 81 QATQPPVRVPKPESPKSRSRSNVEVKE---NTP---KKQKQCNCKHSRCLKLYCECFASG 134
++ P P+ +P + + NV ++ TP + ++ CNCK S C K YCEC+ +G
Sbjct: 337 ESRDPLAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAG 396
Query: 135 IYC-DGCNCVNCHNNVENEAARR 156
I C DGC C +C N+ +A RR
Sbjct: 397 IGCSDGCRCEDCRNSFGIKADRR 419
>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1084
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 62/118 (52%), Positives = 79/118 (66%), Gaps = 1/118 (0%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIA 175
CNCK ++CLKLYCECFA+ C+GC C C N E EA R+EA +A LERN +AF PKI
Sbjct: 533 CNCKRTKCLKLYCECFAASRMCEGCTCQGCFNKPEFEAMRKEARQAILERNNSAFDPKID 592
Query: 176 SSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
S + EE G H+KGC+CKKS CLKKYCEC+Q + C++ C+C +C+N
Sbjct: 593 SQSKLKQLCNEE-GVDNEQKVHSKGCNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRN 649
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
CNCK S CLK YCEC+ G+ C C C NC NN+ N E ++ L P+A
Sbjct: 617 CNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRNNI-NHPHSHEEPDSRLNATPSA 670
>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
gaditana CCMP526]
Length = 365
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 77/210 (36%), Positives = 108/210 (51%), Gaps = 36/210 (17%)
Query: 55 NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
NSHP +P V + P + + PP+ +P+ R S + ++ K
Sbjct: 106 NSHPGGSSRPVRVEAAIRRQNPSQEARRS-PPLNGSGGGTPQGRGTSGLAATRSS----K 160
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+CNC+ S+CLKLYCECFA G YC D C C++C+N + E R +AVE+ L RNP+AF K
Sbjct: 161 KCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEK 220
Query: 174 I------ASSPHGTRDNKE-----------ETGEVLMLG-------------KHNKGCHC 203
I A+ P ++ + GE M+G H +GC+C
Sbjct: 221 IQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNC 280
Query: 204 KKSGCLKKYCECFQANILCSENCRCMDCKN 233
KKS C+KKYCECF A + C ENC+C C+N
Sbjct: 281 KKSLCIKKYCECFFAGVYCGENCQCEGCQN 310
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA 154
+ CNCK S C+K YCECF +G+YC + C C C NN E A
Sbjct: 276 RGCNCKKSLCIKKYCECFFAGVYCGENCQCEGCQNNPEASAG 317
>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
nagariensis]
Length = 1987
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 63/147 (42%), Positives = 84/147 (57%), Gaps = 25/147 (17%)
Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE---AARREAVEAT 162
+ + K C CK S+CLKLYC+CFA+G YC C+C++CHN E+ RRE + A
Sbjct: 780 RRTSENSSKSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAA- 838
Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
R+P AF KI +P+G GKH +GC+C+KS CLKKYCEC+Q + C
Sbjct: 839 --RDPQAFTRKIQLAPNGN-------------GKHKRGCNCRKSHCLKKYCECYQGGVKC 883
Query: 223 SENCRCMDCKNF------EGSEERQAL 243
C CM+C+N EG+ R AL
Sbjct: 884 GIQCTCMECENMDVGSSQEGAGARGAL 910
>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
Length = 897
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 61/128 (47%), Positives = 79/128 (61%), Gaps = 2/128 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYC+CFA+G++C D C+C +C N E E + RNP AF P
Sbjct: 509 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 568
Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
KI S N E+ +H +GC+CK+S CLKKYCEC+Q+N+ CS CRC C
Sbjct: 569 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 628
Query: 232 KNFEGSEE 239
KN G +E
Sbjct: 629 KNVYGKKE 636
Score = 47.4 bits (111), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 81 QATQPPVRVPKPESPKSRSRSNVE-VKENTP---KKQKQCNCKHSRCLKLYCECFASGIY 136
++ P PK + + SN+E V TP + + CNCK S CLK YCEC+ S +
Sbjct: 559 ESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVG 618
Query: 137 C-DGCNCVNCHN 147
C GC C C N
Sbjct: 619 CSSGCRCEGCKN 630
>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
Length = 674
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 86/130 (66%), Gaps = 7/130 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N +E A R+ +E+ RNP A
Sbjct: 375 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLA 431
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + ++ ++ + +H +GC+CKKS CLKKYCEC+Q + CS NCRC
Sbjct: 432 FAPKVIRNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCE 491
Query: 230 DCKNFEGSEE 239
CKN G ++
Sbjct: 492 GCKNAFGRKD 501
>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
Length = 821
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 61/128 (47%), Positives = 79/128 (61%), Gaps = 2/128 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYC+CFA+G++C D C+C +C N E E + RNP AF P
Sbjct: 433 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 492
Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
KI S N E+ +H +GC+CK+S CLKKYCEC+Q+N+ CS CRC C
Sbjct: 493 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 552
Query: 232 KNFEGSEE 239
KN G +E
Sbjct: 553 KNVYGKKE 560
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 81 QATQPPVRVPKPESPKSRSRSNVE-VKENTP---KKQKQCNCKHSRCLKLYCECFASGIY 136
++ P PK + + SN+E V TP + + CNCK S CLK YCEC+ S +
Sbjct: 483 ESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVG 542
Query: 137 C-DGCNCVNCHN 147
C GC C C N
Sbjct: 543 CSSGCRCEGCKN 554
>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 638
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 101/178 (56%), Gaps = 28/178 (15%)
Query: 71 VPPP--PPPLPGQATQPPVRVPKPESPKSRSRSNVE-------VKENTPKKQ-----KQC 116
+PP PP LP A +P P P S S S+S ++ +N P++ ++C
Sbjct: 107 IPPATKPPDLPTFAAKPIAPRP-PVSTISTSQSPIQHPLKPHATLQNKPQQDSVLTSRRC 165
Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
+C S CLKLYCECFA+G++C C+CV C+N ++E R++A+ + R P+AF+ KIA
Sbjct: 166 SCSRSFCLKLYCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNRKPDAFQSKIAH 225
Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
S E V H +GC+CK+SGCLK YCEC++A + C+ CRC C N
Sbjct: 226 S--------LENATV-----HARGCNCKRSGCLKNYCECYEARVRCTSRCRCQYCYNM 270
>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
Length = 348
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 76/124 (61%), Gaps = 1/124 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYCECFA+G+YC D C C +C N E E + + RNP AF P
Sbjct: 20 KRCNCKRSKCLKLYCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAP 79
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + N+E +H +GC+CKKS CLKKYCEC+QA + CS CRC CK
Sbjct: 80 KVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGCSSGCRCEGCK 139
Query: 233 NFEG 236
N G
Sbjct: 140 NSYG 143
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 81 QATQPPVRVPKPESPKSRSRSNVEVKENTP---KKQKQCNCKHSRCLKLYCECFASGIYC 137
+A P PK + S +N E TP + ++ CNCK S+CLK YCEC+ +G+ C
Sbjct: 70 EARNPLAFAPKVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGC 129
Query: 138 -DGCNCVNCHNNVENEAA 154
GC C C N+ +A
Sbjct: 130 SSGCRCEGCKNSYGKKAV 147
>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
Length = 772
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 61/135 (45%), Positives = 86/135 (63%), Gaps = 7/135 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECFA+G+YC + C+C+ C NN E + R+ +E+ RNP A
Sbjct: 462 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIES---RNPLA 518
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + + ++ + + +H +GC+CKKS CLKKYCECFQ + CS +CRC
Sbjct: 519 FAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCE 578
Query: 230 DCKNFEGSEERQALF 244
CKN G E A+
Sbjct: 579 GCKNAFGKREGAAVL 593
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 27/45 (60%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
G K C CKKS CLK YCECF A + CSE C C+ C N + E
Sbjct: 458 GTSCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQSHTE 502
Score = 45.8 bits (107), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCECF G+ C C C C N
Sbjct: 545 RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 582
>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
Length = 717
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 94/158 (59%), Gaps = 20/158 (12%)
Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
E+ +++PKK+ K+CNCK S+CLKLYCECFA+G+YC + C+C C N
Sbjct: 400 ELNQSSPKKKRRRLEHGGETEGCKRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKP 459
Query: 150 ENE---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
+E A R+ +E+ RNP AF PK+ S + +E + +H +GC+CKKS
Sbjct: 460 IHEDTVLATRKQIES---RNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKS 516
Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
CLKKYCEC+Q + CS NCRC CKN G ++ AL
Sbjct: 517 SCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSALI 554
>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 365
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/210 (36%), Positives = 108/210 (51%), Gaps = 36/210 (17%)
Query: 55 NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
NSHP +P V + P + + PP+ +P+ R S + ++ K
Sbjct: 106 NSHPGGSSRPVRVEAAIRRQNPSQEARRS-PPLNGSGGGTPQGRGTSGLAATRSS----K 160
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+CNC+ S+CLKLYCECFA G YC D C C++C+N + E R +AVE+ L RNP+AF K
Sbjct: 161 KCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEK 220
Query: 174 I------ASSPHGTRDNKE-----------ETGEVLMLG-------------KHNKGCHC 203
I A+ P ++ + GE M+G H +GC+C
Sbjct: 221 IQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNC 280
Query: 204 KKSGCLKKYCECFQANILCSENCRCMDCKN 233
KKS C++KYCECF A + C ENC+C C+N
Sbjct: 281 KKSLCIQKYCECFFAGVYCGENCQCEGCQN 310
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA 154
+ CNCK S C++ YCECF +G+YC + C C C NN E A
Sbjct: 276 RGCNCKKSLCIQKYCECFFAGVYCGENCQCEGCQNNPEASAG 317
>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 695
Score = 128 bits (322), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N + E A R+ +E+ RNP A
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 457
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + + ++ + +H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 458 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 517
Query: 230 DCKNFEGSEERQAL 243
C N G ++ L
Sbjct: 518 GCTNVFGRKDGSLL 531
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
+ ++ CNCK S C+K YCEC+ G+ C C C C N + + +E+ LE N
Sbjct: 484 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 543
Query: 169 AFRPKIASSPHGTRDNKE 186
+ +IA H +KE
Sbjct: 544 TYEKRIAKIQHNVEVSKE 561
>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
Length = 695
Score = 128 bits (321), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N + E A R+ +E+ RNP A
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 457
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + + ++ + +H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 458 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 517
Query: 230 DCKNFEGSEERQAL 243
C N G ++ L
Sbjct: 518 GCTNVFGRKDGSLL 531
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
+ ++ CNCK S C+K YCEC+ G+ C C C C N + + +E+ LE N
Sbjct: 484 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 543
Query: 169 AFRPKIASSPHGTRDNKE 186
+ +IA H +KE
Sbjct: 544 TYEKRIAKIQHNVEVSKE 561
>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
Length = 738
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 64/135 (47%), Positives = 85/135 (62%), Gaps = 7/135 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C C N +E A R+ +E+ RNP A
Sbjct: 444 KRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIES---RNPLA 500
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ S + +E + +H +GC+CKKS CLKKYCEC+Q + CS NCRC
Sbjct: 501 FAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCE 560
Query: 230 DCKNFEGSEERQALF 244
CKN G ++ AL
Sbjct: 561 GCKNAFGRKDGSALI 575
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCEC+ G+ C C C C N
Sbjct: 527 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKN 564
>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
Length = 844
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 98/149 (65%), Gaps = 9/149 (6%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVEN 151
P +P V + + +K CNC S+CLKLYC+CFA+G +C+ CNC NCHNN+EN
Sbjct: 607 PTTPALSETDQVTTELGGLRPRKACNCTKSQCLKLYCDCFANGEFCNRCNCNNCHNNLEN 666
Query: 152 EAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKK 211
E R++A+ L+RNPNAFRPKI S G + ++ +HNKGC+CK+SGCLK
Sbjct: 667 EELRQKAIRGCLDRNPNAFRPKIGKSKAGGPE---------IIRRHNKGCNCKRSGCLKN 717
Query: 212 YCECFQANILCSENCRCMDCKNFEGSEER 240
YCEC++A I CS C+C+ C+N E + ER
Sbjct: 718 YCECYEAKIACSSICKCVGCRNVEETLER 746
>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
[Cucumis sativus]
Length = 788
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 95/155 (61%), Gaps = 8/155 (5%)
Query: 94 SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
SPK + R +E T K+CNCK S+CLKLYCECFA+G+YC + C+C +C N +E
Sbjct: 474 SPKKKRR-RLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHE 532
Query: 153 ---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
A R+ +E+ RNP AF PK+ + + +E+ + +H +GC+CKKS CL
Sbjct: 533 DTVLATRKQIES---RNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCL 589
Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
KKYCEC+Q + CS +CRC CKN G ++ ++
Sbjct: 590 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624
>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
Length = 788
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 95/155 (61%), Gaps = 8/155 (5%)
Query: 94 SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
SPK + R +E T K+CNCK S+CLKLYCECFA+G+YC + C+C +C N +E
Sbjct: 474 SPKKKRR-RLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHE 532
Query: 153 ---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
A R+ +E+ RNP AF PK+ + + +E+ + +H +GC+CKKS CL
Sbjct: 533 DTVLATRKQIES---RNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCL 589
Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
KKYCEC+Q + CS +CRC CKN G ++ ++
Sbjct: 590 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624
>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
Length = 774
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 92/153 (60%), Gaps = 20/153 (13%)
Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN-- 147
+ ++N+PKK+ K+CNCK S+CLKLYCECFA+G+YC + C C +C N
Sbjct: 465 DFQQNSPKKKRRKLEPAGEGESCKRCNCKKSKCLKLYCECFAAGVYCIEPCACRDCFNKP 524
Query: 148 -NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
+VE R+ +E+ RNP AF PK+ S + ++ + +H +GC+CKKS
Sbjct: 525 IHVETVLQTRQQIES---RNPLAFAPKVIRSSDSVSEIGDDLNKTPASARHKRGCNCKKS 581
Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
CLKKYCEC+Q + CS +CRC CKN G ++
Sbjct: 582 SCLKKYCECYQGGVGCSVSCRCEGCKNTYGRKD 614
>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
Length = 767
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 61/135 (45%), Positives = 85/135 (62%), Gaps = 7/135 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECFA+G+YC + C+C+ C NN E + R+ +E+ RNP A
Sbjct: 457 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIES---RNPLA 513
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + + + + + +H +GC+CKKS CLKKYCECFQ + CS +CRC
Sbjct: 514 FAPKVIHTSESGLELGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCE 573
Query: 230 DCKNFEGSEERQALF 244
CKN G E A+
Sbjct: 574 GCKNAFGKREGAAVL 588
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 9/95 (9%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-----RQALFHGDHA 249
G K C CKKS CLK YCECF A + CSE C C+ C N + E RQ + +
Sbjct: 453 GTSCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQI---ESR 509
Query: 250 NNMAYIQQAANAAITGA-IGSSGYASPPVSKKRKG 283
N +A+ + + + +G +G +P ++ ++G
Sbjct: 510 NPLAFAPKVIHTSESGLELGDFSNKTPASARHKRG 544
Score = 45.8 bits (107), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCECF G+ C C C C N
Sbjct: 540 RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 577
>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
Length = 587
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 84/130 (64%), Gaps = 7/130 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECFA+G+YC + C+C+ C NN +E + R+ +E+ RNP A
Sbjct: 460 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIES---RNPLA 516
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + D + + + +H +GC+CKKS CLKKYCECFQ + CS +CRC
Sbjct: 517 FAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCE 576
Query: 230 DCKNFEGSEE 239
CKN G E
Sbjct: 577 GCKNAFGKRE 586
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 28/45 (62%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
G K C CKKS CLK YCECF A + CSE C C+ C+N + E
Sbjct: 456 GTSCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQSHME 500
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCECF G+ C C C C N
Sbjct: 543 RHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKN 580
>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
Length = 584
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/111 (52%), Positives = 73/111 (65%), Gaps = 12/111 (10%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+ CNC SRCLKLYCEC+A+G YC C C NC NN+ENE R+ AV +TL++NP+AFRP
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDKNPDAFRP 466
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
KI K+ET H +GC C KS CL++YCECFQA + C+
Sbjct: 467 KI----------KQET-TTNRKANHLRGCKCTKSKCLQRYCECFQAGVPCT 506
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
+ C+C KS CLK YCEC+ A C C+C +C+N +E R+ L
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQL 451
>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
Length = 727
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 84/133 (63%), Gaps = 7/133 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C C N +E A R+ +E+ RNP A
Sbjct: 435 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKPIHEDTVLATRKQIES---RNPLA 491
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ S + +E + +H +GC+CKKS CLKKYCEC+Q + CS NCRC
Sbjct: 492 FAPKVIRSSEPVTEIGDEFSKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCE 551
Query: 230 DCKNFEGSEERQA 242
CKN G ++ A
Sbjct: 552 GCKNAFGRKDGSA 564
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 26/39 (66%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G+ K C+CKKS CLK YCECF A + C E C C +C N
Sbjct: 431 GEACKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFN 469
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCEC+ G+ C C C C N
Sbjct: 518 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 555
>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
Length = 921
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/131 (48%), Positives = 76/131 (58%), Gaps = 7/131 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K CNCK S+CLKLYC+CF +GI+C DGC C C N VE + E + RNP AF P
Sbjct: 520 KTCNCKKSKCLKLYCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAP 579
Query: 173 KIASSPHGTRDNKEETGEVLML----GKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
KI N E +V M +H +GC+CK+S C KKYCECFQAN+ CS CRC
Sbjct: 580 KIVPCAADVPPNNME--DVNMTTPASARHKRGCNCKRSKCTKKYCECFQANVGCSTGCRC 637
Query: 229 MDCKNFEGSEE 239
C N G E
Sbjct: 638 DGCMNAFGKRE 648
>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
Length = 665
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 88/155 (56%), Gaps = 15/155 (9%)
Query: 96 KSRSRSNVEVKENTPKKQ----KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
K+R SN K+ + K+CNCK ++CLKLYC+CFA+G YC + C C C N E
Sbjct: 465 KTRQASNCLTKKASTSTDDDTTKRCNCKKTKCLKLYCDCFAAGFYCGESCACQGCFNRPE 524
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL------MLGKHNKGCHCK 204
E E + RNP AF PK+ + + TG+V+ +H +GC+CK
Sbjct: 525 YEDTVLETRQQIESRNPLAFAPKVMQTVTAS----PTTGKVVEKLSTPSSARHKRGCNCK 580
Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
KS CLKKYCEC+QAN+ CSE CRC CKN G E
Sbjct: 581 KSMCLKKYCECYQANVGCSEGCRCEGCKNIYGVRE 615
>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
distachyon]
Length = 771
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/171 (40%), Positives = 95/171 (55%), Gaps = 25/171 (14%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECF +G++C + C+C C N N+E + RE +E+ RNP A
Sbjct: 465 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIES---RNPLA 521
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + ++ E++ + +H +GC+CKKS CLKKYCECFQ + CS +CRC
Sbjct: 522 FAPKVIRTSEPGQELGEQSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCE 581
Query: 230 DCKNFEGSEERQALFHGDHA------------------NNMAYIQQAANAA 262
CKN G E L + A +N I QAAN A
Sbjct: 582 GCKNAFGRREGVVLLSIEQAKQGSVENIACVKEETSENDNQLVIYQAANPA 632
>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N + E A R+ +E+ RNP A
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 288
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + + ++ + +H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 289 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 348
Query: 230 DCKNFEGSEERQAL 243
C N G ++ L
Sbjct: 349 GCTNVFGRKDGSLL 362
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
+ ++ CNCK S C+K YCEC+ G+ C C C C N + + +E+ LE N
Sbjct: 315 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 374
Query: 169 AFRPKIASSPHGTRDNKE 186
+ +IA H +KE
Sbjct: 375 TYEKRIAKIQHNVEVSKE 392
>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
Length = 764
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/133 (48%), Positives = 86/133 (64%), Gaps = 11/133 (8%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C CNC NC NN+++E R A+++ L+RNP AF PKI G RD +
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRR- 587
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
HNKGC+CK+SGCLK YCEC++A I+CS +C+C+ CKNFE S ER+ L H
Sbjct: 588 ----------HNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKNFEESPERKTLMHL 637
Query: 247 DHANNMAYIQQAA 259
A + QQ A
Sbjct: 638 ADAAEVRVQQQTA 650
>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
Length = 526
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N + E A R+ +E+ RNP A
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 288
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + + ++ + +H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 289 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 348
Query: 230 DCKNFEGSEERQAL 243
C N G ++ L
Sbjct: 349 GCTNVFGRKDGSLL 362
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
+ ++ CNCK S C+K YCEC+ G+ C C C C N + + +E+ LE N
Sbjct: 315 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 374
Query: 169 AFRPKIASSPHGTRDNKE 186
+ +IA H +KE
Sbjct: 375 TYEKRIAKIQHNVEVSKE 392
>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 510
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 59/122 (48%), Positives = 70/122 (57%), Gaps = 15/122 (12%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK SRCLKLYC+CFA+ +C CNC NC N R EA+ +ERNPNAF+ K
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
R N E+ H GC C+KS CLKKYCECF A CS+ C C+ C+N
Sbjct: 285 -------RPNGEDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330
Query: 235 EG 236
G
Sbjct: 331 AG 332
>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1040
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 64/162 (39%), Positives = 89/162 (54%), Gaps = 39/162 (24%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNP 167
T K++ +CNCK S+CLKLYC+CF G +C+ CNC C N N+A R A++ ERNP
Sbjct: 707 TSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAERESAIKQLQERNP 766
Query: 168 NAFRPKIASS-------------------------------PHGTRDNKEETGEVLMLGK 196
+AF+PKI + P +D++ E+L+
Sbjct: 767 DAFKPKIQTKEQIIQQYLSKLQIFMLIISSLLSKNQIQYLIPKIIKDDE----EILI--- 819
Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
HNKGC+CKKSGC KKYCEC+ + CS+ C+C C+N + SE
Sbjct: 820 HNKGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNRDPSE 861
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE-ERQA 242
C+CKKS CLK YC+CF C+ C C +C N E ++ ER++
Sbjct: 714 CNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAERES 756
>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
Length = 561
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 89/155 (57%), Gaps = 8/155 (5%)
Query: 94 SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
SPK + R + +V K+CNCK +RCLK YC+CFA+GIYC + C C C N +E E
Sbjct: 407 SPKKK-RQDHKVSSTDGDGCKRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCFNRLEYE 465
Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML----GKHNKGCHCKKSGC 208
EA + NP AF PKI H T + + + G+H GC+CK+S C
Sbjct: 466 DTVLEARQQKESHNPLAFAPKIVQ--HVTEFHAINVEDASLFTPSSGRHKTGCNCKRSMC 523
Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
+KKYCEC+QAN+ CS CRC CKN G +E + L
Sbjct: 524 VKKYCECYQANVGCSNACRCEGCKNIHGRKEGEKL 558
>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
thaliana]
Length = 639
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 85/151 (56%), Gaps = 14/151 (9%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C C+ S+CLKLYCECF++G++C + C+C NC N +E ++ E RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ S+ D E + +H +GC+C+KSGC KKYCECF + CS NCRCM CK
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCK 572
Query: 233 NFEGSEERQ-------------ALFHGDHAN 250
N G Q A +GDH +
Sbjct: 573 NTFGHTNEQCAGDSDVVTINDEAKHYGDHGD 603
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ ++ CNC+ S C K YCECF G+ C C C+ C N
Sbjct: 536 RHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKN 573
>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
Length = 896
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 64/137 (46%), Positives = 77/137 (56%), Gaps = 10/137 (7%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYC+CFA+G YC D C C C N E E + RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAP 533
Query: 173 KIAS-----SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
KI S H +N +H +GC+CK+S CLKKYCEC+QAN+ CS CR
Sbjct: 534 KIVQPTTDISSHMDDENLTTPSS----ARHKRGCNCKRSMCLKKYCECYQANVGCSSGCR 589
Query: 228 CMDCKNFEGSEERQALF 244
C CKN G +E F
Sbjct: 590 CEGCKNVHGKKEDYVAF 606
>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
lyrata]
Length = 611
Score = 125 bits (315), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 59/124 (47%), Positives = 78/124 (62%), Gaps = 1/124 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C CK S+CLKLYCECFA+G+YC + C+C NC N +E ++ E RNP AF P
Sbjct: 431 KRCKCKKSQCLKLYCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKSREVIEARNPLAFAP 490
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ S+ D E + +H +GC+C+KSGCLKKYCEC + CS +CRC+ CK
Sbjct: 491 KVVSTSDTAIDLWVENSKTPASARHTRGCNCRKSGCLKKYCECNLMGVRCSSSCRCIGCK 550
Query: 233 NFEG 236
N G
Sbjct: 551 NVFG 554
>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
Length = 760
Score = 125 bits (313), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 92/153 (60%), Gaps = 20/153 (13%)
Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN-- 147
+ ++N+PKK+ K+CNCK S+CLKLYCECFA+G+YC + C C +C N
Sbjct: 452 DFQQNSPKKKRCKLEPAGEGESCKRCNCKKSKCLKLYCECFAAGVYCIEPCACHDCFNKP 511
Query: 148 -NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
+VE R+ +E+ RNP AF PK+ S + ++ + +H +GC+CKKS
Sbjct: 512 IHVETVLQTRQQIES---RNPLAFAPKVIRSSDSVSEIGDDPNKTPASARHKRGCNCKKS 568
Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
CLKKYCEC+Q + CS +CRC CKN G ++
Sbjct: 569 SCLKKYCECYQGGVGCSISCRCEGCKNTYGRKD 601
>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
[Nannochloropsis gaditana CCMP526]
Length = 511
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/122 (48%), Positives = 70/122 (57%), Gaps = 15/122 (12%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK SRCLKLYC+CFA+ +C CNC NC N R EA+ +ERNPNAF+ K
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
R N E+ H GC C+KS CLKKYCECF A CS+ C C+ C+N
Sbjct: 285 -------RPNGEDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330
Query: 235 EG 236
G
Sbjct: 331 AG 332
>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
Length = 503
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/165 (40%), Positives = 96/165 (58%), Gaps = 27/165 (16%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ C+ NC NN+ +E R +A++A L+RNP AF+PKI + G D K
Sbjct: 302 YCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLG--DVKP 359
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
+HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN++ S ER+ L
Sbjct: 360 ---------RHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDESPERKTLM-- 408
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASP----PVSKKRKGQELF 287
NA G + S ++SP + K RK + F
Sbjct: 409 ----------SLPNAMEIGRLDDSHFSSPAKFSALPKSRKDRRSF 443
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 44/84 (52%), Gaps = 10/84 (11%)
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
R P+ PK + + +VK P+ K CNC+ S CLK YCEC+ + I C C C+ C
Sbjct: 340 RNPEAFQPKIGTGNLGDVK---PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 396
Query: 147 NNVENEAARREAVEATLERNPNAF 170
N +E+ R+ TL PNA
Sbjct: 397 NY--DESPERK----TLMSLPNAM 414
>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
Length = 780
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/135 (45%), Positives = 87/135 (64%), Gaps = 8/135 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECF +G++C + C+C C N N+E + RE +E+ RNP A
Sbjct: 477 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLA 533
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + G ++ +++ + +H +GC+CKKS CLKKYCEC+Q + CS +CRC
Sbjct: 534 FAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCE 592
Query: 230 DCKNFEGSEERQALF 244
CKN G E AL
Sbjct: 593 GCKNAFGRREGVALL 607
>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
distachyon]
Length = 787
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 77/124 (62%), Gaps = 1/124 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFA+G+YC + C+C+ C N +E + RNP AF P
Sbjct: 475 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 534
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ ++ +E+ +H +GC+CKKS CLKKYCEC+Q + CS NCRC CK
Sbjct: 535 KVIRLSDAGQETQEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCK 594
Query: 233 NFEG 236
N G
Sbjct: 595 NTFG 598
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 13/62 (20%)
Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---------NVENEAA 154
NTP + ++ CNCK S CLK YCEC+ G+ C C C +C N +VENE
Sbjct: 552 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNTFGMRDGAVSVENEEM 611
Query: 155 RR 156
++
Sbjct: 612 KQ 613
>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 760
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/135 (45%), Positives = 87/135 (64%), Gaps = 8/135 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECF +G++C + C+C C N N+E + RE +E+ RNP A
Sbjct: 457 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLA 513
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + G ++ +++ + +H +GC+CKKS CLKKYCEC+Q + CS +CRC
Sbjct: 514 FAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCE 572
Query: 230 DCKNFEGSEERQALF 244
CKN G E AL
Sbjct: 573 GCKNAFGRREGVALL 587
>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
Length = 774
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 89/153 (58%), Gaps = 14/153 (9%)
Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
+ +N+PKK+ K+CNCK S+CLKLYCECFA+G+YC + C+C +C N
Sbjct: 463 DFNQNSPKKKRRKSETPGETEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 522
Query: 150 ENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
+E + + RNP AF PK+ + + ++ + +H +GC+CKKS CL
Sbjct: 523 IHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCL 582
Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
KKYCEC+Q + CS +CRC CKN G ++ A
Sbjct: 583 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 615
>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
Length = 792
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 71/119 (59%), Gaps = 13/119 (10%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYC+CFA + C CNC +C+NN E+ R +A+ LERNP AF K
Sbjct: 26 CNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAFDAKF 85
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
R+ ++ V H GC C+KS CLKKYCECF + CSE C C+ C+N
Sbjct: 86 -------RETEKHDAVV-----HKTGCKCRKSACLKKYCECFNKGVACSEKCNCVGCRN 132
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 23/35 (65%), Gaps = 1/35 (2%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
C C+ S CLK YCECF G+ C + CNCV C N V
Sbjct: 100 CKCRKSACLKKYCECFNKGVACSEKCNCVGCRNTV 134
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 24/34 (70%)
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
GC+CK+S CLK YC+CF ++C+ +C C C N
Sbjct: 25 GCNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYN 58
>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
Length = 764
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 89/153 (58%), Gaps = 14/153 (9%)
Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
+ +N+PKK+ K+CNCK S+CLKLYCECFA+G+YC + C+C +C N
Sbjct: 453 DFNQNSPKKKRRKVEPAGDTEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 512
Query: 150 ENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
+E + + RNP AF PK+ + + ++ + +H +GC+CKKS CL
Sbjct: 513 IHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSNCL 572
Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
KKYCEC+Q + CS +CRC CKN G ++ A
Sbjct: 573 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 605
>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
Length = 774
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 81/130 (62%), Gaps = 1/130 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+CNCK S+CLKLYCECFA+G+YC + C+C +C N +E + + RNP AF P
Sbjct: 486 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAP 545
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + + ++ + +H +GC+CKKS CLKKYCEC+Q + CS +CRC CK
Sbjct: 546 KVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 605
Query: 233 NFEGSEERQA 242
N G ++ A
Sbjct: 606 NAFGRKDGSA 615
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 34/79 (43%), Gaps = 22/79 (27%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNA 169
+ ++ CNCK S CLK YCEC+ G+ C C C C N A
Sbjct: 569 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKN---------------------A 607
Query: 170 FRPKIASSPHGTRDNKEET 188
F K S+P G + EET
Sbjct: 608 FGRKDGSAPVGIETDPEET 626
>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
Length = 450
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 71/137 (51%), Positives = 83/137 (60%), Gaps = 14/137 (10%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNP 167
N +K CNC S CLKLYCECFA G C CNC NC NN+ E R A++ TLERNP
Sbjct: 139 NLTNSRKPCNCTKSHCLKLYCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNP 198
Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
AF PKI G R KH KGC+CK+SGCLK YCEC++A I CS+ CR
Sbjct: 199 TAFHPKIGRG-EGER-------------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCR 244
Query: 228 CMDCKNFEGSEERQALF 244
C C+N E S ER++L
Sbjct: 245 CQGCRNTEDSVERRSLM 261
>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
Length = 798
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 64/133 (48%), Positives = 86/133 (64%), Gaps = 11/133 (8%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C+ CNC NC NN+E+E R +A++ L+RNP AF+PKI G D +
Sbjct: 590 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR- 648
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H
Sbjct: 649 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHL 698
Query: 247 DHANNMAYIQQAA 259
A + QQ A
Sbjct: 699 ADAAEVRVQQQTA 711
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 648 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEES 689
>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
Length = 800
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 74/124 (59%), Gaps = 1/124 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFA+G+YC + C+C C N +E + RNP AF P
Sbjct: 482 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAP 541
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + E+ +H +GC+CKKS CLKKYCEC+Q + CS NCRC CK
Sbjct: 542 KVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCK 601
Query: 233 NFEG 236
N G
Sbjct: 602 NAFG 605
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 25/39 (64%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G+ K C CKKS CLK YCECF A + CSE C C C N
Sbjct: 478 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMN 516
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 4/57 (7%)
Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVE 160
NTP + ++ CNCK S CLK YCEC+ G+ C C C +C N A E E
Sbjct: 559 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNAFGRRDAETELPE 615
>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
Length = 712
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 63/150 (42%), Positives = 82/150 (54%), Gaps = 28/150 (18%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K CNCK S+CLKLYCECFA+ +C C C C N+ E++ R A E L RNP AFRP
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508
Query: 173 KIASSP----------HGTRDNKEETGEVLMLG-----------------KHNKGCHCKK 205
K+ + T+ N +E + L+ G +H +GC+CKK
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKK 568
Query: 206 SGCLKKYCECFQANILCSENCRCMDCKNFE 235
S C KKYCECFQ + CS+ C+C +CKN E
Sbjct: 569 SNCQKKYCECFQQGVNCSDLCKCEECKNNE 598
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 44/110 (40%), Gaps = 26/110 (23%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN------------------------- 233
K C+CKKS CLK YCECF N C NC C C N
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508
Query: 234 -FEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
E ++ +LF D IQ+ ++A ++G + + VS K K
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDK 558
>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 782
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/146 (42%), Positives = 86/146 (58%), Gaps = 5/146 (3%)
Query: 95 PKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEA 153
PK R +S+ E+ K+C+CK S+CLKLYCECFA+G+YC + C+C C N +E
Sbjct: 458 PKKRRKSDNGDDESC----KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 513
Query: 154 ARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYC 213
+ RNP AF PK+ ++ +E+ +H +GC+CKKS CLKKYC
Sbjct: 514 IVLSTRKHIEFRNPLAFAPKVIRMSEAGQEAQEDPKNTPASARHKRGCNCKKSSCLKKYC 573
Query: 214 ECFQANILCSENCRCMDCKNFEGSEE 239
EC+Q + CS NCRC CKN G+ +
Sbjct: 574 ECYQGGVGCSNNCRCETCKNTFGTRD 599
>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
Length = 475
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 100/189 (52%), Gaps = 22/189 (11%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 382
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPP-VSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
H + + + + + ++ PP + K R+ + L H ++
Sbjct: 383 PH------YMEPGDFESSHYLSPAKFSGPPKLRKNRQAFSCISWEVVEATCACLLAHGEE 436
Query: 306 GNHIRTSPS 314
H SPS
Sbjct: 437 AEHEHCSPS 445
Score = 46.2 bits (108), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 8/83 (9%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
+ K CNCK S CLK YCEC+ + I C C C+ C N E R+ + P
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIAC-KNYEESPERKMLMSTPHYMEPGD 390
Query: 170 FR------PKIASSPHGTRDNKE 186
F P S P R N++
Sbjct: 391 FESSHYLSPAKFSGPPKLRKNRQ 413
>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 490
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 72/119 (60%), Gaps = 5/119 (4%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C CK +RCLKLYC+CF IYC C C++CHN E AR+ A++ L RNP+AF K
Sbjct: 217 CTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKF 276
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+P ET L+ H GC C+KS C+KKYCEC+ ++ C+ +CRC CKN
Sbjct: 277 QKTP--VDGATVETPSKLLT--HKLGCKCRKSACMKKYCECYAGHVYCNTHCRCTGCKN 331
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 26/40 (65%)
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
GC CKK+ CLK YC+CF I C NCRC+DC N E+
Sbjct: 216 GCTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQED 255
>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
Length = 776
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 77/130 (59%), Gaps = 1/130 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFA+G+YC + C+C C N +E + RNP AF P
Sbjct: 464 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 523
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + E+ +H +GC+CKKS CLKKYCEC+Q + CS NCRC CK
Sbjct: 524 KVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSTNCRCESCK 583
Query: 233 NFEGSEERQA 242
N G + ++
Sbjct: 584 NTFGRRDAES 593
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 25/39 (64%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G+ K C CKKS CLK YCECF A + CSE C C C N
Sbjct: 460 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 498
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 4/57 (7%)
Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVE 160
NTP + ++ CNCK S CLK YCEC+ G+ C C C +C N A E E
Sbjct: 541 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSTNCRCESCKNTFGRRDAESELTE 597
>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
Length = 620
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 66/110 (60%), Gaps = 13/110 (11%)
Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
LYCECFA+G++C C+C NC N +N + + RNPNAF KI +S G
Sbjct: 277 LYCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGG---- 332
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
G+H KGCHCKKS CLKKYCECFQA +LC + C+C CKN E
Sbjct: 333 ---------GQHKKGCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKNKE 373
>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
Length = 710
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 87/169 (51%), Gaps = 36/169 (21%)
Query: 94 SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE-- 150
+PK R +S + E K+C+CK S+CLKLYCECFA+G+YC D C C +C N E
Sbjct: 373 TPKKRRKS---LGEKAGDGCKRCHCKKSKCLKLYCECFAAGVYCLDSCACRDCFNKPEFA 429
Query: 151 -NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML--------------- 194
R+ +E RNP AF PKI T DN G +
Sbjct: 430 DTVMGTRQQIET---RNPLAFAPKIVQ----TSDNTPTEGVGIWFSGHAMTNSLFALQPN 482
Query: 195 -------GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
+H +GC+CKKS CLKKYCEC+QA + CS+ C+C CKN G
Sbjct: 483 CPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 531
>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
Length = 407
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 83/148 (56%), Gaps = 19/148 (12%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
+ P++ K C+CK S CLKLYCEC A+ CD CNC C N E A R AV A LER
Sbjct: 42 QTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILER 101
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
NP AF PK+AS +H +GC+C+KSGC+K YCEC QA + C+
Sbjct: 102 NPLAFLPKVASGS----------------SQHLRGCNCRKSGCMKNYCECHQAGVACTSR 145
Query: 226 CRCMDCKNFEG--SEERQALFHGDHANN 251
C C C+N E S ++ +F G A++
Sbjct: 146 CACHQCRNTETFVSAKKMLVFAGVSADD 173
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 36/69 (52%), Gaps = 1/69 (1%)
Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEAT 162
+V + + + CNC+ S C+K YCEC +G+ C C C C N +A++ V A
Sbjct: 109 KVASGSSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAG 168
Query: 163 LERNPNAFR 171
+ + ++ R
Sbjct: 169 VSADDSSLR 177
>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 394
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/148 (42%), Positives = 84/148 (56%), Gaps = 19/148 (12%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
+ P++ K C+CK S CLKLYCEC A+ CD CNC C N E A R AV A L+R
Sbjct: 42 QTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDR 101
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
NP AF+PK+AS +H +GC+C+KSGC+K YCEC QA + C+
Sbjct: 102 NPLAFQPKVASGS----------------SQHLRGCNCRKSGCMKNYCECHQAGVACTSR 145
Query: 226 CRCMDCKNFEG--SEERQALFHGDHANN 251
C C C+N E S ++ +F G A++
Sbjct: 146 CACHQCRNTETFVSAKKMLVFAGVSADD 173
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNC+ S C+K YCEC +G+ C C C C N +A++ V A + + N R I
Sbjct: 121 CNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSADDNDLRVTI 180
>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 974
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 56/124 (45%), Positives = 76/124 (61%), Gaps = 7/124 (5%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK ++CLKLYC+CFA G C C CV C NN R +A+E L+RNP AF K
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVK- 694
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNK--GCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
P + +K+E ++ ++K GC+C+KS CLKKYC C+ + + CSE C+C +CK
Sbjct: 695 ---PEEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSEACKCTECK 751
Query: 233 NFEG 236
N E
Sbjct: 752 NLEN 755
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 27/37 (72%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS 237
C+CKK+ CLK YC+CF LC ++C+C++C N E +
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEAT 672
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 87 VRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNC 145
V E+ KS+ N + +N+ K CNC+ S CLK YC C+ SG+ C + C C C
Sbjct: 691 FNVKPEEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSEACKCTEC 750
Query: 146 HNNVEN 151
N+EN
Sbjct: 751 K-NLEN 755
>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
Length = 782
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 58/128 (45%), Positives = 76/128 (59%), Gaps = 2/128 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFA+G+YC + C+C C N +E + RNP AF P
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 527
Query: 173 KIASSPHGTRD-NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
K+ +D E+ +H +GC+CKKS CLKKYCEC+Q + CS NCRC C
Sbjct: 528 KVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGC 587
Query: 232 KNFEGSEE 239
KN G +
Sbjct: 588 KNTFGKRD 595
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 25/39 (64%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G+ K C CKKS CLK YCECF A + CSE C C C N
Sbjct: 464 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 502
Score = 46.6 bits (109), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 8/59 (13%)
Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
NTP + ++ CNCK S CLK YCEC+ G+ C C C C N +R+A +T
Sbjct: 546 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTF----GKRDAAVST 600
>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
Length = 794
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 58/128 (45%), Positives = 76/128 (59%), Gaps = 2/128 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFA+G+YC + C+C C N +E + RNP AF P
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 527
Query: 173 KIASSPHGTRD-NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
K+ +D E+ +H +GC+CKKS CLKKYCEC+Q + CS NCRC C
Sbjct: 528 KVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGC 587
Query: 232 KNFEGSEE 239
KN G +
Sbjct: 588 KNTFGKRD 595
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 25/39 (64%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G+ K C CKKS CLK YCECF A + CSE C C C N
Sbjct: 464 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 502
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 8/59 (13%)
Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
NTP + ++ CNCK S CLK YCEC+ G+ C C C C N +R+A +T
Sbjct: 546 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTF----GKRDAAVST 600
>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
Length = 782
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 58/128 (45%), Positives = 76/128 (59%), Gaps = 2/128 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFA+G+YC + C+C C N +E + RNP AF P
Sbjct: 456 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 515
Query: 173 KIASSPHGTRD-NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
K+ +D E+ +H +GC+CKKS CLKKYCEC+Q + CS NCRC C
Sbjct: 516 KVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGC 575
Query: 232 KNFEGSEE 239
KN G +
Sbjct: 576 KNTFGKRD 583
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 25/39 (64%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G+ K C CKKS CLK YCECF A + CSE C C C N
Sbjct: 452 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 490
Score = 46.2 bits (108), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 8/59 (13%)
Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
NTP + ++ CNCK S CLK YCEC+ G+ C C C C N +R+A +T
Sbjct: 534 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTF----GKRDAAVST 588
>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
Length = 475
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 78/122 (63%), Gaps = 15/122 (12%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 382
Query: 247 DH 248
H
Sbjct: 383 PH 384
Score = 46.2 bits (108), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 8/83 (9%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
+ K CNCK S CLK YCEC+ + I C C C+ C N E R+ + P
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIAC-KNYEESPERKMLMSTPHYMEPGD 390
Query: 170 FR------PKIASSPHGTRDNKE 186
F P S P R N++
Sbjct: 391 FESSHYLSPAKFSGPPKLRKNRQ 413
>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
Length = 380
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 93/161 (57%), Gaps = 23/161 (14%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C C+C NN+ +E R +A++A ++RNP AF+PK+ G +
Sbjct: 183 YCDCFASGDFCTSCSC----NNLRHELERFKAIKACIDRNPEAFQPKMGKGRLGVSKLR- 237
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 238 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMNT 287
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELF 287
H +A + + + ++ PP K RK ++ F
Sbjct: 288 PH------YMEAGDFEGNHHLSPAKFSGPP--KLRKNRQTF 320
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 237 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEES 278
>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 805
Score = 118 bits (296), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 59/131 (45%), Positives = 79/131 (60%), Gaps = 15/131 (11%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
KK + CNCK ++CLKLYC+CFA+G +C CNC C N V N+ AR + VE LE+NP A
Sbjct: 475 KKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEKNPFA 534
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNK-----GCHCKKSGCLKKYCECFQANILCSE 224
F +D + E L L K GC+C++SGCLKKYC+C+Q + C E
Sbjct: 535 F---------NVKDIEIEEPSQLSLKAQKKLASKKGCNCRRSGCLKKYCQCYQDGLQCGE 585
Query: 225 NCRCMDCKNFE 235
+C+C C+N E
Sbjct: 586 HCKCNGCENCE 596
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-RQALFHG 246
K + C+CKK+ CLK YC+CF A C C C +C N ++E R + G
Sbjct: 474 NKKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEG 526
>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
Length = 363
Score = 118 bits (296), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 70/133 (52%), Positives = 82/133 (61%), Gaps = 14/133 (10%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNC S CLKLYCECFA G C CNC NC NN+ E R A++ TLERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 203
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI G R KH KGC+CK+SGCLK YCEC++A I CS+ CRC C+
Sbjct: 204 KIGRG-EGER-------------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 249
Query: 233 NFEGSEERQALFH 245
N E S ER++L
Sbjct: 250 NTEDSVERRSLMR 262
>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
Length = 364
Score = 118 bits (296), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 70/133 (52%), Positives = 82/133 (61%), Gaps = 14/133 (10%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+K CNC S CLKLYCECFA G C CNC NC NN+ E R A++ TLERNP AF P
Sbjct: 145 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 204
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI G R KH KGC+CK+SGCLK YCEC++A I CS+ CRC C+
Sbjct: 205 KIGRG-EGER-------------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 250
Query: 233 NFEGSEERQALFH 245
N E S ER++L
Sbjct: 251 NTEDSVERRSLMR 263
>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 471
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 60/135 (44%), Positives = 77/135 (57%), Gaps = 21/135 (15%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
KKQK CNCK ++CLKLYC+CFA+G C G CNC C N N R A+ + ++R P+A
Sbjct: 127 KKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCFN-TSNSEKRAGAIVSIMDRQPDA 185
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + H KGC+C +SGCLKKYCEC+ + C E C+C
Sbjct: 186 FGPKVQQNTHK------------------KGCNCTRSGCLKKYCECYTLGVNCGEYCKCT 227
Query: 230 DCKNFE-GSEERQAL 243
CKN + ER AL
Sbjct: 228 QCKNVDLNHSERLAL 242
>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 76/111 (68%), Gaps = 11/111 (9%)
Query: 124 LKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
LKLYCECF G YC C+C CHN + R++A+++ L+RN +AF+PK+A S G +
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS--GKQ 262
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
N + GKH +GC+CK+SGCLKKYCEC+QA + C++ C C++CKN
Sbjct: 263 VNNK--------GKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKN 305
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 6/64 (9%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
KP+ KS + N N K + CNCK S CLK YCEC+ +G+ C D CNCV C N
Sbjct: 253 KPKVAKSGKQVN-----NKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKNRD 307
Query: 150 ENEA 153
+++A
Sbjct: 308 DHDA 311
>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
Length = 664
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/128 (44%), Positives = 75/128 (58%), Gaps = 19/128 (14%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNA 169
++Q+ CNCK S CLK+YCECFA+G +C C+C++C N A E L +NPNA
Sbjct: 260 REQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNA 319
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRC 228
F K+ ++ TG H +GC CK+S CLKKYCECF A C+ E C+C
Sbjct: 320 FEVKVTAA----------TG-------HRRGCRCKRSKCLKKYCECFHAGARCNPEVCQC 362
Query: 229 MDCKNFEG 236
DC+N EG
Sbjct: 363 EDCRNTEG 370
>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
CCMP2712]
Length = 113
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/122 (48%), Positives = 79/122 (64%), Gaps = 12/122 (9%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
++K C+CK SRCLKLYCECFA+G C GC CV+C N+ ++E R +AV+ +RN NAF
Sbjct: 4 RKKVCSCKKSRCLKLYCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNNAFA 63
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
PKI + G H +GC CKKS CLKKYCEC+QA + C++ C+C +C
Sbjct: 64 PKIVDEIQQDK------------GMHARGCRCKKSHCLKKYCECYQAGVQCTDKCKCEEC 111
Query: 232 KN 233
+N
Sbjct: 112 QN 113
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
E++++ + C CK S CLK YCEC+ +G+ C D C C C N
Sbjct: 69 EIQQDKGMHARGCRCKKSHCLKKYCECYQAGVQCTDKCKCEECQN 113
>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific
Length = 475
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 77/122 (63%), Gaps = 15/122 (12%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I CS C+C+ CKN+E S ER+ L
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKMLMST 382
Query: 247 DH 248
H
Sbjct: 383 PH 384
Score = 45.4 bits (106), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEES 373
>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
norvegicus]
Length = 467
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 77/122 (63%), Gaps = 15/122 (12%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 270 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 324
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I CS C+C+ CKN+E S ER+ L
Sbjct: 325 ----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKMLMST 374
Query: 247 DH 248
H
Sbjct: 375 PH 376
Score = 45.4 bits (106), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 324 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEES 365
>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
Length = 394
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/111 (48%), Positives = 75/111 (67%), Gaps = 11/111 (9%)
Query: 124 LKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
LKLYCECF G YC C+C CHN + R++A+++ L+RN +AF+PK+A S G +
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS--GKQ 262
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
N + GKH +GC+CK+SGCLKKYCEC+QA + C++ C C+ CKN
Sbjct: 263 VNNK--------GKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKN 305
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 6/64 (9%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
KP+ KS + N N K + CNCK S CLK YCEC+ +G+ C D CNCV C N
Sbjct: 253 KPKVAKSGKQVN-----NKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKNRD 307
Query: 150 ENEA 153
+++A
Sbjct: 308 DHDA 311
>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
Length = 662
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 76/132 (57%), Gaps = 1/132 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+ CNC+ S+CLKLYC CFA+ +YC C+C C NN +E + T NP AF P
Sbjct: 365 RHCNCRRSKCLKLYCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKRTESHNPLAFAP 424
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
+ + + +++ + +H +GC+C+KS CLKKYCECFQ+ + C +CRCM CK
Sbjct: 425 TVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECFQSGVGCCISCRCMSCK 484
Query: 233 NFEGSEERQALF 244
N G E L
Sbjct: 485 NSFGKREGVLLL 496
>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
Length = 360
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 87/149 (58%), Gaps = 27/149 (18%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 163 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 217
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 218 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 267
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASP 275
H Y++ G SS Y SP
Sbjct: 268 PH-----YME-------PGDFESSHYLSP 284
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 217 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEES 258
>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
sativa Japonica Group]
Length = 757
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 1/124 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFAS +YC + C+C C N+ +E RNP AF P
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 521
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + + E++ +H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CK
Sbjct: 522 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCK 581
Query: 233 NFEG 236
N G
Sbjct: 582 NSFG 585
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 502
>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 766
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 1/124 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFAS +YC + C+C C N+ +E RNP AF P
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 530
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + + E++ +H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CK
Sbjct: 531 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCK 590
Query: 233 NFEG 236
N G
Sbjct: 591 NSFG 594
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 511
>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
Length = 260
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 28/211 (13%)
Query: 71 VPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-----QKQCNCKHSRCLK 125
V L Q + + + +++S + + N +K Q+QCNCK S+CLK
Sbjct: 14 VKNKQKKLINQIKKKSSFLTQKSHSQTKSEHTINQQANNIQKLKALQQRQCNCKKSKCLK 73
Query: 126 LYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDN 184
LYC+CFA G YC+ C+C C NN N+ R A++ ERNPNAF +I
Sbjct: 74 LYCDCFAIGEYCNPICHCFECKNNESNKEKREMALKQNKERNPNAFTSRIIQQSQ----- 128
Query: 185 KEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
L H +GC+CKKSGC KKYC+C+ + CS CRC+ C+N E ++
Sbjct: 129 ---------LNNHQRGCNCKKSGCQKKYCQCYLDGMECSRYCRCIGCENCNIFENKKICL 179
Query: 245 HGDHAN--------NMAYIQQAANAAITGAI 267
+ + N YIQ+ I I
Sbjct: 180 NNKNIEKQIQQQKGNSDYIQKDQQQNILNQI 210
>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
Length = 295
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 87/149 (58%), Gaps = 27/149 (18%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 98 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 152
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 153 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 202
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASP 275
H Y++ G SS Y SP
Sbjct: 203 PH-----YME-------PGDFESSHYLSP 219
Score = 43.1 bits (100), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 8/83 (9%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
+ K CNCK S CLK YCEC+ + I C C C+ C N E R+ + P
Sbjct: 152 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIAC-KNYEESPERKMLMSTPHYMEPGD 210
Query: 170 FR------PKIASSPHGTRDNKE 186
F P S P R N++
Sbjct: 211 FESSHYLSPAKFSGPPKLRKNRQ 233
>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 766
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 7/130 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECF +G++C + C+C +C N N+E + R+ +E+ RNP A
Sbjct: 459 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIES---RNPLA 515
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + ++ E + +H +GC+CKKS CLKKYCEC+Q + CS +CR
Sbjct: 516 FAPKVIRTSEPGQELGEYSNRTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRYE 575
Query: 230 DCKNFEGSEE 239
CKN G E
Sbjct: 576 GCKNAFGRRE 585
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 25/39 (64%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
G K C CKKS CLK YCECF A + CSE C C DC N
Sbjct: 455 GTSCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLN 493
Score = 42.4 bits (98), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 5/50 (10%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNC-HNNVENEAARREAV 159
+ ++ CNCK S CLK YCEC+ G+ C ++C + +N RRE +
Sbjct: 542 RHKRGCNCKKSSCLKKYCECYQGGVGCS----ISCRYEGCKNAFGRREGL 587
>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
Length = 644
Score = 115 bits (288), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 1/124 (0%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C+CK S+CLKLYCECFAS +YC + C+C C N+ +E RNP AF P
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 408
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ + + E++ +H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CK
Sbjct: 409 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCK 468
Query: 233 NFEG 236
N G
Sbjct: 469 NSFG 472
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 389
>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 978
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 73/119 (61%), Gaps = 9/119 (7%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK ++CLKLYCECFA YC +GC+C C N ENE R+ A++ RN +AF+PK
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKT 464
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
S R NK ++ KGC+C+K+ CLKKYCECF A C+ C+C C+N
Sbjct: 465 ESE--QERKNKANNEQM------KKGCNCRKTKCLKKYCECFNAGTYCNNMCKCDSCEN 515
>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 78/136 (57%), Gaps = 19/136 (13%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF--- 170
QCNCK S+CLKLYCECFA+ C CNC C N ++N R +A+E L RNP AF
Sbjct: 163 QCNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTI 222
Query: 171 ------RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
+P+I P ++ +ET KGC+CKKS C KKYCEC+ N C++
Sbjct: 223 LTNNGQQPQIIPEPKSQKEQSKETK---------KGCNCKKSECKKKYCECYSINQKCTD 273
Query: 225 NCRCMDCKNFEGSEER 240
C+C +C N E +E+
Sbjct: 274 LCKCENCLNKEEPQEQ 289
>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
Length = 781
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 85/141 (60%), Gaps = 3/141 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
++CNCK SRCLKLYCECFA+GIYC D C+CVNC N E E + + T RNP AF P
Sbjct: 441 RRCNCKKSRCLKLYCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAP 500
Query: 173 KIASSPHGTRDNKEETGEVLMLG--KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
K+ ++ + N E G+ + +H KGC+CKKS C KKYCECFQ + C CRC
Sbjct: 501 KVVNNATNSPANMMEEGKWMKTSSSRHKKGCNCKKSKCSKKYCECFQGGVGCCNGCRCEG 560
Query: 231 CKNFEGSEERQALFHGDHANN 251
C N G++ + + NN
Sbjct: 561 CYNPYGNKTETSYRRAERWNN 581
>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1114
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/127 (44%), Positives = 71/127 (55%), Gaps = 11/127 (8%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNC ++CLKLYC CF G+ C D C C +C+N + R A++A +R P+AF K+
Sbjct: 714 CNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKV 773
Query: 175 ASSPHGTRDNKEETGEVL--------MLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P R K E E L L H KGC+CKKS C KKYCECF A + C+ C
Sbjct: 774 NFLPELAR--KIEDAERLDADLELQNQLLAHKKGCNCKKSACKKKYCECFLAGVKCTYLC 831
Query: 227 RCMDCKN 233
RC C+N
Sbjct: 832 RCEQCQN 838
>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
Length = 929
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 69/117 (58%), Gaps = 14/117 (11%)
Query: 126 LYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDN 184
LYC+CFA+G YC C+C +C N VEN A EA +RNPNAF KI +
Sbjct: 540 LYCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQRNPNAFVQKIEA-------- 591
Query: 185 KEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
+ + G+H +GC+CKKS C+KKYCECFQA + C E+C+C C N G R+
Sbjct: 592 -----DAALGGQHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
+ ++ CNCK S C+K YCECF +G+ C + C C +CHN + R
Sbjct: 598 QHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643
>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 712
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/194 (35%), Positives = 89/194 (45%), Gaps = 43/194 (22%)
Query: 83 TQPPVRVPKPESPKSRS-----RSNVEVKENTPKKQKQ---------CNCKHSRCLKLYC 128
T PP P P P S RS++ ++ T + + C CK S+CLKLYC
Sbjct: 291 TNPPANYPAPTIPSRPSVPVAQRSSIVLQAATARLKAHHYTDNSSVTCTCKKSKCLKLYC 350
Query: 129 ECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPK-------------- 173
CF++ C C C C N E R +AV+ATL+RNPNAFR K
Sbjct: 351 HCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKFVKVTGGEEFAKEW 410
Query: 174 -------------IASSPHGTRDNKEETGEVLMLG-KHNKGCHCKKSGCLKKYCECFQAN 219
+ SP T+ E V + H GC+C+KS CLK+YCECF A
Sbjct: 411 EKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKSFCLKRYCECFGAQ 470
Query: 220 ILCSENCRCMDCKN 233
C NC+C+ C+N
Sbjct: 471 TPCGLNCKCLGCQN 484
>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
Length = 386
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 75/130 (57%), Gaps = 6/130 (4%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
QCNCK S+CLKLYCECFA+ C CNC +C N ++N R +A+E L RNP+AF
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250
Query: 174 IASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 228
+++ + K N KGC+CKKSGC KKYCEC+ N+ CS+ C+C
Sbjct: 251 FQQKSQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310
Query: 229 MDCKNFEGSE 238
C N +E
Sbjct: 311 EQCLNRTDAE 320
>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 1022
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/118 (46%), Positives = 71/118 (60%), Gaps = 6/118 (5%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIA 175
C CK ++CLKLYCECF + YC G NC C N E E R+ A+E +RN +AF PK
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKAD 825
Query: 176 SSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
S K ++G + H+KGC+CKKS C KKYCEC+Q C+ C+C +CKN
Sbjct: 826 KSDQYY---KSDSGTKAI---HSKGCNCKKSDCRKKYCECYQMGAKCTHLCKCYNCKN 877
>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
Length = 478
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/126 (44%), Positives = 75/126 (59%), Gaps = 31/126 (24%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNP 167
N + K+C+CK+S+CLK+YCECFA+ I C+ C C C NN EA +E+
Sbjct: 97 NIADQCKKCHCKNSKCLKMYCECFANKILCNNCQCFGCQNN-----------EANIEKIG 145
Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
N + +++ KH++GCHC+KSGCLKKYCECFQA I C+ENC+
Sbjct: 146 N--------------------DQSVLIDKHSRGCHCRKSGCLKKYCECFQAGIPCNENCK 185
Query: 228 CMDCKN 233
C DCKN
Sbjct: 186 CYDCKN 191
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER 240
K CHCK S CLK YCECF ILC+ NC+C C+N E + E+
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCN-NCQCFGCQNNEANIEK 143
>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
Length = 384
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 75/130 (57%), Gaps = 6/130 (4%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
QCNCK S+CLKLYCECFA+ C CNC +C N ++N R +A+E L RNP+AF
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250
Query: 174 IASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 228
+++ + K N KGC+CKKSGC KKYCEC+ N+ CS+ C+C
Sbjct: 251 FQQKGQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310
Query: 229 MDCKNFEGSE 238
C N +E
Sbjct: 311 EQCLNRTDAE 320
>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
Length = 396
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/121 (46%), Positives = 72/121 (59%), Gaps = 17/121 (14%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK SRCLKLYCECF S YC CNC+NC+NN ++E+ R EAV+ RNP+ F +
Sbjct: 151 CNCKKSRCLKLYCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCFENHV 210
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
D K+ G + GC C+++ C KKYCECF + C + C+C DCKN
Sbjct: 211 --------DEKK--------GVNKSGCRCRRTHCDKKYCECFSHGVPCGDQCQCKDCKNE 254
Query: 235 E 235
E
Sbjct: 255 E 255
>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 88/147 (59%), Gaps = 11/147 (7%)
Query: 104 EVKENTPKKQKQCNCKHSR---CLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARR 156
+ + +PKK++ + SR L+LYCECFA+G+YC + C C +C N +E A R
Sbjct: 445 DFSQTSPKKKRHVG-RRSRGLQALQLYCECFAAGVYCIEPCACQDCFNKPIHEDTVLATR 503
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
+ +E+ RNP AF PK+ S + +E+ + +H +GC+CKKS CLKKYCEC+
Sbjct: 504 KQIES---RNPLAFAPKVIRSSEPAPEIGDESSKTPASARHKRGCNCKKSSCLKKYCECY 560
Query: 217 QANILCSENCRCMDCKNFEGSEERQAL 243
Q + CS NCRC CKN G ++ AL
Sbjct: 561 QGGVGCSLNCRCEGCKNAFGRKDGSAL 587
>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 510
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/108 (47%), Positives = 64/108 (59%), Gaps = 13/108 (12%)
Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
LYCECFA+G +C C+C C N ENEA + + +RNP AF KI +
Sbjct: 207 LYCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKIMA--------- 257
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ G+ +H KGCHCKKS CLKKYCECFQA + C + C+C CKN
Sbjct: 258 -DAGDD---ARHTKGCHCKKSACLKKYCECFQAGVKCQDYCKCEGCKN 301
>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
Length = 348
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/120 (45%), Positives = 74/120 (61%), Gaps = 1/120 (0%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
QCNCK S+CLKLYCECF + C CNC C N ++N R +A+E L RNP AF P
Sbjct: 167 QCNCKKSKCLKLYCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPI 226
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ ++ + +E+ + + + KGC+CKKS C KKYCEC+ N C+E C+C +C N
Sbjct: 227 LTNNGQQPQVIQEQKSQKDIQKETKKGCNCKKSECKKKYCECYSINQKCTELCKCENCLN 286
>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 742
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 75/128 (58%), Gaps = 19/128 (14%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
E +P + K CNC++S+CLK YC+CFA+G+YC C C CHN E E R++A++ +R
Sbjct: 493 EKSPTEAKPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKR 552
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
+AF P I ++ H KGCHCK S C KKYC C Q +LCS
Sbjct: 553 IKDAFLPVIQNNSHI------------------KGCHCKNSHCQKKYCVCHQNGVLCSSL 594
Query: 226 CRCMDCKN 233
C+C++C+N
Sbjct: 595 CQCVECEN 602
Score = 43.1 bits (100), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVE 160
K C+CK+S C K YC C +G+ C C CV C N +E+ + +++
Sbjct: 568 KGCHCKNSHCQKKYCVCHQNGVLCSSLCQCVECENKIESMKKEQSSIQ 615
>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 495
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 91/163 (55%), Gaps = 11/163 (6%)
Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATL 163
+ ++ K ++ C CK+S+CLKLYCECFA G YC D C C+ C N E E +EA + TL
Sbjct: 245 INDSEQKAREGCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECENEIKEARKVTL 304
Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
RNP+AF K+ +D E MLG + KGC CK++ C KKYCEC+ A + CS
Sbjct: 305 TRNPDAFTSKLEVVGTIVQDEDIEANRG-MLG-YKKGCKCKRTYCKKKYCECYNAGVKCS 362
Query: 224 ENCRCMDCKNFE--------GSEERQALFHGDHANNMAYIQQA 258
C C +C N E + E+Q + +HA +A QA
Sbjct: 363 YLCICENCHNQEDDPSKEKDNAAEQQVHKNPNHAGGVANTNQA 405
>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
Length = 414
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 68/114 (59%), Gaps = 1/114 (0%)
Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
+K YCECFA+G+YC + C+C+ C NN +E E RNP AF PK+ +
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVS 197
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
+ EET + +H +GC+CKKS CLKKYCECFQ + CS CRC CKN G
Sbjct: 198 EFGEETNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFG 251
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 7/47 (14%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGI------YCDGC-NCVNCHNNVE 150
+ ++ CNCK S CLK YCECF G+ C+GC N C N VE
Sbjct: 211 RHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFGCKNGVE 257
Score = 38.9 bits (89), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEE 239
+K YCECF A + C E C C++C N E+
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHED 168
>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
Length = 196
Score = 111 bits (277), Expect = 1e-21, Method: Composition-based stats.
Identities = 53/136 (38%), Positives = 73/136 (53%), Gaps = 25/136 (18%)
Query: 99 SRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARRE 157
S+ +++++ ++ CNCK S+CLK YC+CFA+G C + CNCV C NN N R+E
Sbjct: 82 SQKSLQIQSKLFINERSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKE 141
Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
A + R+P AF+ KGC+CKKSGC KKYCECF
Sbjct: 142 AKVQIINRDPGAFKQSF------------------------KGCNCKKSGCQKKYCECFL 177
Query: 218 ANILCSENCRCMDCKN 233
+ + C+ CRC C N
Sbjct: 178 SGLACTHLCRCDGCLN 193
>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
Length = 218
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 71/119 (59%), Gaps = 15/119 (12%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYC+CFA+G C CNC C NN N R + +E +ERNP AF K+
Sbjct: 85 CNCKKSKCLKLYCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPEAFNQKV 144
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ H+KGC+C+KSGC KKYCEC+Q I CS+NC+C C+N
Sbjct: 145 KEVDYKL--------------AHSKGCNCRKSGCKKKYCECYQLGIECSDNCKCDGCQN 189
>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
Length = 1430
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/128 (41%), Positives = 73/128 (57%), Gaps = 12/128 (9%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR--REAVEAT--LERNPNA 169
K CNCK S+CLKLYCEC+A+G++C CNC NC N E+ A A A L A
Sbjct: 1012 KPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPEDGEAHPPYPATHALGRLRHGDVA 1071
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
+ +S + ++ + GC CKKS CLKKYCECF+A + C +C+C
Sbjct: 1072 VPTAVLTSAQKRKPQSKQ--------RQESGCFCKKSKCLKKYCECFEAGVHCEASCKCE 1123
Query: 230 DCKNFEGS 237
+C+N++GS
Sbjct: 1124 NCENYDGS 1131
>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
Length = 142
Score = 110 bits (275), Expect = 2e-21, Method: Composition-based stats.
Identities = 55/129 (42%), Positives = 76/129 (58%), Gaps = 10/129 (7%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNA 169
+K + C CK SRCL+LYCECF +GI+C+ C C NC N N+ A + RNP+A
Sbjct: 10 RKVEFCKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKMRNPDA 69
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F K + N+ E + H KGC+C K+ C KKYCECF A I C+ENC+C
Sbjct: 70 FSQKFLV----VKQNQYEG-----IVSHKKGCNCTKTQCTKKYCECFNAGIKCTENCKCE 120
Query: 230 DCKNFEGSE 238
+C+N++ +
Sbjct: 121 NCENYKDEQ 129
Score = 48.5 bits (114), Expect = 0.010, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER 240
C CKKS CL+ YCECF I C+++C C +C N E ++++
Sbjct: 15 CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQ 54
Score = 48.1 bits (113), Expect = 0.014, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 36/62 (58%), Gaps = 5/62 (8%)
Query: 91 KPESPKSRSRSNVEVKENTPK----KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNC 145
K +P + S+ + VK+N + +K CNC ++C K YCECF +GI C + C C NC
Sbjct: 63 KMRNPDAFSQKFLVVKQNQYEGIVSHKKGCNCTKTQCTKKYCECFNAGIKCTENCKCENC 122
Query: 146 HN 147
N
Sbjct: 123 EN 124
>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas reinhardtii]
gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
Length = 91
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/96 (59%), Positives = 65/96 (67%), Gaps = 9/96 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YCECFAS YC+ CNC+ C NN ENEA R+ AVEA +ERNPNAF+PKI H T
Sbjct: 1 YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHET----- 55
Query: 187 ETGEVLML----GKHNKGCHCKKSGCLKKYCECFQA 218
T V+ G+H KGC+CKKS CLKKYCECFQ
Sbjct: 56 HTPVVVAAAGASGRHLKGCNCKKSFCLKKYCECFQV 91
>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
Length = 593
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/131 (43%), Positives = 81/131 (61%), Gaps = 8/131 (6%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN--- 147
P SPK + R + + E K+CNCK S+CLKLYCECFA+G YC + C+C+NC N
Sbjct: 307 PISPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPI 365
Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
+ + A R+ +E+ RNP AF PK+ + + E+ + +H +GC+CKKS
Sbjct: 366 HKDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 422
Query: 208 CLKKYCECFQA 218
CLKKYCEC+Q+
Sbjct: 423 CLKKYCECYQS 433
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 34/58 (58%), Gaps = 5/58 (8%)
Query: 177 SPHGTRDNKEETGEVLMLGKHN-KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
SP R E++GE G + K C+CKKS CLK YCECF A C E C C++C N
Sbjct: 309 SPKKKRRKSEQSGE----GDSSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFN 362
>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
Length = 155
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 81/144 (56%), Gaps = 14/144 (9%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+C+CK S+CLKLYCECFASG YC D C C NC N E+E E + RNP AF P+
Sbjct: 1 RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60
Query: 174 IASSPHGTRDNKEETGEVLML-------------GKHNKGCHCKKSGCLKKYCECFQANI 220
+ S R + ++ KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61 VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120
Query: 221 LCSENCRCMDCKNFEGSEERQALF 244
CS+ C+C CKN G+++ F
Sbjct: 121 GCSDACKCRGCKNTFGAKQGDDFF 144
Score = 42.4 bits (98), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
T K ++ CNCK S CLK YCECF S + C D C C C N
Sbjct: 94 TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKN 133
>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
Length = 243
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/122 (53%), Positives = 87/122 (71%), Gaps = 11/122 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC+NN +EA R +A++A L RNP AF+PKI K
Sbjct: 52 YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIG---------KG 102
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
+ GEV +HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S +R+ L++G
Sbjct: 103 KLGEVKP--RHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEESPQRRILWNG 160
Query: 247 DH 248
H
Sbjct: 161 PH 162
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 36/69 (52%), Gaps = 4/69 (5%)
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
R P+ PK EVK P+ K CNCK S CLK YCECF + I C C C++C
Sbjct: 90 RNPEAFQPKIGKGKLGEVK---PRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCK 146
Query: 147 NNVENEAAR 155
N E+ R
Sbjct: 147 NYEESPQRR 155
>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
Length = 512
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC N++ +E R +A++A L+RNP AF+PKI K
Sbjct: 311 YCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 361
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 362 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 418
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 368 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 410
>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
[Oryctolagus cuniculus]
Length = 450
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/140 (44%), Positives = 89/140 (63%), Gaps = 11/140 (7%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN +E R +A++A L+RNP AF+PKI G+ +
Sbjct: 250 YCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGSAKPR- 308
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 309 ----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEESPERKTLMNA 358
Query: 247 DHANNMAYIQQAANAAITGA 266
+ + ++ + + TGA
Sbjct: 359 PNCVDAGDLEGGHHLSPTGA 378
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C CV C N E+
Sbjct: 307 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEES 349
>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
Length = 511
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 61/119 (51%), Positives = 84/119 (70%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 310 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 360
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 361 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 417
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 367 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 409
>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
lyrata]
Length = 651
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 74/119 (62%), Gaps = 7/119 (5%)
Query: 125 KLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNAFRPKIASSPHG 180
K YCECFA+G+YC + C+C++C N +E A R+ +E+ RNP AF PK+ +
Sbjct: 363 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLAFAPKVIRNSES 419
Query: 181 TRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
+ ++ + +H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G ++
Sbjct: 420 VLETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 478
Score = 45.8 bits (107), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCEC+ G+ C C C C N
Sbjct: 435 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 472
>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
Length = 509
Score = 108 bits (270), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 61/119 (51%), Positives = 85/119 (71%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 308 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 358
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER++L +
Sbjct: 359 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKSLIN 415
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 365 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 407
>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
Length = 658
Score = 108 bits (270), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 75/119 (63%), Gaps = 7/119 (5%)
Query: 125 KLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNAFRPKIASSPHG 180
K YCECFA+G+YC + C+C++C N +E A R+ +E+ RNP AF PK+ +
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLAFAPKVIRNSDS 426
Query: 181 TRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
++ ++ + +H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN G ++
Sbjct: 427 VQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 485
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCEC+ G+ C C C C N
Sbjct: 442 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 479
>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
Length = 517
Score = 108 bits (270), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 62/122 (50%), Positives = 84/122 (68%), Gaps = 11/122 (9%)
Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
L YC+CFASG +C+ CNC NC NN+ +E R +A++A L RNP AF+PKI
Sbjct: 317 LARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG-------- 368
Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
K + G+V +HNKGC C++SGCLK YCEC++A I+CS C+C+DCKN+E S ER+ L
Sbjct: 369 -KGQLGDVK--PRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEESPERKTL 425
Query: 244 FH 245
+
Sbjct: 426 MN 427
Score = 45.8 bits (107), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K C+C+ S CLK YCEC+ + I C C C++C N E+
Sbjct: 377 PRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEES 419
>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
Length = 141
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/139 (43%), Positives = 80/139 (57%), Gaps = 14/139 (10%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+C+CK S+CLKLYCECFASG YC D C C NC N E+E E + RNP AF P+
Sbjct: 1 RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60
Query: 174 IASSPHGTRDNKEETGEVLML-------------GKHNKGCHCKKSGCLKKYCECFQANI 220
+ S R + ++ KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61 VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120
Query: 221 LCSENCRCMDCKNFEGSEE 239
CS+ C+C CKN G+++
Sbjct: 121 GCSDACKCRGCKNTFGAKQ 139
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
T K ++ CNCK S CLK YCECF S + C D C C C N
Sbjct: 94 TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKN 133
>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
Length = 671
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 61/108 (56%), Gaps = 13/108 (12%)
Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
LYC+CFA+G++C C+C +C N + R+ RNPNAF KI
Sbjct: 358 LYCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRYQIESRNPNAFANKIVDDDS------ 411
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ KH KGCHCKKS CLKKYCECFQAN+ C + C+C CKN
Sbjct: 412 -------VDAKHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKN 452
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
K K C+CK S CLK YCECF + + C D C C C N +
Sbjct: 415 KHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKNTTD 455
>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
Length = 373
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/144 (43%), Positives = 83/144 (57%), Gaps = 10/144 (6%)
Query: 115 QCNCKHSRCLKLYCECFAS-GIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
QCNCK S+CLKLYCECFA+ G+ CNC +C N ++N R +A+E L RN +AF
Sbjct: 187 QCNCKKSKCLKLYCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFVQC 246
Query: 174 IASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 228
++ G ++ + K N KGC+CKKSGC KKYCEC+ NI C+E C+C
Sbjct: 247 FSTK--GGAQFVQQDKPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECYSQNIRCNELCKC 304
Query: 229 MDCKNFEGSE--ERQALFHGDHAN 250
C N +E +Q L DH N
Sbjct: 305 EHCLNKTDAEIQAQQDLGQEDHNN 328
>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
Length = 316
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 94/161 (58%), Gaps = 16/161 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YCECFA+G +C CNC NCHNN+E+E R A++A LERNP+AFRPKI + G
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEGNER--- 194
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH- 245
+H+KGC+CK+SGCLK YCEC++A I C+ C+C +CKN E ER+ L
Sbjct: 195 ---------RHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKNREDCLERKNLMQL 245
Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQEL 286
D A+ Q AA + ++ I S +S K G L
Sbjct: 246 ADAADIRVKQQTAAKSKLSTQIDD---LSASISAKNDGSRL 283
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
N + K CNCK S CLK YCEC+ + I C C C NC N
Sbjct: 192 NERRHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKN 232
>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
Length = 326
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 4/116 (3%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
T +K CNCK ++CLKLYC+C A+G+YC D C C NC N ENE + E R+P
Sbjct: 10 TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 69
Query: 168 NAFRPKIASSPHGTRDN-KEETGEVLM--LGKHNKGCHCKKSGCLKKYCECFQANI 220
AF P+I + T DN +E G +H +GC+CKKS C KKYCEC+Q I
Sbjct: 70 LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQVYI 125
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 25/35 (71%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
K C+CK++ CLK YC+C A + C+++C C +C N
Sbjct: 15 KHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLN 49
>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
Length = 276
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 71/124 (57%), Gaps = 15/124 (12%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
++ K CNC+ S+CLKLYC+CFA G C CNC C NN N R ++ +ERNP A
Sbjct: 144 QEHKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFIQKMIERNPQA 203
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F K+ EV H KGC+C+KSGC KKYCEC+Q I CS+NC+C
Sbjct: 204 FNQKVQ--------------EVESKMTHAKGCNCRKSGCQKKYCECYQMGIECSDNCKCD 249
Query: 230 DCKN 233
C N
Sbjct: 250 GCLN 253
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%)
Query: 191 VLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ L + +K C+C+KS CLK YC+CF LCS C C C N
Sbjct: 139 LFYLNQEHKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFN 181
>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
Length = 601
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 74/138 (53%), Gaps = 14/138 (10%)
Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
YCECF++G++C + C+C NC N +E ++ E RNP AF PK+ S+ D
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ---- 241
E + +H +GC+C+KSGC KKYCECF + CS NCRCM CKN G Q
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFGHTNEQCAGD 547
Query: 242 ---------ALFHGDHAN 250
A +GDH +
Sbjct: 548 SDVVTINDEAKHYGDHGD 565
Score = 42.4 bits (98), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ ++ CNC+ S C K YCECF G+ C C C+ C N
Sbjct: 498 RHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKN 535
>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
infestans T30-4]
Length = 777
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/124 (45%), Positives = 74/124 (59%), Gaps = 13/124 (10%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C CK + CLK+YC CF+S +C GC C +C N ++ R EA+++ L +P AF
Sbjct: 540 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAF--SF 596
Query: 175 ASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRCM 229
AS P T TG + +L + + +GC CKKS CLKKYCECFQ I C+ +CRCM
Sbjct: 597 ASLPQDT----NTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCM 652
Query: 230 DCKN 233
DC N
Sbjct: 653 DCSN 656
>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
Length = 713
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 82/148 (55%), Gaps = 20/148 (13%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFR--- 171
C CK + CLK+YC CF+S +C GC C +C N +++ R EA++ L +P AF
Sbjct: 536 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAFSFAS 594
Query: 172 -PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
P+ AS+ +++ V+M +GC CKKS CLKKYCECFQ I C+ +CRCMD
Sbjct: 595 LPQSASTTGFLHLLPQKSSAVVM-----RGCRCKKSKCLKKYCECFQNGIACTSHCRCMD 649
Query: 231 CKNFEGS---------EERQALFHGDHA 249
C N S ++R A HG A
Sbjct: 650 CSNHSESTAAHQHPHLQKRSATAHGKRA 677
>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 107 bits (267), Expect = 2e-20, Method: Composition-based stats.
Identities = 56/130 (43%), Positives = 68/130 (52%), Gaps = 28/130 (21%)
Query: 108 NTPK---KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATL 163
N PK K CNCK S+CLK YC+CFA+G C + CNCV C NN N R+EA +
Sbjct: 80 NLPKIITNTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQII 139
Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
R+P AF+ KGC+CKKSGC KKYCECF + + C+
Sbjct: 140 NRDPGAFKQSF------------------------KGCNCKKSGCQKKYCECFLSGLACT 175
Query: 224 ENCRCMDCKN 233
CRC C N
Sbjct: 176 HLCRCDGCLN 185
>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
Length = 728
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 83/119 (69%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C+ CNC NC+NN+ +E R +A++A L+RNP AF+PKI G D K
Sbjct: 527 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--DIKP 584
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 585 ---------RHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 634
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 584 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 626
>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
Length = 510
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 309 YCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIG---------KG 359
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 360 KLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 416
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 366 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 408
>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
Length = 515
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/118 (51%), Positives = 82/118 (69%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L RNP AF+PKI K
Sbjct: 314 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG---------KG 364
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 365 QLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 420
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 371 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 413
>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/119 (48%), Positives = 69/119 (57%), Gaps = 15/119 (12%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYC+CFA+ C CNC C NN N R E +ERNP AF K+
Sbjct: 109 CNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPEAFNQKV 168
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
EV H+KGC+C+KSGC KKYCEC+Q I CS+NC+C CKN
Sbjct: 169 K--------------EVDQKMAHSKGCNCRKSGCKKKYCECYQMGIECSDNCKCDGCKN 213
>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
Length = 511
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 360
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 361 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 417
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 367 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 409
>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
Length = 511
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 360
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 361 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 417
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 367 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 409
>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
Length = 808
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 98/206 (47%), Gaps = 37/206 (17%)
Query: 68 VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKL- 126
V P P P+ AT+ + SPK + R + V E + K+CNCK S+CLKL
Sbjct: 448 VQPTEDCPQPIAYMATEDFNQ----NSPKKKRRKSEPVGE--IEGCKRCNCKKSKCLKLI 501
Query: 127 ------------------YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
YCECFA+G+YC + C+C C N +E + + RNP
Sbjct: 502 DLKMTILVLELTILFFFSYCECFAAGVYCIEPCSCQECFNKPIHEDTVLQTRKQIESRNP 561
Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL------ 221
AF PK+ S + + + +H +GC+CKKS CLKKYCEC+Q IL
Sbjct: 562 LAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYCECYQVLILTISCLP 621
Query: 222 -----CSENCRCMDCKNFEGSEERQA 242
CS +CRC CKN G ++ A
Sbjct: 622 QGGVGCSISCRCEGCKNAFGRKDGSA 647
>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
Length = 383
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/119 (51%), Positives = 84/119 (70%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 182 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 232
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 233 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 289
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 239 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 281
>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
Length = 548
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 58/119 (48%), Positives = 82/119 (68%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C+ CNC NC+NN+ +E R +A++A L+RNP AF+PKI G
Sbjct: 347 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD----- 401
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 402 ------IKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 454
>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
Length = 265
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 99/191 (51%), Gaps = 27/191 (14%)
Query: 71 VPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-QKQ------CNCKHSRC 123
+P P P+P +PPV P+ + + ++ + E++ + Q Q CNCK S+C
Sbjct: 1 MPHYPHPIPY-WYRPPVYFPQYDGQQYQNTLSKELQSKVSQLVQYQKVSHFACNCKKSKC 59
Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF---------RPK 173
LKLYCECFA+ C CNC C N ++N R +A E L RNP+AF +P+
Sbjct: 60 LKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAILTNNGQQPQ 119
Query: 174 IASS-----PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
I P + KE ++ KGC+CKKS C KKYCEC+ N C++ C+C
Sbjct: 120 IIQDKTLILPLEEKSQKESQIDITT----RKGCNCKKSECKKKYCECYSINQKCTDLCKC 175
Query: 229 MDCKNFEGSEE 239
+C N S+E
Sbjct: 176 ENCLNKAESQE 186
>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
Length = 508
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/118 (50%), Positives = 82/118 (69%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L RNP AF+PKI K
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG---------KG 357
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G+V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 358 QLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 413
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 406
>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
Length = 783
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 11/99 (11%)
Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
A L+RNP AF+PKI G D + H+KGC+CK+SGCLK YCEC++A I
Sbjct: 604 ACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKI 652
Query: 221 LCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+CS C+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 653 MCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 691
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
K CNCK S CLK YCEC+ + I C C C+ C N
Sbjct: 631 KGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 665
>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
trifallax]
Length = 710
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 94/169 (55%), Gaps = 18/169 (10%)
Query: 81 QATQPPVRVPKPE-SPKSRSRSNVEVKENTPKK----QKQCNCKHSRCLKLYCECFASGI 135
QAT V +P+ + + S N + N+P++ K+C C S+CLKLYCECFA+G+
Sbjct: 487 QATSVGVNIPQKQITHNHHSHHNEQKLGNSPQQPGVSTKKCTCSKSKCLKLYCECFAAGL 546
Query: 136 YCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML 194
C C C +C N ++ A ++A + ++R+P AF K++ + +G +
Sbjct: 547 VCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEIKVS----------QNSGTDKL- 595
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
+H KGC CKKSGC K YCECFQ + C++ C+C C N E ++ L
Sbjct: 596 -QHRKGCTCKKSGCQKGYCECFQLGVPCTDFCKCSGCANCEKKSDQPCL 643
>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
Length = 374
Score = 105 bits (261), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 73/128 (57%), Gaps = 17/128 (13%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF---- 170
CNCK S+CLKLYCECFA+ C CNC C N ++N R +A+E L RNP+AF
Sbjct: 169 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAFAAIL 228
Query: 171 -----RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
+P+I + KE ++ KGC+CKKS C KKYCEC+ N C++
Sbjct: 229 TNNGQQPQIIQE---EKSQKESQKDITT----RKGCNCKKSECKKKYCECYSINQRCTDL 281
Query: 226 CRCMDCKN 233
C+C +C N
Sbjct: 282 CKCENCLN 289
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 4/71 (5%)
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK----GCHCKKSGCLKKYCECFQANILC 222
P + P+ + +KE +V L ++ K C+CKKS CLK YCECF N +C
Sbjct: 131 PPIYYPQYDGQHYQNTLSKELQSKVSQLVQYQKVSPFACNCKKSKCLKLYCECFANNWVC 190
Query: 223 SENCRCMDCKN 233
S+NC C +CKN
Sbjct: 191 SQNCNCCECKN 201
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
K CNCK S C K YCEC++ C D C C NC N V+
Sbjct: 255 KGCNCKKSECKKKYCECYSINQRCTDLCKCENCLNKVQ 292
>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
fascicularis]
Length = 490
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/118 (50%), Positives = 82/118 (69%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L RNP AF+PKI K
Sbjct: 289 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG---------KG 339
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G+V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 340 QLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 395
Score = 47.0 bits (110), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 346 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 388
>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
Length = 283
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/119 (51%), Positives = 82/119 (68%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ C+C NC NN+ +E R A++A L+RNP AF+PKI S G D K
Sbjct: 77 YCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKSRLG--DVKP 134
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+HNKGC+CK+SGCLK YCEC++A I+CS C+C+DC N+E S ER+ L +
Sbjct: 135 ---------RHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEESPERKTLMN 184
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNCK S CLK YCEC+ + I C C C++C+N E+
Sbjct: 134 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEES 176
>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
Length = 516
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/119 (49%), Positives = 84/119 (70%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 315 YCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIG---------KG 365
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ G+V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S E++ L +
Sbjct: 366 KLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPEQKTLMN 422
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 372 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 414
>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
Length = 263
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/118 (50%), Positives = 82/118 (69%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+++E R +A++A L RNP AF+PKI +S G D K
Sbjct: 53 YCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLG--DVKP 110
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+H+KGC C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 111 ---------RHSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 159
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
R P+ PK + +VK P+ K C+C+ S CLK YCEC+ + I C C C+ C
Sbjct: 91 RNPEAFQPKIGTSQLGDVK---PRHSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCK 147
Query: 147 NNVEN 151
N E+
Sbjct: 148 NYEES 152
>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
Length = 391
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/119 (50%), Positives = 82/119 (68%), Gaps = 11/119 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ +E R A++ L+RNP AF+PKI K
Sbjct: 186 YCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIG---------KG 236
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+G+V +HNKGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L +
Sbjct: 237 RSGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 293
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
R P+ PK + +VK P+ K CNCK S CLK YCEC+ + I C C C+ C
Sbjct: 224 RNPEAFQPKIGKGRSGDVK---PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 280
Query: 147 NNVEN 151
N E+
Sbjct: 281 NYEES 285
>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
Length = 502
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/118 (48%), Positives = 80/118 (67%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C CNC NC NN+ +E R +A++A L+RNP AF+PKI K
Sbjct: 302 YCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIG---------KG 352
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
G+V +HN+GC+C++SGCLK YCEC++A I+CS C+C+ CKN++ ER+ L
Sbjct: 353 RPGDVT--PRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDEGPERKTLM 408
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
TP+ + CNC+ S CLK YCEC+ + I C C C+ C N E
Sbjct: 358 TPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDE 400
>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
[Homo sapiens]
Length = 529
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI K
Sbjct: 328 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 378
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 379 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 434
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 385 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 427
>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
tropicalis]
Length = 183
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/125 (52%), Positives = 87/125 (69%), Gaps = 11/125 (8%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K +K CNC S+CLKLYC+CFA+G C+ CNC NC+NN E+E R +AV+A L RNP+AF
Sbjct: 9 KTKKPCNCTRSQCLKLYCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPDAF 68
Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
+PKI K + G+V +HNKGC+CK+SGC K YCEC++A I CS C+C+
Sbjct: 69 QPKI---------EKGKIGDV--KPRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVG 117
Query: 231 CKNFE 235
CKN +
Sbjct: 118 CKNVK 122
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 8/61 (13%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN-------NVENEAARREAVEA 161
P+ K CNCK S C K YCEC+ + I C C CV C N ++ RE +EA
Sbjct: 82 PRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVGCKNVKSLDSGSIIPTCTSREVIEA 141
Query: 162 T 162
T
Sbjct: 142 T 142
>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
testis-specific; AltName: Full=Testis-specific
metallothionein-like protein
gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
[Homo sapiens]
Length = 508
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI K
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 357
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 358 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 413
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 406
>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
Length = 508
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI K
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 357
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 358 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 413
Score = 46.6 bits (109), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 406
>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
[Homo sapiens]
Length = 463
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI K
Sbjct: 262 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 312
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 313 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 368
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 319 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 361
>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
Length = 641
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 75/130 (57%), Gaps = 4/130 (3%)
Query: 93 ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVEN 151
+S K R R +++ + K + C+CK S+CLKLYC CFA+ +YC + C+C C NN +
Sbjct: 339 QSHKKRRR---KLQNDDGKSCRHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMH 395
Query: 152 EAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKK 211
E A + T RNP AF P + + +++ +H +GC+C+KS CLKK
Sbjct: 396 EEAVSHIRKQTESRNPLAFAPTVTRKCGSVSELGDDSNNTPASARHKRGCNCRKSSCLKK 455
Query: 212 YCECFQANIL 221
YCECFQ +L
Sbjct: 456 YCECFQVLML 465
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 26/45 (57%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
GK + C CKKS CLK YC CF A + CSE C C C N EE
Sbjct: 353 GKSCRHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMHEE 397
>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
Length = 622
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 90/156 (57%), Gaps = 16/156 (10%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YCECFASG +C+ CNC NC+NN +E R +A++ L+RNP AF PKI S G
Sbjct: 427 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSKLGD----- 481
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
+ HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S +++
Sbjct: 482 ------IKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT---- 531
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
+ Y+ N + ++ SP + K R+
Sbjct: 532 -QLTTLNYMDIGNNDENNPFLTTTFEISPKLEKDRE 566
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
P K CNCK S CLK YCECF + I C C C+ C N E
Sbjct: 484 PHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 525
>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
Length = 377
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/138 (45%), Positives = 86/138 (62%), Gaps = 13/138 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C CNC NC+NN ++E R A++A L+RNP AF PKI G
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLGD----- 230
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
+ +HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S ER+ L
Sbjct: 231 ------IKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEESPERKTLI-- 282
Query: 247 DHANNMAYIQQAANAAIT 264
+ N+M N A++
Sbjct: 283 NMPNDMEIKSHEENDAVS 300
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
P+ K CNCK S CLK YCECF + I C C CV C N
Sbjct: 233 PRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKN 271
>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
trifallax]
Length = 600
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/115 (45%), Positives = 68/115 (59%), Gaps = 10/115 (8%)
Query: 122 RCLKLYCECFASGIYCD-GCNCVNCHNNVEN-EAARREAVEATLERNPNAFRPKIAS-SP 178
+CLKLYC+CFA G C C C+NC N +N E+ EA + + RNP AF+PK+ P
Sbjct: 454 QCLKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGP 513
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
N + +H KGC+CK SGC K YCECFQ + C++NCRC+ C N
Sbjct: 514 AQMSQNDADK-------QHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561
>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
Length = 503
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 92/156 (58%), Gaps = 16/156 (10%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YCECFASG +C+ CNC NC+NN +E R +A++ L+RNP AF PKI S
Sbjct: 307 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQS--------- 357
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
E G++ HNKGC+CK+SGCLK YCECF+A I+CS C+C+ CKN+E S +++
Sbjct: 358 ELGDIK--PHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT---- 411
Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
+ Y+ N + S+ SP + K R+
Sbjct: 412 -QPTMLNYMDIGNNDENNLFLTSTFEISPKLEKDRE 446
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
E+ + P K CNCK S CLK YCECF + I C C C+ C N E
Sbjct: 358 ELGDIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 405
>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
Length = 387
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 75/130 (57%), Gaps = 11/130 (8%)
Query: 115 QCNCKHSRCLKLYCECFAS-GIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
QCNCK S+CLKLYCECFA+ G+ CNC C N ++N R +A+E L RN +AF
Sbjct: 194 QCNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAFAQC 253
Query: 174 IAS--SPHGTRD---NKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCS 223
+ +P + N + + K N KGC+CKKSGC KKYCEC+ N+ C+
Sbjct: 254 FTTKGAPQFVQQGIFNFIQDKPIKEPSKDNSSVVHKGCNCKKSGCKKKYCECYSQNLKCN 313
Query: 224 ENCRCMDCKN 233
+ C+C C N
Sbjct: 314 DLCKCEHCLN 323
>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
Length = 735
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 60/94 (63%), Gaps = 9/94 (9%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
K C+CK+S+CLK+YCECFAS C+GC C CHNN ++ + A +TLERNP+AF PK
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAFGPK 385
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
S +G + E KH KGCHC++SG
Sbjct: 386 FKQSFNGEKPKAE---------KHLKGCHCRRSG 410
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
K CHCK S CLK YCECF + LC+ C+C C N E E
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCN-GCQCFGCHNNEDHIE 365
>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
Length = 299
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI K
Sbjct: 98 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 148
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 149 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 204
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 155 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 197
>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
Length = 492
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 60/97 (61%), Gaps = 12/97 (12%)
Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
C+C NC NN+ NE R A+++ LERNP AF+PKI G D + HNK
Sbjct: 282 CHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG---KGRADTER---------LHNK 329
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
GC+CKKS CLK YCEC++A + C+ C+C+ C+N E
Sbjct: 330 GCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRNTEA 366
Score = 45.4 bits (106), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
K CNCK S CLK YCEC+ + + C C CV C N
Sbjct: 329 KGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRN 363
>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
Length = 223
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 61/101 (60%), Gaps = 11/101 (10%)
Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
+A L+RNP AF+PKI G D + H+KGC+CK+SGCLK YCEC++A
Sbjct: 42 TQACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEA 90
Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
I+CS C+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 91 KIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 131
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 47/106 (44%), Gaps = 15/106 (14%)
Query: 60 PQPQPKPVVPPVPP----PPPP---------LPGQATQPPVRVPKPESPKSRSRSNVEVK 106
P PQ +P + +PP P P LP Q PE+ K + E
Sbjct: 5 PLPQIQPNLTNLPPGTVLAPAPGTGNVGYAVLPAQYVTQACLDRNPEAFKPKIGKGKE-G 63
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
E+ + K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 64 ESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEES 109
>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 617
Score = 99.0 bits (245), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 46/108 (42%), Positives = 60/108 (55%), Gaps = 14/108 (12%)
Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
YCECF G YC D C C C NN +N R +A++ RN +AF K + + T
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQITSNSTV--- 413
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
H KGCHCK++ CLKKYCECF A + C+ NC+C +C+N
Sbjct: 414 ----------THKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRN 451
>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
Length = 508
Score = 99.0 bits (245), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/118 (47%), Positives = 77/118 (65%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI G +
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPR- 365
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
HNKGC+C++SGCLK YCEC++A I+CS C+C+ C+N E S ER+ L
Sbjct: 366 ----------HNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTLM 413
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEES 406
>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
Length = 508
Score = 99.0 bits (245), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/118 (49%), Positives = 80/118 (67%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI K
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 357
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ C+N E S ER+ L
Sbjct: 358 QLGNVK--PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTLM 413
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEES 406
>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 455
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 72/129 (55%), Gaps = 22/129 (17%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG--CNCVNCHNNVENEAARREAVEATLE 164
+ P Q CNC SRCLKLYC CF +G C C C+ C N+VE+ R++A++ TL+
Sbjct: 329 QKLPAGQVACNCIRSRCLKLYCTCFQAGKMCQPGICTCIACANSVEDHPERKQAIKHTLQ 388
Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
+ P+AF+ K P G GC CK + C++KYCECF+ + C++
Sbjct: 389 KRPDAFQTK--DRPVGL------------------GCACKNNKCIRKYCECFRNGLSCAD 428
Query: 225 NCRCMDCKN 233
C C++C+N
Sbjct: 429 KCCCLNCEN 437
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 70/138 (50%), Gaps = 20/138 (14%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD--GCNCVNCHNNVEN---EAARREAVEATLERNPNAF 170
C+C S+C+ LYC+CF +G C+ CNC C N + + AR A+ A L RNP AF
Sbjct: 260 CSCPKSKCVALYCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAF 319
Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCM 229
++ + K G+V C+C +S CLK YC CFQA +C C C+
Sbjct: 320 ----VTAGVASAVQKLPAGQV--------ACNCIRSRCLKLYCTCFQAGKMCQPGICTCI 367
Query: 230 DCKNF--EGSEERQALFH 245
C N + E +QA+ H
Sbjct: 368 ACANSVEDHPERKQAIKH 385
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE-NCRCMDCKN 233
T E+ L ++ GC C KS C+ YC+CF+A C+ NC C CKN
Sbjct: 247 TLEIGPLQENPTGCSCPKSKCVALYCDCFKAGRRCNPINCNCTACKN 293
>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
Length = 175
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/121 (45%), Positives = 66/121 (54%), Gaps = 25/121 (20%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K CNCK S+CLK YC+CFA+G C + CNCV C NN N + R+EA R+P AF+
Sbjct: 76 KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFKQ 135
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KGC+CKKSGC KKYCECFQ N+ C+ CRC C
Sbjct: 136 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 171
Query: 233 N 233
N
Sbjct: 172 N 172
>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
Length = 190
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/121 (44%), Positives = 66/121 (54%), Gaps = 25/121 (20%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K CNCK S+CLK YC+CFA+G C + CNCV C NN N + R++A R+P AF+
Sbjct: 91 KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQARDPGAFKQ 150
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KGC+CKKSGC KKYCECFQ N+ C+ CRC C
Sbjct: 151 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 186
Query: 233 N 233
N
Sbjct: 187 N 187
>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
subellipsoidea C-169]
Length = 90
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/96 (65%), Positives = 69/96 (71%), Gaps = 11/96 (11%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI-ASSPHGTRDNK 185
YCECFASG YCDGCNCVNC NN E+E R+ AVEA LERNPNAFRPKI P T
Sbjct: 1 YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCT---- 56
Query: 186 EETGEVLML----GKHNKGCHCKKSGCLKKYCECFQ 217
TG + + +HNKGC+CKKSGCLKKYCECFQ
Sbjct: 57 --TGTHVAIQGAAARHNKGCNCKKSGCLKKYCECFQ 90
>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 113
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 70/114 (61%), Gaps = 7/114 (6%)
Query: 127 YCECFASGIYCDG-CNCVNCHNNVENEAA---RREAVEATLERNPNAFRPKIASSPHGTR 182
YCECFA+ ++C G C C NC N E EA R+ +EA R+P AF PKI + T
Sbjct: 1 YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEA---RDPLAFAPKIVQAAEPTP 57
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
+++ + +H +GC+CKKS CLKKYCEC+QA + CSE CRC CKN G
Sbjct: 58 IPRDDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNMYG 111
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ ++ CNCK S CLK YCEC+ +G+ C +GC C C N
Sbjct: 71 RHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKN 108
>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 886
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 66/127 (51%), Gaps = 17/127 (13%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA---RREAVEATLERN 166
KK C C ++CL LYC+CF I+C + C C++C N VE A R AVE LER
Sbjct: 6 KKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERR 65
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF + S R NK++ GC C K+ CLKKYC CF A + C C
Sbjct: 66 PEAFGRE--SGAFAERRNKKDG-----------GCTCAKNSCLKKYCVCFGAKMQCRARC 112
Query: 227 RCMDCKN 233
C +CKN
Sbjct: 113 NCSNCKN 119
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 68/145 (46%), Gaps = 17/145 (11%)
Query: 95 PKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN---NVE 150
P++ R + E KK C C + CLK YC CF + + C CNC NC N +
Sbjct: 66 PEAFGRESGAFAERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPFGTIR 125
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
E + V T+E A P N L K N C C ++ CLK
Sbjct: 126 IE----DGVTCTIEGGKVNALKGAAFVPIRKLAN---------LAKGNTKCSCTRAACLK 172
Query: 211 KYCECFQANILCSENCRCMDCKNFE 235
YC+CF+ +++C++NC C++CKN E
Sbjct: 173 LYCDCFRQSMVCNDNCSCLECKNSE 197
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 57/132 (43%), Gaps = 24/132 (18%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEA---ARREAVEATL 163
N K +C+C + CLKLYC+CF + C D C+C+ C N+ E R ++ L
Sbjct: 155 NLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDIL 214
Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
E+ P+ F+ G GC C+K+ C YC+C + C
Sbjct: 215 EKRPHIFQKTAKRKAGG-------------------GCSCQKNKCRAGYCDCKGSGGKCD 255
Query: 224 -ENCRCMDCKNF 234
C C++C N
Sbjct: 256 PAVCTCINCDNM 267
>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 356
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 84/185 (45%), Gaps = 23/185 (12%)
Query: 64 PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---------- 113
P + PP PPP P + + V + + E RS N PK+Q
Sbjct: 5 PISMKPPAPPPVTLEPIELSTHAVGLTELE----RSSMNQLAIAPDPKRQRVEEAADGNG 60
Query: 114 -KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPN 168
K C CK SRCLKLYC CFA G YC D C C C N + A E V+ T L
Sbjct: 61 CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFN----KEAFSETVQTTRKVLLSRQK 116
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
KI P + E+ +GC+CKKS CLKKYC+C+Q CS CRC
Sbjct: 117 RMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRC 176
Query: 229 MDCKN 233
DC+N
Sbjct: 177 DDCQN 181
>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
Length = 466
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/118 (45%), Positives = 79/118 (66%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
+C+CFASG + + CNC NC NN+ +E R +A++A L R+P AF+P+I K
Sbjct: 308 HCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIG---------KG 358
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ GEV +H+KGC+C++ GCL+ YCEC +A I+CS C+C+ CKN+E S ER+ L
Sbjct: 359 QLGEVR--PRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCKNYEESPERKTLM 414
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 7/77 (9%)
Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEAT 162
++ E P+ K CNC+ CL+ YCEC + I C C C+ C N E R+ T
Sbjct: 359 QLGEVRPRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGC-KNYEESPERK-----T 412
Query: 163 LERNPNAFRPKIASSPH 179
L PN + + S H
Sbjct: 413 LMSMPNCAQTGLEGSHH 429
>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 586
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 35/60 (58%), Positives = 50/60 (83%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
K++ CNC S+CLKLYC+CFA+G +C CNC +C+NN++NE R++A+ ATLERNP+AF+
Sbjct: 527 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 586
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDC-KNFEGSEERQALFHGDHANN 251
C+C KS CLK YC+CF C NC C DC N + EERQ N
Sbjct: 531 CNCTKSQCLKLYCDCFANGEFCY-NCNCRDCYNNLDNEEERQKAIRATLERN 581
>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
Length = 289
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/157 (42%), Positives = 89/157 (56%), Gaps = 19/157 (12%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFA+G +C CNC NC NN +E R A++A LERNP AF PKI K
Sbjct: 151 YCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIG---------KR 201
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS-EERQALFH 245
+ GE+ HNKGC+CK+SGCL+ YCECF+A I+CS C+C CKN+E S +E + L
Sbjct: 202 KLGEIKH--HHNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKNYEESPDENKQL-- 257
Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
N+ N + +S + P S+K +
Sbjct: 258 -----NVINYMDIGNKEGNSPVLTSAFNMLPKSRKDR 289
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 32/64 (50%), Gaps = 4/64 (6%)
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
R P+ PK R E+K + K CNCK S CL+ YCECF + I C C C C
Sbjct: 189 RNPEAFLPKIGKRKLGEIKHH---HNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCK 245
Query: 147 NNVE 150
N E
Sbjct: 246 NYEE 249
>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
Length = 434
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 70/131 (53%), Gaps = 21/131 (16%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNA 169
+K+K C C ++CLK YC CF + + C + C CV+CHN+ ++E AR AV +P A
Sbjct: 12 EKEKGCRCGRTKCLKQYCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMA 71
Query: 170 FR-------PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
F+ + +P+GT K +GC C++S C KKYCECF A + C
Sbjct: 72 FKGTSLELENQEVHTPNGTL-------------KTVRGCRCRRSKCQKKYCECFGAGLKC 118
Query: 223 SENCRCMDCKN 233
S NC C C N
Sbjct: 119 STNCVCEGCLN 129
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 81 QATQPPVRVPKPESPKSRSRSNVEVKEN-------TPKKQKQCNCKHSRCLKLYCECFAS 133
QA VR+ + P + +++E++ T K + C C+ S+C K YCECF +
Sbjct: 55 QARMMAVRLVRLNDPMAFKGTSLELENQEVHTPNGTLKTVRGCRCRRSKCQKKYCECFGA 114
Query: 134 GIYCD-GCNCVNCHN 147
G+ C C C C N
Sbjct: 115 GLKCSTNCVCEGCLN 129
>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
Length = 286
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 76/140 (54%), Gaps = 12/140 (8%)
Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
NC ++ C+K YC+CF YC CN C NN ENE R++A++ L +NP AF K +
Sbjct: 144 NCSNTYCIKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKT 203
Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC--MDCKNF 234
+ + +K+++ + C+C KSGC KYC C + ILCS+NC+C CKN
Sbjct: 204 RKNFIKKSKKDSNLI-------HTCNCSKSGCNNKYCVCLKEGILCSKNCKCNHRKCKNK 256
Query: 235 EGS---EERQALFHGDHANN 251
+G ++R + D N
Sbjct: 257 QGDRNQKQRSVIISNDENEN 276
>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
Length = 496
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 63/110 (57%), Gaps = 5/110 (4%)
Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
++ YC CF + + C+GC+CV+C+N+ ++E R A+E +P AF ++ +
Sbjct: 297 VRQYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASV 356
Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ E + +H +GC CK S CLKKYCECF+ + CS C C DC N
Sbjct: 357 SVESKPK-----QHVRGCKCKNSKCLKKYCECFEFGVSCSSKCDCKDCMN 401
>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 375
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 68/141 (48%), Gaps = 11/141 (7%)
Query: 109 TPKKQK---------QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREA 158
TPK+QK C CK SRCLKLYC CFA G YC + C CV C N + +
Sbjct: 66 TPKRQKVEDAADGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCVPCFNKADFAETVQTT 125
Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
+ L R KI + E+ +GC+CKKS CLKKYC+C+Q
Sbjct: 126 RKVLLSRQKR-MSLKINRRLEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 184
Query: 219 NILCSENCRCMDCKNFEGSEE 239
CS CRC DCKN G E
Sbjct: 185 GTGCSLFCRCEDCKNPFGKNE 205
>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 120
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/121 (47%), Positives = 72/121 (59%), Gaps = 14/121 (11%)
Query: 127 YCECFASGIYCDG-CNCVNCHNN---VENEAARREAVEATLERNPNAFRPKIA----SSP 178
YCECFA+G+YC G C C +C N +E R+ +E+ RNP AF PKI SSP
Sbjct: 1 YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIES---RNPLAFAPKIVQGAESSP 57
Query: 179 HGTRDNKEETGEVL---MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
+ ++ EVL +H +GC+CKKS CLKKYCEC+QA + CSE CRC C N
Sbjct: 58 VPGVSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMNKY 117
Query: 236 G 236
G
Sbjct: 118 G 118
Score = 42.0 bits (97), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ + ++ CNCK S CLK YCEC+ +G+ C +GC C C N
Sbjct: 76 SARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMN 115
>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
Length = 356
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 68/185 (36%), Positives = 84/185 (45%), Gaps = 23/185 (12%)
Query: 64 PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---------- 113
P + PP PPP P + + V + + E RS N PK+Q
Sbjct: 5 PISMKPPAPPPVTLEPIELSTHAVGLTELE----RSSMNQLAIAPDPKRQRVEEAADGNG 60
Query: 114 -KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPN 168
K C CK SRCLKLYC CFA G YC D C C C + + A E V+ T L
Sbjct: 61 CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFS----KEAFSETVQTTRKVLLSRQK 116
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
KI P + E+ +GC+CKKS CLKKYC+C+Q CS CRC
Sbjct: 117 RMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRC 176
Query: 229 MDCKN 233
DC+N
Sbjct: 177 DDCQN 181
>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
distachyon]
Length = 377
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/151 (39%), Positives = 71/151 (47%), Gaps = 29/151 (19%)
Query: 109 TPKKQ-----------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARR 156
TPK+Q K C CK S+CLKLYC CFA G YC + C CV C N + +
Sbjct: 73 TPKRQRVEEAADGNGCKHCACKKSKCLKLYCPCFAGGGYCSEKCGCVPCFNKSDYAETVQ 132
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHN--------KGCHCKKSGC 208
+ L R KI + + E M HN +GC+CKKS C
Sbjct: 133 TTRKVLLSRQKR-MSLKI--------NRRSEANAEAMEDAHNSSSSTPPRRGCNCKKSSC 183
Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEE 239
LKKYC+C+Q CS CRC DCKN G E
Sbjct: 184 LKKYCDCYQDGTGCSLFCRCEDCKNPYGKNE 214
>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
gi|194698372|gb|ACF83270.1| unknown [Zea mays]
gi|194708474|gb|ACF88321.1| unknown [Zea mays]
gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
Length = 394
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/124 (41%), Positives = 65/124 (52%), Gaps = 18/124 (14%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K C+CK SRCLKLYC C+ASG +C + C C CHN A P
Sbjct: 78 KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTA--------------P 123
Query: 173 KIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
+ P T + +E E L+ + + C CKKSGCLKKYC+C+Q CS NC+C
Sbjct: 124 VLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCD 183
Query: 230 DCKN 233
DC+N
Sbjct: 184 DCRN 187
>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
Length = 176
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 62/97 (63%), Gaps = 8/97 (8%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNCK S+CLKLYC+CF +G+ C CNC +CHNN E+ R ++ +ERNP+AFRPK+
Sbjct: 85 CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQIMERNPSAFRPKV 144
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKK 211
S + N E+ + +H KGC+CKKS KK
Sbjct: 145 ES-----KSNSEDEQD--HKPRHFKGCNCKKSNWSKK 174
Score = 46.2 bits (108), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDC-KNFEGSEERQALF 244
C+CKKS CLK YC+CF A + C ++C C C N E +ER+ +
Sbjct: 85 CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVI 129
>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
Length = 409
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/124 (40%), Positives = 67/124 (54%), Gaps = 17/124 (13%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K C+CK SRC+KLYC CFASG C + C C C N + A R A+ L+
Sbjct: 102 KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLK-------- 153
Query: 173 KIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
P T + ++ E ++ + + C CKKSGCLKKYC+C+Q CS NC+C
Sbjct: 154 -----PVQTSEAGQDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCD 208
Query: 230 DCKN 233
DC+N
Sbjct: 209 DCRN 212
>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
Length = 238
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 88/186 (47%), Gaps = 22/186 (11%)
Query: 60 PQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCK 119
P PV V P +A P PE+P + R + ++ CNC
Sbjct: 10 PSTTKSPVASSVSPAVGSNETKAAFPATE-STPEAPVLKRRQVI-------NNRRPCNCT 61
Query: 120 HSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPH 179
S+CLKLYC+CFA+G +C C+C NC NN+ NE R A+++ LERNP AF+PKI
Sbjct: 62 KSQCLKLYCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKI----- 116
Query: 180 GTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
G + ++ + L L GC + +K A + C+ C+C+ C+N E
Sbjct: 117 GNKTSEIMAKDELTLN----GCT-TRDAIVKSQ----AARVSCTVRCKCVGCRNTEADRN 167
Query: 240 RQALFH 245
+ H
Sbjct: 168 HRNRGH 173
>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 75/143 (52%), Gaps = 29/143 (20%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
P + R + +K+ +KQ+QC CK S+CLK+YC+C A G YC C+C NCHN+ +
Sbjct: 18 PTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDK 77
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
+ R + +E +E+N +A C+C+KS C K
Sbjct: 78 HPEQRSKVLEQMVEKNSSA----------------------------QLKCNCRKSNCQK 109
Query: 211 KYCECFQANILCSENCRCMDCKN 233
KYCEC+ A + CSE C+C DCKN
Sbjct: 110 KYCECYNAGVKCSELCKCDDCKN 132
>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
Length = 359
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 65/130 (50%), Gaps = 8/130 (6%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPNA 169
K C CK SRCLKLYC CF+ G YC D C C C N + A E V+ T L
Sbjct: 67 KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFN----KEAFSETVQTTRKVLLSRQKR 122
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
KI P + E+ +GC+CKKS CLKKYC+C+Q CS CRC
Sbjct: 123 MSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCD 182
Query: 230 DCKNFEGSEE 239
DC+N G E
Sbjct: 183 DCQNPFGKNE 192
>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
Length = 625
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 59/108 (54%), Gaps = 14/108 (12%)
Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
LYCECFA+G +C C+C NC N +N A + RNP AF KI ++ D +
Sbjct: 280 LYCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKIVAT-----DGE 334
Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
E+ + KKS CLKKYCECFQA +LC E C+C CKN
Sbjct: 335 EKHKKGCHC---------KKSACLKKYCECFQAGVLCQEYCKCEGCKN 373
>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|194699308|gb|ACF83738.1| unknown [Zea mays]
gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
gi|224028923|gb|ACN33537.1| unknown [Zea mays]
gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
Length = 359
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 61/127 (48%), Gaps = 2/127 (1%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K C CK SRCLKLYC CF+ G YC D C C C N E A L
Sbjct: 67 KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNK-EAFAETVHTTRKVLLSRQKRMSM 125
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KI P + E+ +GC+CKKS CLKKYC+C+Q CS CRC DC+
Sbjct: 126 KINRRPEANTEPMEDAHHSSSSTPPKRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 185
Query: 233 NFEGSEE 239
N G E
Sbjct: 186 NPFGKNE 192
>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
Length = 159
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 63/121 (52%), Gaps = 25/121 (20%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K CNCK S+CLK YC+CFA+G C + CNCV C NN N R+ A + R+P AF+
Sbjct: 60 KSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNRDPGAFKQ 119
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
KGC+CKKSGC KKYCECF + C+ CRC C
Sbjct: 120 SF------------------------KGCNCKKSGCQKKYCECFLNGLTCTHLCRCDGCL 155
Query: 233 N 233
N
Sbjct: 156 N 156
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 27/37 (72%)
Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ K C+CKKS CLK+YC+CF +CSENC C+ C N
Sbjct: 58 NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFN 94
>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 159
Score = 89.4 bits (220), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 55/132 (41%), Positives = 71/132 (53%), Gaps = 22/132 (16%)
Query: 127 YCECFASGIYCDG-CNCVNCHNN---VENEAARREAVEATLERNPNAFRPKIASS----- 177
YCECFA G+YC G C C +C N +E R+ +E+ RNP AF PKI
Sbjct: 1 YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIES---RNPLAFAPKIVQGAEPSP 57
Query: 178 -PHGTRDNKEETGEVLML---------GKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
P T E ++++ +H +GC+CKKS CLKKYCEC+QA + CSE CR
Sbjct: 58 VPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCR 117
Query: 228 CMDCKNFEGSEE 239
C C N G +E
Sbjct: 118 CEGCMNKYGKKE 129
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 7/69 (10%)
Query: 86 PVRVPKPESPKSRSRSNVEVKE---NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-D 138
P VP P + + + V++ +TP + ++ CNCK S CLK YCEC+ +G+ C +
Sbjct: 55 PSPVPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSE 114
Query: 139 GCNCVNCHN 147
GC C C N
Sbjct: 115 GCRCEGCMN 123
>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
Length = 214
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 70/137 (51%), Gaps = 15/137 (10%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+C+C S C +LYC CF S +C D CNC C+N E A E + +NPNAF P+
Sbjct: 9 KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68
Query: 174 IASSPHGTR--DNKEETGEV--------LMLG----KHNKGCHCKKSGCLKKYCECFQAN 219
I S D + T + M G KH KGC C+KS C K YCECF+ +
Sbjct: 69 IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128
Query: 220 ILCSENCRCMDCKNFEG 236
+ C+ C+C +C N G
Sbjct: 129 VGCTAKCKCQECSNSFG 145
>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
Length = 371
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 62/93 (66%), Gaps = 15/93 (16%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ K
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKM---------GKG 324
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
G + +H+KGC+CK+SGCLK YCEC++A+
Sbjct: 325 RLGAAKL--RHSKGCNCKRSGCLKNYCECYEAS 355
>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 405
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 66/124 (53%), Gaps = 12/124 (9%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
+ C+CK SRCLKLYC CFASG +C + C C C+N + + + P
Sbjct: 95 RTCSCKKSRCLKLYCVCFASGSHCSELCGCDPCYN---------KPIHGVQQNTPPGLPL 145
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
++ + +++ E + + C CKKSGCLKKYC+C+Q CS NC+C DCK
Sbjct: 146 QVVRTAEAGQNSAEFA--RYQMDFFRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCK 203
Query: 233 NFEG 236
N G
Sbjct: 204 NPYG 207
>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
Length = 223
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 71/137 (51%), Gaps = 15/137 (10%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+C+C S C +LYC CF S +C D CNC C+N E A E + +NPNAF P+
Sbjct: 9 KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68
Query: 174 IASSPHGTR--DNKEETGEV--------LMLG----KHNKGCHCKKSGCLKKYCECFQAN 219
I S D + T + M G KH KGC C+KS C K YCECF+ +
Sbjct: 69 IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128
Query: 220 ILCSENCRCMDCKNFEG 236
+ C+ C+C++C N G
Sbjct: 129 VGCTAKCKCLECSNSFG 145
>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
[Oryza sativa Japonica Group]
Length = 194
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 71/137 (51%), Gaps = 15/137 (10%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+C+C S C +LYC CF S +C D CNC C+N E A E + +NPNAF P+
Sbjct: 9 KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68
Query: 174 IASSPHGTR--DNKEETGEV--------LMLG----KHNKGCHCKKSGCLKKYCECFQAN 219
I S D + T + M G KH KGC C+KS C K YCECF+ +
Sbjct: 69 IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128
Query: 220 ILCSENCRCMDCKNFEG 236
+ C+ C+C++C N G
Sbjct: 129 VGCTAKCKCLECSNSFG 145
>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
Length = 164
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 76/143 (53%), Gaps = 29/143 (20%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
P + R + +K+ +KQ+QC CK S+CLK+YC+C A G YC C+C NCHN+ +
Sbjct: 18 PTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDK 77
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
+ R + +E +E+N +P + C+C+KS C K
Sbjct: 78 HPEQRNKVLEQMVEKN----------TPAQIK------------------CNCRKSNCQK 109
Query: 211 KYCECFQANILCSENCRCMDCKN 233
KYCEC+ A + CSE C+C DCKN
Sbjct: 110 KYCECYNAGVKCSELCKCDDCKN 132
>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
Length = 144
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/125 (42%), Positives = 67/125 (53%), Gaps = 22/125 (17%)
Query: 127 YCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
YCECFA+G+YC D C C +C N + R+ +E RNP AF PKI T
Sbjct: 1 YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIET---RNPLAFAPKIVQ----TS 53
Query: 183 DNKEETG-----------EVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
DN G + +H +GC+CKKS CLKKYCEC+QA + CS+ C+C C
Sbjct: 54 DNTPTEGVASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGC 113
Query: 232 KNFEG 236
KN G
Sbjct: 114 KNIYG 118
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ + ++ CNCK S CLK YCEC+ +G+ C DGC C C N
Sbjct: 76 SARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKN 115
>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
Length = 374
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 73/150 (48%), Gaps = 27/150 (18%)
Query: 109 TPKKQK-----------QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARR 156
TPK+QK C CK SRCLKLYC CFA G YC + C C C N +A
Sbjct: 70 TPKRQKVEESADGNGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCQPCFN----KALYA 125
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHN-------KGCHCKKSGCL 209
E V+ T R R K S R E E + H+ +GC+CKKS CL
Sbjct: 126 ETVQTT--RKVLLSRQKRMSLKINRRS--EANAEAVEDAHHSSSSTPPRRGCNCKKSSCL 181
Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEE 239
KKYC+C+Q CS CRC DC+N G E
Sbjct: 182 KKYCDCYQDGTGCSLFCRCEDCRNPFGKNE 211
>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
niloticus]
Length = 310
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/93 (40%), Positives = 60/93 (64%), Gaps = 15/93 (16%)
Query: 147 NNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
++++ A+R+ L NP+AFRPKIA G + G HNKGC+CK+S
Sbjct: 160 SDIQKAASRKWC----LGHNPDAFRPKIAGVKSGA-----------VRGWHNKGCNCKRS 204
Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
GCLK+YCEC++ANI+C+ +C+C+ C+N++ +
Sbjct: 205 GCLKRYCECYEANIMCTSSCKCVGCRNYDAGSQ 237
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
K CNCK S CLK YCEC+ + I C C CV C N
Sbjct: 197 KGCNCKRSGCLKRYCECYEANIMCTSSCKCVGCRN 231
>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
Length = 191
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 62/93 (66%), Gaps = 15/93 (16%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ K
Sbjct: 98 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMG---------KG 144
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
G + +H+KGC+CK+SGCLK YCEC++A+
Sbjct: 145 RLGAAKL--RHSKGCNCKRSGCLKNYCECYEAS 175
>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
Length = 554
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 34/77 (44%), Positives = 50/77 (64%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+AF
Sbjct: 475 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFN 534
Query: 172 PKIASSPHGTRDNKEET 188
K + +D K +T
Sbjct: 535 LKHGEDSNNAKDVKRQT 551
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 43/95 (45%), Gaps = 24/95 (25%)
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
P++AS+P T + + K C+C KS CLK YC+CF C ++C C DC
Sbjct: 458 PQLASAP--------TTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDC 508
Query: 232 KN---FEGSEERQA------------LFHGDHANN 251
N +E ER L HG+ +NN
Sbjct: 509 FNNLDYEVERERAIRSCLDRNPSAFNLKHGEDSNN 543
>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 56/103 (54%), Gaps = 6/103 (5%)
Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML----GKHNKG 200
C N E E E + RNP AF PKI H T + +V + G+H G
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQ--HVTEFQAIDVEDVDLFTPYSGRHKTG 501
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
C+CK+S C+KKYCEC+QAN+ CS CRC C+N G +E A+
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRNIHGRKEEYAM 544
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
CNCK S C+K YCEC+ + + C + C C C N
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRN 534
>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
Length = 573
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/66 (56%), Positives = 50/66 (75%), Gaps = 8/66 (12%)
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
AF+PKI++S D+K + L GKHNKGC CKKSGC+KKYCECFQA + CS+ C+C
Sbjct: 433 AFQPKISTS-----DSKPDP---LNFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKC 484
Query: 229 MDCKNF 234
++C+N+
Sbjct: 485 IECRNY 490
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
N K K C CK S C+K YCECF + + C D C C+ C N
Sbjct: 449 NFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 489
>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
subellipsoidea C-169]
Length = 81
Score = 84.3 bits (207), Expect = 2e-13, Method: Composition-based stats.
Identities = 40/95 (42%), Positives = 55/95 (57%), Gaps = 17/95 (17%)
Query: 123 CLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGT 181
CLKLYC+CFA+G++C D C C +C N + A +A + + ++P AF+PK+ +
Sbjct: 1 CLKLYCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPKVLDAS--- 57
Query: 182 RDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
G H KGC C+KS CLKKYCECF
Sbjct: 58 -------------GGHVKGCACRKSRCLKKYCECF 79
Score = 40.4 bits (93), Expect = 2.4, Method: Composition-based stats.
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 208 CLKKYCECFQANILCSENCRCMDCKN 233
CLK YC+CF + C++ C C DC+N
Sbjct: 1 CLKLYCDCFATGLFCNDACMCKDCEN 26
>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 313
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 62/122 (50%), Gaps = 20/122 (16%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNAFR 171
C CK RCLKLYC+CFA +C C C C N V +AARR A LERNP+AF
Sbjct: 201 CTCKKCRCLKLYCDCFAQSRFCGSACLCKQCCNRLGAVVVQAARR----AVLERNPDAF- 255
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
H + G V GC C +S CLKKYC CFQ ++LC C+C C
Sbjct: 256 -------HRAHSLETRPGSVT----SGAGCGCTRSHCLKKYCVCFQKSLLCGAGCKCEGC 304
Query: 232 KN 233
N
Sbjct: 305 LN 306
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 33/77 (42%), Gaps = 2/77 (2%)
Query: 78 LPGQATQPPVRVPKPESPKSRSRSN-VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIY 136
L Q R +P + R++ +E + + C C S CLK YC CF +
Sbjct: 235 LGAVVVQAARRAVLERNPDAFHRAHSLETRPGSVTSGAGCGCTRSHCLKKYCVCFQKSLL 294
Query: 137 CD-GCNCVNCHNNVENE 152
C GC C C N VEN
Sbjct: 295 CGAGCKCEGCLNTVENR 311
>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
Length = 184
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 45/64 (70%)
Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
+H KGC+CK+SGCLK YCEC++A ILCS C+C+ CKNFE S ER+ L A +
Sbjct: 3 RHTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQ 62
Query: 256 QQAA 259
QQAA
Sbjct: 63 QQAA 66
Score = 45.4 bits (106), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
K CNCK S CLK YCEC+ + I C C CV C N
Sbjct: 6 KGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKN 40
>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQ-CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN 148
K + +++ R NVE N+ + + Q CNC S CLK YC CF SG C D C C +C N
Sbjct: 128 KKQKDQNKLRRNVE---NSKELEIQPCNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNC 184
Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
E R EA+ ++ RD K E+L L + + GC CK +GC
Sbjct: 185 TEFAEERDEAINHVYKK--------------CHRDKKVPVNELLSL-QMSYGCKCKSTGC 229
Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGS 237
KKYCECF+ +C E C C DC N S
Sbjct: 230 QKKYCECFKRGQICGEQCACEDCLNIPFS 258
>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
Length = 466
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 49/91 (53%), Gaps = 14/91 (15%)
Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCK 204
C N VEN + + RNPNAF KI E+ G+ +H KGCHCK
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKI-----------EDDGDD---ARHTKGCHCK 216
Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFE 235
KS CLKKYCECFQA + C E C+C C N E
Sbjct: 217 KSACLKKYCECFQAGVKCQEYCKCEGCMNKE 247
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
S +E + + K C+CK S CLK YCECF +G+ C + C C C N
Sbjct: 198 SKIEDDGDDARHTKGCHCKKSACLKKYCECFQAGVKCQEYCKCEGCMN 245
>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
Length = 482
Score = 82.4 bits (202), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 76/162 (46%), Gaps = 21/162 (12%)
Query: 84 QPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNC 142
Q + K + R +++ KE + K C CK S+C+KLYCECF S +C C C
Sbjct: 327 QKTSSIRKNQLSNIRKSTSLMNKEANLSECKGCTCKRSKCIKLYCECFLSKGFCGPSCTC 386
Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG------- 195
+C N ENE + E L RNP AF K+ EE + L +G
Sbjct: 387 NDCENMEENEEEIVKIREKILSRNPKAFEKKLQI---------EEASQPLGMGPKAASAP 437
Query: 196 ----KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+H KGC CK+S C YCEC Q C+E C C DC N
Sbjct: 438 SSNLRHIKGCTCKRSKCSNNYCECHQHGAKCTELCSCTDCNN 479
>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
CCMP2712]
Length = 125
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 73/136 (53%), Gaps = 21/136 (15%)
Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLE 164
+E+ K++ C C ++CLK YC+CF + I C C C +CHN+ ++E R EA+
Sbjct: 3 EESQQGKERGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRM 62
Query: 165 RNPNAFR-------PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
NP+AF+ + ++P G + K +GC CK+S C KKYCECF
Sbjct: 63 NNPSAFKGTALELEDQEVTTPKGGK-------------KTVRGCRCKRSKCQKKYCECFS 109
Query: 218 ANILCSENCRCMDCKN 233
A I C+ NC C DC N
Sbjct: 110 AGIPCTSNCVCTDCAN 125
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER--QALFH 245
+GC C ++ CLK+YC+CF+ +I C+ +C C DC N EE+ +A+ H
Sbjct: 11 RGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRH 59
>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
Length = 204
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/124 (41%), Positives = 65/124 (52%), Gaps = 18/124 (14%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K C+CK SRCLKLYC C+ASG +C + C C CHN A P
Sbjct: 78 KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTA--------------P 123
Query: 173 KIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
+ P T + +E E L+ + + C CKKSGCLKKYC+C+Q CS NC+C
Sbjct: 124 VLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCD 183
Query: 230 DCKN 233
DC+N
Sbjct: 184 DCRN 187
>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
Length = 272
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/129 (37%), Positives = 63/129 (48%), Gaps = 16/129 (12%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNC S CLK YC CF SG C D C C +C N + R EA+ ++
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKK--------- 201
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
RD K E+L L + + GC CK +GC KKYCECF+ +C E C C C N
Sbjct: 202 -----CHRDKKVPVNELLSL-QMSYGCKCKTTGCQKKYCECFKRGQICGEQCSCEGCLNI 255
Query: 235 EGSEERQAL 243
++ ++ L
Sbjct: 256 PSAQNQKEL 264
>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 932
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 61/117 (52%), Gaps = 15/117 (12%)
Query: 124 LKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
LK YC+CFA+G +C C C C N+ +N+ E VE R + ++S +
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQ----EQVE----------RARQSASQRSST 864
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
+ GE + ++GC CK++GCLK+YCECFQA C+ C C C N G E
Sbjct: 865 SGEALNGEPSASERSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLNCRGLSE 921
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
+ + C CK + CLK YCECF +G C C C C N
Sbjct: 878 RSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLN 915
>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
Length = 341
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 75/169 (44%), Gaps = 23/169 (13%)
Query: 64 PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---------- 113
P + PP PPP P + + V + + E RS N PK+Q
Sbjct: 5 PISMKPPAPPPVTLEPIELSTHAVGLTELE----RSSMNQLAIAPDPKRQRVEEAADGNG 60
Query: 114 -KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPN 168
K C CK SRCLKLYC CFA G YC D C C C N+ A E V+ T L
Sbjct: 61 CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCF----NKEAFSETVQTTRKVLLSRQK 116
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
KI P + E+ +GC+CKKS CLKKYC+C+Q
Sbjct: 117 RMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQ 165
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 32/67 (47%), Gaps = 4/67 (5%)
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
++A +P R EE + G K C CKKS CLK YC CF CS+ C C C
Sbjct: 40 QLAIAPDPKRQRVEEAAD----GNGCKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCF 95
Query: 233 NFEGSEE 239
N E E
Sbjct: 96 NKEAFSE 102
>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
Length = 377
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 48/70 (68%), Gaps = 3/70 (4%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVEN 151
P+S +S +E + + +K C+C SRCLKLYC+CF+SG+ C CNC+NCHNN EN
Sbjct: 309 PQSSESLRTLRLECES---RPRKTCHCTKSRCLKLYCDCFSSGMMCSSCNCINCHNNAEN 365
Query: 152 EAARREAVEA 161
E+ R EA++
Sbjct: 366 ESQRHEAIKV 375
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 26/35 (74%), Gaps = 1/35 (2%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
K CHC KS CLK YC+CF + ++CS +C C++C N
Sbjct: 328 KTCHCTKSRCLKLYCDCFSSGMMCS-SCNCINCHN 361
>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
gi|194696492|gb|ACF82330.1| unknown [Zea mays]
Length = 278
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 3/102 (2%)
Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCH 202
+N ++ E + R+ +E+ RNP AF PK+ + + ++ + + +H +GC+
Sbjct: 1 MNNQSHTETVLSTRQQIES---RNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCN 57
Query: 203 CKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
CKKS CLKKYCECFQ + CS +CRC CKN G E A+
Sbjct: 58 CKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGKREGAAVL 99
Score = 45.4 bits (106), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCECF G+ C C C C N
Sbjct: 51 RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 88
>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
Length = 84
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 55/95 (57%), Gaps = 12/95 (12%)
Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
CLKLYC+CFA+G +C C+C +C N E ++ E R+P AF KI +P G
Sbjct: 1 CLKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMDAPGGG- 59
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
GKH +GC+CK+S CLKKYCECFQ
Sbjct: 60 -----------GGKHKRGCNCKRSHCLKKYCECFQ 83
>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
gi|238005872|gb|ACR33971.1| unknown [Zea mays]
Length = 272
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 55/96 (57%), Gaps = 3/96 (3%)
Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
+E + R+ +E+ RNP AF PK+ + D + + + +H +GC+CKKS C
Sbjct: 1 METVLSTRQQIES---RNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSC 57
Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
LKKYCECFQ + CS +CRC CKN G E ++
Sbjct: 58 LKKYCECFQGGVGCSVSCRCEGCKNAFGKREGASVL 93
Score = 45.4 bits (106), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
+ ++ CNCK S CLK YCECF G+ C C C C N
Sbjct: 45 RHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKN 82
>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
Length = 330
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 62/121 (51%), Gaps = 30/121 (24%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C CK ++CLK YC+CF G +C + C C++C NN +N +E +E T R +I
Sbjct: 216 CICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQN----KELIEQT--------RSQI 263
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
N+ EV C CKKS CLKKYCEC+ A C +C+C C N+
Sbjct: 264 ---------NRNLNDEV--------KCFCKKSKCLKKYCECYNAGKKCGVDCKCEGCMNY 306
Query: 235 E 235
E
Sbjct: 307 E 307
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 28/41 (68%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
+GC CKK+ CLK YC+CFQ C E C C+DC N E ++E
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE 254
>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
Length = 391
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 80/191 (41%), Gaps = 25/191 (13%)
Query: 54 ANSHPPPQPQPKPV-------VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVK 106
A + P P+P + P P V P P S K
Sbjct: 9 ATNQPWPEPDDAHLSPPPPPPPASAPTPAAEAAAPTASAAVSPPAPAEANGNSDRKRRRK 68
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLER 165
+ K C+CK SRCLKLYC C+ASG +C + C C CHN A
Sbjct: 69 ADDGDGCKACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTA-------- 120
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILC 222
P + P T + +E E L+ + + C CKKSGCLKKYC+C+Q C
Sbjct: 121 ------PVLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGC 174
Query: 223 SENCRCMDCKN 233
S NC+C DC+N
Sbjct: 175 SINCKCDDCRN 185
>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/136 (38%), Positives = 69/136 (50%), Gaps = 20/136 (14%)
Query: 103 VEVKENTPK----KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARRE 157
+ VKE +PK K + C+C + CLK+YC CF +G C + C C +C N E R E
Sbjct: 65 IVVKEESPKLESNKWQPCSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHE 124
Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
AVE I H NK+ T E L K GC+CKK+ CLK+YCECF
Sbjct: 125 AVEY------------IKKKAHR---NKKVTQEKLFETKEVWGCNCKKTRCLKRYCECFI 169
Query: 218 ANILCSENCRCMDCKN 233
C+ +C C C+N
Sbjct: 170 RQKTCTVDCNCNYCEN 185
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 21/35 (60%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
C C K+ CLK YC CF C E C+C DCKN +
Sbjct: 82 CSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVD 116
>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
gi|219884557|gb|ACL52653.1| unknown [Zea mays]
gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
Length = 394
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 62/125 (49%), Gaps = 19/125 (15%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K C+CK SRCLKLYC CFASG++C + C C CHN + + R A L
Sbjct: 87 KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL--------- 137
Query: 173 KIASSPHGTRDNKEETGEVLM-LGKHNKGCH---CKKSGCLKKYCECFQANILCSENCRC 228
K + +D E+ M L + C C LKKYC+C+Q CS NC+C
Sbjct: 138 KPVQTFEAGQDIVEQVIRSPMDLIRRKCTCKKSGC-----LKKYCDCYQGGAGCSINCKC 192
Query: 229 MDCKN 233
DC+N
Sbjct: 193 DDCRN 197
>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
Length = 484
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 64/97 (65%), Gaps = 11/97 (11%)
Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
++ YC+CFASG C+ CNC NC NN+ +E R +A++A L+RNP AF+PKI P D
Sbjct: 295 VRKYCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG--PGSLGD 352
Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
K +HN+GC+CK+SGCLK YCEC++ +
Sbjct: 353 AKP---------RHNRGCNCKRSGCLKNYCECYELAL 380
>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 51/119 (42%), Gaps = 23/119 (19%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNC S C K+YCEC A G C C C NC N N+ E L +N + K
Sbjct: 72 CNCSKSECRKMYCECLAKGRLCSSACRCENCQNRTSNKKVLHVIKELDLNKNLKKIKSK- 130
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ GC CKKS CLKKYCECF + C C C +C+N
Sbjct: 131 ---------------------RFKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQN 168
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C CK S CLK YCECF SG C GCNC NC N + +EA + + ++ + F K
Sbjct: 136 CTCKKSMCLKKYCECFHSGKSCGYGCNCENCQNQLLDEAQKN--ICTPNQKKTSKFSSKF 193
Query: 175 ASSPHGTRDNKEETGEVL 192
S+ TR E G VL
Sbjct: 194 ESANTNTRS--EYYGMVL 209
>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
Length = 710
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 67/140 (47%), Gaps = 40/140 (28%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECF +G++C + C+C C N N+E + RE +E+ RNP A
Sbjct: 439 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLA 495
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + E G+ L G + CS +CRC
Sbjct: 496 FAPKVI---------RTEPGQELAGG------------------------VGCSVSCRCE 522
Query: 230 DCKNFEGSEERQALFHGDHA 249
CKN G E AL + A
Sbjct: 523 GCKNAFGRREGVALLGIEEA 542
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 26/45 (57%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
G K C CKKS CLK YCECF A + CSE C C C N + E
Sbjct: 435 GTSCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNME 479
>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
Length = 303
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 60/119 (50%), Gaps = 18/119 (15%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNC S CLKLYC+CF C + C C++C N + R+ A+E + K+
Sbjct: 183 CNCSKSNCLKLYCQCFHQNKQCTELCKCLDCKNCDYHTQVRQTALE----------KIKM 232
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
S ++ + +V GC C+KS C K YCECF+ N C+ +CRC DC N
Sbjct: 233 KSQRQKHDNDLFDLSKVW-------GCKCQKSQCQKNYCECFRRNQKCNSSCRCKDCAN 284
>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
CCMP1335]
Length = 141
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 58/117 (49%), Gaps = 12/117 (10%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C SRC+KLYC+CF G C+ CNC +C N E R ++ L+ NP AF
Sbjct: 35 CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAFSED- 93
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
S T+ + G GC C S CLKKYC CF A+ C++ C C DC
Sbjct: 94 -SDKFNTKRQRMSRGN---------GCACPSSHCLKKYCSCFGADAGCTDKCSCNDC 140
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN-FEGSEERQ 241
C C KS C+K YC+CF LC+ C C DCKN E EER+
Sbjct: 35 CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREERE 76
>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 65/132 (49%), Gaps = 16/132 (12%)
Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEA 161
VE +N K + C+C + CLK+YC CF +G C + C C +C N N R +AVE
Sbjct: 69 VESPKNEIAKWQPCSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEY 128
Query: 162 TLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
I H NK+ + E L K GC+CKK+ CLK+YCEC+
Sbjct: 129 ------------IKKKAHR---NKKVSQETLFETKDIWGCNCKKTRCLKRYCECYIRQKA 173
Query: 222 CSENCRCMDCKN 233
C+ C C C+N
Sbjct: 174 CTVECNCTHCEN 185
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 40/79 (50%), Gaps = 8/79 (10%)
Query: 86 PVRVPKPESPKSRSRSNVEVKENTPKKQKQ---CNCKHSRCLKLYCECFASGIYCD-GCN 141
P R E K ++ N +V + T + K CNCK +RCLK YCEC+ C CN
Sbjct: 120 PQRDKAVEYIKKKAHRNKKVSQETLFETKDIWGCNCKKTRCLKRYCECYIRQKACTVECN 179
Query: 142 CVNCHN----NVENEAARR 156
C +C N ++ NE R+
Sbjct: 180 CTHCENGKDEDLYNEIRRQ 198
>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
Length = 163
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 66/140 (47%), Gaps = 15/140 (10%)
Query: 48 QSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKE 107
+SQA A Q P P QAT +P E+ + + +
Sbjct: 7 ESQAAVAAGSSSYQSMSTMQTPRDSLHSIPFEQQATLCEGTMPSSENADIIELNQMSPER 66
Query: 108 NTPKKQ--------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA---R 155
N+ K + ++CNCK SRCLKLYCECFA+G YC D C+C NC N E A
Sbjct: 67 NSKKAKYSTESEGCRRCNCKRSRCLKLYCECFAAGNYCLDTCSCENCFNKSEYADAVLHM 126
Query: 156 REAVEATLERNPNAFRPKIA 175
R+ +EA RNP AF PK+
Sbjct: 127 RQQIEA---RNPVAFAPKVV 143
Score = 47.0 bits (110), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
+ C+CK+S CLK YCECF A C + C C +C F SE A+ H
Sbjct: 81 RRCNCKRSRCLKLYCECFAAGNYCLDTCSCENC--FNKSEYADAVLH 125
>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
Length = 255
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 54/119 (45%), Gaps = 18/119 (15%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C S CLKLYC CF C D C C C N E+ R A+E K+
Sbjct: 136 CRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALE------------KV 183
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
H +++ + L GC C+KS C K YCECF N CS +CRC DC N
Sbjct: 184 KQKLH-----RQKNDDDLFDRSKIWGCKCQKSQCQKNYCECFVRNQKCSSSCRCKDCAN 237
>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
Length = 275
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 85/213 (39%), Gaps = 26/213 (12%)
Query: 33 FTGGFVGSVILPEHP------QSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPP 86
+ FV ++P+ P + N H P + P P Q +Q
Sbjct: 60 YKSWFVYHPVMPDDPFLSYLSYVSNIVQNPHQQYPDNPIKIENDQDIPQPQFQNQTSQIN 119
Query: 87 VRVPKPESPKSRSRSNVEVKENT----PKKQKQCNCKHSRCLKLYCECFASGIYC-DGCN 141
+ + S+ ++ +++K T + + CNC S CLK YC CF SG C D C
Sbjct: 120 LNKTIQQQKHSQPKNQIKIKTITNLYKASEIQPCNCSQSSCLKRYCTCFHSGRMCMDECQ 179
Query: 142 CVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGC 201
C +C N R A+ ++ RD E+L L + + GC
Sbjct: 180 CKDCKNCDLFSEEREAAINYVFKK--------------CNRDKNVPANELLSL-QISYGC 224
Query: 202 HCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
CK +GC KKYCEC + C + C C DC N
Sbjct: 225 KCKSTGCQKKYCECLKRGQACGDLCSCEDCLNI 257
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 31/43 (72%), Gaps = 1/43 (2%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE-GSEERQA 242
C+C +S CLK+YC CF + +C + C+C DCKN + SEER+A
Sbjct: 153 CNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREA 195
>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
Length = 321
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 56/119 (47%), Gaps = 18/119 (15%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C S CLKLYC+CF C + C C C N ++ R+ A+E + KI
Sbjct: 201 CKCTKSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQTALE----------KIKI 250
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
S D+ + V GC C+KS C K YCEC+ N CS +CRC DC N
Sbjct: 251 KSQRQKHDDDLFDRSRVW-------GCKCQKSQCQKNYCECYIRNQKCSSSCRCKDCAN 302
>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 319
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 11/81 (13%)
Query: 155 RREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCE 214
R +A+ + LERNP AF+PK+ + N ET + H +GC+C+KS CLK YCE
Sbjct: 20 RSQAIASILERNPLAFQPKVVA-------NGSETNNL----HHLRGCNCRKSNCLKNYCE 68
Query: 215 CFQANILCSENCRCMDCKNFE 235
C QA + C+ C C C N E
Sbjct: 69 CHQARVSCTNRCACHKCCNTE 89
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAV 159
N + CNC+ S CLK YCEC + + C + C C C N E +A++ +
Sbjct: 47 NNLHHLRGCNCRKSNCLKNYCECHQARVSCTNRCACHKCCNTEEFHSAKKMLI 99
>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
Length = 372
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 54/111 (48%), Gaps = 18/111 (16%)
Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
YC C+ASG +C + C C CHN A P + P T +
Sbjct: 69 YCVCYASGSHCTELCGCEPCHNKPPQGAPWTA--------------PVLPLKPVQTLEAG 114
Query: 186 EETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+E E L+ + + C CKKSGCLKKYC+C+Q CS NC+C DC+N
Sbjct: 115 QEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 165
>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
Length = 916
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 61/139 (43%), Gaps = 14/139 (10%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT--------- 162
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ +
Sbjct: 753 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIPWSVIPAHLRCR 812
Query: 163 -LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
+E P+ I + KE G L + N+ ++ C + I+
Sbjct: 813 NMEDRPDVDMDSIDNFADVANKRKELNGTYL---QDNRSNVYLTDDVIEATIMCMISRIV 869
Query: 222 CSENCR-CMDCKNFEGSEE 239
E R C D E EE
Sbjct: 870 MHEKQRMCKDSTEREVMEE 888
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 4/49 (8%)
Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN---FEGSEER 240
G K C+C KS CLK YC+CF C ++C C DC N +E ER
Sbjct: 751 GSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKRER 798
>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
Length = 524
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 42/59 (71%), Gaps = 3/59 (5%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
K++ CNC S+CLKLYC+CFA+G +C CNC +CHN +++ AR++A+ RN + F
Sbjct: 325 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIRC---RNTDQF 380
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
C+C KS CLK YC+CF C NC C DC N
Sbjct: 329 CNCTKSQCLKLYCDCFANGEFCY-NCNCKDCHN 360
>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
Length = 96
Score = 70.9 bits (172), Expect = 2e-09, Method: Composition-based stats.
Identities = 39/84 (46%), Positives = 47/84 (55%), Gaps = 9/84 (10%)
Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
AV RNPNAF K + K++ V+ H KGC CKKS CLKKYCECF
Sbjct: 2 AVNDAKNRNPNAFNIKFQ-----IIEEKQDDKIVI----HKKGCTCKKSNCLKKYCECFN 52
Query: 218 ANILCSENCRCMDCKNFEGSEERQ 241
A +LC++NC C CKN + E Q
Sbjct: 53 AGVLCNQNCVCDSCKNMQKEENNQ 76
Score = 44.7 bits (104), Expect = 0.13, Method: Composition-based stats.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARRE 157
+K C CK S CLK YCECF +G+ C+ C C +C N + E +++
Sbjct: 33 KKGCTCKKSNCLKKYCECFNAGVLCNQNCVCDSCKNMQKEENNQQQ 78
>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
Length = 341
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 56/119 (47%), Gaps = 18/119 (15%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C S CLKLYC CF I C + C C +CHN + R +A+E + K+
Sbjct: 225 CKCSKSHCLKLYCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE----------KVKV 274
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
D+ + V GC C+KS C K YCECF N CS C+C +C+N
Sbjct: 275 KQQRRKHDDDLFDKDTVW-------GCQCRKSQCKKNYCECFIRNKKCSSLCKCNNCEN 326
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 26/43 (60%), Gaps = 3/43 (6%)
Query: 194 LGKHNKG---CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
L +HN C C KS CLK YC CF NI CSE C+C DC N
Sbjct: 215 LDEHNLNNSPCKCSKSHCLKLYCACFHRNIECSELCQCHDCHN 257
>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 65/131 (49%), Gaps = 18/131 (13%)
Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
++ ENT K + C+C + CLK+YC CF +G C + C C +C N E R AVE
Sbjct: 72 QILENT--KWQPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEY- 128
Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
I H NK+ T E L K GC+CKK+ C K+YCECF C
Sbjct: 129 -----------IRKKAHR---NKKVTQEKLFETKDVWGCNCKKTRCQKRYCECFIRQKTC 174
Query: 223 SENCRCMDCKN 233
+ +C C C+N
Sbjct: 175 TVDCNCNHCEN 185
>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
Length = 298
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 55/115 (47%), Gaps = 8/115 (6%)
Query: 123 CLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPNAFRPKIASSP 178
C+ YC CFA G YC D C C C N + A E V+ T L KI P
Sbjct: 13 CICSYCPCFAGGGYCSDKCGCQPCFN----KEAFSETVQTTRKVLLSRQKRMSMKINRRP 68
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ E+ +GC+CKKS CLKKYC+C+Q CS CRC DC+N
Sbjct: 69 EANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 123
Score = 38.9 bits (89), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
++ CNCK S CLK YC+C+ G C C C +C N
Sbjct: 88 RRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 123
>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
Length = 421
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 55/111 (49%), Gaps = 18/111 (16%)
Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
YC CFASG +C + C C C+N + R P + T +
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRST--------------PALPLKAVQTLETG 173
Query: 186 EETGEVLMLGKHNKG---CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+++ E L+ + G C CKKSGCLKKYC+C+Q CS NC+C DCKN
Sbjct: 174 QDSVEQLIRSPMDFGRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKN 224
>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
Length = 246
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 60/125 (48%), Gaps = 17/125 (13%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVN-CHNNVENEAARREAVEATLERNPN 168
K Q+ CNCK S C+K YC CF SG C C C + C N N R EA++ N
Sbjct: 131 KCQQPCNCKSSGCIKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERSEAIKHV---NEK 187
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
+R N++ E L GC C KS C K+YCECF N C+E C+C
Sbjct: 188 CYR------------NRKIPKEALFKLDVIYGCSCTKSKCRKRYCECFLRNQNCTEKCKC 235
Query: 229 MDCKN 233
DC N
Sbjct: 236 FDCCN 240
>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 381
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 60/118 (50%), Gaps = 31/118 (26%)
Query: 98 RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC--DGCNCVNCHNNVE-NEAA 154
RSR + E PK K C CK++ CLKLYC CF G +C D C C++C+N E NE
Sbjct: 264 RSR---RLFEQVPKTVKPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNETG 320
Query: 155 --RREAVEATLERNPNAF--RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
R+EAV L R +AF RP K++TGE GC CKK+ C
Sbjct: 321 GKRQEAVSEILLRRIDAFESRP------------KKKTGE---------GCACKKNRC 357
Score = 45.4 bits (106), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSEN-CRCMDCKNFEGSEE 239
K C CK + CLK YC CFQ C + C+C+DC N E
Sbjct: 277 KPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNE 318
>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 73/150 (48%), Gaps = 22/150 (14%)
Query: 89 VPKPESPKSRSRSNVEVKENTP----KKQKQ-CNCKHSRCLKLYCECFASGIYC-DGCNC 142
+ K + + ++ ++ +N P K Q+Q CNCK+S CLK YC CF SG C C C
Sbjct: 116 LQKKQENEEQAFKYFQILQNDPSEIQKLQQQPCNCKNSGCLKRYCRCFHSGRMCLKECQC 175
Query: 143 V-NCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGC 201
+C NN +++ R A+ + + +R N+ + L GC
Sbjct: 176 SEDCQNNEQHQEQRNNAI---IHVDQKCYR------------NRRMPRDALFKLDVIYGC 220
Query: 202 HCKKSGCLKKYCECFQANILCSENCRCMDC 231
C KS C K+YCEC+ N C++ C+C DC
Sbjct: 221 SCTKSKCRKRYCECYLRNQKCTDKCKCFDC 250
>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
Length = 240
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 56/129 (43%), Gaps = 23/129 (17%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
CNC S C K+YCEC A G C C C NCHN N+ E
Sbjct: 72 CNCSRSGCRKMYCECLAKGRLCSSFCRCQNCHNKTSNKLVLNVIEEL------------- 118
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
++ + + GC C+KS CLKKYCECF + C C+C++C+N+
Sbjct: 119 ---------DQNRNRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKSCGSGCKCVNCENY 169
Query: 235 EGSEERQAL 243
E + +
Sbjct: 170 VLDEVEKII 178
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 3/68 (4%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C+ S CLK YCECF SG C GC CVNC N V +E + + + ++ + F K
Sbjct: 136 CTCQKSMCLKKYCECFNSGKSCGSGCKCVNCENYVLDEVEK--IICSPNQKKTSKFSSKF 193
Query: 175 ASSPHGTR 182
S+ TR
Sbjct: 194 ESANTNTR 201
>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
Length = 216
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 16/127 (12%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERN 166
N K + C+C + CLK+YC CF +G C + C C +C N + R EAVE
Sbjct: 74 NESSKWQPCSCTKTYCLKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEY----- 128
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
I H NK+ E L K GC+CKK+ CLK+YCEC+ C+ C
Sbjct: 129 -------IKKKAHR---NKKVPQEKLFETKDIWGCNCKKTRCLKRYCECYIRQKTCTVEC 178
Query: 227 RCMDCKN 233
C C+N
Sbjct: 179 NCNHCEN 185
>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
Length = 123
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 64/120 (53%), Gaps = 18/120 (15%)
Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
+C C + CLKLYCEC +G C D C CV+C N + R++A++ + NP AF
Sbjct: 14 RCTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKF-PERKKALKHIKKTNPKAFLR- 71
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
S+ G D++ C C+KS C KKYC C+Q+ C+ C+C+DCKN
Sbjct: 72 -VSNLKGHEDSR--------------CCACQKSLCNKKYCSCYQSGRYCTPKCKCVDCKN 116
Score = 43.1 bits (100), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER-QALFHGDHANNMAYIQ 256
C C K+ CLK YCEC +A C++ C C+DC N ER +AL H N A+++
Sbjct: 15 CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFPERKKALKHIKKTNPKAFLR 71
>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
Length = 188
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 73/171 (42%), Gaps = 55/171 (32%)
Query: 69 PPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYC 128
P P P P Q Q ++ + E P + + +C C S+C K YC
Sbjct: 46 PKFFSPIRPFPIQ--QKITKIKQIEKPILNTIT-------------KCKCMKSKCQKSYC 90
Query: 129 ECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
ECFA+G C+ CNC C+N + +++ + K +
Sbjct: 91 ECFAAGKACNMDCNCQGCNNTICSQSIK-----------------------------KVQ 121
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
TG GC+CKK+GCLKKYCEC+ C+ C C+DC N E SE
Sbjct: 122 TG----------GCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDSE 162
Score = 38.9 bits (89), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAAR 155
CNCK + CLK YCEC+ C CNC++C N ++E+ +
Sbjct: 125 CNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDSESEK 165
>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
Length = 326
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 43/66 (65%), Gaps = 7/66 (10%)
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
FRPK+ S + N E+ + +H KGC+CKKS CLKKYCEC+Q + CSE C+C
Sbjct: 244 FRPKVES-----KSNSEDEQD--HKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCD 296
Query: 230 DCKNFE 235
+CKN E
Sbjct: 297 ECKNCE 302
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 95 PKSRSRSNVE-VKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
PK S+SN E +++ P+ K CNCK S CLK YCEC+ G+ C + C C C N
Sbjct: 246 PKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKN 300
>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
Length = 223
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 59/122 (48%), Gaps = 22/122 (18%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLE---RNPNAFR 171
C+C+ + CLK+YC CF +G +C C C C N E + R +AV+ + RN +
Sbjct: 89 CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKKAHRNKKVPK 148
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
KI ET E+ GC+C K+ C+KKYCECF C+ C C C
Sbjct: 149 EKIF-----------ETVEIW-------GCNCSKTRCVKKYCECFIRGKKCTVECNCDHC 190
Query: 232 KN 233
N
Sbjct: 191 DN 192
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 23/35 (65%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
C C+K+ CLK YC CF C ++CRC +C+N E
Sbjct: 89 CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTE 123
>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
Length = 523
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 51/64 (79%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+T + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 451 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 510
Query: 167 PNAF 170
P AF
Sbjct: 511 PEAF 514
>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
Length = 459
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 29/36 (80%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
KGC+CKK+GCLK+YCECFQ N C+ CRC CKN+
Sbjct: 196 KGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKNY 231
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 33/72 (45%), Gaps = 4/72 (5%)
Query: 80 GQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD- 138
G +PP KP K + + + +N K CNCK + CLK YCECF + C
Sbjct: 165 GDQEEPPTTRAKPIELKLQHQQQPKFVDNNAKG---CNCKKTGCLKRYCECFQNNKRCTI 221
Query: 139 GCNCVNCHNNVE 150
C C C N E
Sbjct: 222 KCRCQGCKNYHE 233
>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
Length = 285
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 17/128 (13%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVN-CHNNVENEAARREAVEATLER 165
N+ +Q+ C C++S CLK YC CF SG C C CV C NN ++ R +A++ E+
Sbjct: 152 NSELEQQPCFCRNSGCLKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEK 211
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
NK + L K GC+CKK+ C YCECF C+ +
Sbjct: 212 ---------------CHRNKNVPKDALFKLKDCFGCNCKKTRCQTGYCECFLRKSKCTMD 256
Query: 226 CRCMDCKN 233
C+C +C+N
Sbjct: 257 CQCDNCEN 264
>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
Length = 259
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 72/151 (47%), Gaps = 17/151 (11%)
Query: 84 QPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNC 142
+ P+ + S++ NV+ + K C+C + CL+LYC CF + C C C
Sbjct: 111 EEPIVNQRTRRKASKANYNVDDSPTMDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKC 170
Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCH 202
C N+ ++ +A+E + K + H D+ + T +V GC
Sbjct: 171 SECFNDGQHSDEVLKAIEQ--------IKIKEQRASHHDLDSFD-TRQVW-------GCK 214
Query: 203 CKKSGCLKKYCECFQANILCSENCRCMDCKN 233
CKK+ C+K YCECF + CS +C+C +C+N
Sbjct: 215 CKKTQCIKGYCECFIRHKKCSSHCQCSNCQN 245
>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 78
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 37/58 (63%), Gaps = 1/58 (1%)
Query: 123 CLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPH 179
CLKLYC+CFA + C D C C NC+N+ E+ RR AV L R+P+AF K + PH
Sbjct: 2 CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKFKTEPH 59
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 2/38 (5%)
Query: 208 CLKKYCECFQANILCSENCRCMDCKNF--EGSEERQAL 243
CLK YC+CF ++CS+ CRC +C N G E R A+
Sbjct: 2 CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAV 39
>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 55/122 (45%), Gaps = 22/122 (18%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAV---EATLERNPNAFR 171
C+C S CL++YC CF +G C C C C N +N R +AV + +RN
Sbjct: 89 CSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPE 148
Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
KI ET E+ GC+C K+ C+K YCECF CS C C C
Sbjct: 149 GKIF-----------ETTEIW-------GCNCSKTRCVKNYCECFIRGKKCSVECNCQHC 190
Query: 232 KN 233
N
Sbjct: 191 DN 192
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 24/38 (63%)
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS 237
C C KS CL+ YC CF +C ++C+C +C+N E +
Sbjct: 88 SCSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDN 125
>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 379
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 84/169 (49%), Gaps = 26/169 (15%)
Query: 80 GQATQPPVRVPKPESPKSRSR--SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC 137
QA+Q P ++ + S + S +V +T +CNC+ ++C+K YC C+A+ +C
Sbjct: 214 SQASQTPTKLISLQEQTSSQQISSFTKVHVDT-----ECNCQTTQCIKKYCSCYANNNFC 268
Query: 138 --DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD---NKEETGEVL 192
C C +C N + ++ +T ++ +PK+ SS + + L
Sbjct: 269 VPSICKCQSCMNTFQ------QSYNSTSQK-----KPKLISSQKSLSQVILYQIQNINQL 317
Query: 193 MLGKHNK--GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
+ K N C C+KS C K+YC+C+++ CS C+C +C N +G +E
Sbjct: 318 KIQKENATISCTCQKSFCKKQYCDCYKSGKKCSTLCKCQNCFN-QGQKE 365
Score = 39.3 bits (90), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARR 156
C C+ S C K YC+C+ SG C C C NC N + E +R
Sbjct: 328 CTCQKSFCKKQYCDCYKSGKKCSTLCKCQNCFNQGQKEVVQR 369
>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
Length = 139
Score = 61.2 bits (147), Expect = 1e-06, Method: Composition-based stats.
Identities = 40/133 (30%), Positives = 65/133 (48%), Gaps = 21/133 (15%)
Query: 103 VEVKENTPKKQKQ-CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVE 160
++++ N ++ K+ C C+ ++CLKLYC+CF G C C C C N++ + R +A+
Sbjct: 8 LQIQGNDSEETKEGCQCQKTKCLKLYCQCFHDGKCCGQSCGCNGCKNSLNDYLERNKAIG 67
Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
+ + + ++ K++ G +GC CKKS C YCECF
Sbjct: 68 KIIHKYKDDYKQKLS-------------------GDKGQGCCCKKSKCKLNYCECFIKGR 108
Query: 221 LCSENCRCMDCKN 233
C E C C C+N
Sbjct: 109 SCGEQCHCKKCEN 121
>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
Length = 262
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 70/155 (45%), Gaps = 37/155 (23%)
Query: 83 TQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQ--CNCKHSRCLKLYCECFASGIYCDG- 139
+QP + + + + + ++ +++ + K+K+ C+C +C YC C G C
Sbjct: 96 SQPILDISQISQIQLQPKTKLDLTDQPKYKKKKVGCSCTKRKCSSKYCFCAKVGQSCSNL 155
Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
C CV+C NN ++ + ++ T NA + K
Sbjct: 156 CKCVDCSNN---KSFMSQQIQVT-----NA--------------------------QEQK 181
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
GC CKK+ CLK YC+CF + CS NC+C+ CKN
Sbjct: 182 GCKCKKNFCLKGYCDCFAKGMQCSSNCKCISCKNM 216
>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
Length = 278
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 58/124 (46%), Gaps = 17/124 (13%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVN-CHNNVENEAARREAVEATLERNPNA 169
+Q+ C C++S CLK YC CF SG C C CV C NN ++ R A++ E+
Sbjct: 149 EQQPCFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEK---- 204
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
NK + L K GC CKKS C YCECF C+ +C C
Sbjct: 205 -----------CHRNKNIPKDALFKLKDCFGCSCKKSRCQTGYCECFLRKSKCTIDCECQ 253
Query: 230 DCKN 233
+C+N
Sbjct: 254 NCEN 257
>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
Length = 242
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 61/132 (46%), Gaps = 26/132 (19%)
Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEA 161
+EVK+ P Q +C C S+C + YCECFA G C C C NC N N+ +E
Sbjct: 86 IEVKDLQP--QIKCQCNKSQCQQSYCECFARGKTCGKHCGCQNCQNKQMNKK-----LEK 138
Query: 162 TLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
++ R K +S ++ KGC C KS C K+C C Q
Sbjct: 139 IQKKLIKQKRIKQRTS------------------RYLKGCTCGKSKCQNKFCSCHQDGKY 180
Query: 222 CSENCRCMDCKN 233
C+ CRC+DCKN
Sbjct: 181 CNSGCRCVDCKN 192
>gi|38637223|dbj|BAD03489.1| hypothetical protein [Oryza sativa Japonica Group]
gi|38637276|dbj|BAD03540.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 141
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 11/85 (12%)
Query: 428 SDGSDGQKA--RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVY 485
+D D QK RP+SPGT ALMCDEQD +F + +P+ ++ + + +++
Sbjct: 57 ADPFDIQKGNKRPLSPGTQALMCDEQDAVFQDSKAPD---------ATPSTTEENLVDIF 107
Query: 486 AEQERIVLTKFRDCLNRLITFGEIK 510
EQE+ VLT FRD L +L T G+++
Sbjct: 108 KEQEKRVLTNFRDYLCKLATCGKLQ 132
>gi|145481073|ref|XP_001426559.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393634|emb|CAK59161.1| unnamed protein product [Paramecium tetraurelia]
Length = 254
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 51/117 (43%), Gaps = 35/117 (29%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN-NVENEAARREAVEATLERNPNAFRPK 173
C CK ++C+K YCECF G C C C C N V+ + + + V T+E
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCNKQVKQSSGKMQLVSPTIEF-------- 195
Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
+GC C+K C KKYCEC + N C++ C+C+D
Sbjct: 196 -------------------------QGCKCQKQNCSKKYCECLKRNQRCTDLCKCID 227
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 20/33 (60%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
C CKK+ C+K YCECFQ CS C C C N
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCN 176
>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 103
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 29/34 (85%)
Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
+H KGC+CK+SGCLK YCEC++A I CS NC+C+
Sbjct: 60 RHTKGCNCKRSGCLKNYCECYEAKIACSGNCKCI 93
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVN-CHNNV 149
P PK + S E + T + K CNCK S CLK YCEC+ + I C G C C+ CH
Sbjct: 43 PVQPKIGATSAAE--DATRRHTKGCNCKRSGCLKNYCECYEAKIACSGNCKCIGECHAER 100
Query: 150 E 150
E
Sbjct: 101 E 101
>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
Length = 113
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 44/65 (67%), Gaps = 7/65 (10%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
K+C+CK S+CLKLYCECFAS +YC + C+C C N+ +E + R +E+ RNP A
Sbjct: 19 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIES---RNPLA 75
Query: 170 FRPKI 174
F PK+
Sbjct: 76 FAPKV 80
Score = 47.4 bits (111), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
K C CKKS CLK YCECF + + CSE+C C C N EE
Sbjct: 19 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 59
>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
Length = 677
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 199 KGCHCKKSGCLKKY--CECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQ 256
K C+C KS CLK + A I+CS C+C+ CKNFE S ER+ L H A + Q
Sbjct: 523 KPCNCTKSLCLKLWEVNNISMAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQ 582
Query: 257 QAA 259
Q A
Sbjct: 583 QTA 585
>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 957
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 14/78 (17%)
Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG--CNCVNCHNNVENEAARR------- 156
K N K K C CK ++CLKLYC CF+ I CD C+C C N ++E AR
Sbjct: 308 KSNKKKTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERC-ANTQDELARDGDNGMAM 366
Query: 157 ----EAVEATLERNPNAF 170
A ++ LERNP AF
Sbjct: 367 SAVVTARKSILERNPRAF 384
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%)
Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
L K + C CKKS CLKKYCECF ++ C+ C+C +C N
Sbjct: 652 LTKQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
K+ + C CK S+CLK YCECFA + C C C NC N
Sbjct: 654 KQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGNQA 693
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCMDCKN 233
K +KGC CKK+ CLK YC CF +I C + C C C N
Sbjct: 313 KTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCAN 351
>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
Length = 590
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 71/176 (40%), Gaps = 45/176 (25%)
Query: 67 VVPPVPPPPPPL--PGQATQPP----VRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKH 120
+VP + P G +T P +++P P S + S V+ K ++ C +
Sbjct: 422 LVPEIRPQTRNFRESGNSTIPDKDDRLQMPGPYSLFEITSSQVDKKIKCSCQKNSCTQAY 481
Query: 121 SRCLKLYCECFASGIYCD--GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSP 178
C+C +G CD C+C +C N +EN+ R E E ++
Sbjct: 482 -------CDCLKNGQACDPSSCSCASCLNTIENQNLRLEIQEKKQKQ------------- 521
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRCMDCKN 233
G+ +GC CK S C K+YCECFQ C C+C DCKN
Sbjct: 522 ----------------GQAKEGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKN 561
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 11/55 (20%)
Query: 103 VEVKENTPKKQKQ------CNCKHSRCLKLYCECFASGIYCD--GCNCVNCHNNV 149
+E++E KKQKQ C+CK+S+C K YCECF +G CD C C +C NN+
Sbjct: 512 LEIQE---KKQKQGQAKEGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKNNL 563
>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
Length = 192
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 76/181 (41%), Gaps = 57/181 (31%)
Query: 59 PPQPQPKPVVPPVPPPPPPLPGQATQP-PVR--VPKPESPKSRSRSNVEVKENTPKKQKQ 115
P + P P++ P P A +P P++ + KP+ E++++ K
Sbjct: 28 PQKESPMPILKFFDPDESPNILSAKKPFPIQKNIVKPK----------EIEKSIVK---- 73
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C S+C K YCECFA G C CNC+ C+N
Sbjct: 74 CKCLKSKCQKSYCECFAVGRSCSIDCNCLGCNN--------------------------- 106
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
S + K + G GC+CKK+GC+KKYCEC+ C+ C C+DC N
Sbjct: 107 --SNCSQKIKKSQCG----------GCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCNQ 154
Query: 235 E 235
E
Sbjct: 155 E 155
>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
Length = 296
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 3/80 (3%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQA 258
KGC CK++ C+K YC+C+Q+ +C E C+C+DCKN S ER + N+ A
Sbjct: 38 KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN---SVERPLVTDNLPKNSRRQRASA 94
Query: 259 ANAAITGAIGSSGYASPPVS 278
A + + G P +
Sbjct: 95 VAAKVEAGVLKVGLKPAPTT 114
Score = 45.4 bits (106), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 11/61 (18%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
+P +PK+ S + K C CK + C+K YC+C+ S C + C CV+C N+V
Sbjct: 25 QPRTPKTPKESTI----------KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKNSV 74
Query: 150 E 150
E
Sbjct: 75 E 75
>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
Length = 296
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 3/80 (3%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQA 258
KGC CK++ C+K YC+C+Q+ +C E C+C+DCKN S ER + N+ A
Sbjct: 38 KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN---SVERPLVTDNLPKNSRRQRASA 94
Query: 259 ANAAITGAIGSSGYASPPVS 278
A + + G P +
Sbjct: 95 VAAKVEAGVLKVGLKPAPTT 114
Score = 45.4 bits (106), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 11/61 (18%)
Query: 91 KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
+P +PK+ S + K C CK + C+K YC+C+ S C + C CV+C N+V
Sbjct: 25 QPRTPKTPKESTI----------KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKNSV 74
Query: 150 E 150
E
Sbjct: 75 E 75
>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
Length = 754
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 79/177 (44%), Gaps = 33/177 (18%)
Query: 20 GDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPP--VPPP--- 74
G IP RQ+ SV+ P S+ + HP Q Q + P V PP
Sbjct: 149 GQIPHHAYQRQMS-----SASVVQGPGPSSRGFRS-YHPIEQFQQRQAPPSHYVYPPYVH 202
Query: 75 --PPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFA 132
P +P TQ P P+P+ R RS + K C+C ++CLKLYC CFA
Sbjct: 203 VAPTVVPTVHTQKP---PQPQVLTLRQRSASD---------KGCSCSKTQCLKLYCSCFA 250
Query: 133 SGIYC--DGC---NCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDN 184
+ + C D C +C NC + A +A + L +NP AF+ K S GTR++
Sbjct: 251 NNLLCNKDVCICSDCKNCEAESTEDGAILKARRSILRKNPKAFKSKFQS---GTREH 304
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 4/51 (7%)
Query: 190 EVLMLGKH---NKGCHCKKSGCLKKYCECFQANILCSEN-CRCMDCKNFEG 236
+VL L + +KGC C K+ CLK YC CF N+LC+++ C C DCKN E
Sbjct: 220 QVLTLRQRSASDKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEA 270
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 24/37 (64%)
Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
H GC C+KS CLKKYCECF + C +C C +C N
Sbjct: 421 HRTGCKCRKSRCLKKYCECFSNGVKCGASCSCENCGN 457
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENE 152
C C+ SRCLK YCECF++G+ C C+C NC N E
Sbjct: 425 CKCRKSRCLKKYCECFSNGVKCGASCSCENCGNQPAQE 462
>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
Length = 151
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 65/141 (46%), Gaps = 20/141 (14%)
Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERN 166
N+ + ++ C C+ ++CLKLYC+CF G C + C+C C N++ + R +A+ + +
Sbjct: 19 NSEETKEGCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHKY 78
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
+ + K + G GC CKKS C YCECF C + C
Sbjct: 79 KDESKLKFS-------------------GDKGSGCCCKKSKCQLNYCECFIKGRSCGDQC 119
Query: 227 RCMDCKNFEGSEERQALFHGD 247
C C+N + ++ H D
Sbjct: 120 HCKKCQNGKKQNQKICANHED 140
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 27/55 (49%), Gaps = 9/55 (16%)
Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
H N EET E GC C+K+ CLK YC+CF C E C C CKN
Sbjct: 14 HSYAMNSEETKE---------GCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKN 59
>gi|194759119|ref|XP_001961797.1| GF14749 [Drosophila ananassae]
gi|190615494|gb|EDV31018.1| GF14749 [Drosophila ananassae]
Length = 324
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 30/37 (81%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
KGC CK+S C+K YC+C+Q+ ++CS+ CRC+ C+N E
Sbjct: 103 KGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRNTE 139
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
+N ++ K C CK S+C+K YC+C+ S + C C CV C N
Sbjct: 90 ANAQIPVQCQATMKGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRN 137
>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
Length = 250
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 34/43 (79%), Gaps = 3/43 (6%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
KGC CK+S C+K YC+C+Q+ +CS++CRC+ C+N ++ER+
Sbjct: 24 KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCRN---TQERK 63
Score = 42.7 bits (99), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENE 152
K C+CK S+C+K YC+C+ S C C C+ C N E +
Sbjct: 24 KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCRNTQERK 63
>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 79/183 (43%), Gaps = 40/183 (21%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
+N PK K C C S CLK YC+CFA+G C C C CHN + R EA+ +
Sbjct: 50 QNIPKSLK-CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKMQ 108
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
N + + P G +++ LK+YC+CFQ N CSE+
Sbjct: 109 NQSIEKE----VPVGISCKCKKSKC------------------LKRYCDCFQNNQKCSES 146
Query: 226 CRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQE 285
C+C +C N + + + GD Q+ +I+ +S A P+ R+G
Sbjct: 147 CQCNNCSN-----QVEKVEQGD--------QRLMMNSISQFDENSRLAKSPIVYNRQG-- 191
Query: 286 LFF 288
FF
Sbjct: 192 -FF 193
>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
Length = 244
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 54/126 (42%), Gaps = 24/126 (19%)
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
+PK Q +C C S+C + YCECFA G C C C NC N N+
Sbjct: 90 SPKPQIKCQCSKSQCQQSYCECFARGKTCGKHCGCQNCQNMQMNKKL------------- 136
Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
KI + K+ T L KGC C KS C +C C Q C+ C+
Sbjct: 137 ----GKIQKRLVKQKRIKQRTPRYL------KGCTCGKSKCQNNFCSCHQDGKYCNSECK 186
Query: 228 CMDCKN 233
C+DCKN
Sbjct: 187 CVDCKN 192
>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila]
gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
thermophila SB210]
Length = 622
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 54/116 (46%), Gaps = 14/116 (12%)
Query: 64 PKPVVPPVPPPPPPL-PGQATQPPVRVPKPESPKSRSRSNVEVKE-------NTPKKQKQ 115
P+ ++P + QA + K + P ++ +S KE TP
Sbjct: 274 PQQILPNFDQSSLSINANQAIVINNNMQKKQQPNNKLQSIDSSKEGEILRVSKTP----- 328
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
C C+ S CLKLYCECFA G C C C+ C N+ + R EA++ +NP+AF
Sbjct: 329 CKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNHLELRNEAIKVIEAKNPSAF 384
>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
Length = 289
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/233 (24%), Positives = 98/233 (42%), Gaps = 35/233 (15%)
Query: 14 SQSESGGDIPAKKLA-----RQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVV 68
S SE GD+P +L LD G+ + E A++ + P Q Q +
Sbjct: 50 SSSELIGDVPVPRLTLFVSQMSLDENQGYYKKQMSLE-----ALSPTKYSPQQKQENQIR 104
Query: 69 PPVPPPPPPLPGQATQPPVR----VPKPESPKSRSRSNVEVKENTPKKQKQ---CNCKHS 121
+ Q ++ V + ++ + +++N V +N+P ++ + C+C +
Sbjct: 105 QDAQEIKQLEIQEIKQVDIQEVQNVQQIQTRRKVNKANYNV-DNSPVEEDEMKPCHCTKT 163
Query: 122 RCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHG 180
CL+LYC CF + C C C +C+N+ ++E +AVE + A + S
Sbjct: 164 HCLQLYCSCFHNRRQCMSECKCNDCYNDGQHEEDVLKAVEQIKMKEQRASHHDLDSF--- 220
Query: 181 TRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
D K+ G K KG YCECF C+ +C+C +C+N
Sbjct: 221 --DTKQVWGCKCKKTKCVKG-----------YCECFIRGKKCTSHCQCTECEN 260
>gi|195473860|ref|XP_002089210.1| GE18995 [Drosophila yakuba]
gi|194175311|gb|EDW88922.1| GE18995 [Drosophila yakuba]
Length = 237
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
KGC CK+S C+K YC+C+Q+ +C+ CRC+DC+N E
Sbjct: 26 KGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRNTE 62
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 4/58 (6%)
Query: 94 SPKSRSRSNVE---VKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
+PK +S E K P K C CK S+C+K YC+C+ S C C C++C N
Sbjct: 3 TPKKKSVDRSEGRKAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRN 60
>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
Length = 514
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 43/114 (37%), Gaps = 44/114 (38%)
Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
L YC+C A+G+YC D C C NC N ENE + E R+P AF P+I
Sbjct: 244 LSSYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRI-------- 295
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
A I CS+ CRC DC+N G
Sbjct: 296 -----------------------------------AGIGCSDGCRCEDCRNSFG 314
>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
Length = 229
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 24/128 (18%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
+N PK K C C S CLK YC+CFA+G C C C CHN + + R EA+ +
Sbjct: 50 QNIPKSLK-CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQ 108
Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
N + + P G +++ + K+YC+CFQ N C E+
Sbjct: 109 NQSIEKE----VPVGISCKCKKSKCL------------------KRYCDCFQNNQKCHES 146
Query: 226 CRCMDCKN 233
C+C +C N
Sbjct: 147 CQCSNCSN 154
Score = 42.7 bits (99), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 24/39 (61%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
C C KS CLK+YC+CF +C+ C+C C N E +E
Sbjct: 58 CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKE 96
>gi|194856723|ref|XP_001968813.1| GG24300 [Drosophila erecta]
gi|190660680|gb|EDV57872.1| GG24300 [Drosophila erecta]
Length = 239
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 29/37 (78%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
KGC CK+S C+K YC+C+Q+ +C++ CRC+ C+N E
Sbjct: 26 KGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRNTE 62
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 7/55 (12%)
Query: 100 RSNVEVKENT------PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
+ NVE E+ P K C CK S+C+K YC+C+ S C C C+ C N
Sbjct: 6 KKNVERTESRRGKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRN 60
>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
Length = 225
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 33/45 (73%), Gaps = 3/45 (6%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
KGC CK+S C+K YC+C+Q+ +CS C+C+ C+N +EER A+
Sbjct: 25 KGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRN---TEERLAV 66
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 7/77 (9%)
Query: 96 KSRSRSNVEVKENTPKKQ--KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENE 152
+S + N E + PK+Q K C CK S+C+K YC+C+ S C C C+ C N E
Sbjct: 5 RSGNYFNKEQSKRQPKQQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRNTEERL 64
Query: 153 AA----RREAVEATLER 165
A ++ A+ A ER
Sbjct: 65 AVATPPKKTALAAKRER 81
>gi|219114349|ref|XP_002176345.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217402591|gb|EEC42581.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 636
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%)
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
GC C++S CLKKYCECFQ C NC+C +C+N
Sbjct: 245 GCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQNL 279
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
Q C C+ S CLK YCECF + C C C NC N
Sbjct: 243 QFGCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQN 278
>gi|195034702|ref|XP_001988959.1| GH11450 [Drosophila grimshawi]
gi|193904959|gb|EDW03826.1| GH11450 [Drosophila grimshawi]
Length = 225
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 34/45 (75%), Gaps = 3/45 (6%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
+GC CK+S C+K YC+C+Q+ +CS +C+C+ C+N +E+R A+
Sbjct: 22 RGCCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRN---TEQRMAV 63
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAA----RREAVEATLER 165
C CK S+C+K YC+C+ S C C C+ C N + A + A+ A ER
Sbjct: 24 CCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRNTEQRMAVTTPPLKTAIAAKRER 78
>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
Length = 97
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 112 KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
+ + C C S C+KLYC CF++ YC C C NC N E E V+ +NP AF
Sbjct: 5 QHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAF 64
Query: 171 RPKIA 175
PKI
Sbjct: 65 DPKIV 69
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 29/47 (61%)
Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
M ++ C C KS C+K YC CF A CS+NCRC +CKN E E+
Sbjct: 1 MSNDQHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYED 47
>gi|195118094|ref|XP_002003575.1| GI21907 [Drosophila mojavensis]
gi|193914150|gb|EDW13017.1| GI21907 [Drosophila mojavensis]
Length = 234
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 33/45 (73%), Gaps = 3/45 (6%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
KGC CK+S C+K YC+C+Q+ +CS C+C+ C+N ++ER A+
Sbjct: 25 KGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRN---TQERLAV 66
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 11/80 (13%)
Query: 97 SRSRSNVEVKENTPKKQ------KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV 149
S+ RS + + + PK+Q K C CK S+C+K YC+C+ S C C CV C N
Sbjct: 2 SQVRSGIYLSKEHPKRQPKPQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRNTQ 61
Query: 150 ENEAA----RREAVEATLER 165
E A ++ A+ A ER
Sbjct: 62 ERLAVATPPKKTALAAKRER 81
>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
Length = 240
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 27/35 (77%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
KGC CK+S C+K YC+C+Q+ +C++ CRC C+N
Sbjct: 23 KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRN 57
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 5/59 (8%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFR 171
K C+CK S+C+K YC+C+ S C C C C N V E +V +N NA +
Sbjct: 23 KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRNTVVRELVDPNSV----AKNSNAVK 77
>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
Length = 264
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 61/138 (44%), Gaps = 17/138 (12%)
Query: 97 SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
S++ NV+ + K C+C + CL+LYC CF + C C C +C+N+ ++
Sbjct: 129 SKANYNVDESPTLDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYNDGQHTDEV 188
Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
+AVE + A + S D ++ G K KG YCEC
Sbjct: 189 LKAVEQIKIKEQRASHHDLDSF-----DTRQVWGCKCKKTKCVKG-----------YCEC 232
Query: 216 FQANILCSENCRCMDCKN 233
F N CS +C C DC+N
Sbjct: 233 FIRNKKCSSHCHCSDCQN 250
>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
Length = 280
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 17/119 (14%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C+C + CL+LYC CF + C C C +C+N+ ++E ++AVE + A +
Sbjct: 151 CHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEEEVQKAVEQIKIKEQRASHHDL 210
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
S D K+ G K KG YCECF C+ +C+C +C+N
Sbjct: 211 DSF-----DTKQVWGCKCKKTKCVKG-----------YCECFIRGKKCTSHCQCTECEN 253
>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 47/114 (41%), Gaps = 33/114 (28%)
Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
L YCECFA+G+YC + C+C++C NN +E E RNP AF PK+
Sbjct: 8 LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIR------ 61
Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
N + E F+ + CS CRC CKN G
Sbjct: 62 -NLDSASE-------------------------FRGGVGCSSYCRCEGCKNTFG 89
>gi|24581949|ref|NP_608936.1| tombola [Drosophila melanogaster]
gi|7296988|gb|AAF52259.1| tombola [Drosophila melanogaster]
gi|294862322|gb|ADE06681.2| RT06796p1 [Drosophila melanogaster]
Length = 243
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 28/38 (73%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
C CK+S C+K YC+C+Q+ +C++ CRC+ C+N E E
Sbjct: 28 CCCKRSQCIKNYCDCYQSMAICTKFCRCVGCRNTEVRE 65
>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 75/161 (46%), Gaps = 21/161 (13%)
Query: 77 PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---KQCNCKHSRCLKLYCECFAS 133
P+ + + +P ++ + +++N V +++P ++ K C+C + CL+LYC CF +
Sbjct: 85 PIEMKQVEQVEEIPHIQTRRKANKTNYNV-DDSPTEEDDFKACHCSKTHCLQLYCSCFHN 143
Query: 134 GIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL 192
C + C C +C N+ ++E +AVE + A + S D K+ G
Sbjct: 144 RRPCTNECKCNDCFNDGKHEDEVIKAVEQIKLKEQRASHHDLDSF-----DTKQVWGCKC 198
Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
K KG YCECF C+ +C+C +C+N
Sbjct: 199 KKTKCVKG-----------YCECFIRGKKCTSHCQCTECEN 228
>gi|218194439|gb|EEC76866.1| hypothetical protein OsI_15056 [Oryza sativa Indica Group]
Length = 177
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 48/109 (44%), Gaps = 3/109 (2%)
Query: 88 RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKL--YCECFASGIYC-DGCNCVN 144
R+ P + K S + V +K + +K C+ K K YC CF++ YC C C N
Sbjct: 59 RLLSPAASKQGSTAGVALKRDEKEKGGICHFKKRARGKQESYCLCFSAYGYCSQNCRCTN 118
Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLM 193
C N E E V+ +NP AF PKI + + + VLM
Sbjct: 119 CKNREYYEDFVEERVDMIKMKNPRAFDPKIVRVQDASEIEPQSSNAVLM 167
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 31/60 (51%), Gaps = 12/60 (20%)
Query: 182 RDNKEETGEVLMLGKHNKGCHCKKS--GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
RD KE+ G CH KK G + YC CF A CS+NCRC +CKN E E+
Sbjct: 78 RDEKEKGGI----------CHFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYED 127
>gi|145520699|ref|XP_001446205.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413682|emb|CAK78808.1| unnamed protein product [Paramecium tetraurelia]
Length = 226
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 61/155 (39%), Gaps = 43/155 (27%)
Query: 81 QATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-G 139
+ Q V V K P N+ EN C C+ S+C++ YC C A+ C
Sbjct: 99 ELNQNNVIVNKLVKPSDDELFNLSRDENIV-----CKCRKSKCIQNYCVCSANNQECSFK 153
Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
C C NC N +PK A S K+ E L
Sbjct: 154 CECYNCSNK----------------------QPKSAIS-------KQSGSEYL------- 177
Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMD-CKN 233
GC+C+K C K YCEC + I C+ CRC D CKN
Sbjct: 178 GCNCRKQDCSKGYCECQKRKIKCTSKCRCCDECKN 212
>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 54/121 (44%), Gaps = 17/121 (14%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C+C + CL+LYC CF + C + C C +C+N+ ++E +AVE + A +
Sbjct: 149 CHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYNDGKHEDEVLKAVEQIKLKEQRASHHDL 208
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
S D K+ G K KG YCECF C+ +C+C + K
Sbjct: 209 DSF-----DTKQVWGCKCKKTKCVKG-----------YCECFIRGKKCTSHCQCTELKTI 252
Query: 235 E 235
E
Sbjct: 253 E 253
>gi|428180254|gb|EKX49122.1| hypothetical protein GUITHDRAFT_151625 [Guillardia theta CCMP2712]
Length = 289
Score = 46.6 bits (109), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 23/33 (69%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
C CK+S C+KKYCECFQ + C+ C C DC N
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGN 190
Score = 38.5 bits (88), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 1/35 (2%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNV 149
C CK SRC+K YCECF + C C C +C N +
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGNTI 192
>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
Length = 263
Score = 45.4 bits (106), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 17/119 (14%)
Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C+C ++CL+LYC CF C C C C N+ ++ E V+A + +R +
Sbjct: 147 CHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFNDGNHKV---EMVKALQKVKLKEYR--V 201
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
+ S + D ++ G KG YCECF C+ C+C +C N
Sbjct: 202 SQSDLDSFDTRQVLGCKCKKTHCKKG-----------YCECFIRGRKCTSQCKCCECNN 249
Score = 42.4 bits (98), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 21/35 (60%)
Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
K CHC K+ CL+ YC CF N CS C+C C N
Sbjct: 145 KPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFN 179
>gi|145483717|ref|XP_001427881.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394964|emb|CAK60483.1| unnamed protein product [Paramecium tetraurelia]
Length = 219
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 51/122 (41%), Gaps = 38/122 (31%)
Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C+ S+C++ YC C A+ C C C NC N ++P + PK+
Sbjct: 122 CKCRKSKCIQNYCVCSANNQECKSNCECYNCSN-----------------KSPKSAPPKM 164
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM-DCKN 233
+S GC+C+K C K+YCEC + C+ C C DC+N
Sbjct: 165 SSIEFN-------------------GCNCRKQNCSKRYCECQKRMTRCTSRCNCCEDCEN 205
Query: 234 FE 235
E
Sbjct: 206 QE 207
>gi|145508477|ref|XP_001440188.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407394|emb|CAK72791.1| unnamed protein product [Paramecium tetraurelia]
Length = 235
Score = 45.1 bits (105), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 47/116 (40%), Gaps = 37/116 (31%)
Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
C C+ S+C++ YC C A+ C C C NC N + P + ++
Sbjct: 138 CKCRRSKCIQNYCVCSANNQECKLNCECYNCSN-----------------KQPKSTPSQL 180
Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
+S GC+C+K C K+YCEC + NI C+ C C +
Sbjct: 181 SSIEFN-------------------GCNCRKQNCSKRYCECQKRNIKCTSKCNCCE 217
>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
Length = 261
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAV 159
+N PK + C C S CLK YC+CFA+G C + C C CHN+ E R EA+
Sbjct: 82 QNVPKSLR-CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHNSEEYMEEREEAI 134
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
C C KS CLK+YC+CF +C+ C+C C N
Sbjct: 90 CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122
>gi|221487871|gb|EEE26103.1| Radical SAM domain-containing protein, putative [Toxoplasma gondii
GT1]
Length = 1083
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 82/187 (43%), Gaps = 18/187 (9%)
Query: 363 SEAAKSVAVQRNTTEKQAEDRAGT--SVASSAQGRLQSHKEPDVDRTVTDDCSSANQADK 420
SE AK++A+ EK+ D G + + QG +QS + P + + S++ AD+
Sbjct: 412 SEPAKAIAL--TGVEKEGADLLGEDGAAGTDMQGTIQSREIPVFNNIPSTGGSASRTADR 469
Query: 421 VGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQG 480
P S + S P GT E+ T A SSP+ L G + Q P
Sbjct: 470 --PLQSRTSASA-----PRETGTTGTTVREEPTSETANSSPSPLIPFGFGRAPQPPTEMA 522
Query: 481 MTEVYAEQERI-------VLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGT 533
++ + A + VL ++R+ L+RL +F E ++ + R E G+ LSN
Sbjct: 523 LSRLLAPTSPLPVMGRPRVLGEYRNGLHRLQSFAEPLTLRGPATLRVENGAALLFLSNAW 582
Query: 534 ASSRTGT 540
S+ + T
Sbjct: 583 TSAESAT 589
>gi|123410061|ref|XP_001303597.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884989|gb|EAX90667.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 162
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 32/67 (47%), Gaps = 2/67 (2%)
Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATL 163
+++ PK CNC+ CL C CF YC DGC C C N E E R + E L
Sbjct: 1 MEQTEPKCASCCNCE-GNCLSTTCPCFLHSKYCCDGCKCQKCRNKKEYEQERVASFEQHL 59
Query: 164 ERNPNAF 170
NP AF
Sbjct: 60 LENPLAF 66
>gi|76155324|gb|AAX26592.2| SJCHGC03965 protein [Schistosoma japonicum]
Length = 176
Score = 42.4 bits (98), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 16/22 (72%), Positives = 17/22 (77%)
Query: 113 QKQCNCKHSRCLKLYCECFASG 134
+K CNC S CLKLYCECFA G
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQG 165
>gi|237830953|ref|XP_002364774.1| radical SAM domain-containing protein [Toxoplasma gondii ME49]
gi|211962438|gb|EEA97633.1| radical SAM domain-containing protein [Toxoplasma gondii ME49]
gi|221507654|gb|EEE33258.1| Radical SAM domain-containing protein, putative [Toxoplasma gondii
VEG]
Length = 1083
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 20/188 (10%)
Query: 363 SEAAKSVA---VQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQAD 419
SE AK++A VQ+ + ED A + QG +QS + P + + S++ AD
Sbjct: 412 SEPAKAIALTGVQKEGADLLGEDGAA---GTDMQGTIQSREIPVFNNIPSTGGSASRTAD 468
Query: 420 KVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQ 479
+ P S + S P GT E+ T A SSP+ L G + Q P
Sbjct: 469 R--PLQSRTSASA-----PRETGTTGTTVREEPTSETANSSPSPLIPFGFGRAPQPPTEM 521
Query: 480 GMTEVYAEQERI-------VLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNG 532
++ + A + VL ++R+ L+RL +F E ++ + R E G LSN
Sbjct: 522 ALSRLPAPTSPLPVMGRPRVLGEYRNGLHRLQSFAEPLTLRGPATLRVENGGALLFLSNA 581
Query: 533 TASSRTGT 540
S+ + T
Sbjct: 582 WTSAASAT 589
>gi|384246535|gb|EIE20025.1| hypothetical protein COCSUDRAFT_58250 [Coccomyxa subellipsoidea
C-169]
Length = 556
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 44/70 (62%), Gaps = 8/70 (11%)
Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
+RNP AF+ KI +G + +G + G H +GCHCKKS C KKYCECFQA + C
Sbjct: 162 QRNPQAFQEKIEKQANG-----QGSG---VSGLHKRGCHCKKSHCKKKYCECFQAGVNCG 213
Query: 224 ENCRCMDCKN 233
+C+C +C N
Sbjct: 214 VHCKCDECHN 223
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.127 0.376
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,784,014,009
Number of Sequences: 23463169
Number of extensions: 434168873
Number of successful extensions: 4556111
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 7289
Number of HSP's successfully gapped in prelim test: 11037
Number of HSP's that attempted gapping in prelim test: 3852991
Number of HSP's gapped (non-prelim): 396011
length of query: 602
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 453
effective length of database: 8,863,183,186
effective search space: 4015021983258
effective search space used: 4015021983258
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 80 (35.4 bits)