BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007469
         (602 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
          Length = 592

 Score =  848 bits (2190), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 438/612 (71%), Positives = 487/612 (79%), Gaps = 30/612 (4%)

Query: 1   MGEGEGGEFPPKKSQSESGG------DIPAKKLARQLDFT--GGFVGSVILPEHPQSQAV 52
           M +GEGG+FPPKK QS++ G      DIP KKLARQLDFT  GG   +V+LPEHPQSQ  
Sbjct: 1   MEQGEGGDFPPKKVQSDTAGTAAVTSDIPLKKLARQLDFTAFGGASAAVVLPEHPQSQPQ 60

Query: 53  TANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK 112
               HP        V  PVPP       Q T P VRV KPESP+SR R N++VK+ TPKK
Sbjct: 61  AQAQHP--------VTMPVPP-------QTTHPSVRVVKPESPRSRPRPNIDVKDGTPKK 105

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE TLERNPNAFRP
Sbjct: 106 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRP 165

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KIASSPHG RD++EE+GE L+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCK
Sbjct: 166 KIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 225

Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTA 292
           NFEGSEERQALFHGDHAN+MAYIQQAANAAITGAIGSSG+ SPPVSKKRKGQELF+G T+
Sbjct: 226 NFEGSEERQALFHGDHANSMAYIQQAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTS 285

Query: 293 KDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDL 351
           KDPS+HRL  FQQ +  +    +S+LSS PV+R  ++AA G+SKFTYRSLLADIIQPQDL
Sbjct: 286 KDPSLHRLAQFQQASLTKASVTTSSLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDL 345

Query: 352 KELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDD 411
           KELCSVLVV+S EAA++ A +RN  EK AE++  TS ASS Q RLQS KE D+++ V DD
Sbjct: 346 KELCSVLVVVSEEAARTFADKRNLVEKPAEEQVETSRASSTQDRLQSQKESDIEKAVADD 405

Query: 412 CSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT 471
           CSS NQADK+GPDDS SDG D  K RPMSPGTLALMCDEQDT+FMAA+SPN L GH CNT
Sbjct: 406 CSSRNQADKLGPDDSSSDGGDVPKGRPMSPGTLALMCDEQDTMFMAAASPNVLMGHSCNT 465

Query: 472 SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSN 531
           SSQLPYGQG+TE YAEQERI+LTKFRDCLNRLITFGEIKE K SSLARTE G Q +  SN
Sbjct: 466 SSQLPYGQGITEAYAEQERIILTKFRDCLNRLITFGEIKETKCSSLARTESGGQNEPFSN 525

Query: 532 GTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGE 591
           G A++RT TGNQ   VSNGVSK   P     PA KMSQ VTA     NN L   P+   E
Sbjct: 526 GPANARTDTGNQQVPVSNGVSKPPFP-----PATKMSQTVTAAAASPNNTLLSKPSTHTE 580

Query: 592 -GDVKLKTENKM 602
            GD+ LKTE ++
Sbjct: 581 NGDINLKTEKEL 592


>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
 gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
          Length = 601

 Score =  828 bits (2138), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/614 (70%), Positives = 473/614 (77%), Gaps = 29/614 (4%)

Query: 1   MGEGEGGEFPPKK---SQSESGGDIPAKKLARQLDFT-----GGFVGSVILPEHPQSQ-- 50
           MGEGEG EFPPKK   SQSES  D P KKLARQLDFT          +V+LPEHPQSQ  
Sbjct: 1   MGEGEGNEFPPKKASSSQSESL-DFPTKKLARQLDFTTQGNNNNNNNNVVLPEHPQSQQQ 59

Query: 51  --AVTANSHPP---PQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEV 105
             A  A    P   PQ QP+ V  P  P  PP      QPPVR  KPESPKS+ R N E+
Sbjct: 60  PPATVALPQEPVVAPQ-QPQAVAIPEAPLLPP------QPPVRAIKPESPKSKPRPNAEL 112

Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLER 165
           K+ TPK+QKQCNCKHSRCLKLYCECFASG YCDGCNCVNC+NNVENEAARREAVEATLER
Sbjct: 113 KDGTPKRQKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLER 172

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
           NPNAFRPKIASSPHGTRD++EE GE L+LGKHNKGCHCKKSGCLKKYCECFQANILCSEN
Sbjct: 173 NPNAFRPKIASSPHGTRDSREENGEGLILGKHNKGCHCKKSGCLKKYCECFQANILCSEN 232

Query: 226 CRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQE 285
           C+CMDCKNFEGSEERQALFHGDH NNMAYIQQAANAAITGAIGSSGY SPP+SKKRKGQE
Sbjct: 233 CKCMDCKNFEGSEERQALFHGDHTNNMAYIQQAANAAITGAIGSSGYVSPPISKKRKGQE 292

Query: 286 LFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSI-PVARASSTAAVGSSKFTYRSLLAD 344
           L FG T KDPS HRLGHFQQ  HIR S SS+  S  P+ARA S+A +G SKFTYRSLLAD
Sbjct: 293 LLFGPTTKDPSFHRLGHFQQAVHIRPSTSSSSLSSNPIARAGSSATLGPSKFTYRSLLAD 352

Query: 345 IIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDV 404
           IIQPQDLKELCSVLVVLS EAAK++A QRN TE   ED+   S+ASS Q RLQS K    
Sbjct: 353 IIQPQDLKELCSVLVVLSGEAAKTLAGQRNATENWVEDQTEPSLASSPQERLQSQKGAGA 412

Query: 405 DRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRL 464
             ++  DCSSANQADK G   S SDG D  K RPMSPGTLALMCDE+DT+FM A+SPN L
Sbjct: 413 GNSIAHDCSSANQADKTGHGSSSSDGVDVPKGRPMSPGTLALMCDEEDTMFMTAASPNGL 472

Query: 465 TGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGS 524
           TG GC+T+SQ P GQGMTE+Y EQERIVLTKFRDCLNRLITFGEIKE K SS ARTE+G+
Sbjct: 473 TGRGCSTTSQFPCGQGMTEIYTEQERIVLTKFRDCLNRLITFGEIKETKCSSFARTELGN 532

Query: 525 QKDALSNGTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTATVPISNNDLRH 584
            KD +SNGT ++R   GNQ    SNGV +   P     P +K SQMV +TV  S+NDL  
Sbjct: 533 PKDPVSNGTTNARAEVGNQQVPYSNGVGRHVIP-----PTSKTSQMVGSTVATSSNDLSR 587

Query: 585 NPALPGEGDVKLKT 598
            P LP  GD K KT
Sbjct: 588 IPTLPQNGDNKPKT 601


>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
          Length = 524

 Score =  792 bits (2046), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/528 (75%), Positives = 440/528 (83%), Gaps = 7/528 (1%)

Query: 77  PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIY 136
           P+P Q T P VRV KPESP+SR R N++VK+ TPKKQKQCNCKHSRCLKLYCECFASGIY
Sbjct: 2   PVPPQTTHPSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKLYCECFASGIY 61

Query: 137 CDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGK 196
           CDGCNCVNCHNNVENEAARREAVE TLERNPNAFRPKIASSPHG RD++EE+GE L+LGK
Sbjct: 62  CDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSREESGEALVLGK 121

Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQ 256
           HNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGDHAN+MAYIQ
Sbjct: 122 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANSMAYIQ 181

Query: 257 QAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSS 315
           QAANAAITGAIGSSG+ SPPVSKKRKGQELF+G T+KDPS+HRL  FQQ +  +    +S
Sbjct: 182 QAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTSKDPSLHRLAQFQQASLTKASVTTS 241

Query: 316 TLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNT 375
           +LSS PV+R  ++AA G+SKFTYRSLLADIIQPQDLKELCSVLVV+S EAA++ A +RN 
Sbjct: 242 SLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDLKELCSVLVVVSEEAARTFADKRNL 301

Query: 376 TEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQK 435
            EK AE++  TS ASS Q RLQS KE D+++ V DDCSS NQADK+GPDDS SDG D  K
Sbjct: 302 VEKPAEEQVETSRASSTQDRLQSQKESDIEKAVADDCSSRNQADKLGPDDSSSDGGDVPK 361

Query: 436 ARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTK 495
            RPMSPGTLALMCDEQDT+FMAA+SPN L GH CNTSSQLPYGQG+TE YAEQERI+LTK
Sbjct: 362 GRPMSPGTLALMCDEQDTMFMAAASPNVLMGHSCNTSSQLPYGQGITEAYAEQERIILTK 421

Query: 496 FRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSKAY 555
           FRDCLNRLITFGEIKE K SSLARTE G Q +  SNG A++RT TGNQ   VSNGVSK  
Sbjct: 422 FRDCLNRLITFGEIKETKCSSLARTESGGQNEPFSNGPANARTDTGNQQVPVSNGVSKPP 481

Query: 556 GPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGE-GDVKLKTENKM 602
            P     PA KMSQ VTA     NN L   P+   E GD+ LKTE ++
Sbjct: 482 FP-----PATKMSQTVTAAAASPNNTLLSKPSTHTENGDINLKTEKEL 524


>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
          Length = 559

 Score =  744 bits (1921), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/532 (71%), Positives = 418/532 (78%), Gaps = 29/532 (5%)

Query: 23  PAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA 82
           PAKKLARQLDFTG        PEHPQ       S PP  P               LP Q 
Sbjct: 24  PAKKLARQLDFTGA-------PEHPQL------SQPPQLPVAV------------LPLQP 58

Query: 83  TQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNC 142
             P  RV KPESPKSRSR N E+K+ TPKKQKQCNCKHS+CLKLYCECFASGIYCDGCNC
Sbjct: 59  QAPHARVGKPESPKSRSRPNFEIKDATPKKQKQCNCKHSKCLKLYCECFASGIYCDGCNC 118

Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCH 202
           VNC NNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD++EE GEVL+LGKHNKGCH
Sbjct: 119 VNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREEAGEVLILGKHNKGCH 178

Query: 203 CKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAA 262
           CKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGD  NNMAYIQQAANAA
Sbjct: 179 CKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNMAYIQQAANAA 238

Query: 263 ITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSSTLSSIP 321
           ITGAIGSSGY+SPPVSKKRKGQELFF  T KDPSI +LG  QQ NH+R  +PSS+LS + 
Sbjct: 239 ITGAIGSSGYSSPPVSKKRKGQELFFWPTTKDPSISKLG--QQVNHVRGPAPSSSLSPVS 296

Query: 322 VARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE 381
            AR   TA +G SKF YRSLLADIIQPQ LKELCSVLV++S +AAK++  Q+   EKQAE
Sbjct: 297 GARV-GTATLGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKILMEKQAE 355

Query: 382 DRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSP 441
           D+  TS+ASS+Q +L S KE  V++TV DDCSSANQ DK+ PD+S SDG+D  K RPMSP
Sbjct: 356 DQTETSLASSSQEQLPSQKEGRVEKTVADDCSSANQTDKISPDNSSSDGADVPKGRPMSP 415

Query: 442 GTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLN 501
           GTLALMCDEQDT+FM A+SP     H CNTSSQ PYGQ MTEVYAEQERIVLTKFRD LN
Sbjct: 416 GTLALMCDEQDTMFMTAASPIAPMAHACNTSSQFPYGQEMTEVYAEQERIVLTKFRDFLN 475

Query: 502 RLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSK 553
           R+IT GEI E K SSLAR E+ SQKD + NG  ++ +    Q  T SNGV+K
Sbjct: 476 RVITMGEINETKCSSLARNELESQKDPIINGIGNTSSEIVLQHGTTSNGVTK 527


>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
          Length = 559

 Score =  743 bits (1918), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/608 (64%), Positives = 446/608 (73%), Gaps = 59/608 (9%)

Query: 1   MGEGEGGEFPPKKSQSES--------GGDIPAKKLARQLDFTGGFVGSVILPEHPQSQAV 52
           MGEGEGG    +K+ S S          ++PAKKLARQLDFTG        PEHP     
Sbjct: 1   MGEGEGGGDCVQKNASLSEVVAPSAASLEVPAKKLARQLDFTGA-------PEHP----- 48

Query: 53  TANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK 112
                           P +P    PLP QA  PP RV KPESPKSRSR N E+K+ TPKK
Sbjct: 49  ----------------PQLPVAVLPLPLQA--PPARVGKPESPKSRSRPNFEMKDATPKK 90

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           QKQCNCKHS+CLKLYCECFASGIYCDGCNCVNC NNVENEAARREAVEATLERNPNAFRP
Sbjct: 91  QKQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRP 150

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KIASSPHGTRD++E+ GEVL+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCK
Sbjct: 151 KIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 210

Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTA 292
           NFEGSEERQALFHGD  NNMAYIQQAANAAITGAIGSSGY+SPPVSKKRKGQELFF  T 
Sbjct: 211 NFEGSEERQALFHGDQ-NNMAYIQQAANAAITGAIGSSGYSSPPVSKKRKGQELFFWPTT 269

Query: 293 KDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDL 351
           KDPSI +LG  QQ NH+R  +PSS+LS +  AR   TA +G SKF YRSLLADIIQPQ L
Sbjct: 270 KDPSISKLG--QQVNHVRGPAPSSSLSPVSGARV-GTATLGPSKFMYRSLLADIIQPQHL 326

Query: 352 KELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDD 411
           KELCSVLV++S +AAK++  Q+   EKQAED+  TS+ASS+Q +L S KE  +++TV DD
Sbjct: 327 KELCSVLVLVSGQAAKTLTDQKILMEKQAEDQTETSLASSSQEQLPSQKEGHIEKTVADD 386

Query: 412 CSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT 471
           CSSANQ DK+ PD+S SDG+D  K RPMSPGTLALMCDEQDT+FM  +SP     H CNT
Sbjct: 387 CSSANQTDKISPDNSSSDGADVPKGRPMSPGTLALMCDEQDTMFMTTASPIAPMAHACNT 446

Query: 472 SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSN 531
           SSQ PYGQGMTEVY EQERIVLTKFRD LNR+IT GEI E K SSL R E+ SQKD + N
Sbjct: 447 SSQFPYGQGMTEVYPEQERIVLTKFRDFLNRVITMGEINETKCSSLTRNELESQKDPIIN 506

Query: 532 GTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGE 591
           G  ++ +    Q    SNGV+K   P            +VT +  +   ++  N    GE
Sbjct: 507 GIGNASSEIVLQQGATSNGVTKPGSP------------LVTTSTSMIPGNIAEN----GE 550

Query: 592 GDVKLKTE 599
             VK++ E
Sbjct: 551 TKVKIEKE 558


>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
 gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
          Length = 519

 Score =  739 bits (1908), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/515 (74%), Positives = 414/515 (80%), Gaps = 12/515 (2%)

Query: 1   MGEGEGGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPE-HPQSQAVT---ANS 56
           MGE EG EFPPKK Q     D PAKKLARQLDFT G     +LPE  P    VT   A  
Sbjct: 1   MGESEGSEFPPKKLQQSETADFPAKKLARQLDFTQG-----VLPELQPSPVVVTPASAQC 55

Query: 57  HPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQC 116
            P PQP P+P    V  P  P      QP  R  KPESPKSR R N E+K+ TPKKQKQC
Sbjct: 56  QPQPQPHPQPQPQVVSIPVAPQQQTQQQPAARAVKPESPKSRPRLNAELKDGTPKKQKQC 115

Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
           NCKHSRCLKLYCECFASG YCDGCNCVNC+NNVENEAARREAVEATLERNPNAFRPKIAS
Sbjct: 116 NCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIAS 175

Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
           SPHGTRD++EETGE L+  KHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEG
Sbjct: 176 SPHGTRDSREETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEG 235

Query: 237 SEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPS 296
           SEERQALFHGDH NNMAYIQQAANAAITGAIGSSGYAS PVS+KRKGQELFFG T KDPS
Sbjct: 236 SEERQALFHGDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPS 295

Query: 297 IHRLGHFQQGNHIR-TSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELC 355
            HRLGHFQQ +HIR  +PSS L S PVARA +T  +GSSK TYRSLLADIIQ QDLKELC
Sbjct: 296 FHRLGHFQQASHIRPAAPSSPLPSNPVARAGNTTTLGSSKITYRSLLADIIQSQDLKELC 355

Query: 356 SVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSA 415
           SVLVVLS EAAK+++ QRN+ +K+ ED+  T +ASS Q RLQS KE D D+ V++DCSSA
Sbjct: 356 SVLVVLSGEAAKTLSDQRNSMKKRVEDQRDTFLASSTQERLQSQKEIDADKIVSNDCSSA 415

Query: 416 NQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQL 475
           N     G DDS  DG D  K RPMSPGT+ALMCDEQDTIFMAA+SPN L GHGC++SSQL
Sbjct: 416 NHEG--GLDDSSLDGVDMPKGRPMSPGTMALMCDEQDTIFMAAASPNGLMGHGCDSSSQL 473

Query: 476 PYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIK 510
           P GQGMTE +AEQERIVL KFRDCLNRLITFGEIK
Sbjct: 474 PCGQGMTEAHAEQERIVLMKFRDCLNRLITFGEIK 508


>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
 gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
          Length = 545

 Score =  726 bits (1874), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/546 (69%), Positives = 416/546 (76%), Gaps = 49/546 (8%)

Query: 1   MGEGEGGEFPPKKSQSES-GGDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPP 59
           MGE EG EFPPKK Q +S       KKLARQLDFT G     +LP+HPQSQ       P 
Sbjct: 1   MGESEGSEFPPKKQQQQSENAYFQTKKLARQLDFTQG-----VLPDHPQSQ-------PS 48

Query: 60  PQPQPKPVVPPVPPPPPPLPGQATQ--------------------------------PPV 87
           P  +P  +V P      P P   +Q                                PP+
Sbjct: 49  PLQKPPLLVSPAAAQSQPQPHPQSQLQLQPQVAEIQVVPQQQTQQQQQPARAVLRLLPPM 108

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHN 147
              KPESPKS    N E+K+ TPKKQ+QCNCKHSRCLKLYCECFASG YCDGCNCVNC+N
Sbjct: 109 -FRKPESPKSIPIPNTELKDGTPKKQRQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYN 167

Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
           NVENEAARREAVEATLERNPNAFRPKIASSPHGTRD++EETG+ L+  KHNKGCHCKKSG
Sbjct: 168 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSG 227

Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAI 267
           CLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGDH NNMAYIQQAANAAITGAI
Sbjct: 228 CLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHGNNMAYIQQAANAAITGAI 287

Query: 268 GSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ--GNHIR-TSPSSTLSSIPVAR 324
           GSSGYASPPVS+KRKGQELFFG T KD S  RLGHFQQ  G+H R  +PSS+L S P+AR
Sbjct: 288 GSSGYASPPVSRKRKGQELFFGHTVKDQSFDRLGHFQQVNGSHTRPPAPSSSLPSNPIAR 347

Query: 325 ASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRA 384
           A +   +G SK TYRSLLADIIQPQDLKELCSVLVVLS EAAK+ + QRN+ EK+ ED+ 
Sbjct: 348 AGNAITLGPSKITYRSLLADIIQPQDLKELCSVLVVLSGEAAKTFSDQRNSMEKRVEDQR 407

Query: 385 GTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTL 444
            T +ASS Q RLQSHKE D D+ V +DCSSAN ADKVGPDDS SDG+D  K RPMSPGTL
Sbjct: 408 ETLLASSTQERLQSHKESDADKIVANDCSSANHADKVGPDDSSSDGADMPKGRPMSPGTL 467

Query: 445 ALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLI 504
            LMCDEQDT+ MAA+SP+ L GHGCNTSSQLP GQGM E +AEQERIVLTKFRDCLNRLI
Sbjct: 468 ELMCDEQDTMLMAAASPSGLMGHGCNTSSQLPCGQGMAEAHAEQERIVLTKFRDCLNRLI 527

Query: 505 TFGEIK 510
           TFGEIK
Sbjct: 528 TFGEIK 533


>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
          Length = 561

 Score =  709 bits (1829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/567 (66%), Positives = 422/567 (74%), Gaps = 42/567 (7%)

Query: 2   GEGEGGEFPPKKSQSESG-------GDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTA 54
           GE EG +  PK   S           D+PAKKLARQLDFT  F G   +P          
Sbjct: 4   GEAEGSDGAPKNIVSNDVVANALLPFDVPAKKLARQLDFTD-FGG---MPA--------- 50

Query: 55  NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
               PPQP   PV+ P+PP             VRV KPESPKSR R   E+KE TPKKQK
Sbjct: 51  ----PPQP---PVILPLPP----------SASVRVGKPESPKSRPRPGFEIKEATPKKQK 93

Query: 115 QCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           QCNCKHS+CLKLYCECFASGIYCDGCNCVNC+NNVENEAARREAVEATLERNPNAFRPKI
Sbjct: 94  QCNCKHSKCLKLYCECFASGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKI 153

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
           ASSPHGTRD++E+ GEVL+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNF
Sbjct: 154 ASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNF 213

Query: 235 EGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKD 294
           EGSEERQALFHGD  NNM YIQQAANAAITGAIGSSGY+SPP+SKKRKGQEL FG T KD
Sbjct: 214 EGSEERQALFHGDQNNNMVYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKD 273

Query: 295 PSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
           PS+ R G  QQ N++R  +PSS+LS IP AR    A +G SK  YRSLLADIIQPQ LKE
Sbjct: 274 PSVGRQG--QQANNVRVPAPSSSLSPIPGARVGP-ATLGPSKLMYRSLLADIIQPQHLKE 330

Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT-DDC 412
           LCSVLV++S +AAK     +N  +K AED+  TS ASS Q +L S KE +V++ +  DDC
Sbjct: 331 LCSVLVLVSGQAAKMFTDHKNFMDKHAEDQTETSRASSTQEQLPSQKEANVEKAMADDDC 390

Query: 413 SSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS 472
           SSANQ D + PD+S SDG+D  K RPMSPGTLALMCDEQDT+FM A+S      H CNTS
Sbjct: 391 SSANQTDNISPDNSCSDGADVPKGRPMSPGTLALMCDEQDTMFMTAASTVGSRAHACNTS 450

Query: 473 SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNG 532
            Q PYGQ MTEVYAEQERIVLTKFRD LNR+IT GEI E K SSLAR+E+ SQKD   N 
Sbjct: 451 LQPPYGQVMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSELESQKDPNINC 510

Query: 533 TASSRTGTGNQPATVSNGVSKAYGPSA 559
           T ++ T T +Q    +NGV+K  G S+
Sbjct: 511 TGNASTETQHQQGATNNGVAKTAGNSS 537


>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
 gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
 gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
 gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
          Length = 629

 Score =  659 bits (1701), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/647 (56%), Positives = 431/647 (66%), Gaps = 65/647 (10%)

Query: 1   MGEGEGGEFPPKK---------------------SQSESGGDIPAKKLARQLDFTGGFVG 39
           M EGEGG+ PPK                      + + +  D+P+KKLARQLDF    + 
Sbjct: 1   MAEGEGGDCPPKNVVHPEFVTVTAAAATAAAAANATATAWLDVPSKKLARQLDFNAMLME 60

Query: 40  SVILPEHPQSQAVTANSHPPPQPQPKPV-VPPVPPPPPPLPGQATQPPVRVPKPESPKSR 98
                  PQ Q VT  S        KPV V  +P P P          VRV K ESPK R
Sbjct: 61  Q----SKPQQQVVTQGSVMVQ----KPVGVGGLPMPVPAQVQTLQHSSVRVGKQESPKPR 112

Query: 99  SRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
           SR N EVKE TPKKQ+QCNCKHS+CLKLYCECFASGIYCDGCNCVNC NNV+NEAARREA
Sbjct: 113 SRPNFEVKEGTPKKQRQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREA 172

Query: 159 VEATLERNPNAFRPKIASSPHGTRDNK---------EETGEVLMLGKHNKGCHCKKSGCL 209
           VEATLERNPNAFRPKIASSP G RD++         EE GE L+L KH+KGCHCKKSGCL
Sbjct: 173 VEATLERNPNAFRPKIASSPQGARDSRVSRSSILWQEEAGEGLILIKHHKGCHCKKSGCL 232

Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGS 269
           KKYCECFQAN+LCSENCRCMDCKNFEGSEERQALF GD  NN+ Y+QQAANAAITGAIGS
Sbjct: 233 KKYCECFQANVLCSENCRCMDCKNFEGSEERQALFRGDQNNNV-YLQQAANAAITGAIGS 291

Query: 270 SGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASS-- 327
            G++SPP SKKRKGQELF   TAKDPSI + G   Q        S T + + + +  +  
Sbjct: 292 YGFSSPPASKKRKGQELFLWPTAKDPSISKPG---QQVKFAYCTSRTGTGVNLVKGPAPS 348

Query: 328 --------------TAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQR 373
                         T     SK  YRSLL+D++QP  LKELCSVLV++S +AAK++A Q+
Sbjct: 349 SSASPVSSARGTNPTLGQSPSKLKYRSLLSDVVQPHHLKELCSVLVLVSGQAAKTLADQK 408

Query: 374 NTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDG 433
            T EK+ ED+  TS+ASS Q +L S KE DV++ + DDCSSANQ DK+ P +S SDG+D 
Sbjct: 409 KTVEKRTEDQTETSLASSTQEQLLSQKEVDVEKAMDDDCSSANQTDKISPGNSCSDGADV 468

Query: 434 QKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVL 493
            K RPMSPGTLALMCDEQD++FM A+SP   T H CNTSSQ P GQG+TEVYAEQERIVL
Sbjct: 469 PK-RPMSPGTLALMCDEQDSMFMTAASPIGQTTHACNTSSQFPDGQGVTEVYAEQERIVL 527

Query: 494 TKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSK 553
           T+FRD LNR+IT GEI E K SSLAR+E+ ++KD ++N T ++ T T +Q    SNG +K
Sbjct: 528 TQFRDFLNRVITMGEINETKCSSLARSELENKKDLINNETGNASTETVHQQEATSNGDAK 587

Query: 554 AYGPSALAAPAAKMSQMVTATVPISNNDLRHNPALPGEGDVKLKTEN 600
           A  P   A     +  M T + P+  +D      +   G+ KLK EN
Sbjct: 588 AAIPPMAATSTPAVPPMATTSTPVVPSD-----TVAENGESKLKMEN 629


>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
          Length = 551

 Score =  654 bits (1687), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/576 (63%), Positives = 417/576 (72%), Gaps = 43/576 (7%)

Query: 2   GEGEGGEFPPKKSQSESG-------GDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTA 54
           GE EG +  PK   +           D+PAKKLARQLDFTG F G           + T 
Sbjct: 4   GEAEGSDGAPKNVVANDVVANALLPFDVPAKKLARQLDFTG-FGG----------MSTTP 52

Query: 55  NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
              PP     +P                    VRV KPESPKSR R   E+KE TPKKQK
Sbjct: 53  QPQPPVVLPLRPSTS-----------------VRVGKPESPKSRPRPGFEIKEATPKKQK 95

Query: 115 QCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           QCNCKHS+CLKLYCECFASGIYCDGCNCVNC NNVENEAARREAVEATLERNPNAFRPKI
Sbjct: 96  QCNCKHSKCLKLYCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRPKI 155

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
           ASSPH TRD +E+ GE+L+LGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNF
Sbjct: 156 ASSPHRTRDIREDAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNF 215

Query: 235 EGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKD 294
           EGSEERQALFHGD  NN+ YIQQAANAAITGAIGSSGY+SPP+SKKRKGQEL FG T KD
Sbjct: 216 EGSEERQALFHGDQNNNLVYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKD 275

Query: 295 PSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
           PS+ R G  QQ N++R  +PSS+LS IP AR    A +G SK  YRSLLADIIQPQ LKE
Sbjct: 276 PSVGRQG--QQANNVRVPAPSSSLSPIPGARVGP-ATLGPSKLMYRSLLADIIQPQHLKE 332

Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT-DDC 412
           LCSVLV++S +AAK+   Q+N  +K AED+  TS ASS Q +L S KE +VD+ +  DDC
Sbjct: 333 LCSVLVLVSGQAAKTFTDQKNLMDKHAEDQTETSRASSTQEQLPSQKEANVDKAMADDDC 392

Query: 413 SSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS 472
           SSANQ D +  D+S SDG+D  K RPMSPGTLALMCDEQDT+FM A+SP     H CNTS
Sbjct: 393 SSANQTDNISLDNSCSDGADAPKGRPMSPGTLALMCDEQDTMFMTAASPVGSRAHACNTS 452

Query: 473 SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNG 532
            Q PYGQ MTEVYAEQERIVLTKFRD LNR+IT GEI E K SSLAR+E+ SQK    N 
Sbjct: 453 LQPPYGQVMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSELESQKGPNMNC 512

Query: 533 TASSRTGTGNQPATVS---NGVSKAYGPSALAAPAA 565
           + +  T T +Q A  +   NGV+KA   +++  P +
Sbjct: 513 SGNDSTETAHQQAATNNNINGVAKAAAGNSILFPGS 548


>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
          Length = 602

 Score =  633 bits (1633), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 345/558 (61%), Positives = 400/558 (71%), Gaps = 31/558 (5%)

Query: 21  DIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPP---- 76
           D+PAKKLARQLDFTG   G  +LPEHP  Q     S    Q    P V  V         
Sbjct: 31  DVPAKKLARQLDFTG--TGGAVLPEHPHFQ-----SRSRCQESESPAVMVVQSQSQPQSP 83

Query: 77  ------PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQC---NCKHSRCLKLY 127
                 P+  +A  PP R  KP+SPKSRSRSNVE K+ TPKKQKQC   NCKHSRCLKLY
Sbjct: 84  QQLVVLPIGTKAPLPPAR--KPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLY 141

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           CECFASG+YCDGCNC NCHNNVE+EA+RREAVE TLERNPNAFRPKIA+SPHGTR++++E
Sbjct: 142 CECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKIANSPHGTRESRDE 201

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
            GE++MLGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGD
Sbjct: 202 IGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGD 261

Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGN 307
           HANN+AYIQQAANAAITGAIGSSGYA  PVSKKRKG EL FG   KD  ++ +    Q N
Sbjct: 262 HANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQAN 321

Query: 308 HIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAK 367
           +I +S  ST S  P A   S  A G SKF++RSLLAD+IQP DLKELCSVLVVL+SE AK
Sbjct: 322 NIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381

Query: 368 SVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSG 427
             A QR + EKQ  D   TS ASS+   +Q  K    ++    +C S+NQ+D+   D S 
Sbjct: 382 IQAEQR-SDEKQINDPQQTSRASSSGDGVQHQKAE--EKAADGECGSSNQSDRSVHDSSN 438

Query: 428 SDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAE 487
           S+ SD  +ARPMSPGTLALMCDEQDT+FM A   +    H CNTSS +P  + ++EVY E
Sbjct: 439 SNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADGSAAHDCNTSSHVP-DRSLSEVYIE 497

Query: 488 QERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATV 547
           QERIVLTKFRDCLN+LIT GEIKE K +S + +EVG+Q     N + +  + +G Q  ++
Sbjct: 498 QERIVLTKFRDCLNKLITLGEIKETKLTSRSESEVGNQ-----NQSNNFTSNSGCQQRSI 552

Query: 548 SNGVSKAYGPSALAAPAA 565
           SNGV K    SA   P A
Sbjct: 553 SNGVVKNVALSAPRIPPA 570


>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
           [Cucumis sativus]
          Length = 556

 Score =  613 bits (1580), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 326/504 (64%), Positives = 371/504 (73%), Gaps = 26/504 (5%)

Query: 21  DIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPP---- 76
           D+PAKKLARQLDFTG   G  +LPEHP  Q     S    Q    P V  V         
Sbjct: 31  DVPAKKLARQLDFTG--TGGAVLPEHPHFQ-----SRSRCQESESPAVMVVQSQSQPQSP 83

Query: 77  ------PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQC---NCKHSRCLKLY 127
                 P+  +A  PP R  KP+SPKSRSRSNVE K+ TPKKQKQC   NCKHSRCLKLY
Sbjct: 84  QQLVVLPIGTKAPLPPAR--KPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLY 141

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           CECFASG+YCDGCNC NCHNNVE+EA+RREAVE TLERNPNAFRPK A+SPHGTR++++E
Sbjct: 142 CECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKXANSPHGTRESRDE 201

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
            GE++MLGKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFHGD
Sbjct: 202 IGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGD 261

Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGN 307
           HANN+AYIQQAANAAITGAIGSSGYA  PVSKKRKG EL FG   KD  ++ +    Q N
Sbjct: 262 HANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQAN 321

Query: 308 HIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAK 367
           +I +S  ST S  P A   S  A G SKF++RSLLAD+IQP DLKELCSVLVVL+SE AK
Sbjct: 322 NIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381

Query: 368 SVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSG 427
             A QR + EKQ  D   TS ASS+   +Q  K    ++    +C S+NQ+D+   D S 
Sbjct: 382 IQAEQR-SDEKQINDPQQTSRASSSGDGVQHQKAE--EKAADGECGSSNQSDRSVHDSSN 438

Query: 428 SDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAE 487
           S+ SD  +ARPMSPGTLALMCDEQDT+FM A   +    H CNTSS +P  + ++EVY E
Sbjct: 439 SNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADGSAAHDCNTSSHVP-DRSLSEVYIE 497

Query: 488 QERIVLTKFRDCLNRLITFGEIKE 511
           QERIVLTKFRDCLN+LIT GEIKE
Sbjct: 498 QERIVLTKFRDCLNKLITLGEIKE 521


>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
 gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
 gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
 gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
 gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
 gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 603

 Score =  608 bits (1569), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 339/579 (58%), Positives = 397/579 (68%), Gaps = 46/579 (7%)

Query: 1   MGE-GEGGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPP 59
           MGE G GGEFPPKK   E G   P KK ARQLDFTGG     +    P +  V A S  P
Sbjct: 1   MGEDGGGGEFPPKKDGVEEG--FPTKKPARQLDFTGGSDEHSL--SKPAAPTVVATSVKP 56

Query: 60  PQPQPKP----------------VVPPVPPPP----PPLPGQATQPPVRVPKPESPKSRS 99
                 P                V P +P       PP   Q    P+R P PESPK+R 
Sbjct: 57  IISSSVPSTIRPGMTIAIGQVTQVRPTLPMATTMSNPPSQSQIVNAPIRHPIPESPKARG 116

Query: 100 -RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
            R NVE ++ TP+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNV+NE ARREA
Sbjct: 117 PRPNVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREA 176

Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
           VEATLERNP AFRPKIASSPHG RD +E+ GEV++LGKHNKGCHCKKSGCLKKYCECFQA
Sbjct: 177 VEATLERNPFAFRPKIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQA 236

Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVS 278
           NILCSENC+C+DCKNFEGSEERQALFHG+H+N+MAY+QQAANAAITGA+GSSG+A  P  
Sbjct: 237 NILCSENCKCLDCKNFEGSEERQALFHGEHSNHMAYLQQAANAAITGAVGSSGFAPSPAP 296

Query: 279 KKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFT 337
           K+RKGQE+ F    KD S  RL HF Q N+ RT  P+S  S  PV+RA   A+   SKF 
Sbjct: 297 KRRKGQEILFNQAIKDSS--RLSHFPQVNNGRTGGPTSGTSPSPVSRAGGNASSVPSKFV 354

Query: 338 YRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQ 397
           YRSLLADIIQP D++ LCSVLV ++ EAAK+   +RN  E + +D+  TS+ASSAQ + Q
Sbjct: 355 YRSLLADIIQPHDVRALCSVLVTVAGEAAKTSTDKRNEIENRVDDQTETSLASSAQDQPQ 414

Query: 398 -SHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFM 456
            ++   DV+   TD     NQADK GP++S SDG D  K  P+SP TLALMCDEQDTIFM
Sbjct: 415 GNNNAADVEMVATDH----NQADKSGPEESNSDGVDASKVTPLSPATLALMCDEQDTIFM 470

Query: 457 -AASSPN-RLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKY 514
            AA SPN  +  +GC      P  QG +E+YAEQER+VLTKFRDCLNRLI++ EIKE K 
Sbjct: 471 VAAPSPNGSVDPNGCR-----PNSQGQSEIYAEQERLVLTKFRDCLNRLISYAEIKESKC 525

Query: 515 SSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSK 553
            SLAR  +  Q  A++  T  +  G   Q   + NG S+
Sbjct: 526 LSLARMHI--QPPAIA--TVKTENGIQQQ-VPIVNGASR 559


>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 601

 Score =  595 bits (1534), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 334/583 (57%), Positives = 395/583 (67%), Gaps = 38/583 (6%)

Query: 1   MGE-GEGGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQSQ-------AV 52
           MGE G GGEFPPKK   E G   P KK ARQLDFTGG     +      +         V
Sbjct: 1   MGEDGGGGEFPPKKDGVEEG--FPTKKPARQLDFTGGSDEQSLSKAAAPTVVATAVKPVV 58

Query: 53  TANSHPPPQP-------QPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS-RSNVE 104
           T++     +P       Q +P +P      PP   Q    P+R PKPESPK+R  R  VE
Sbjct: 59  TSSIPSTIRPGVTIAIGQVRPTLPMATTSNPPSQSQILNAPIRHPKPESPKARGPRPIVE 118

Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLE 164
            ++ TP+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNV+NE ARREAVEATLE
Sbjct: 119 GRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLE 178

Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
           RNP AFRPKIASSPHG RD +E+ GEV++LGKHNKGCHCKKSGCLKKYCECFQANILCSE
Sbjct: 179 RNPFAFRPKIASSPHGVRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSE 238

Query: 225 NCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQ 284
           NC+C+DCKNFEGSEERQALFHG+HAN+MAY+QQAANAAITGA+GSSG+A  P  K+RKGQ
Sbjct: 239 NCKCLDCKNFEGSEERQALFHGEHANHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQ 298

Query: 285 ELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFTYRSLLA 343
           E+ F    KD S  RLG F Q N  R   P+S  S  PV+RA   A+   SKF YRSLLA
Sbjct: 299 EILFNQATKDSS--RLGQFPQVNSGRASGPTSGSSPSPVSRAGGNASSAPSKFVYRSLLA 356

Query: 344 DIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKE-P 402
           DIIQP D++ LCSVLV ++ EAAK+   +RN  E + ED+  TS+ASSAQ + Q   +  
Sbjct: 357 DIIQPHDVRALCSVLVAVAGEAAKTSTDKRNEIENRVEDQTETSLASSAQDQPQGDNDAA 416

Query: 403 DVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFM-AASSP 461
           D++   TD     NQADK G ++S SDG+D  K  P+SP TLALMCDEQDTIFM AA SP
Sbjct: 417 DMEMVATD----GNQADKSGAEESNSDGADASKGNPLSPATLALMCDEQDTIFMVAAPSP 472

Query: 462 NRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTE 521
           N     G   ++     QG +E+YAEQER+VLTKFRDCL+RLI++ EIKE K  SLAR  
Sbjct: 473 NGAVDPGGRRTN----SQGQSEIYAEQERLVLTKFRDCLSRLISYAEIKESKCLSLARMH 528

Query: 522 VGSQKDALSNGTASSRTGTG-NQPATVSNGVSKAYGPSALAAP 563
           +        + TA+ +T  G  Q   + NG S+      L  P
Sbjct: 529 I------QPSATATVKTENGVQQQVPIVNGASRTNSQPTLNKP 565


>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
 gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
 gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
 gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 571

 Score =  553 bits (1425), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 316/574 (55%), Positives = 383/574 (66%), Gaps = 69/574 (12%)

Query: 1   MGEGE-GGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHP-QSQAVTANSHP 58
           MGEGE G +FPPK  +        + K ARQLDFTGG   S +   H  Q+ ++ A S P
Sbjct: 1   MGEGEEGDKFPPKTDEVTQ----ESMKSARQLDFTGG--SSDVEHSHSNQASSMAAASIP 54

Query: 59  PP---------QPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENT 109
            P           Q  P V    PPPP   G     P+R P+PESP S  R   E ++ T
Sbjct: 55  SPIVTVTRPIITSQAPPTVATPIPPPPQSQGIILHVPIRHPRPESPNSMPRPAGETRDGT 114

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
           P+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNA
Sbjct: 115 PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNA 174

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           FRPKIA+SPHG RDN+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC+C+
Sbjct: 175 FRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCL 234

Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
           DCKNFEGSE RQ+LFHG+H++N+AY+Q  ANAAITGAIGSSG+AS P  K+RKGQE+FF 
Sbjct: 235 DCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFN 293

Query: 290 STAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQ 349
              KD S HRLG   Q N+ RT+ S T S     RA   A++G SK  Y+SLLA+II+P 
Sbjct: 294 QGTKDSSTHRLG---QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPM 345

Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
           D+K LCSVLV ++ EAAK++      TEK+  ++  TSVASS Q                
Sbjct: 346 DVKALCSVLVAVAGEAAKTL------TEKRLANQKETSVASSVQ---------------- 383

Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
           D     N+A+K G +DS +DGS G   R +SP TLALMCDE+DT+ M A+SPN      C
Sbjct: 384 DQGHVNNKAEKSGLEDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------C 434

Query: 470 NT--SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKD 527
           +   +SQLP GQ   +VYAEQE++VLTKFRDCLNR+I+ GE+KE   S        S+ D
Sbjct: 435 SVEPTSQLPNGQD--QVYAEQEKVVLTKFRDCLNRIISCGEVKESNCSM-------SRMD 485

Query: 528 ALSNGTASSRTGTGNQPATVSNGVSK-AYGPSAL 560
             +    + R     Q A V+NGVS+ A  PS L
Sbjct: 486 LDTPVQTTVRIDPVVQQAPVANGVSQTAKQPSQL 519


>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 578

 Score =  550 bits (1418), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 314/574 (54%), Positives = 381/574 (66%), Gaps = 62/574 (10%)

Query: 1   MGEGE-GGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHP-QSQAVTANSHP 58
           MGEGE G +FPPK  +        + K ARQLDFTGG   S +   H  Q+ ++ A S P
Sbjct: 1   MGEGEEGDKFPPKTDEVTQ----ESMKSARQLDFTGG--SSDVEHSHSNQASSMAAASIP 54

Query: 59  PP---------QPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENT 109
            P           Q  P V    PPPP   G     P+R P+PESP S  R   E ++ T
Sbjct: 55  SPIVTVTRPIITSQAPPTVATPIPPPPQSQGIILHVPIRHPRPESPNSMPRPAGETRDGT 114

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
           P+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNA
Sbjct: 115 PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNA 174

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           FRPKIA+SPHG RDN+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC+C+
Sbjct: 175 FRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCL 234

Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
           DCKNFEGSE RQ+LFHG+H++N+AY+Q  ANAAITGAIGSSG+AS P  K+RKGQE+FF 
Sbjct: 235 DCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFN 293

Query: 290 STAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQ 349
              KD S HRLG   Q N+ RT+ S T S     RA   A++G SK  Y+SLLA+II+P 
Sbjct: 294 QGTKDSSTHRLG---QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPM 345

Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
           D+K LCSVLV ++ EAAK++      TEK+  ++  TSVASS Q                
Sbjct: 346 DVKALCSVLVAVAGEAAKTL------TEKRLANQKETSVASSVQ---------------- 383

Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
           D     N+A+K G +DS +DGS G   R +SP TLALMCDE+DT+ M A+SPN      C
Sbjct: 384 DQGHVNNKAEKSGLEDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------C 434

Query: 470 NT--SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKD 527
           +   +SQLP GQ   +VYAEQE++VLTKFRDCLNR+I+ GE+K             S+ D
Sbjct: 435 SVEPTSQLPNGQD--QVYAEQEKVVLTKFRDCLNRIISCGEVKVFCLFVAESNCSMSRMD 492

Query: 528 ALSNGTASSRTGTGNQPATVSNGVSK-AYGPSAL 560
             +    + R     Q A V+NGVS+ A  PS L
Sbjct: 493 LDTPVQTTVRIDPVVQQAPVANGVSQTAKQPSQL 526


>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
          Length = 536

 Score =  531 bits (1368), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 290/500 (58%), Positives = 350/500 (70%), Gaps = 53/500 (10%)

Query: 64  PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRC 123
           P  V  P+PPPP    G     P+R P+PESP S  R   E ++ TP+K+KQCNCKHSRC
Sbjct: 35  PPTVATPIPPPPQS-QGIILHVPIRHPRPESPNSMPRPAGETRDGTPQKKKQCNCKHSRC 93

Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
           LKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNAFRPKIA+SPHG RD
Sbjct: 94  LKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRD 153

Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           N+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC C+DCKNFEGSE RQ+L
Sbjct: 154 NREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQSL 213

Query: 244 FHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHF 303
           FHG+H++N+AY+Q  ANAAITGAIGSSG+AS P  K+RKGQE+FF    KD S HRLG  
Sbjct: 214 FHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-- 270

Query: 304 QQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSS 363
            Q N+ RT+ S T S     RA   A++G SK  Y+SLLA+II+P D+K LCSVLV ++ 
Sbjct: 271 -QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAG 324

Query: 364 EAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGP 423
           EAAK++      TEK+  ++  TSVASS Q   Q H                N+A+K G 
Sbjct: 325 EAAKTL------TEKRLANQKETSVASSVQD--QGHVN--------------NKAEKSGL 362

Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT--SSQLPYGQGM 481
           +DS +DGS G   R +SP TLALMCDE+DT+ M A+SPN      C+   +SQLP GQ  
Sbjct: 363 EDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------CSVEPTSQLPNGQD- 412

Query: 482 TEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTG 541
            +VYAEQE++VLTKFRDCLNR+I+ GE+KE   S        S+ D  +    + R    
Sbjct: 413 -QVYAEQEKVVLTKFRDCLNRIISCGEVKESNCSM-------SRMDLDTPVQTTVRIDPV 464

Query: 542 NQPATVSNGVSK-AYGPSAL 560
            Q A V+NGVS+ A  PS L
Sbjct: 465 VQQAPVANGVSQTAKQPSQL 484


>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
          Length = 543

 Score =  528 bits (1361), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 288/500 (57%), Positives = 348/500 (69%), Gaps = 46/500 (9%)

Query: 64  PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRC 123
           P  V  P+PPPP    G     P+R P+PESP S  R   E ++ TP+K+KQCNCKHSRC
Sbjct: 35  PPTVATPIPPPPQS-QGIILHVPIRHPRPESPNSMPRPAGETRDGTPQKKKQCNCKHSRC 93

Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
           LKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNAFRPKIA+SPHG RD
Sbjct: 94  LKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRD 153

Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           N+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC C+DCKNFEGSE RQ+L
Sbjct: 154 NREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQSL 213

Query: 244 FHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHF 303
           FHG+H++N+AY+Q  ANAAITGAIGSSG+AS P  K+RKGQE+FF    KD S HRLG  
Sbjct: 214 FHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-- 270

Query: 304 QQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSS 363
            Q N+ RT+ S T S     RA   A++G SK  Y+SLLA+II+P D+K LCSVLV ++ 
Sbjct: 271 -QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAG 324

Query: 364 EAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGP 423
           EAAK++      TEK+  ++  TSVASS Q   Q H                N+A+K G 
Sbjct: 325 EAAKTL------TEKRLANQKETSVASSVQD--QGHVN--------------NKAEKSGL 362

Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT--SSQLPYGQGM 481
           +DS +DGS G   R +SP TLALMCDE+DT+ M A+SPN      C+   +SQLP GQ  
Sbjct: 363 EDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------CSVEPTSQLPNGQD- 412

Query: 482 TEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTG 541
            +VYAEQE++VLTKFRDCLNR+I+ GE+K             S+ D  +    + R    
Sbjct: 413 -QVYAEQEKVVLTKFRDCLNRIISCGEVKVFCLFVAESNCSMSRMDLDTPVQTTVRIDPV 471

Query: 542 NQPATVSNGVSK-AYGPSAL 560
            Q A V+NGVS+ A  PS L
Sbjct: 472 VQQAPVANGVSQTAKQPSQL 491


>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 568

 Score =  527 bits (1357), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 297/520 (57%), Positives = 357/520 (68%), Gaps = 48/520 (9%)

Query: 1   MGEGEGGE-FPPKKSQSESGGDIPAKKLARQLDFTG-GFVGSVILPEHPQSQAVTANSHP 58
           MGEGE GE FPPK  +      + + K ARQLDFTG  +  S   P    S A  +   P
Sbjct: 1   MGEGEEGEKFPPKTDEV----TLESMKSARQLDFTGVEYSQSNQPPVASSSTAAASIPSP 56

Query: 59  PPQP---QPKPVVPPV--PPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ 113
             +P   Q +PV   V  P PPPP        P+R P      S  R   E ++ TP+K+
Sbjct: 57  VTRPITSQARPVTATVGTPIPPPPQSQGILHVPIRHPVDTCFFSFVRPVGEARDGTPQKK 116

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           KQCNCKHSRCLKLYCECFASG YCDGCNCVNCHNNVENE  RREA+ +TLERNPNAFRPK
Sbjct: 117 KQCNCKHSRCLKLYCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRPK 176

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           IA+SPHG RDN+EE GEV++LG+HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKN
Sbjct: 177 IAASPHGGRDNREEVGEVVLLGRHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKN 236

Query: 234 FEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK 293
           FEGSE RQ+LFHG+H++N+AY+Q  ANAAITGAIGSSG+AS P  K+RKGQE+FF    K
Sbjct: 237 FEGSEVRQSLFHGEHSHNVAYLQH-ANAAITGAIGSSGFASAPAPKRRKGQEIFFNQGPK 295

Query: 294 DPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
           D S HRLG   Q N  RT+   T S      A   A++  SK  YRSLLADII+P+D+K 
Sbjct: 296 DSSTHRLG---QANSGRTTSPKTGS-----HAGGNASLRPSKVVYRSLLADIIKPKDVKA 347

Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCS 413
           LCSVLV ++ EAAK++      TE++  D+  TSVASS Q +  S+ +     T  +  +
Sbjct: 348 LCSVLVSVAGEAAKTL------TEERLADQKETSVASSVQDQGHSNNK----GTEGEKAA 397

Query: 414 SANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS- 472
           S NQA+K GP        DG K R +SP TLALMCDE+DT+ M A+SPN      C+   
Sbjct: 398 SGNQAEKSGP--------DGSKGRSLSPETLALMCDERDTMLMVAASPN------CSVEP 443

Query: 473 -SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKE 511
            SQLP GQ   +VYAEQE++VLTKFRDCLNR+I+ GE+KE
Sbjct: 444 PSQLPNGQ--DQVYAEQEKVVLTKFRDCLNRIISCGEVKE 481


>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
          Length = 676

 Score =  526 bits (1354), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 317/579 (54%), Positives = 384/579 (66%), Gaps = 59/579 (10%)

Query: 21  DIPAKKLARQLDFTGGFVGS--VILPEHPQ-------------------SQAVTANSHPP 59
           D PAK   RQLDFT     S  VILPEHPQ                   SQ+      P 
Sbjct: 89  DFPAK---RQLDFTAMCRASANVILPEHPQLPLQTQPQRSQSQSQWQLQSQSPPQLLPPQ 145

Query: 60  PQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCK 119
           P+PQ  PVVP +  P        ++P ++  K ESP+S  RSN EVK+ TPKK+K CNCK
Sbjct: 146 PRPQLVPVVPRITNP--------SRPTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCK 197

Query: 120 HSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPH 179
           +SRCLKLYCECFASG+YC+ C+C NC NNV NEAAR+EAV +TL+RNPNAFRPKIA+SPH
Sbjct: 198 NSRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPH 257

Query: 180 GTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           G+ D  EE G+V ++GKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEE
Sbjct: 258 GSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 317

Query: 240 RQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHR 299
           R+ALFH D+ N MAYIQQAANAAI+GAIGSSGY +   SKKRK Q++ F +TA + S HR
Sbjct: 318 RRALFH-DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTANNQSNHR 375

Query: 300 LGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLV 359
              FQQ NH+R S  S   SI V   ++ A +GSSKFTYRS LADI+Q QD+K+LCS+LV
Sbjct: 376 NTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKDLCSLLV 435

Query: 360 VLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQAD 419
           V+S EA K++      TE +  D   TS+ SS Q R +  +  DV + + D C S NQ +
Sbjct: 436 VVSQEAGKTL------TEAE-RDHIETSIISSTQER-EHLQTDDVQKPLPDACLSENQVE 487

Query: 420 KVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP--- 476
           +V  D+   D +D Q  R MSPGTLALMCDEQD+IFM   SP+ + G     SSQ+P   
Sbjct: 488 RVQTDE--LDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSPDGVVG-----SSQIPTVK 540

Query: 477 --YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTA 534
             +GQ  TEVY EQER+VLTKF D LNRLIT G IKE   S LA++  G+Q++ L +   
Sbjct: 541 SSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSHTGNQQEPLEDDAT 600

Query: 535 SSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTA 573
              TG GN      N + K   PS   A   +MSQ + A
Sbjct: 601 KVGTGKGNHKEPNGNSIVK---PS--FAATTQMSQTIAA 634


>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
          Length = 532

 Score =  521 bits (1342), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 289/500 (57%), Positives = 348/500 (69%), Gaps = 57/500 (11%)

Query: 64  PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRC 123
           P  V  P+PPPP    G     P+R P+PESP S  R   E ++ TP+K+KQCNCKHSRC
Sbjct: 35  PPTVATPIPPPPQSQ-GIILHVPIRHPRPESPNSMPRPAGETRDGTPQKKKQCNCKHSRC 93

Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
           LKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNAFRPKIA+SPHG RD
Sbjct: 94  LKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRD 153

Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           N+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC C+DCKNFEGSE RQ+L
Sbjct: 154 NREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQSL 213

Query: 244 FHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHF 303
           FHG+H++N+AY+Q  ANAAITGAIGSSG+AS P  K+RKGQE+FF    KD S HRLG  
Sbjct: 214 FHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-- 270

Query: 304 QQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSS 363
            Q N+ RT+ S T S     RA   A++G SK    SLLA+II+P D+K LCSVLV ++ 
Sbjct: 271 -QANNGRTTSSQTGS-----RAGGNASLGPSK----SLLANIIKPMDVKALCSVLVAVAG 320

Query: 364 EAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGP 423
           EAAK++      TEK+  ++  TSVASS Q   Q H                N+A+K G 
Sbjct: 321 EAAKTL------TEKRLANQKETSVASSVQD--QGHVN--------------NKAEKSGL 358

Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNT--SSQLPYGQGM 481
           +DS +DGS G   R +SP TLALMCDE+DT+ M A+SPN      C+   +SQLP GQ  
Sbjct: 359 EDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------CSVEPTSQLPNGQD- 408

Query: 482 TEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTG 541
            +VYAEQE++VLTKFRDCLNR+I+ GE+KE   S        S+ D  +    + R    
Sbjct: 409 -QVYAEQEKVVLTKFRDCLNRIISCGEVKESNCSM-------SRMDLDTPVQTTVRIDPV 460

Query: 542 NQPATVSNGVSK-AYGPSAL 560
            Q A V+NGVS+ A  PS L
Sbjct: 461 VQQAPVANGVSQTAKQPSQL 480


>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
          Length = 579

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 315/585 (53%), Positives = 382/585 (65%), Gaps = 76/585 (12%)

Query: 15  QSESGGDIPAKKLARQLDFTGGFVGS--VILPEHPQ-------------------SQAVT 53
           QSE+  D PAK   RQLDFT     S  VILPEHPQ                   SQ+  
Sbjct: 3   QSEAS-DFPAK---RQLDFTAMCRASANVILPEHPQLPLQTQPQRSQSQSQWQLQSQSPP 58

Query: 54  ANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ 113
               P P+PQ  PVVP +  P        ++P ++  K ESP+S  RSN EVK+ TPKK+
Sbjct: 59  QLLPPQPRPQLVPVVPRITNP--------SRPTLQAVKQESPRSLPRSNSEVKDGTPKKK 110

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           K CNCK+SRCLKLYCECFASG+YC+ C+C NC NNV NEAAR+EAV +TL+RNPNAFRPK
Sbjct: 111 KHCNCKNSRCLKLYCECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPK 170

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           IA+SPHG+ D  EE G+V ++GKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKN
Sbjct: 171 IANSPHGSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 230

Query: 234 FEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK 293
           FEGSEER+ALFH D+ N MAYIQQAANAAI+GAIGSSGY +   SKKRK Q++ F +TA 
Sbjct: 231 FEGSEERRALFH-DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTAN 288

Query: 294 DPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKE 353
           + S HR   FQQ NH+R S  S   SI V   ++ A +GSSKFTYRS LADI+Q QD+K+
Sbjct: 289 NQSNHRNTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKD 348

Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCS 413
           LCS+LVV+S EA K++      TE++                LQ+    DV + + D C 
Sbjct: 349 LCSLLVVVSQEAGKTL------TERE---------------HLQTD---DVQKPLPDACL 384

Query: 414 SANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSS 473
           S NQ ++V  D+   D +D Q  R MSPGTLALMCDEQD+IFM   SP+ + G     SS
Sbjct: 385 SENQVERVQTDE--LDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSPDGVVG-----SS 437

Query: 474 QLP-----YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDA 528
           Q+P     +GQ  TEVY EQER+VLTKF D LNRLIT G IKE   S LA++  G+Q++ 
Sbjct: 438 QIPTVKSSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSHTGNQQEP 497

Query: 529 LSNGTASSRTGTGNQPATVSNGVSKAYGPSALAAPAAKMSQMVTA 573
           L +      TG GN      N + K   PS   A   +MSQ + A
Sbjct: 498 LEDDATKVGTGKGNHKEPNGNSIVK---PS--FAATTQMSQTIAA 537


>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
 gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
          Length = 629

 Score =  480 bits (1236), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 286/634 (45%), Positives = 375/634 (59%), Gaps = 75/634 (11%)

Query: 12  KKSQSESGGDIPAKKLARQLDFTGGFVGSV--ILPEHPQSQAVTANS------------- 56
           ++SQSE+  D   KKLARQLDFT     SV   L EH                       
Sbjct: 2   EQSQSEAISDFAPKKLARQLDFTAVCRASVNVALSEHSSRLQQQQPPPLPPPPPPPMSQP 61

Query: 57  ------------------HPPPQPQPKP--------VVPPVPPPPPPLPGQATQPPVRVP 90
                               PP+ QP+         +VPP     P    + +   +++ 
Sbjct: 62  QLHLKLQQQPQPVLQLKLETPPKQQPQATQARPQHVMVPPAMRRIPHPVQKLSLHTLQLI 121

Query: 91  KPESPKSRSRSNVEVKEN--TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNN 148
           K +SP SR R+N+E K+N  TPKK K CNCK+SRCLKLYCECFA+G++C GCNC NCHNN
Sbjct: 122 KQQSPSSRPRNNIEAKDNSGTPKKAKHCNCKNSRCLKLYCECFAAGVHCTGCNCTNCHNN 181

Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
           VENEA+R+EAV A LERNP AF+PKIASSPHG+RD KE+  EV ++GKHNKGCHCKKSGC
Sbjct: 182 VENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKEDAMEVQLVGKHNKGCHCKKSGC 241

Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG 268
           LKKYCECFQANILCSENC+CMDCKNFEGSEER+ALFH +H N + Y+QQAANAAI GAIG
Sbjct: 242 LKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNH-NGIVYMQQAANAAICGAIG 300

Query: 269 SSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASST 328
           SSGY +P  S KRK +EL FGST +D    R+   QQ NH R +P+++ S + V    + 
Sbjct: 301 SSGYGTPLTSNKRKSEEL-FGSTNRDQLGRRVAKTQQENHQR-NPAASASPLSVPVPHNA 358

Query: 329 AAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTT-----EKQAEDR 383
            A+GSSKFTY+S LA I+QPQD+K++CS+LVVLS EA K+ A + +        ++ E  
Sbjct: 359 TALGSSKFTYKSPLAGILQPQDVKKMCSLLVVLSQEARKAYAGKMDMQPGGDDNRKLESS 418

Query: 384 AGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGT 443
           + +S+     G + +    +V +TV +D  + N A+K    DSG DG   +  RP+SP  
Sbjct: 419 SSSSIREREDGLMGN----NVHKTVPNDHVNGNTAEK-DSSDSGKDGGGLENGRPLSP-E 472

Query: 444 LALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRL 503
           + LMC EQ+ +F    S   + G   N   +    Q  +E YAEQE+++LT FRD LNRL
Sbjct: 473 IDLMCHEQEMVFREGESATGMAGLSQNEIQKSSSAQECSEAYAEQEKLILTGFRDFLNRL 532

Query: 504 ITFGEIK---------------EMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVS 548
           IT G IK               E   S LAR++ GS ++        +   T N     +
Sbjct: 533 ITCGSIKGKLLWSETFLCSNFHETTCSPLARSKTGSHQEPADGEMIKAGIETLNDKNAYA 592

Query: 549 NGVSKAYGPSALAAPAAKMSQMVTATVPISNNDL 582
           +G+ K+     LA  A+ +   + A +PI N ++
Sbjct: 593 SGIVKS---PVLATSASNVDLPIKALLPIGNGEV 623


>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
          Length = 576

 Score =  463 bits (1192), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 238/444 (53%), Positives = 307/444 (69%), Gaps = 22/444 (4%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           KPESPK       E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 89  KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 148

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NE ARREA++ATLERNP+AFRPKI SSPH  R+N E  G++ ++GKHNKGCHCKKSGCLK
Sbjct: 149 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCLK 208

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQANILCSENC+CMDCKNFEGSEER+ALF GDH N++ ++QQAANAA+ GAIG++
Sbjct: 209 KYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSI-HMQQAANAAVNGAIGAA 267

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
            ++SP  S+KRK  +     +AK+   HR  H  Q  + +    S    +P+ +      
Sbjct: 268 AFSSPSTSRKRKHIDPSLDHSAKENGAHRTNHVPQVFYQKNGMPSD-GPLPINQPVHPPT 326

Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEK----------QA 380
           +G  K TYR LLADI+QP D+KELC +LVV+S  AAK+ A +++  +K          + 
Sbjct: 327 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAYAGKKSQEDKVVEKEDEMGGKR 386

Query: 381 EDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS-ANQADKVGPDDSGSDGSDGQKA-RP 438
           E   G S+AS+   R +++K+P+  +   DD S+   +A K   ++S  D +D QK+ RP
Sbjct: 387 EGEIGGSLASTNHDREENNKDPNNQKASADDPSTGGTRAGKASLEESKHDCNDDQKSNRP 446

Query: 439 MSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRD 498
           MSPGTLALMCDEQDT+FM        T          P  Q  +E+YAEQER VLT+FRD
Sbjct: 447 MSPGTLALMCDEQDTMFM--------TSQNVVPQQPAPVNQNQSELYAEQERCVLTEFRD 498

Query: 499 CLNRLITFGEIKEMKYSSLARTEV 522
           CL++L+TFG +KE K+S   ++EV
Sbjct: 499 CLHKLVTFGRMKEEKFSMAIKSEV 522


>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
 gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 619

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 241/443 (54%), Positives = 305/443 (68%), Gaps = 25/443 (5%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           KPESPK       E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 137 KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 196

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NE ARREA++ATLERNP+AFRPKI SSPH  R+N E  G++ ++GKHNKGCHCKKSGCLK
Sbjct: 197 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCLK 256

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQANILCSENC+CMDCKNFEGSEER+ALF GDH N++ ++QQAANAA+ GAIG++
Sbjct: 257 KYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSI-HMQQAANAAVNGAIGAA 315

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
            ++SP  S+KRK  +     +AK+   HR  H  Q N +   PS     +P+ +      
Sbjct: 316 AFSSPSASRKRKHIDPSLDHSAKENGAHRTNHVPQKNGM---PSD--GPLPINQPVHPPT 370

Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVA---VQRNTTEKQ------AE 381
           +G  K TYR LLADI+QP D+KELC +LVV+S  AAK+ A    Q    EK+       E
Sbjct: 371 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAYAGKKSQDKVVEKEDEMGGKRE 430

Query: 382 DRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS-ANQADKVGPDDSGSDGSDGQKA-RPM 439
              G S+AS+   R +++ +P+  +   DD S+   +A K   ++S  D +D QK+ RPM
Sbjct: 431 GEIGGSLASTNHDREENNNDPNNQKASADDPSTGGTRAGKASLEESKHDCNDDQKSNRPM 490

Query: 440 SPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDC 499
           SPGTLALMCDEQDT+FM        T          P  Q  +E+YAEQER VLT+FRDC
Sbjct: 491 SPGTLALMCDEQDTMFM--------TSQNVVPQQPAPVNQNQSELYAEQERCVLTEFRDC 542

Query: 500 LNRLITFGEIKEMKYSSLARTEV 522
           L++L+TFG +KE K+S   ++EV
Sbjct: 543 LHKLVTFGRMKEEKFSMAIKSEV 565


>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 615

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 265/548 (48%), Positives = 330/548 (60%), Gaps = 67/548 (12%)

Query: 23  PAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPP-------PQPQ------------ 63
           P KKL RQLDF  G       P    + A  + S PP       PQP             
Sbjct: 32  PIKKLVRQLDFNYGN------PAMAAAAAAVSRSVPPRALQVGYPQPHQLRAALPMGVPH 85

Query: 64  --------------------PKPVVPPVPPPPPP-----LPGQATQPPVRVP-KPESPKS 97
                               P+P +P   P P P     L     +PPV VP KPESPK 
Sbjct: 86  QLQPRGLPVMRPHQMVHAQLPRPALPMAMPVPVPQARSALSQPVQRPPVAVPLKPESPKP 145

Query: 98  RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARRE 157
           R R   + K+ TP K+K CNC+HS+CLKLYCECFASGIYCDGCNC NC NNVENE ARRE
Sbjct: 146 RGRL-YDGKDGTPTKKKCCNCRHSKCLKLYCECFASGIYCDGCNCTNCFNNVENEVARRE 204

Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
           A+EATLERNP+AFRPKI SSPH  R+N E  G++ ++GKHNKGCHCKKSGCLKKYCECFQ
Sbjct: 205 AIEATLERNPDAFRPKIGSSPHTNRNNMEVAGDLPLIGKHNKGCHCKKSGCLKKYCECFQ 264

Query: 218 ANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPV 277
           ANILCSENC+C DCKNF+GSE+R+ L   DH  N+ ++QQAANAA+ GAIG++ ++SP  
Sbjct: 265 ANILCSENCKCRDCKNFDGSEDRKHL-SLDH-KNVVHMQQAANAAVNGAIGATAFSSPST 322

Query: 278 SKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFT 337
           SKKRK  +     + K+    R G   Q  + + + +S   S+PV ++     +G  K T
Sbjct: 323 SKKRKHIDSSLDLSTKEHLAMRNGQLSQVIYQKNAMASD-GSLPVGQSVHPHMMGPFKVT 381

Query: 338 YRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGT--SVASSAQGR 395
           YR LLADIIQP D+KELC +LV +S +AAK+   ++   EK AE   GT  S+AS+   R
Sbjct: 382 YRPLLADIIQPGDVKELCKLLVTVSGQAAKAYTGRKIQEEKVAEKEDGTGGSLASTNHDR 441

Query: 396 LQSHKEPDVDRTVTDDCSSA-NQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDT 453
               ++ +  +T  DD SS      K   DDS  D +D QK +RPMSPGTLALMCDEQD 
Sbjct: 442 EGKTQDQNHRKTSIDDRSSVGTDMGKASLDDSRPDCNDDQKSSRPMSPGTLALMCDEQDA 501

Query: 454 IFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMK 513
           +F         T         +   Q  +E+YAEQER VLT+FRDCL +L+ +G +KE K
Sbjct: 502 MFA--------TSQNAVAPPAVAADQNRSELYAEQERCVLTEFRDCLRKLVQYGRMKEEK 553

Query: 514 YSSLARTE 521
           YS   R+E
Sbjct: 554 YSMAIRSE 561


>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
          Length = 515

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 263/539 (48%), Positives = 334/539 (61%), Gaps = 70/539 (12%)

Query: 15  QSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQSQA------------------VTANS 56
           QSE+  D+  +KLARQLDF   +      P H +                     +  + 
Sbjct: 3   QSETASDLAPRKLARQLDFAAVYGD----PSHQKLPPPLPPPQSPLRPPSHSPLRLQLDL 58

Query: 57  HPPPQ-----------------PQPKPVVPP--VPPPPPPLPGQATQPPVRVPKPESPKS 97
            PP Q                 P+PK V P   +P P    P +  Q    V KPESP+S
Sbjct: 59  QPPGQRPWLYSEAQEQQPGLYSPRPKLVSPVRRIPHPVVKFPVKVLQ----VVKPESPRS 114

Query: 98  RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARRE 157
           + R NVE+K+ TPKKQKQCNCK+SRCLKLYCECFA+GIYCDGCNCVNCHNNV+NEAAR+E
Sbjct: 115 QPRCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVDNEAARQE 174

Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
           AV  TLERNPNAFRPKIASSP   RD+KE   E+ ++GKHNKGCHCKKSGCLKKYCECFQ
Sbjct: 175 AVGITLERNPNAFRPKIASSPQEQRDSKE--CEIKVIGKHNKGCHCKKSGCLKKYCECFQ 232

Query: 218 ANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPV 277
           ANILCSENC+CMDCKNFEGS+ER A+FH D+  N+ +++QAANA I+GA+GSSGY +   
Sbjct: 233 ANILCSENCKCMDCKNFEGSDERIAIFHKDY--NLVHMKQAANATISGAVGSSGYGTHIT 290

Query: 278 SKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFT 337
            KKRK QE+F G +A D +++    + Q      S  S+LS   ++  S+T   GSS+ T
Sbjct: 291 PKKRKNQEMFPGKSAMDQTVNMTAQYLQEIDPMASSPSSLSDSFISDPSNTRISGSSRST 350

Query: 338 YRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED-RAGTSVASSAQGRL 396
           YRS+LAD+++P ++K LCS+ VVLS  AAK+ A  R   ++Q +      SVASSA+  L
Sbjct: 351 YRSVLADVLKPNNVKNLCSLFVVLSRVAAKTNAEMRGKVDQQTKTGNFEASVASSAKS-L 409

Query: 397 QSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFM 456
           Q  +  DV +   DD  + ++A+ V               RP+SP TLALMCDEQD +  
Sbjct: 410 QETR--DVHQPACDDHVNKDEAEAVDV---------AHYNRPLSPETLALMCDEQDDMLF 458

Query: 457 AASSPNRLTGHGCNTSSQLPY-----GQGMTEVYAEQERIVLTKFRDCLNRLITFGEIK 510
              S +   G  CN++ Q          G T+VY EQER++LTKF D L  L+T G IK
Sbjct: 459 GNCSAD---GVACNSTFQNMIQKSFNSDGCTDVYREQERLILTKFLDILRGLVTHGSIK 514


>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
          Length = 529

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 239/443 (53%), Positives = 304/443 (68%), Gaps = 21/443 (4%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           KPESPK       E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 43  KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 102

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NE ARREA++ATLERNP+AFRPKI SSPH  R+N E  G++ ++GKHNKGCHCKKSGCLK
Sbjct: 103 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCLK 162

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQANILCSENC+CMDCKNFEGSEER+ALF GDH N++ ++QQAANAA+ GAIG++
Sbjct: 163 KYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSI-HMQQAANAAVNGAIGAA 221

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
            ++SP  S+KRK  +     +AK+   HR  H  Q  + +    S    +P+ +      
Sbjct: 222 AFSSPSASRKRKHIDPSLDHSAKENGAHRTNHVPQVFYQKNGMPSD-GPLPINQPVHPPT 280

Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVA---VQRNTTEKQ------AE 381
           +G  K TYR LLADI+QP D+KELC +LVV+S  AAK+ A    Q    EK+       E
Sbjct: 281 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAYAGKKSQDKVVEKEDEMGGKRE 340

Query: 382 DRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS-ANQADKVGPDDSGSDGSDGQKA-RPM 439
              G S+AS+   R +++ +P+  +   DD S+   +A K   ++S  D +D QK+ RPM
Sbjct: 341 GEIGGSLASTNHDREENNNDPNNQKASADDPSTGGTRAGKASLEESKHDCNDDQKSNRPM 400

Query: 440 SPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDC 499
           SPGTLALMCDEQDT+FM        T          P  Q  +E+YAEQER VLT+FRDC
Sbjct: 401 SPGTLALMCDEQDTMFM--------TSQNVVPQQPAPVNQNQSELYAEQERCVLTEFRDC 452

Query: 500 LNRLITFGEIKEMKYSSLARTEV 522
           L++L+TFG +KE K+S   ++EV
Sbjct: 453 LHKLVTFGRMKEEKFSMAIKSEV 475


>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
          Length = 656

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 264/558 (47%), Positives = 333/558 (59%), Gaps = 33/558 (5%)

Query: 35  GGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQ----PPVRVP 90
            G  G VI      S A + NS P  +   + V          +   A +      +++ 
Sbjct: 119 AGTTGVVIFTAAEASFAGSTNSAPREEASGEGVASGEYSDRAQVYHCAFRMLNFSAIKIV 178

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           KP+SP+S+ + NVE+K+ TPKKQKQCNCK+SRCLKLYCECFA+GIYCDGCNCVNCHNNV+
Sbjct: 179 KPKSPRSQPQCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCNCVNCHNNVD 238

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NEAAR+EAV  TLERNPNAFRPKIASSP   RD+KE   E+ ++GKHNKGCHCKKSGCLK
Sbjct: 239 NEAARQEAVGITLERNPNAFRPKIASSPLERRDSKE--CEIQVIGKHNKGCHCKKSGCLK 296

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQANILCSENC+CMDCKNFEGS+ER+A+FH D+  N+ +I+QAANA I+   GSS
Sbjct: 297 KYCECFQANILCSENCKCMDCKNFEGSDERRAIFHKDY--NLVHIKQAANATISRVAGSS 354

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
           GY +    KKRK  E+F G +A D + +    +QQ      S  S LS   V+  S+T  
Sbjct: 355 GYGTHLTPKKRKIHEMFPGKSAMDQTDNITAQYQQEIDPMASSPSFLSDSFVSDPSNTRI 414

Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVAS 390
             SS+ TYRS+LAD  QPQ++K LCS+ VVLS  AAK+ A      ++Q   + G   AS
Sbjct: 415 SASSRSTYRSVLADFFQPQNVKNLCSLFVVLSQVAAKTNAEMIGKVDQQT--KTGNFEAS 472

Query: 391 SAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDE 450
            A       +  DV + V DD  + ++AD V   +           RP+SP TLALMCDE
Sbjct: 473 VASSSKSLQETRDVHQLVCDDHVNKDEADAVDIANYN---------RPLSPETLALMCDE 523

Query: 451 QDTIFMAASSPNRLTGHGCNTSSQLPY-----GQGMTEVYAEQERIVLTKFRDCLNRLIT 505
           QD +    SS N   G  CN++ Q          G T+V  EQER++LTKF D L  L+T
Sbjct: 524 QDDMIFGNSSAN---GVACNSTLQNMIQKSSNSDGCTDVCREQERLILTKFLDVLRGLVT 580

Query: 506 FGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSKAYGPSALAAPAA 565
            G IKE   SS  +    S+K+   N    + T  G +    +N       PS       
Sbjct: 581 HGSIKETMCSSSTKKGERSKKEPADNANIGAETNVGREKGIHNNSFGNYLIPS-----VT 635

Query: 566 KMSQMVTA-TVPISNNDL 582
           K+SQ   A T    NNDL
Sbjct: 636 KISQTNYAVTNGHGNNDL 653


>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
 gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
          Length = 550

 Score =  430 bits (1106), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 241/492 (48%), Positives = 313/492 (63%), Gaps = 47/492 (9%)

Query: 60  PQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------VP-KPESPKSRSRSNVEV 105
           P  +P  VV  VP P   +P     P +R             VP KPESPK R+R   E 
Sbjct: 45  PVMRPHQVVGHVPLPRHAVPMAVPVPQLRPMPPQPVQRPPVAVPLKPESPKPRARL-YEG 103

Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLER 165
           K++TP K+K CNC++SRCLKLYCECFASG +CDGCNC NC NN ENE ARREA+EATLER
Sbjct: 104 KDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLER 163

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
           NP+AFRPKI SSPH  R+N E + ++ ++GKHNKGCHCKKSGCLKKYCECFQANILCSEN
Sbjct: 164 NPDAFRPKIGSSPHTNRNN-EVSSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSEN 222

Query: 226 CRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQE 285
           C+CMDCKNFEGSEER+ LF GDH N +  +QQA NAA+ GAIG++G+ SP  S+KRK  +
Sbjct: 223 CKCMDCKNFEGSEERRNLFQGDHKNAIN-MQQATNAAVNGAIGATGFPSPSTSRKRKHID 281

Query: 286 LFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADI 345
               ++ K+   HR  H  Q  + + +       IP+++++    +G  K TYR LLADI
Sbjct: 282 PSLDNSNKEHVAHRNYHISQSVYQKNAVPD--GPIPISQSAHPPNLGPFKVTYRPLLADI 339

Query: 346 IQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-----------DRAGTSVASSAQG 394
           +Q +D+KELC +LVV+S EAAK+ A ++   E+  E           DRAG S+A +   
Sbjct: 340 VQEEDIKELCKLLVVVSGEAAKTYAGRKTMEERVGEKEDERGGQKEDDRAG-SLALTNHD 398

Query: 395 RLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKA-RPMSPGTLAL------- 446
           R  ++++PD   ++ D  S      K   ++S  + +D QK+ RPMSPGTLAL       
Sbjct: 399 REGNNQDPDQKASIDDHSSRGTHTGKAILEESRPNCADDQKSNRPMSPGTLALLCDEEVK 458

Query: 447 MCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITF 506
           MCDEQDT+F         T         +   Q  + +YAEQER+VLT+FRDCL + +  
Sbjct: 459 MCDEQDTVFT--------TSQNAVPQQTITVNQNQSALYAEQERVVLTEFRDCLRKFVER 510

Query: 507 GEIKEMKYSSLA 518
           G  KE +YSS+A
Sbjct: 511 GRNKERRYSSMA 522


>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
 gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
          Length = 682

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 227/424 (53%), Positives = 295/424 (69%), Gaps = 18/424 (4%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           K +SP SR        + TPKK KQCNCK+S CLKLYCECFA+GI+C+GCNC+NC NNVE
Sbjct: 91  KQDSPGSRL-----ANDGTPKKPKQCNCKNSMCLKLYCECFAAGIHCNGCNCLNCRNNVE 145

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NE  R+EAVE TL+RNPNAFRPKIASSPHG+RD  E+  EV MLGKHNKGC+CK+SGCLK
Sbjct: 146 NEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTEDAQEVQMLGKHNKGCNCKRSGCLK 205

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD-HANNMAYIQQAANAAITGAIGS 269
           KYCECFQ NI CSENC+C++CKNFEGS+ER+ALFHG  HA   AY QQA NAAI+GAIGS
Sbjct: 206 KYCECFQGNIFCSENCKCLECKNFEGSDERRALFHGSPHA--TAYAQQATNAAISGAIGS 263

Query: 270 SGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPS-STLSSIPVARASST 328
           SGY +   SKKRK +EL FG   KD   H    F QGN +R S + S+  S+PV+R ++ 
Sbjct: 264 SGYGTTLASKKRKSEEL-FGIATKDQPAHSTAKFSQGNTLRNSAAFSSPLSVPVSRTANG 322

Query: 329 AAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-DRAGTS 387
           A +GS+K TYRS LA +++PQ +KE+C++LV +S +A +++A +    + Q+E +   T 
Sbjct: 323 AVLGSTKPTYRSPLAGVLKPQFVKEICTLLVTISGKATEALAGKVGKMDTQSERENIETY 382

Query: 388 VASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALM 447
            AS  Q +  S K  +    + DDC + N+A+K G  DSG    D + ARP SP  + LM
Sbjct: 383 SASPTQEKESSLKGWNAIEHMPDDCLNGNKAEKDGSIDSG----DVENARPSSPD-IDLM 437

Query: 448 CDEQDTIFMAASSPNRLTGHGCNTSSQLP-YGQGMTEVYAEQERIVLTKFRDCLNRLITF 506
           C E++ +FM   SP  +    C + +Q P  G   ++++AEQE+I+LT F D LN + T+
Sbjct: 438 CHEEEMMFMEMGSPIGV-ARLCQSKTQKPDDGYECSQLHAEQEKIILTSFLDILNTVSTY 496

Query: 507 GEIK 510
           G IK
Sbjct: 497 GSIK 500


>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
 gi|224030123|gb|ACN34137.1| unknown [Zea mays]
 gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
          Length = 607

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 218/451 (48%), Positives = 284/451 (62%), Gaps = 39/451 (8%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           KPESPK R+R   E K++TP K+K  NC++S+CLKLYCECFASG YCDGCNC NC NN +
Sbjct: 128 KPESPKLRARL-YEGKDSTPTKKKCYNCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPD 186

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NE ARREAVEA L+RNP+AFRPKI SSPH  R+N E   ++ ++GKHNKGCHCKKSGCLK
Sbjct: 187 NEVARREAVEAILDRNPDAFRPKIGSSPHMHRNN-EVPSDLPLVGKHNKGCHCKKSGCLK 245

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQANILCSENC+CMDCKNFEGSEER+ LF GDH N +  +QQ  NAA+ GAIG++
Sbjct: 246 KYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAIN-MQQVTNAAVNGAIGAT 304

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAA 330
           G+ SP  S+KRK  +       K+   H+  H  Q N +         SIP+++++    
Sbjct: 305 GFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQKNAVPD------GSIPISQSAHPPT 358

Query: 331 VGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEK----------QA 380
           +G   FTY+ LLAD +Q +D+KELC +LVV+S EAAK+ A Q+   E+          Q 
Sbjct: 359 LGP--FTYKPLLADTVQAEDIKELCKLLVVVSGEAAKTYAGQKTKEERVGEKEDEIGGQK 416

Query: 381 EDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKA-RPM 439
           ED    S+A +   R  ++++ D    + D  S      K   ++S  +  D  K  RPM
Sbjct: 417 EDDRVGSLALTNHDREGNNQDQDQKAPIDDHSSRGTHTGKAIIEESRPNCVDDHKTHRPM 476

Query: 440 SPGTLAL---------MCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQER 490
           SPGTLAL         MCDEQDT+F         T         +      + +YAEQE 
Sbjct: 477 SPGTLALMCDEEAFEKMCDEQDTMFT--------TSQNAAPQQTVIVNHNQSALYAEQEN 528

Query: 491 IVLTKFRDCLNRLITFGEIKEMKYSSLARTE 521
           +VLT+FRDCL + + +G +KE +YS   ++E
Sbjct: 529 VVLTEFRDCLRKFVRYGRMKEERYSMAIKSE 559


>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
          Length = 628

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 228/426 (53%), Positives = 278/426 (65%), Gaps = 37/426 (8%)

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           EE G+V ++GKHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEER+ALFH
Sbjct: 215 EEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFH 274

Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
            D+ N MAYIQQAANAAI+GAIGSSGY +   SKKRK QE+ F +TA + S HR   FQQ
Sbjct: 275 -DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRKQQEVSFCTTANNQSNHRNTQFQQ 333

Query: 306 GNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEA 365
            NH+R S  S   SI V   ++ A +GSSKFTYRS LADI+QPQD+K+LCS+LVV+S EA
Sbjct: 334 DNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQPQDVKDLCSLLVVVSQEA 393

Query: 366 AKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDD 425
            K++      TE +  D   TS+ SS   R +  +  DV + + D C S NQ ++V  D+
Sbjct: 394 GKTL------TEAE-RDHIETSIISSTHER-EHLQTDDVQKPLPDACLSENQVERVQTDE 445

Query: 426 SGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP-----YGQG 480
              D +D Q  R MSPGTLALMCDEQD+IFMA  SP+ + G     SSQ+P     +GQ 
Sbjct: 446 --LDQADVQNGRLMSPGTLALMCDEQDSIFMARGSPDGVVG-----SSQIPTVKSSHGQA 498

Query: 481 MTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGT 540
            TEVY EQER+VLTKF D LNRLIT G IKE   S LA++E G+Q++ L +     RTG 
Sbjct: 499 CTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSETGNQQEPLEDDATKVRTGK 558

Query: 541 GNQPATVSNGVSKAYGPSALAAPAAKMSQMVTAT-----------VPISNNDLRHNPALP 589
           GN      N + K   PS   A   +MSQ + A             P +    + NPA+ 
Sbjct: 559 GNHKEPNGNSIVK---PS--FAATTQMSQTIAAAKGNGIAKQKPPFPSTTRISQTNPAVH 613

Query: 590 GEGDVK 595
           G G  K
Sbjct: 614 GNGTAK 619



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 40/79 (50%), Gaps = 9/79 (11%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVE------ATL 163
           K  K C+CK S CLK YCECF + I C + C C++C  N E    RR          A +
Sbjct: 225 KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDC-KNFEGSEERRALFHDNGNYMAYI 283

Query: 164 ERNPN-AFRPKIASSPHGT 181
           ++  N A    I SS +GT
Sbjct: 284 QQAANAAISGAIGSSGYGT 302


>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
 gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
          Length = 646

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 250/630 (39%), Positives = 333/630 (52%), Gaps = 129/630 (20%)

Query: 15  QSESGGDIPAKKLARQLDFT---GGFVGSVILPEHPQSQAVT-------------ANSH- 57
           +SE+  D+  +K ARQLDF    GG     + P  PQS + T             A  H 
Sbjct: 3   KSETTLDLANRKFARQLDFAAACGGPSYLTLSPPTPQSLSKTIPRPPTPSYLPPPAQMHL 62

Query: 58  ------------PPPQPQPK------PVVPPVPPPPPPLPGQATQPPVRVPK-------- 91
                        PPQ Q +      PV+P    P P L     + P   PK        
Sbjct: 63  DFQSPVRRPWKQSPPQWQQQRTSQSPPVMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQA 122

Query: 92  ---PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNN 148
                SP+S+S++   +K+NT KKQK+CNCK+S+CLKLYCEC+A+GIYCDGCNC NCHNN
Sbjct: 123 LNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKCLKLYCECYAAGIYCDGCNCQNCHNN 182

Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPH----------------------------- 179
           + NEAAR+EA+  TLE+NPNAFRPKIASSP                              
Sbjct: 183 LNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSFELK 242

Query: 180 -GTRDN---------------------KEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
            G  +N                     +EE  E+ ++G+HNKGCHCKK GCLKKYCECF 
Sbjct: 243 FGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCECFH 301

Query: 218 ANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG-----SSGY 272
           AN+LCSENC+C+DCKNFEGS+  + +   + +  +  I+QA NAAI GA+G     S  +
Sbjct: 302 ANVLCSENCKCIDCKNFEGSDVWRIVLQEECS--LVQIRQATNAAINGAVGFGPSISGTH 359

Query: 273 ASPPVSKKRKGQELFFGSTAKD-PSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAV 331
            +P   KKRK QE F G +  D P      H ++ + I +SP S  +S      S TA  
Sbjct: 360 ITP---KKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASF----VSDTAYK 412

Query: 332 GSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-DRAGTSVAS 390
             S+  +RS+LAD++Q Q++K LCS+LVVLS EAAK+ A  R    ++ + ++   S+AS
Sbjct: 413 RISRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEASIAS 472

Query: 391 SAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDE 450
           S+Q  LQ  +  +V R      +S N A+K   D    D  +    RP+SP TL LMCDE
Sbjct: 473 SSQ-LLQDSR--NVVR------ASENHANKDVADAVDIDIHN----RPLSPETLKLMCDE 519

Query: 451 QDTIFMAASSPNRLTGHGC--NTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGE 508
            D +F    S N +       N   +     G T VYAEQER++LTKFRD L  LI  G 
Sbjct: 520 LDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELIILGS 579

Query: 509 IKEMKYSSLARTEVGSQKDALSNGTASSRT 538
           IKE  +SS  + +V  +K   +NG + + T
Sbjct: 580 IKETMHSSSVKKDVSIEKTPKNNGDSGAET 609


>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
 gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
          Length = 220

 Score =  350 bits (898), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 179/253 (70%), Positives = 192/253 (75%), Gaps = 35/253 (13%)

Query: 1   MGEGEGGEFPPKKSQSESGGDIPAKKLARQLDFT--GGFVGSVILPEHPQSQAVTANSHP 58
           MGEGEG + PPK +  +S      KKLARQLDF   GG   +  LPEHPQ          
Sbjct: 1   MGEGEGSDIPPKNAPLDS-----VKKLARQLDFNAFGGTPVTAPLPEHPQ---------- 45

Query: 59  PPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNC 118
                P P +PP+P               ++ KPESPKS+SR N E K+ TPKKQKQCNC
Sbjct: 46  -----PSPTLPPLPA-------------TKLGKPESPKSKSRPNFETKDATPKKQKQCNC 87

Query: 119 KHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSP 178
           KHSRCLKLYCECFASGIYCDGCNCVNC NNV+NEAARREAVEATLERNPNAFRPKIASSP
Sbjct: 88  KHSRCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSP 147

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
           HGTRDNKEETGEV +L KHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSE
Sbjct: 148 HGTRDNKEETGEVKVLVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSE 207

Query: 239 ERQALFHGDHANN 251
           ERQALFHGD  NN
Sbjct: 208 ERQALFHGDQNNN 220


>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
 gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 459

 Score =  349 bits (896), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 200/446 (44%), Positives = 263/446 (58%), Gaps = 54/446 (12%)

Query: 71  VPPPPPPLP--GQATQPPVRVPKPESPKSRSRSNVEVKENTP-KKQKQCNCKHSRCLKLY 127
            PP  PPL   G       RV   +   SR  + VE KENTP K+QK CNCK+S+CLKLY
Sbjct: 49  TPPQKPPLHQFGHNQVQKNRVSNQDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKLY 108

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           CECFASG YC+GCNCVNCHN +ENE++R+ A+   LERNP+AF+PKIA SPHG +D +E 
Sbjct: 109 CECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQEN 168

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
             +VL++GKH+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+AL HG 
Sbjct: 169 VQQVLLIGKHSKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERKALLHGS 228

Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLG-HFQQG 306
             ++  YIQQ  NAA+  AI  S Y  PP S+KRK +++       D  +   G  +Q+ 
Sbjct: 229 QVSD-TYIQQMTNAAVNRAIDMSAYLYPPESRKRKSKDI------SDSVVSSYGVQYQRA 281

Query: 307 NHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAA 366
           NH+R +  ++L S+P  +A S    GS+   YRS  ++  QP  ++ELCS+LV      +
Sbjct: 282 NHVRRNGENSLFSLPNNKAVS----GSTTSAYRSSWSNTFQPHHVRELCSLLV------S 331

Query: 367 KSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTD--DCS-SANQADKVGP 423
            SV V    ++K  ++  G S    AQ          V   + D  DC   AN+ D+   
Sbjct: 332 NSVDVANKLSDKGRKNDKGPSSLDGAQR---------VANEINDSPDCVLDANRMDE--- 379

Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTE 483
                        +P+SP T ALMCDE+  I     +  R+      TS +       + 
Sbjct: 380 -------------KPISPATRALMCDEEHEIGSEKETSARVK-----TSQEKEDTDTSSG 421

Query: 484 VYAEQERIVLTKFRDCLNRLITFGEI 509
           +Y EQER +L+ FRD L +L     I
Sbjct: 422 IYLEQERQILSIFRDYLIQLSNRARI 447


>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 412

 Score =  345 bits (886), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 199/390 (51%), Positives = 244/390 (62%), Gaps = 53/390 (13%)

Query: 23  PAKKLARQLDFTGGFVGSVI-----------------LP---EHPQSQAVTANSHPP--- 59
           P KKL RQLDF    +                     LP   +HPQ          P   
Sbjct: 25  PIKKLVRQLDFNSAALAGNPAMAAAAAAVSRALQPRALPAGFQHPQHVRAAVPMVVPQQL 84

Query: 60  -----PQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------VP-KPESPKSRSR 100
                P  +P PVV  VP    PL      P +R             VP KP+SPK  +R
Sbjct: 85  HQRALPVMRPHPVVGHVPRHAVPL--AVPVPQLRPMPPQPVQRPPVAVPLKPDSPKPWAR 142

Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE 160
              E K++TP K+K CNC++SRCLKLYCECFASG +CDGCNC NC NN ENE ARREA+E
Sbjct: 143 L-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIE 201

Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
           ATLERNP+AFRPKI SSPH  R+N E + ++ ++GKHNKGCHCKKSGCLKKYCECFQANI
Sbjct: 202 ATLERNPDAFRPKIGSSPHTNRNN-EVSSDLPLVGKHNKGCHCKKSGCLKKYCECFQANI 260

Query: 221 LCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKK 280
           LCSENC+CMDCKNFEGSEER+ LF GDH N +  IQQA NAA+ GAIG++G+ SP  S+K
Sbjct: 261 LCSENCKCMDCKNFEGSEERRNLFQGDHKNAIN-IQQATNAAVNGAIGTTGFPSPSTSRK 319

Query: 281 RKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRS 340
           RK  +     + K+ + HR  H  Q N +         SIP+++++    +G  K TYR 
Sbjct: 320 RKHIDPSLDHSNKEHAAHRNCHIPQKNAVPD------GSIPISQSAHPPTLGPFKVTYRP 373

Query: 341 LLADIIQPQDLKELCSVLVVLSSEAAKSVA 370
           LLADI+Q +D+KELC +LVV+S EAAK+ A
Sbjct: 374 LLADIVQAEDVKELCKLLVVVSGEAAKTYA 403


>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
 gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
          Length = 649

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 246/633 (38%), Positives = 329/633 (51%), Gaps = 132/633 (20%)

Query: 15  QSESGGDIPAKKLARQLDFT---GGFVGSVILPEHPQSQAVT-------------ANSH- 57
           +SE+  D+  +K ARQLDF    GG     + P  PQS + T             A  H 
Sbjct: 3   KSETTLDLANRKFARQLDFAAACGGPSYLTLSPPTPQSLSKTIPRPPTPSYLPPPAQMHL 62

Query: 58  ------------PPPQPQPK------PVVPPVPPPPPPLPGQATQPPVRVPK-------- 91
                        PPQ Q +      PV+P    P P L     + P   PK        
Sbjct: 63  DFQSPVRRPWKQSPPQWQQQRTSQSPPVMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQA 122

Query: 92  ---PESPKSRSRSNVEVKENTPKKQKQCNCKHSR---CLKLYCECFASGIYCDGCNCVNC 145
                SP+S+S++   +K+NT KKQK+CNCK+S+       YCEC+A+GIYCDGCNC NC
Sbjct: 123 LNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKSDFATARYCECYAAGIYCDGCNCQNC 182

Query: 146 HNNVENEAARREAVEATLERNPNAFRPKIASSPH-------------------------- 179
           HNN+ NEAAR+EA+  TLE+NPNAFRPKIASSP                           
Sbjct: 183 HNNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSF 242

Query: 180 ----GTRDN---------------------KEETGEVLMLGKHNKGCHCKKSGCLKKYCE 214
               G  +N                     +EE  E+ ++G+HNKGCHCKK GCLKKYCE
Sbjct: 243 ELKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCE 301

Query: 215 CFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG-----S 269
           CF AN+LCSENC+C+DCKNFEGS+  + +   + +  +  I+QA NAAI GA+G     S
Sbjct: 302 CFHANVLCSENCKCIDCKNFEGSDVWRIVLQEECS--LVQIRQATNAAINGAVGFGPSIS 359

Query: 270 SGYASPPVSKKRKGQELFFGSTAKD-PSIHRLGHFQQGNHIRTSPSSTLSSIPVARASST 328
             + +P   KKRK QE F G +  D P      H ++ + I +SP S  +S      S T
Sbjct: 360 GTHITP---KKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASF----VSDT 412

Query: 329 AAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAE-DRAGTS 387
           A    S+  +RS+LAD++Q Q++K LCS+LVVLS EAAK+ A  R    ++ + ++   S
Sbjct: 413 AYKRISRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEAS 472

Query: 388 VASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALM 447
           +ASS+Q  LQ  +  +V R      +S N A+K   D    D  +    RP+SP TL LM
Sbjct: 473 IASSSQ-LLQDSR--NVVR------ASENHANKDVADAVDIDIHN----RPLSPETLKLM 519

Query: 448 CDEQDTIFMAASSPNRLTGHGC--NTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLIT 505
           CDE D +F    S N +       N   +     G T VYAEQER++LTKFRD L  LI 
Sbjct: 520 CDELDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELII 579

Query: 506 FGEIKEMKYSSLARTEVGSQKDALSNGTASSRT 538
            G IKE  +SS  + +V  +K   +NG + + T
Sbjct: 580 LGSIKETMHSSSVKKDVSIEKTPKNNGDSGAET 612


>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
 gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
          Length = 497

 Score =  332 bits (851), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 196/432 (45%), Positives = 256/432 (59%), Gaps = 30/432 (6%)

Query: 86  PVRVPKPESPK-SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVN 144
           P+   K ESP+ S  ++  ++   TPK  +QC+CK S+CLKLYCECFASG YCD C+C N
Sbjct: 64  PLSFKKSESPRISLPQARTKIINGTPKSHRQCHCKQSKCLKLYCECFASGSYCDECSCAN 123

Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCK 204
           CHNNVENE  RREA E  LERNPNAF+PKI  SP    D+ +   +VL++ KH KGCHCK
Sbjct: 124 CHNNVENEDVRREAAECILERNPNAFKPKITGSPCTPPDDGDAAKDVLVMAKHIKGCHCK 183

Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
           ++GCLKKYCECFQANILCSENC+C+ CKN EG E    +   +H    AYIQQ    + +
Sbjct: 184 RTGCLKKYCECFQANILCSENCKCVSCKNLEGRELGMVVTSENHCKTKAYIQQENANSSS 243

Query: 265 GAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVAR 324
             + SS   S   S+KR  +EL F S  +D  IH         H   +P     SIP  R
Sbjct: 244 AVVSSSHIFSQE-SRKRTFREL-FDSNMRDTEIHEF-----TKHHAVNPVGVHVSIPTLR 296

Query: 325 A------SSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEK 378
           A      +S+AA+GSSK TYRSLLAD I P D +ELCS+LVV+ SEAAK +A +    + 
Sbjct: 297 ADCVCQIASSAALGSSKLTYRSLLADAIHPLDTRELCSLLVVV-SEAAKFLADEH--CKA 353

Query: 379 QAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARP 438
           Q ++     +A   +   +  KEPDV + V  +    ++ D    D+S S   D Q   P
Sbjct: 354 QIKEVKEYQMACDLEKDCK--KEPDVHQVVPHNQFPGSRTDITVMDNSRSCDIDMQDGGP 411

Query: 439 MSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRD 498
            S     LMCD++D +F  ++S ++   HGCN        +  T+   E ER+VLT  RD
Sbjct: 412 FSSEEDYLMCDKKDKLFTRSASLDQNLNHGCN--------ENFTK---EHERLVLTCLRD 460

Query: 499 CLNRLITFGEIK 510
           CL +LI FG +K
Sbjct: 461 CLGKLIAFGTMK 472


>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
           distachyon]
          Length = 487

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 204/476 (42%), Positives = 258/476 (54%), Gaps = 71/476 (14%)

Query: 66  PVVPP---VPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSR 122
           PVV P    P  P P P      P+   KP +         EVK  TPKK+K CNCK+SR
Sbjct: 29  PVVAPQQQKPAAPVPRPWPVAFTPM---KPAA---------EVKNVTPKKRKHCNCKNSR 76

Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           CLKLYCECFA+ + CDGCNC NC NN ENE  R+ A+EATL RNP AF+PKI + P    
Sbjct: 77  CLKLYCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQPKIENGPSTVN 136

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
             K+ +G V +  KHNKGCHCKKSGCLKKYCECFQANILCS+NCRCMDCKNFEGSEER+A
Sbjct: 137 VRKDNSGAVPLRPKHNKGCHCKKSGCLKKYCECFQANILCSKNCRCMDCKNFEGSEERRA 196

Query: 243 LFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGH 302
           L  GD+ ++   IQQAAN A+ GAIGSSGY   P  +K           +++ S+  +  
Sbjct: 197 LVQGDNTSDRNNIQQAANTALNGAIGSSGYKCSPARRK----------ISQEDSLSEM-Q 245

Query: 303 FQQGNHIRTSPSSTLSSIPVARASSTAAVG---------SSKFTYRSLLADIIQPQDLKE 353
           FQQ N         LS +     S T   G         SSK  YRS LA+ I   D+ +
Sbjct: 246 FQQVN---------LSDVSQLTPSCTGFGGHNGCYSQSNSSKMIYRSPLANTIHLSDVND 296

Query: 354 LCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCS 413
           L   +V+   +AA+                A  ++A +    ++  KE   +  + ++ S
Sbjct: 297 LVKHVVIACRKAAE----------------AFPTIADNKGVEVEVEKEFHANYGLNNNRS 340

Query: 414 SANQADKVGPDDSGSDGSDGQKA---RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCN 470
                 +  P D  S     Q     RP SPGT ALMCDEQDT F      +      C+
Sbjct: 341 KVQDLKEASPMDIHSKACTDQHNVDPRPASPGTQALMCDEQDTSFGNDIYRSSFAATSCD 400

Query: 471 TSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQK 526
                   + ++E+ A QE +VLT FRD L  LIT G+I E K SS A  E+ +Q+
Sbjct: 401 --------KNISEINALQENLVLTGFRDYLRVLITRGKINEAKSSSEAAMELNAQR 448


>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 405

 Score =  322 bits (824), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 212/429 (49%), Positives = 269/429 (62%), Gaps = 39/429 (9%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           K ESP+ RSRS  + K+ TPKK KQCNCK+SRCLKLYCECFASG YC+GCNCVNC NNVE
Sbjct: 4   KHESPRQRSRS-FDAKDGTPKKCKQCNCKNSRCLKLYCECFASGTYCEGCNCVNCCNNVE 62

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NE  R+EAVEATLERNPNAFRPKI SSP G RD+ E+ GE  + GKHNKGCHCKKSGCLK
Sbjct: 63  NEIVRQEAVEATLERNPNAFRPKIFSSP-GIRDSGEDVGEHPLAGKHNKGCHCKKSGCLK 121

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQANILCSENC+C+DCKNF+GSEER+ALF  DH  ++ ++Q    + +     S 
Sbjct: 122 KYCECFQANILCSENCKCVDCKNFDGSEERRALFQADHNMSVNFMQPLGGSGVLS--NSP 179

Query: 271 GYASP-PVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTA 329
            Y SP P+ KKR+  EL F        +   G       +  + SST++S P ++  S  
Sbjct: 180 AYLSPSPLLKKRRTHELVFSGPGLKEQLTSGG--GTAPSLSLALSSTIASTPSSQGVS-P 236

Query: 330 AVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVA 389
           AV  +K  ++SLLA ++Q + + ELC +LVV+SSE           T+K+      + VA
Sbjct: 237 AVPPAKPMHKSLLAGVVQIEAIHELCKLLVVVSSE-----------TQKEILGMLLSDVA 285

Query: 390 SSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCD 449
           +S+  R  S +    DR VT            GPDD  +D    +  RPMSPGTLALMCD
Sbjct: 286 NSSTLRAFS-QSFQCDRAVT----------TTGPDDEDAD----KPQRPMSPGTLALMCD 330

Query: 450 EQDTIFMAASSP-NRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGE 508
           E++ +F A  SP   L     +TS        +  ++AEQER +L +FRDCL R+ + G 
Sbjct: 331 EKEPLFTAPPSPIGGLASEELSTSGS----PQIALLHAEQERAILLEFRDCLRRITSVGN 386

Query: 509 IKEMKYSSL 517
            + M   S 
Sbjct: 387 RRGMVLGSF 395


>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
 gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
          Length = 448

 Score =  309 bits (791), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 185/473 (39%), Positives = 253/473 (53%), Gaps = 72/473 (15%)

Query: 58  PPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKEN-TPKKQKQC 116
           P P+P P   +PP P                               E+K   + K++K C
Sbjct: 2   PTPRPWPVAFIPPKPV-----------------------------AEIKSTPSTKRKKHC 32

Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
           NCK+S+CLKLYCECFA+G+YCDGC+C  C N V NE+AR+EA+ +T +RNP AF+PKI +
Sbjct: 33  NCKNSQCLKLYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIEN 92

Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
             +     K++ G    L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEG
Sbjct: 93  GSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEG 152

Query: 237 SEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPS 296
           SEE + +  GD++++   IQ AAN A+ GAIGSSGY   PV +KR  ++  +     + S
Sbjct: 153 SEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGS 212

Query: 297 IHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCS 356
             +   FQ+ NH+    SS ++S        +     S   YRS LA+ I P D   L  
Sbjct: 213 TMQT-QFQEANHV---DSSEITSSTGLEGCYSNYQSRSNVVYRSALANTISPTDATGLAK 268

Query: 357 VLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSAN 416
            LV++  +AA +           AE++                 E +V+R +  D   A 
Sbjct: 269 HLVIVCRKAADAFLT-------TAENKG----------------EMEVEREIHTDSDGAT 305

Query: 417 QADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP 476
             D+    D G   +  + +RP SPGT ALMCDEQD+ F            G +     P
Sbjct: 306 NMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF------------GTDYRISFP 353

Query: 477 ---YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQK 526
              + Q  +E+ A+QE+ VLT FRD L  +IT G+I E   SS A  ++ ++K
Sbjct: 354 VALHDQDTSELNAQQEKAVLTGFRDYLRTVITRGKINEANRSSEAAMQLDTRK 406


>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 912

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 199/466 (42%), Positives = 266/466 (57%), Gaps = 58/466 (12%)

Query: 80  GQATQPPVRVPK-----PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASG 134
           G  +Q P R P      P  P SRSRS ++ K+ TPKK KQCNCK+SRCLKLYCECFASG
Sbjct: 430 GSPSQSPHRSPSIKHSSPRRPVSRSRSALDGKDGTPKKCKQCNCKNSRCLKLYCECFASG 489

Query: 135 IYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL-M 193
            YCDGCNCVNC NN E+E  R+EAVEATLERNPNAFRPKIA+SP    D KE+  +    
Sbjct: 490 TYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPKIANSPRILHDGKEDVVDRNGS 549

Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMA 253
             KHNKGCHCKKSGCLKKYCEC+QANILCSENC+C+DC+NF  S+ER+ALF GD A  ++
Sbjct: 550 AAKHNKGCHCKKSGCLKKYCECYQANILCSENCKCVDCRNFITSDERRALFPGDPALGLS 609

Query: 254 YIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSP 313
           ++Q+ +  A       SG+ S P+S+KRK Q++ +       +   L   +  NH  T+P
Sbjct: 610 FVQRPSGCA-------SGHMS-PLSRKRKTQDVEYIMPGLQTNKSNLVKVE--NHAATTP 659

Query: 314 SSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSV---- 369
             ++ +               K   RSLLAD++    +++LC +LV++S++    V    
Sbjct: 660 KMSIYTASAEHIPEPMV----KVVNRSLLADVLHTDVIQDLCKLLVIVSADVQHEVSGIP 715

Query: 370 AVQRNTTEKQAEDRAGTS--------VASSAQGRLQSHKE-PDVDRTVTDDCSSA--NQA 418
           AV   T      D   T          A +A  RLQ+      +   V +D +S     A
Sbjct: 716 AVDNGTNNASTVDPTNTEDCPSGKEPSAPAAPDRLQAESTGTTMAEYVRNDPTSTPPVAA 775

Query: 419 DKVGPD------DSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTS 472
            +  P       D  +   + +K RPMSPGTLALMCDE+D +  A SSP           
Sbjct: 776 VRTAPSPSNQALDVVAKREETRKERPMSPGTLALMCDEKDPLLTAPSSPAH--------- 826

Query: 473 SQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLA 518
                     +++  QER++L++FRDCL R+I+ G  +  + S+ A
Sbjct: 827 --------ENQLHVVQERVILSEFRDCLRRIISVGNRRAYQLSTEA 864


>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
 gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
          Length = 509

 Score =  296 bits (758), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 199/468 (42%), Positives = 258/468 (55%), Gaps = 46/468 (9%)

Query: 59  PPQPQPKPVVPPVPPPP---PPLPGQATQP--PVRVPKPESPKSRSRSNVEVKENTP-KK 112
           PP   P+ V  P   P     P P    QP  PV  P P       +  VE+K  TP KK
Sbjct: 10  PPALVPEAVAEPRAAPQLQQQPKPAVQVQPSMPVLRPWPMEITLSMKPVVEMKSGTPAKK 69

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNCK+S+CLKLYCECFA+G YCDGCNC  C N VENE  R++A+  T  RNPNAF+P
Sbjct: 70  KKHCNCKNSQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQP 129

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI + P      K+  G +  L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+CMDCK
Sbjct: 130 KIENGPIPPSVRKD-AGALPSLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCK 188

Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTA 292
           N+EGSEE ++   GD++ +   +QQAAN A+ GAIGSSGY   PV +KR          A
Sbjct: 189 NYEGSEELRSTIQGDNSCDRNNLQQAANVALNGAIGSSGYRFSPVRRKR---------PA 239

Query: 293 KDPSIHRLG--------HFQQGNHIRTSPSSTLSSIPVARASSTAAV-GSSKFTYRSLLA 343
           +DP   RL          FQ+ N++  S  ++       R  ST    G SK  YRS LA
Sbjct: 240 EDPHYQRLNVEGSMMQTQFQEANNVDVSQITS----STGRDGSTGNFQGRSKLVYRSPLA 295

Query: 344 DIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPD 403
           + I   D+ +L + LV++  +AA+         + +AE      + ++   +   +K  D
Sbjct: 296 NTIHVTDVNDLANHLVIVCRKAAERFTT---IVDNKAEMEIDREICTNTYQKFDENK--D 350

Query: 404 VDRTVTDDCSSANQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDTIF-MAASSP 461
           V +            D+   DDS    SD Q+ ARP SPGT AL+CDEQD  F  A  SP
Sbjct: 351 VQKDALSQLGKVTNIDQQIVDDSILYCSDTQEDARPASPGTQALLCDEQDLTFGTAHRSP 410

Query: 462 NRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEI 509
             +  H           Q ++E++A QE  VL +FR+ L  +I  G+I
Sbjct: 411 IPIALH----------DQDISELHAAQENAVLKEFRNYLRLIIARGKI 448


>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
 gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
          Length = 461

 Score =  294 bits (752), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 186/447 (41%), Positives = 252/447 (56%), Gaps = 98/447 (21%)

Query: 75  PPPLPGQATQPPVR-------VPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLY 127
           PP  PG+  + PVR       +P+P           E+KE   KK KQCNCK++RCLKLY
Sbjct: 60  PPAQPGEPGRKPVRQLNFSSILPRPPV--------YELKEVAAKKCKQCNCKNTRCLKLY 111

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           CECFASG YCDGCNC+NC NNVENEA R+EAVE TLERNPNAFRPKIA++ H  RD+ ++
Sbjct: 112 CECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADD 171

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
               + LGKHNKGC+CKKSGCLKKYCECFQANILCS+NC+C+DCKN++ S+ER+AL+  D
Sbjct: 172 ----MRLGKHNKGCNCKKSGCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPD 227

Query: 248 HANNMAY-IQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK-DPSIHRLGHFQQ 305
           ++ +  +        + TG++     +  P  K+R    +F G + K    I+R     Q
Sbjct: 228 YSQSTYHGGPPPYTTSATGSLACVSSSPMPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQ 287

Query: 306 GNHIRTS---PSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLS 362
              +  S   P+S+L               S+K    SLLA ++QP+ + +LC++LV+ S
Sbjct: 288 DAKLSGSMLPPNSSL---------------SAKTIQTSLLAGVVQPETVSDLCNLLVMAS 332

Query: 363 SEAAKSVAVQRNTTEKQAED--RAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADK 420
           +EAAK++      +++ AE+   AG                                   
Sbjct: 333 AEAAKAMIASGPVSKRPAENGQEAG----------------------------------- 357

Query: 421 VGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQG 480
                SG    D ++ RPMSPGTLALMCDEQD +F  A+                 +G  
Sbjct: 358 -----SGQGDEDIREKRPMSPGTLALMCDEQDPLFTPAT-----------------HGDY 395

Query: 481 MTEVYAEQERIVLTKFRDCLNRLITFG 507
             E+ AEQER++L +FRDCL ++IT G
Sbjct: 396 DKELAAEQERLILEEFRDCLRKIITVG 422


>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
 gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
          Length = 460

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 186/447 (41%), Positives = 252/447 (56%), Gaps = 98/447 (21%)

Query: 75  PPPLPGQATQPPVR-------VPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLY 127
           PP  PG+  + PVR       +P+P           E+KE   KK KQCNCK++RCLKLY
Sbjct: 60  PPAQPGEPGRKPVRQLNFSSILPRPPV--------YELKEVAAKKCKQCNCKNTRCLKLY 111

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           CECFASG YCDGCNC+NC NNVENEA R+EAVE TLERNPNAFRPKIA++ H  RD+ ++
Sbjct: 112 CECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADD 171

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
               + LGKHNKGC+CKKSGCLKKYCECFQANILCS+NC+C+DCKN++ S+ER+AL+  D
Sbjct: 172 ----MRLGKHNKGCNCKKSGCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPD 227

Query: 248 HANNMAY-IQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAK-DPSIHRLGHFQQ 305
           ++ +  +        + TG++     +  P  K+R    +F G + K    I+R     Q
Sbjct: 228 YSQSTYHGGPPPYTTSATGSLACVSSSPMPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQ 287

Query: 306 GNHIRTS---PSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLS 362
              +  S   P+S+L               S+K    SLLA ++QP+ + +LC++LV+ S
Sbjct: 288 DAKLSGSMLPPNSSL---------------SAKTIQTSLLAGVVQPETVSDLCNLLVMAS 332

Query: 363 SEAAKSVAVQRNTTEKQAED--RAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADK 420
           +EAAK++      +++ AE+   AG                                   
Sbjct: 333 AEAAKAMIASGPVSKRPAENGQEAG----------------------------------- 357

Query: 421 VGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQG 480
                SG    D ++ RPMSPGTLALMCDEQD +F  A+                 +G  
Sbjct: 358 -----SGQGDEDIREKRPMSPGTLALMCDEQDPLFTPAT-----------------HGDY 395

Query: 481 MTEVYAEQERIVLTKFRDCLNRLITFG 507
             E+ AEQER++L +FRDCL ++IT G
Sbjct: 396 DKELAAEQERLILEEFRDCLRKIITVG 422


>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
 gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
          Length = 473

 Score =  292 bits (748), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 183/471 (38%), Positives = 249/471 (52%), Gaps = 73/471 (15%)

Query: 68  VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKL- 126
           V P  P P P P  A  PP  V + +S  S             K++K CNCK+S+CLKL 
Sbjct: 48  VQPQMPTPRPWP-VAFIPPKPVAEIKSTPS------------TKRKKHCNCKNSQCLKLA 94

Query: 127 -----------------YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
                            YCECFA+G+YCDGC+C  C N V NE+AR+EA+ +T +RNP A
Sbjct: 95  CSSILDLFTGQIDQLYGYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKA 154

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F+PKI +  +     K++ G    L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C 
Sbjct: 155 FQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQ 214

Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
           DCKNFEGSEE + +  GD++++   IQ AAN A+ GAIGSSGY   PV +KR  ++  + 
Sbjct: 215 DCKNFEGSEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQ 274

Query: 290 STAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQ 349
               + S  +   FQ+ NH+    SS ++S        +     S   YRS LA+ I P 
Sbjct: 275 KLNGEGSTMQT-QFQEANHV---DSSEITSSTGLEGCYSNYQSRSNVVYRSALANTISPT 330

Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
           D   L   LV++  +AA +           AE++                 E +V+R + 
Sbjct: 331 DATGLAKHLVIVCRKAADAFLT-------TAENKG----------------EMEVEREIH 367

Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
            D   A   D+    D G   +  + +RP SPGT ALMCDEQD+ F            G 
Sbjct: 368 TDSDGATNMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF------------GT 415

Query: 470 NTSSQLP---YGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSL 517
           +     P   + Q  +E+ A+QE+ VLT FRD L  +IT G+I   +++ +
Sbjct: 416 DYRISFPVALHDQDTSELNAQQEKAVLTGFRDYLRTVITRGKINGTEFAKI 466


>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
 gi|223943031|gb|ACN25599.1| unknown [Zea mays]
 gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
          Length = 530

 Score =  289 bits (739), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 195/505 (38%), Positives = 275/505 (54%), Gaps = 49/505 (9%)

Query: 45  EHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS--RSN 102
           +HPQ       S PP       +  P  P   P P     P + VP+ + P + +  +S+
Sbjct: 8   QHPQQL-----SSPPVLAAASEIKAPAVPSQQPKPAA---PVMSVPR-QWPIAFNPLKSS 58

Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
            E K  TPKK+K CNC++S+CLK+YCECF    YCDGCNC NC N V NE AR EA+EA 
Sbjct: 59  TEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAI 118

Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
            +RNP+AF+PKI +  +     K+ +G V ++ KH+KGCHCKKSGCLKKYCEC+QAN+ C
Sbjct: 119 RQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQANVFC 178

Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
           S+NCRCM CKN EG+++ +     DHA++  +I+QAAN A  G++GSSGY    +S+KR 
Sbjct: 179 SKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGYIC-SLSRKRS 237

Query: 283 GQE---LFFGSTAKDPSIHRLGHFQQGNHIRTS---PSSTLSSIPVARASSTAAVGSSKF 336
            ++   +   S    P       +QQGNH   S   P ST        A++T     +  
Sbjct: 238 HEDTQCVRINSEGSMPET----QYQQGNHADVSMLPPCST--GFDGHNAANTKPYNPN-- 289

Query: 337 TYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQG-- 394
            YRS LA+ I   ++ +L   LV++   AA ++A   + T+  AE++   S    + G  
Sbjct: 290 -YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIA--DDETDLIAEEKGSQSNNGLSNGNC 346

Query: 395 RLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDT 453
           + Q  KE      +++ CS     D+   D++GS  S     +RP SP T ALMCDEQD 
Sbjct: 347 KQQDFKEASSMVILSEGCS-----DQPNIDETGSHRSKTLDFSRPASPTTQALMCDEQDA 401

Query: 454 IF---------MAASSPNRLTGHGCNTSSQLPY---GQGMTEVYAEQERIVLTKFRDCLN 501
            F          A     + T  G +  S  P     Q ++E+ A QE +VLT  R+ L 
Sbjct: 402 TFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEINAAQENLVLTGLREYLR 461

Query: 502 RLITFGEIKEMKYSSLARTEVGSQK 526
            +IT G+I E K S  A  E+ +Q+
Sbjct: 462 VIITRGKINEHKSSLEAAMELDAQQ 486


>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
          Length = 356

 Score =  284 bits (727), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 139/215 (64%), Positives = 166/215 (77%), Gaps = 3/215 (1%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           KPESPK R+R   E K++TP K+K CNC++S+CLKLYCECFASG YCDGCNC NC NN +
Sbjct: 128 KPESPKLRARL-YEGKDSTPTKKKCCNCRNSKCLKLYCECFASGTYCDGCNCTNCFNNPD 186

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           NE ARREAVEA L+RNP+AFRPKI SSPH  R+N E   ++ ++GKHNKGCHCKKSGCLK
Sbjct: 187 NEVARREAVEAILDRNPDAFRPKIGSSPHMHRNN-EVPSDLPLVGKHNKGCHCKKSGCLK 245

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQANILCSENC+CMDCKNFEGSEER+ LF GDH N +  +QQ  NAA+ GAIG++
Sbjct: 246 KYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAIN-MQQVTNAAVNGAIGAT 304

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
           G+ SP  S+KRK  +       K+   H+  H  Q
Sbjct: 305 GFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQ 339


>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
          Length = 485

 Score =  281 bits (720), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 189/488 (38%), Positives = 266/488 (54%), Gaps = 49/488 (10%)

Query: 45  EHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS--RSN 102
           +HPQ       S PP       +  P  P   P P     P + VP+ + P + +  +S+
Sbjct: 8   QHPQQL-----SSPPVLAAASEIKAPAVPSQQPKPAA---PVMSVPR-QWPIAFNPLKSS 58

Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
            E K  TPKK+K CNC++S+CLK+YCECF    YCDGCNC NC N V NE AR EA+EA 
Sbjct: 59  TEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAI 118

Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
            +RNP+AF+PKI +  +     K+ +G V ++ KH+KGCHCKKSGCLKKYCEC+QAN+ C
Sbjct: 119 RQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQANVFC 178

Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
           S+NCRCM CKN EG+++ +     DHA++  +I+QAAN A  G++GSSGY    +S+KR 
Sbjct: 179 SKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGYIC-SLSRKRS 237

Query: 283 GQE---LFFGSTAKDPSIHRLGHFQQGNHIRTS---PSSTLSSIPVARASSTAAVGSSKF 336
            ++   +   S    P       +QQGNH   S   P ST        A++T     +  
Sbjct: 238 HEDTQCVRINSEGSMPET----QYQQGNHADVSMLPPCST--GFDGHNAANTKPYNPN-- 289

Query: 337 TYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQG-- 394
            YRS LA+ I   ++ +L   LV++   AA ++A   + T+  AE++   S    + G  
Sbjct: 290 -YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIA--DDETDLIAEEKGSQSNNGLSNGNC 346

Query: 395 RLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQK-ARPMSPGTLALMCDEQDT 453
           + Q  KE      +++ CS     D+   D++GS  S     +RP SP T ALMCDEQD 
Sbjct: 347 KQQDFKEASSMVILSEGCS-----DQPNIDETGSHRSKTLDFSRPASPTTQALMCDEQDA 401

Query: 454 IF---------MAASSPNRLTGHGCNTSSQLPY---GQGMTEVYAEQERIVLTKFRDCLN 501
            F          A     + T  G +  S  P     Q ++E+ A QE +VLT  R+ L 
Sbjct: 402 TFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEINAAQENLVLTGLREYLR 461

Query: 502 RLITFGEI 509
            +IT G+I
Sbjct: 462 VIITRGKI 469


>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
           sativa Japonica Group]
          Length = 508

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 173/421 (41%), Positives = 234/421 (55%), Gaps = 49/421 (11%)

Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
           VEVK  +PKK+K CNCK+S     YCECFA+ +YCDGC+C  C N +ENE  R+EA+E  
Sbjct: 112 VEVKNASPKKRKHCNCKNS-----YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETI 166

Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
           L RNP AF+PKI +SP+     K+ +  +  + KHNKGCHC+KS CLKKYCECFQANILC
Sbjct: 167 LLRNPLAFQPKIENSPNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILC 226

Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
           S+NCRC DCKNFEGSEER+AL    +A++  +IQ+AAN A+ GA GS GY + PV +++K
Sbjct: 227 SKNCRCQDCKNFEGSEERKALVQMKNASDRNHIQEAANFALNGATGSLGYKNSPV-RRKK 285

Query: 283 GQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLL 342
            QE   G       I     FQ  +H   S  ++  +      +      SS+  YRS L
Sbjct: 286 YQENSLGE-----QILSEAQFQPTDHADVSQLASSCTGFGGDIAINYQSKSSEMIYRSPL 340

Query: 343 ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEP 402
           A+ I   ++ +L    +V   +A ++              +AG  V   A+ + Q++ E 
Sbjct: 341 ANTIPLIEVNDLVKNALVACRKAPEAFLT-----------KAGNKVEMQAEKQYQTNDEI 389

Query: 403 DVDRTVTDDCSSANQAD---KVGPDDSGSDGSDGQKA------RPMSPGTLALMCDEQDT 453
           + D+T   +   A+  D   K   D    +G++   A      RP SPGT ALMCDE DT
Sbjct: 390 NNDKTKEQNLKEASLKDIQNKACIDRQNINGTEPHLANSFKDSRPASPGTQALMCDEMDT 449

Query: 454 IFMAASSPNRLTGHGCNTSSQLPY-----GQGMTEVYAEQERIVLTKFRDCLNRLITFGE 508
            F              N   + P+      Q ++E+ A+QERIVLT  RD L  LIT G 
Sbjct: 450 TFG-------------NEDYRSPFVVPSRDQDISELNADQERIVLTGLRDYLRVLITRGN 496

Query: 509 I 509
           I
Sbjct: 497 I 497


>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
 gi|223944643|gb|ACN26405.1| unknown [Zea mays]
 gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
          Length = 499

 Score =  278 bits (712), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 188/457 (41%), Positives = 259/457 (56%), Gaps = 31/457 (6%)

Query: 86  PVRVPKPESPKSRSRSNVEVKENTP-KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVN 144
           PV  P P       +  VE+K  TP KK+K CNCK+S+CLKLYCECFA+G YCDGCNC  
Sbjct: 44  PVLRPWPMEITLSMKPVVEMKSGTPAKKKKHCNCKNSQCLKLYCECFAAGDYCDGCNCKQ 103

Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCK 204
           C N VENE  R+EA+  T  RNPNAF+PKI + P  +   K+  G +    KHNKGCHCK
Sbjct: 104 CGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIPSSVWKD-AGALPSHPKHNKGCHCK 162

Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
           KSGCLKKYCECFQANILCS+NC+CMDCKN+EGSEE ++   GD++ +   IQQAAN A+ 
Sbjct: 163 KSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEELRSTTQGDNSCDRNNIQQAANVALN 222

Query: 265 GAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVAR 324
           GAIGSSGY   PV +KR  ++    +   + S+ +   F++ N++     S ++S     
Sbjct: 223 GAIGSSGYRFSPVRRKRPPEDPHCQTLNVEGSMMQT-QFRETNNVDV---SQMTSSTGRD 278

Query: 325 ASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRA 384
            S+    G+SK  YRS LA+ I   D+ +L + LV+   +AA+         + + E   
Sbjct: 279 GSTVNFQGTSKLVYRSPLANTIHITDVNDLANHLVIACRKAAERFTT---IVDNKVEMEI 335

Query: 385 GTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQ-KARPMSPGT 443
              + ++   +   +K+  V +            D+   DDS    SD Q  ARP SPGT
Sbjct: 336 DKEICTNTDQKFDENKK-GVQKAALSQLGKVTDIDQQIVDDSRLHCSDTQDDARPASPGT 394

Query: 444 LALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP---YGQGMTEVYAEQERIVLTKFRDCL 500
            AL+CDEQD  F            G    S +P   + Q ++E++A QE  VL + R+ L
Sbjct: 395 QALLCDEQDLTF------------GTVHRSSIPVAMHDQVISELHAAQESAVLKELRNNL 442

Query: 501 NRLITFGEIK-EMKYSSLARTEVGSQKDALSNGTASS 536
             +I  G++  E K +S  R E+    DAL +  +S+
Sbjct: 443 RLIIARGKLNAEGKSASGTRMEL----DALRHHESST 475


>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
          Length = 437

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 160/394 (40%), Positives = 220/394 (55%), Gaps = 42/394 (10%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YCECFA+G+YCDGC+C  C N V NE+AR+EA+ +T +RNP AF+PKI +  +     K+
Sbjct: 76  YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
           + G    L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE + +  G
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQG 195

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQG 306
           D++++   IQ AAN A+ GAIGSSGY   PV +KR  ++  +     + S  +   FQ+ 
Sbjct: 196 DNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEA 254

Query: 307 NHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAA 366
           NH+    SS ++S        +     S   YRS LA+ I P D   L   LV++  +AA
Sbjct: 255 NHV---DSSEITSSTGLEGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKAA 311

Query: 367 KSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDS 426
            +           AE++                 E +V+R +  D   A   D+    D 
Sbjct: 312 DAFLT-------TAENKG----------------EMEVEREIHTDSDGATNMDQQNGGDF 348

Query: 427 GSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP---YGQGMTE 483
           G   +  + +RP SPGT ALMCDEQD+ F            G +     P   + Q  +E
Sbjct: 349 GPCCNSLEDSRPASPGTQALMCDEQDSTF------------GTDYRISFPVALHDQDTSE 396

Query: 484 VYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSL 517
           + A+QE+ VLT FRD L  +IT G+I   +++ +
Sbjct: 397 LNAQQEKAVLTGFRDYLRTVITRGKINGTEFAKI 430



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 9/77 (11%)

Query: 88  RVPKPESPKSRSRSN-VEVKEN-------TPKKQKQCNCKHSRCLKLYCECFASGIYCD- 138
           R PK   PK  + SN + ++++        PK  K C+CK S CLK YCECF + I C  
Sbjct: 114 RNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSK 173

Query: 139 GCNCVNCHNNVENEAAR 155
            C C +C N   +E  R
Sbjct: 174 NCKCQDCKNFEGSEELR 190


>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
 gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
          Length = 497

 Score =  275 bits (702), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 181/471 (38%), Positives = 255/471 (54%), Gaps = 48/471 (10%)

Query: 63  QPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSR 122
           QPKP  P +P P         Q P+    P  P +      EVK  TPKK+K CNC++S+
Sbjct: 32  QPKPAAPVMPVP--------RQWPMAF-NPLKPST------EVKSVTPKKKKHCNCRNSK 76

Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           CLK+YCECF    YCDGCNC NC N V NE AR EA+EA  +RNP+AF+PKI + P+   
Sbjct: 77  CLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIENGPNTLN 136

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
             K+ +G V ++ KH+KGCHCKKSGCLKKYCEC+QAN+ CS+NCRCM CKN EG+E+ + 
Sbjct: 137 VRKDNSGAVPVVAKHHKGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNSEGNEDTET 196

Query: 243 LFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSI-HRLG 301
               DHA++ ++IQQAAN A  G +GSSGY    +S+KR  ++        + S+     
Sbjct: 197 SSQRDHASDRSHIQQAANVAFNGTVGSSGYICS-LSRKRSHEDALGVRIKSEGSMPETQT 255

Query: 302 HFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVL 361
            +QQGNH   SP +  S+       + A   S+   YRS LA+ I  +++ +L   LV++
Sbjct: 256 QYQQGNHADVSPLAPCST--GFDRHNAANSKSNNPIYRSPLANTIHLREVNDLVKHLVMV 313

Query: 362 SSEAAKSVAVQR---------NTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDC 412
              A   +A ++         N   KQ + +  +S+   ++G   SH +P+++       
Sbjct: 314 CRMAEAKIAEEKGFQSNNGLSNGNCKQQDFKETSSIDILSKG--CSH-QPNINEMGFHWS 370

Query: 413 SSANQADKVGPDDSGSDGSDGQKA----RPMSPGTLALMCDEQDTIF---------MAAS 459
            ++N +    P  + S   D Q       P SP T ALMC+EQ+T F          A  
Sbjct: 371 ETSNVSRPASPT-TQSLMCDEQNTTFGNEPASPTTQALMCEEQNTTFGNEPASPTTQALM 429

Query: 460 SPNRLTGHGCNTSSQLPY---GQGMTEVYAEQERIVLTKFRDCLNRLITFG 507
              + T  G +  S  P    GQ ++E+ A QE +VLT  R  L  +I  G
Sbjct: 430 CEEQNTTFGNDYKSSFPLVSRGQDISEINAVQENLVLTGLRQYLCTIIRRG 480


>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
 gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
          Length = 154

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 129/154 (83%), Positives = 136/154 (88%), Gaps = 1/154 (0%)

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           EETGE L+  KHNKGCHCKKSGCLKKYCECFQANILCSENC+CMDCKNFEGSEERQALFH
Sbjct: 1   EETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFH 60

Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
           GDH NNMAYIQQAANAAITGAIGSSGYAS PVS+KRKGQELFFG T KDPS HRLGHFQQ
Sbjct: 61  GDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQ 120

Query: 306 GNHIR-TSPSSTLSSIPVARASSTAAVGSSKFTY 338
            +HIR  +PSS L S PVARA +T  +GSSK TY
Sbjct: 121 ASHIRPAAPSSPLPSNPVARAGNTTTLGSSKITY 154



 Score = 45.4 bits (106), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           K  K C+CK S CLK YCECF + I C + C C++C N
Sbjct: 11  KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 48


>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
           distachyon]
          Length = 540

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 184/469 (39%), Positives = 252/469 (53%), Gaps = 47/469 (10%)

Query: 66  PVVP---PVPPPPPPLPGQATQPPVRVPKPESPKSR---SRSNVEVKENTPKKQKQCNCK 119
           PV+P   P+     PLPG        +       SR   S ++ E    TP++ K C CK
Sbjct: 96  PVLPQDLPLSAAKLPLPGVQKVTAGILCSEMMLNSRYPWSETSSESINGTPRR-KNCKCK 154

Query: 120 HSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSP 178
           +S+CLKLYC+CFASG YC D CNC NC N+V +E AR++A+ A +ERNP AF PKI + P
Sbjct: 155 NSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAINAVMERNPVAFMPKIGNIP 214

Query: 179 -HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS 237
            H  ++ +    E   +GKH KGC CK+S CLKKYCECFQ+N+LCSENC+C DCKN+E S
Sbjct: 215 RHAAQNREYRAAEGPRVGKHMKGCQCKRSECLKKYCECFQSNVLCSENCKCTDCKNYESS 274

Query: 238 EERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSI 297
           E+ + +      + + Y+    N A+ G IG S    P  ++K  G  L   S  +D  I
Sbjct: 275 EDMKEMRRMTQQHGV-YVHHVQNLALKGMIGPSAVL-PRAAEKFSG--LSVASLGRDQPI 330

Query: 298 HRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSV 357
           +        N      S  L+S+P     S+  +G    TYR+LLADIIQ +D+  LC V
Sbjct: 331 N-------NNDSSQVLSPLLTSVPTEDTESSVRLGRHGVTYRTLLADIIQIEDVNVLCKV 383

Query: 358 LVVLSSEAAKSV---AVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSS 414
           LV++S +AA +      + NT  K+  DRA + + S+   R    K+ D        CS 
Sbjct: 384 LVLVSRQAAGAFLDSGFKENTNIKKL-DRAESHIPSTNHDRKAVQKQHD-----EQLCSL 437

Query: 415 ANQADKVGPDDSG------SDGSDGQK--ARPMSPGTLALMCDEQDTIFMAASSPNRLTG 466
            N    V P   G      SD SD  K   R +SPGT ALMC+EQD +F+  SS   + G
Sbjct: 438 GNSLIAV-PLSEGRAGMPRSDPSDTWKDNRRSVSPGTQALMCNEQD-MFVHTSS---VAG 492

Query: 467 HGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYS 515
              +T+      + ++E+Y EQE+ VLT   D L  L   G ++E K S
Sbjct: 493 AILSTTK-----ENLSEIYIEQEKRVLTNLLDFLRELAKSGRLQEEKRS 536


>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
          Length = 612

 Score =  224 bits (571), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 174/493 (35%), Positives = 263/493 (53%), Gaps = 27/493 (5%)

Query: 76  PPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGI 135
           P LP  AT   V +    S K  S    E K+ T  ++  C+CK+S+CLKLYCECF  G 
Sbjct: 137 PLLPRPATFIDVML---SSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCECFEKGR 193

Query: 136 YCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG 195
           YC GCNC NC NNV +E  R++A+   LERNP AF PK+ +S     ++K   G++  +G
Sbjct: 194 YCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNSTAHNCESKAAEGDI--VG 251

Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
           KH KGC CK++  LKKYCECF+A++ CSENCRC  CKNF+ +E+R +  +  HA+   ++
Sbjct: 252 KHTKGCKCKRTEYLKKYCECFKASVFCSENCRCTGCKNFKSNEDRISQKNTVHAH---HV 308

Query: 256 QQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPS-IHRLGHFQQGNHIRTSPS 314
           Q  A++ + G      +A+   S       +   S   + S +  L H  Q +  +    
Sbjct: 309 QNPASSGMVGQSVIIFHAAENDSSLSLAASVSDHSINNNTSHVILLQHTFQISPFQALSC 368

Query: 315 STLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSV--AVQ 372
           S  SS+    A S       + TYR  LADIIQ +D+ ELC++L+++S +A  ++  +  
Sbjct: 369 SLTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATLLESGM 428

Query: 373 RNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSD 432
           +  T+++  +R  + ++S+   R    KEP       +  S+     +   +   +D  D
Sbjct: 429 KENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRLELPRADPFD 488

Query: 433 GQKA--RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQER 490
            QK   RP+SPGT ALMCDEQD +F  + +P+         ++     + + +++ EQE+
Sbjct: 489 IQKGNKRPLSPGTQALMCDEQDAVFQDSKAPD---------ATPSTTEENLVDIFKEQEK 539

Query: 491 IVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQP---ATV 547
            VLT FRD L +L T G+++E+K  SLA T    Q  A  +G  SS T          T+
Sbjct: 540 RVLTNFRDYLCKLATCGKLQELKL-SLASTNCSEQAFA-DHGNNSSITRVAKVTRIRQTI 597

Query: 548 SNGVSKAYGPSAL 560
               SKA   +AL
Sbjct: 598 PQFSSKAASTTAL 610


>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
          Length = 586

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 172/492 (34%), Positives = 257/492 (52%), Gaps = 51/492 (10%)

Query: 76  PPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGI 135
           P LP  AT   V +    S K  S    E K+ T  ++  C+CK+S+CLKLYCECF  G 
Sbjct: 137 PLLPRPATFIDVML---SSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCECFEKGR 193

Query: 136 YCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG 195
           YC GCNC NC NNV +E AR++A+   LERNP AF PK+++S     ++K   G++  +G
Sbjct: 194 YCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNSTAHNCESKAAEGDI--VG 251

Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
           KH KGC CK++ CLKKYCECF+A++ CSENCRC  CKN++ + +R +  +  HA+++   
Sbjct: 252 KHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHHV--- 308

Query: 256 QQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSS 315
               N A +G +G S               + F +   D S                  S
Sbjct: 309 ---QNPASSGMVGQSV--------------IIFHAAENDSS--------LSLAASALSCS 343

Query: 316 TLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAKSV--AVQR 373
             SS+    A S       + TYR  LADIIQ +D+ ELC++L+++S +A  ++  +  +
Sbjct: 344 LTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATLLESGMK 403

Query: 374 NTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDG 433
             T+++  +R  + ++S+   R    KEP       +  S+     +   +   +D  D 
Sbjct: 404 ENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRLELPRADPFDI 463

Query: 434 QKA--RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERI 491
           QK   RP+SPGT ALMCDEQD +F  + +P+         ++     + + +++ EQE+ 
Sbjct: 464 QKGNKRPLSPGTQALMCDEQDAVFQDSKAPD---------ATPSTTEENLVDIFKEQEKR 514

Query: 492 VLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGTASSRTGTGNQP---ATVS 548
           VLT FRD L +L T G+++E+K  SLA T    Q  A  +G  SS T          T+ 
Sbjct: 515 VLTNFRDYLCKLATCGKLQELKL-SLASTNCSEQAFA-DHGNNSSITRVAKVTRIRQTIP 572

Query: 549 NGVSKAYGPSAL 560
              SKA   +AL
Sbjct: 573 QFSSKAASTTAL 584


>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
          Length = 426

 Score =  215 bits (547), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 155/453 (34%), Positives = 215/453 (47%), Gaps = 84/453 (18%)

Query: 68  VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLY 127
           V P  P P P P  A  PP  V + +S  S             K++K CNCK+S+CLKL 
Sbjct: 48  VQPQMPTPRPWP-VAFIPPKPVAEIKSTPS------------TKRKKHCNCKNSQCLKL- 93

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
                                       +EA+           + KI +  +     K++
Sbjct: 94  ----------------------------QEAITVRSNGIQKPSKLKIENGSNALNLRKDD 125

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
            G    L KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE + +  GD
Sbjct: 126 AGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGD 185

Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGN 307
           ++++   IQ AAN A+ GAIGSSGY   PV +KR  ++  +     + S  +   FQ+ N
Sbjct: 186 NSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEAN 244

Query: 308 HIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAAK 367
           H+    SS ++S        +     S   YRS LA+ I P D   L   LV++  +AA 
Sbjct: 245 HV---DSSEITSSTGLEGCYSNYQCRSNVVYRSALANTISPTDATGLAKHLVIVCRKAAD 301

Query: 368 SVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSG 427
           +           AE++                 E +V+R +  D   A   D+    D G
Sbjct: 302 AFLT-------TAENKG----------------EMEVEREIHTDSHGATNMDQQNGGDFG 338

Query: 428 SDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLP---YGQGMTEV 484
              +  + +RP SPGT ALMCDEQ + F            G +     P   + Q  +E+
Sbjct: 339 PCCNSLEDSRPASPGTQALMCDEQGSTF------------GTDYRISFPVALHDQDTSEL 386

Query: 485 YAEQERIVLTKFRDCLNRLITFGEIKEMKYSSL 517
            A+QE+ VLT FRD L  +IT G+I   +++ +
Sbjct: 387 NAQQEKAVLTGFRDYLRTVITRGKINGTEFAKI 419



 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 52/131 (39%), Gaps = 35/131 (26%)

Query: 59  PPQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------------------VPKPE 93
           P  P P+P  P    PP P+    + P  +                         + KP 
Sbjct: 50  PQMPTPRP-WPVAFIPPKPVAEIKSTPSTKRKKHCNCKNSQCLKLQEAITVRSNGIQKPS 108

Query: 94  SPKSRSRSN-VEVKEN-------TPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVN 144
             K  + SN + ++++        PK  K C+CK S CLK YCECF + I C   C C +
Sbjct: 109 KLKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQD 168

Query: 145 CHNNVENEAAR 155
           C N   +E  R
Sbjct: 169 CKNFEGSEELR 179


>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
          Length = 553

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 141/354 (39%), Positives = 194/354 (54%), Gaps = 44/354 (12%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-----------------RQ 241
           K C+CK S CLK YCECF + + C + C C +C N   +E                  R 
Sbjct: 160 KCCNCKHSKCLKLYCECFASGVYC-DGCNCSNCFNNVKNETARREAIDATLERNPDAFRP 218

Query: 242 ALFHGDHAN--NMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHR 299
            +    HAN  NM    QAANAA+ GAIG++ ++SP  S+KRK  +     +AK+   HR
Sbjct: 219 KIGSSPHANRNNM----QAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHR 274

Query: 300 LGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLV 359
             H  Q  + +    S    +P+ +      +G  K TYR LLADI+QP D+KELC +LV
Sbjct: 275 TNHVPQVFYQKNGMPSD-GPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLV 333

Query: 360 VLSSEAAKSVA---VQRNTTEKQ------AEDRAGTSVASSAQGRLQSHKEPDVDRTVTD 410
           V+S  AAK+ A    Q    EK+       E   G S+AS+   R +++ +P+  +   D
Sbjct: 334 VVSGNAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASAD 393

Query: 411 DCSSA-NQADKVGPDDSGSDGSDGQKA-RPMSPGTLALMCDEQDTIFMAASSPNRLTGHG 468
           D S+   +A K   ++S  D +D QK+ RPMSPGTLALMCDEQDT+FM        T   
Sbjct: 394 DPSTGGTRAGKASLEESKHDCNDDQKSNRPMSPGTLALMCDEQDTMFM--------TSQN 445

Query: 469 CNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEV 522
                  P  Q  +E+YAEQER VLT+FRDCL++L+TFG +KE K+S   ++EV
Sbjct: 446 VVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 499



 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 68/96 (70%), Positives = 78/96 (81%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           KPESPK       E K+ TP K+K CNCKHS+CLKLYCECFASG+YCDGCNC NC NNV+
Sbjct: 137 KPESPKPPRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNVK 196

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           NE ARREA++ATLERNP+AFRPKI SSPH  R+N +
Sbjct: 197 NETARREAIDATLERNPDAFRPKIGSSPHANRNNMQ 232


>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
          Length = 702

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 92/131 (70%), Positives = 105/131 (80%), Gaps = 9/131 (6%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNCK+SRCLKLYCECFASG YCDGCNC+NC+NN E+EA R+ AVEA LERNPNAFRP
Sbjct: 153 KKHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRP 212

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KIA       DN+   G      +HNKGC+CKKSGCLKKYCECFQA I+CSE C+C+DCK
Sbjct: 213 KIAPG----ADNRRAGG-----ARHNKGCNCKKSGCLKKYCECFQAGIVCSEICKCLDCK 263

Query: 233 NFEGSEERQAL 243
           N+EGSE R AL
Sbjct: 264 NYEGSEARDAL 274


>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
          Length = 274

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 96/164 (58%), Positives = 122/164 (74%), Gaps = 1/164 (0%)

Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
            YCECFA+ +YCDGC+C  C N +ENE  R+EA+E  L RNP AF+PKI +SP+     K
Sbjct: 87  WYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRK 146

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + +  +  + KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL  
Sbjct: 147 DNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQ 206

Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
             +A++  +IQ+AAN A++GA GS GY S PV +++K QE   G
Sbjct: 207 MKNASDRNHIQEAANFALSGATGSLGYKSSPV-RRKKYQENSLG 249



 Score = 45.8 bits (107), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREA 158
           + N E     PK  K C+C+ S CLK YCECF + I C   C C +C  N E    R+  
Sbjct: 146 KDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDC-KNFEGSEERKAL 204

Query: 159 VE 160
           V+
Sbjct: 205 VQ 206


>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
          Length = 321

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 112/246 (45%), Positives = 139/246 (56%), Gaps = 27/246 (10%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERN 166
           N    +K CNC++SRCLKLYCECFASG YC   CNCV C NNVEN   R+EAVEATLERN
Sbjct: 15  NIQTDRKTCNCRNSRCLKLYCECFASGQYCLQNCNCVECKNNVENSVQRQEAVEATLERN 74

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           PNAFRPK+  + +G   ++          KHNKGCHCKKSGCLKKYCECFQA+I CS+ C
Sbjct: 75  PNAFRPKVLPTLNGGEGDE----------KHNKGCHCKKSGCLKKYCECFQASIFCSDMC 124

Query: 227 RCMDCKNFEGSEERQALFH-------GDHANNMAY--IQQAANAAITGAIGSSGYA---S 274
           RC+DCKNFEGS+  +A+         G  A+ +    +Q   N    GA+    Y    S
Sbjct: 125 RCLDCKNFEGSDAHKAIASQQEFMPAGSDASLLVAPGLQMDVNKTNGGAMKQEAYVPQKS 184

Query: 275 PPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSS 334
           P     + G    F  T K P +H L    + N +     + LS+  VAR  +   V   
Sbjct: 185 PMGFAAKGGASPHFLKT-KKPILHGL---IKPNAVDQLGEALLSAAEVARKRAHDQVAYD 240

Query: 335 KFTYRS 340
               R+
Sbjct: 241 ALKRRT 246



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           E   K  K C+CK S CLK YCECF + I+C D C C++C N
Sbjct: 90  EGDEKHNKGCHCKKSGCLKKYCECFQASIFCSDMCRCLDCKN 131


>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 255

 Score =  182 bits (463), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 85/130 (65%), Positives = 101/130 (77%), Gaps = 6/130 (4%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K++K CNC++SRCLKLYCECFASG++CD CNCVNC NN+ + A R  AVE+TLERNPNAF
Sbjct: 4   KRKKYCNCRNSRCLKLYCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAF 63

Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
           RPKIA    G  + + E        +HN+GCHCKKS CLKKYCECFQANI CS+ CRCM+
Sbjct: 64  RPKIAPGV-GPLEKRREGA-----SRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCME 117

Query: 231 CKNFEGSEER 240
           CKNF GS +R
Sbjct: 118 CKNFLGSSQR 127



 Score = 46.6 bits (109), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           +E   +  + C+CK S CLK YCECF + I+C D C C+ C N
Sbjct: 78  REGASRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCMECKN 120



 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           M  K  K C+C+ S CLK YCECF + + C ++C C++C N
Sbjct: 1   MPSKRKKYCNCRNSRCLKLYCECFASGLHC-DSCNCVNCSN 40


>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
          Length = 695

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 77/151 (50%), Positives = 101/151 (66%), Gaps = 10/151 (6%)

Query: 93  ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE 152
           ES      SN+E   N  + +K CNC  S+CLKLYC+CFA+G +C  CNC+NC NN+ENE
Sbjct: 464 ESITMTPESNLE--SNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 521

Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
             R+ A++  LERNPNAFRPKI  +     D+         + KH KGC+CK+SGCLK Y
Sbjct: 522 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 573

Query: 213 CECFQANILCSENCRCMDCKNFEGSEERQAL 243
           CEC++A I CS NC+CM C+N E S E+++L
Sbjct: 574 CECYEAKIACSNNCKCMGCRNIEDSMEKKSL 604


>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
          Length = 752

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 77/151 (50%), Positives = 101/151 (66%), Gaps = 10/151 (6%)

Query: 93  ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE 152
           ES      SN+E   N  + +K CNC  S+CLKLYC+CFA+G +C  CNC+NC NN+ENE
Sbjct: 521 ESITMTPESNLE--SNGIRPRKPCNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 578

Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
             R+ A++  LERNPNAFRPKI  +     D+         + KH KGC+CK+SGCLK Y
Sbjct: 579 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 630

Query: 213 CECFQANILCSENCRCMDCKNFEGSEERQAL 243
           CEC++A I CS NC+CM C+N E S E+++L
Sbjct: 631 CECYEAKIACSNNCKCMGCRNIEDSMEKKSL 661


>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
 gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
          Length = 766

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 74/127 (58%), Positives = 90/127 (70%), Gaps = 11/127 (8%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG--CNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           K CNCK+SRCLKLYCECFA+G  C    C C NC N VE+   R  A+E  L+RNPNAFR
Sbjct: 16  KGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVLDRNPNAFR 75

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           PKIA+    +RD         +  +HN+GCHCK+SGCLKKYCECFQA I C+E CRC+ C
Sbjct: 76  PKIAAVASPSRD---------VALRHNRGCHCKRSGCLKKYCECFQAAIYCAETCRCVSC 126

Query: 232 KNFEGSE 238
           +N+EGS+
Sbjct: 127 ENYEGSD 133



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/59 (45%), Positives = 36/59 (61%), Gaps = 7/59 (11%)

Query: 90  PKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           PK  +  S SR +V ++ N     + C+CK S CLK YCECF + IYC + C CV+C N
Sbjct: 76  PKIAAVASPSR-DVALRHN-----RGCHCKRSGCLKKYCECFQAAIYCAETCRCVSCEN 128


>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  165 bits (417), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 140/455 (30%), Positives = 202/455 (44%), Gaps = 106/455 (23%)

Query: 84  QPPVRVPKPESPKSRSRSNVEVKENTPKK----QKQCNCKHSRCLKLYCECFASGIYCDG 139
           + PV+   P  P SR  S  E K+ T ++    +K+C CK S+CLKLYC+CFASG+ C  
Sbjct: 32  ESPVKEIPPFPPISREHS--ETKDTTDQEGITWRKRCRCKQSKCLKLYCDCFASGVLCTD 89

Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
           C+CV+CHNN +N  AR  AV   L RNPNAF  K+ SS     D + +       G  ++
Sbjct: 90  CDCVDCHNNSDNYDARDAAVVNVLGRNPNAFNEKLFSS---INDKQCKAAPDTRPGLLSR 146

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKN-------------------FEGSEER 240
           GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN                   +EG +  
Sbjct: 147 GCKCKRTKCLKKYCECFQANVLCSDNCKCINCKNVSEAFQPSVFAWGLNSRKLYEGFDNP 206

Query: 241 QALFHGD------HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKD 294
           +  F  D        +++ +        +  A G S + SP V ++R+ QEL   ++   
Sbjct: 207 EKKFVCDLGIISPSEDSVGFNSHNTAGCMNYAPGFSAHNSPQVYRRRRHQELPEWNSCPA 266

Query: 295 PSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSL-----LADIIQPQ 349
           P           N I+ +  S +SS P             K  YR        A  + P 
Sbjct: 267 PLFS-----IPDNSIQNALGSPMSSSP-------------KLPYRKKKPPLGYASTVVP- 307

Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
           DL ++CS+L+  S  A  +           AED          Q R+    +  VD   T
Sbjct: 308 DLGDICSLLLAASETATAN-----------AED----------QNRICIKPDDKVDNVYT 346

Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
           +  S +         +SG+   + Q  R +              I +  +  N      C
Sbjct: 347 EVLSES---------ESGNVEEEIQSFRRL--------------IELIDAQYNGEEHSKC 383

Query: 470 NTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLI 504
            T + +      T++Y EQER VL  FRDCL   I
Sbjct: 384 KTETSV----HETDIYMEQERAVLETFRDCLQTFI 414


>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 151

 Score =  162 bits (411), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 81/145 (55%), Positives = 101/145 (69%), Gaps = 10/145 (6%)

Query: 94  SPKSRSRSNVEVKENTPK----KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNV 149
           SP+S     +  + +TP+    ++K+CNCK+S+CLKLYCECFASG YCD CNC  C NN 
Sbjct: 12  SPRS-----IAARASTPRDSASQRKRCNCKNSKCLKLYCECFASGKYCDACNCAGCKNND 66

Query: 150 ENEAARREAVEATLERNPNAFRPKIAS-SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
           E+   R+ A+E TLERNPNAFRPKI + + +GT D            +HN+GCHCK+SGC
Sbjct: 67  EHAHERQSAIEQTLERNPNAFRPKIINQTTNGTLDGAGAGDGATTEARHNRGCHCKRSGC 126

Query: 209 LKKYCECFQANILCSENCRCMDCKN 233
           LKKYCECFQA I C E CRC +CKN
Sbjct: 127 LKKYCECFQAAIYCVERCRCAECKN 151


>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
           CCMP2712]
          Length = 138

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 78/129 (60%), Positives = 98/129 (75%), Gaps = 4/129 (3%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNP 167
           TP K+K CNCK+S+CLKLYCECFASG+YC+  CNC +CHNNV  E  R+ AV +TLERNP
Sbjct: 10  TPAKRKPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERNP 69

Query: 168 NAFRPKIASS-PHGTRDNKEETGEVLML--GKHNKGCHCKKSGCLKKYCECFQANILCSE 224
           +AFRPKI  +   G  D+  +    LM    +H KGCHCKKSGCLKKYCECFQA ++CS+
Sbjct: 70  HAFRPKITQTGQQGIVDSPAKGARWLMFVKARHMKGCHCKKSGCLKKYCECFQAAVMCSD 129

Query: 225 NCRCMDCKN 233
            C+C++C+N
Sbjct: 130 TCKCIECRN 138


>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 546

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 71/124 (57%), Positives = 94/124 (75%), Gaps = 12/124 (9%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           K+CNCK S+C+KLYCEC+A+G+YC+GC+CV+C NN ++E  R +A++ TL +NP AF PK
Sbjct: 69  KKCNCKQSQCIKLYCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPK 128

Query: 174 IASSPHGTRDNKEETG----EVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           I+         K+ET     + L LGKHNKGC CK+SGC+KKYCECFQA + CSENC+C 
Sbjct: 129 ISQ--------KQETASKVVDPLNLGKHNKGCACKRSGCMKKYCECFQAGVPCSENCKCT 180

Query: 230 DCKN 233
           DC+N
Sbjct: 181 DCQN 184



 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 30/44 (68%), Gaps = 1/44 (2%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
           G  +K C+CK+S C+K YCEC+ A + C+  C C+ CKN + SE
Sbjct: 65  GYTSKKCNCKQSQCIKLYCECYAAGVYCN-GCHCVSCKNNDQSE 107



 Score = 46.6 bits (109), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 34/64 (53%), Gaps = 2/64 (3%)

Query: 85  PPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCV 143
           P   +PK  S K  + S V    N  K  K C CK S C+K YCECF +G+ C + C C 
Sbjct: 122 PTAFLPKI-SQKQETASKVVDPLNLGKHNKGCACKRSGCMKKYCECFQAGVPCSENCKCT 180

Query: 144 NCHN 147
           +C N
Sbjct: 181 DCQN 184


>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
 gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
          Length = 1133

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 68/109 (62%), Positives = 79/109 (72%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YCECFASG YC+ CNCV C NN E+EA R+ AVEA LERNPNAFRPKI S+         
Sbjct: 144 YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQAPAAAAA 203

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
                   G+H KGC+CKKS CLKKYCECFQA I CS+NC+C++CKN+E
Sbjct: 204 VVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYE 252



 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 25/41 (60%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
           +  K CNCK S CLK YCECF +GIYC D C CV C N  E
Sbjct: 213 RHLKGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYEE 253


>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
 gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
          Length = 116

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 75/116 (64%), Positives = 84/116 (72%), Gaps = 9/116 (7%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YCECFASG+YCDGCNC+NC NN ENE  R+EAVE  LERNPNAFRPKI S      D   
Sbjct: 1   YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60

Query: 187 ETGEVLM---------LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
               ++          + +HNKGC CKKSGCLKKYCECFQANILCSENCRC+DC+N
Sbjct: 61  CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116



 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
             +  K C CK S CLK YCECF + I C + C CV+C N
Sbjct: 77  VARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116


>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
          Length = 596

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/158 (55%), Positives = 102/158 (64%), Gaps = 26/158 (16%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNCK+S+CLKLYCECFASG YC  CNC  C NN+ENE+ R+E +   LERNPNAFRP
Sbjct: 33  KKNCNCKNSKCLKLYCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRP 92

Query: 173 KIAS----SPHGTRDNKEETG----------------------EVLMLGKHNKGCHCKKS 206
           KIAS    SP  T+     TG                      + +  GKH KGC CKKS
Sbjct: 93  KIASLPPQSPQITQKVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKS 152

Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           GCLKKYCECFQA I CS+NC+C DCKNF+GS +R+ L 
Sbjct: 153 GCLKKYCECFQAGIFCSDNCKCCDCKNFDGSLDRRILL 190



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 10/109 (9%)

Query: 40  SVILPEHPQSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRS 99
           ++IL  +P +      S PP  PQ    V  V   P   P Q     + +  P       
Sbjct: 80  AIILERNPNAFRPKIASLPPQSPQITQKVVGVTGGPINNPSQIN---IGLSTP------I 130

Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
            + V       K  K C CK S CLK YCECF +GI+C D C C +C N
Sbjct: 131 MTGVNDGVAAGKHAKGCACKKSGCLKKYCECFQAGIFCSDNCKCCDCKN 179


>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 491

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 72/126 (57%), Positives = 89/126 (70%), Gaps = 13/126 (10%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           +K CNCK+SRCLKLYCECFASG YCD  C CV+C NN  N+ AR++AV+ATLERNP AFR
Sbjct: 137 KKTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFR 196

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           PKI S+                  +H+KGCHC+KS C K+YCEC+QA I C+ +C+C+DC
Sbjct: 197 PKINSA------------ATESWARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDC 244

Query: 232 KNFEGS 237
           KN   S
Sbjct: 245 KNHSHS 250



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-RQALFHGDHANNMAYIQQ 257
           K C+CK S CLK YCECF +   C  +C+C+ C N E ++E RQ         N A  + 
Sbjct: 138 KTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFRP 197

Query: 258 AANAAIT 264
             N+A T
Sbjct: 198 KINSAAT 204



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 98  RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNN 148
           R + N    E+  +  K C+C+ S C K YCEC+ +GI C   C CV+C N+
Sbjct: 196 RPKINSAATESWARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDCKNH 247


>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
          Length = 750

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 79/180 (43%), Positives = 108/180 (60%), Gaps = 23/180 (12%)

Query: 74  PPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFAS 133
           P  P  G+ +     V     PKS S  N        K +K CNC  S+CLKLYC+CFA+
Sbjct: 506 PTVPKIGKPSTSTALVLSETKPKSTSDVN--------KTKKPCNCTRSQCLKLYCDCFAN 557

Query: 134 GIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLM 193
           G +C  CNC +C+NN+E+E  R +A+ + L+RNPNAFRPKI          K   G+   
Sbjct: 558 GEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAFRPKI----------KSFVGQA-- 605

Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMA 253
             +H KGC+CK+SGCLK YCECF+A I CS+NC+C+ C+N E  ++    F GDH  +++
Sbjct: 606 ERQHTKGCNCKRSGCLKNYCECFEARIACSQNCKCIGCRNME--DDSHTSF-GDHPGSLS 662


>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
          Length = 916

 Score =  155 bits (392), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 78/153 (50%), Positives = 100/153 (65%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+E+E  R++A+ A LERN
Sbjct: 657 EDNSRPRKPCNCTKSQCLKLYCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGACLERN 716

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           PNAFRPKI     G  +            +H KGC+CK+SGCLK YCEC++A ILCS  C
Sbjct: 717 PNAFRPKIGKGKEGDHER-----------RHTKGCNCKRSGCLKNYCECYEAKILCSSMC 765

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L     A  +   QQAA
Sbjct: 766 KCVGCKNFEDSSERKTLMQLADAAEVRVQQQAA 798


>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
 gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
          Length = 757

 Score =  155 bits (391), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 66/124 (53%), Positives = 89/124 (71%), Gaps = 10/124 (8%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           K++ CNC  S+CLKLYC+CFA+G +C  CNC +C+NN++NE  R++A+ ATLERNP+AF+
Sbjct: 488 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 547

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           PKI     G     E+      L  H KGC+CK+SGCLK YCEC++A I CS NC+C+ C
Sbjct: 548 PKI-----GAVSADEDA-----LRLHTKGCNCKRSGCLKNYCECYEAKIACSANCKCIGC 597

Query: 232 KNFE 235
           +N E
Sbjct: 598 RNTE 601



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 2/52 (3%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDC-KNFEGSEERQALFHGDHANN 251
           C+C KS CLK YC+CF     C  NC C DC  N +  EERQ         N
Sbjct: 492 CNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEERQKAIRATLERN 542


>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
          Length = 605

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 76/163 (46%), Positives = 100/163 (61%), Gaps = 18/163 (11%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K +K CNC  S CLKLYCECFA+G +CD CNC+NCHNN+E +  R +A+++ LERNP AF
Sbjct: 375 KIRKPCNCTKSMCLKLYCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAF 434

Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
           RPKI       R              H KGC+CK+SGCLK YCEC++A I C+  C+C+ 
Sbjct: 435 RPKIGRGRDANRT-------------HQKGCNCKRSGCLKNYCECYEARIPCTSKCKCIG 481

Query: 231 CKNFE----GSEERQALFHGDHANNMAYIQQAANAA-ITGAIG 268
           CKN E      +E+  L +   A  +   QQAA+ + I+  IG
Sbjct: 482 CKNLEEDSVALKEQHPLMNLADAAAVRCQQQAASKSRISTHIG 524


>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
 gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
          Length = 587

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 70/159 (44%), Positives = 100/159 (62%), Gaps = 13/159 (8%)

Query: 77  PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIY 136
           P+P  A +P   + KP S  + S   V+ + N   ++K CNC  S+CLKLYC+CFA+G +
Sbjct: 330 PVPAPANKPNFEILKPPSSAA-STPAVDAQLNLASRRKHCNCSKSQCLKLYCDCFANGEF 388

Query: 137 CDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGK 196
           C  C C +C NN++NE  R +A+ + L+RNP+AF+PKI +   G             +  
Sbjct: 389 CKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKPKITAPASGD------------MRL 436

Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           HNKGC+CK+SGCLK YCEC++A I CS  C+C+ C+N E
Sbjct: 437 HNKGCNCKRSGCLKNYCECYEAKIPCSSMCKCVGCRNME 475


>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
           magnipapillata]
          Length = 530

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/174 (45%), Positives = 110/174 (63%), Gaps = 12/174 (6%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           + +S KS S++NV     +   +K CNC  S+CLKLYCECFA+G +C+ CNC  C NN+ 
Sbjct: 296 RIDSNKSASKANVGPITESKILKKPCNCTKSQCLKLYCECFANGEFCNNCNCRICFNNIA 355

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           +E  R +A+++ L+RNP AF PKI     G    K +T       +H KGC+C++SGCLK
Sbjct: 356 HEVERSKAIKSCLDRNPYAFHPKI-----GKGKIKGDTER-----RHTKGCNCRRSGCLK 405

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
            YCEC++A ILCS  C+C  CKNFE S +R+ L     A  +  +QQ  NAAI+
Sbjct: 406 NYCECYEAKILCSTLCKCSGCKNFEESADRKTLMQLADAAEVRVLQQ--NAAIS 457


>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
          Length = 448

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 69/134 (51%), Positives = 91/134 (67%), Gaps = 11/134 (8%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNC  S CLKLYC+CFA+G +C  CNC +CHNN++NEA R  A++A+LERNPNAF+P
Sbjct: 221 KKPCNCTKSMCLKLYCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNPNAFKP 280

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI  +  G  D++           H KGCHCKKS CLK YCEC++A + C+E C+C  C+
Sbjct: 281 KIGVASRGRVDSER---------LHQKGCHCKKSNCLKNYCECYEAKVPCTERCKCSSCR 331

Query: 233 NFEGSEERQALFHG 246
           N E   +R + F G
Sbjct: 332 NTES--DRASKFRG 343


>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
           occidentalis]
          Length = 609

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 67/131 (51%), Positives = 87/131 (66%), Gaps = 11/131 (8%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           K CNC  S+CLKLYC+CFA+G +C  CNCV C+NN+E+E  R  AV+  LERNPNAF PK
Sbjct: 393 KPCNCTKSQCLKLYCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHPK 452

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           I     G ++            +H KGC+CK+SGCLK YCEC++A ILCS  CRC+ C N
Sbjct: 453 IGKYKPGDKER-----------RHTKGCNCKRSGCLKNYCECYEARILCSGVCRCVGCHN 501

Query: 234 FEGSEERQALF 244
            E + + ++L 
Sbjct: 502 IEENMDAESLL 512


>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
          Length = 999

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 71/155 (45%), Positives = 87/155 (56%), Gaps = 33/155 (21%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K+  CNCK S+CLKLYCECFASG YCD  CNC++C N    E  R++A+ A LE+NPNAF
Sbjct: 563 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNAF 622

Query: 171 RPKIASSPHGTRDNKEE--------------------------------TGEVLMLGKHN 198
           +PKI ++P  T  +                                   +  +L    H 
Sbjct: 623 KPKIGATPGMTPGSARRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMHK 682

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            GCHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 683 HGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 717



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 22/39 (56%), Positives = 27/39 (69%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           C+CKKS CLK YCECF +   C E+C C+DC N   +EE
Sbjct: 567 CNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEE 605


>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
 gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
          Length = 944

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 73/186 (39%), Positives = 110/186 (59%), Gaps = 14/186 (7%)

Query: 50  QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENT 109
           Q +     P  Q  P  V P +P  PP      T+P   + K   P + + + V+     
Sbjct: 671 QRIRQQQLPTEQSAPTKVEPKLPTQPPGFNPVPTKPLFDILK--QPATGAAATVDALGGM 728

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
             ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+A
Sbjct: 729 TSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNMDYEVERERAIRSCLDRNPSA 788

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F+PKI ++P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I C+  C+C+
Sbjct: 789 FQPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCV 836

Query: 230 DCKNFE 235
            C+N E
Sbjct: 837 GCRNME 842


>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 73/153 (47%), Positives = 88/153 (57%), Gaps = 31/153 (20%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K+  CNCK S+CLKLYCECFASG YCD  CNC++C N    E  R++A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNAF 619

Query: 171 RPKIASSPH-------GTR----DNKEETGEVL-------------------MLGKHNKG 200
           +PKI ++         G R    D +  TG  L                       H  G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKHG 679

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVSCGENCRCIDCKN 712



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 191 VLMLGKHNKG-CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           V+M+    K  C+CKKS CLK YCECF +   C E+C C+DC N   +E+
Sbjct: 553 VVMIPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAED 602


>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 73/153 (47%), Positives = 88/153 (57%), Gaps = 31/153 (20%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K+  CNCK S+CLKLYCECFASG YCD  CNC++C N    E  R++A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNAF 619

Query: 171 RPKIASSPH-------GTR----DNKEETGEVLM-------------------LGKHNKG 200
           +PKI ++         G R    D +  TG  L                       H  G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKHG 679

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 712



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 1/50 (2%)

Query: 191 VLMLGKHNKG-CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           V+M+    K  C+CKKS CLK YCECF +   C E+C C+DC N   +E+
Sbjct: 553 VVMIPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAED 602



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           NT   +  C+CK S C K YCECF +G+ C + C C++C N  
Sbjct: 672 NTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKNQT 714


>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
          Length = 356

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 64/313 (20%)

Query: 95  PKSRSRSNVEVKEN----TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           P SR  S  + K +    T +K K C CK S+CLKLYC+CFASG+ C  C+CV+CHNN E
Sbjct: 43  PISREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSE 102

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
              AR  A+   L RNPNAF  K   S     DN+ +       G  ++GC CK++ CLK
Sbjct: 103 KCDAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLK 159

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQAN+LCS+NC+C++CKN   + +  A                           S
Sbjct: 160 KYCECFQANLLCSDNCKCINCKNVSEAFQPPAF--------------------------S 193

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIP---VARASS 327
            + SP V ++R+ +EL   ++   P                     L SIP   +  A  
Sbjct: 194 AHNSPQVYRRRRDRELTEWNSCPAP---------------------LFSIPDNSIQNALG 232

Query: 328 TAAVGSSKFTY---RSLL--ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED 382
           +    S K  Y   RSL+     + P DL +LCS+LV  +SE+A + A  +N    + +D
Sbjct: 233 SPMSCSPKLPYRKKRSLMGYTSTLLP-DLGDLCSLLVA-ASESATTTAEDQNRIFTKPDD 290

Query: 383 RAGTSVASSAQGR 395
           +    ++S ++ R
Sbjct: 291 KEAIELSSESESR 303


>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
           CCMP2712]
          Length = 118

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 78/124 (62%), Positives = 93/124 (75%), Gaps = 9/124 (7%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K+KQCNCK+S+CLKLYCECFASG YC+  CNC  C NN   E  R+ A++ATLERNP AF
Sbjct: 2   KRKQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAF 61

Query: 171 RPKIASS-PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           +PKIASS P G  D+        + G+H KGCHCKKSGCLKKYCECFQA +LCS  C+C 
Sbjct: 62  QPKIASSAPPGIEDSP-------VQGRHTKGCHCKKSGCLKKYCECFQAGVLCSAQCKCH 114

Query: 230 DCKN 233
           +C+N
Sbjct: 115 ECRN 118



 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 28/43 (65%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
           K C+CK S CLK YCECF +   C+ +C C  C+N E  E+++
Sbjct: 4   KQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQR 46


>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
 gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
           protein 54
 gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
          Length = 435

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 100/170 (58%), Gaps = 9/170 (5%)

Query: 97  SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
           S  R   + K   P ++K CNC  S+CLKLYC+CFA+G +C  CNC +CHNN+E ++ R 
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 217

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
           +A+  +LERNPNAF+PKI  +  G  D         +   H KGCHCKKSGCLK YCEC+
Sbjct: 218 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 268

Query: 217 QANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGA 266
           +A + C++ C+C  C+N E     +    G   +N   +    NA+ T  
Sbjct: 269 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASSTAT 318


>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
 gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
 gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 368

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 64/313 (20%)

Query: 95  PKSRSRSNVEVKEN----TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           P SR  S  + K +    T +K K C CK S+CLKLYC+CFASG+ C  C+CV+CHNN E
Sbjct: 43  PISREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSE 102

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
              AR  A+   L RNPNAF  K   S     DN+ +       G  ++GC CK++ CLK
Sbjct: 103 KCDAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLK 159

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQAN+LCS+NC+C++CKN   + +  A                           S
Sbjct: 160 KYCECFQANLLCSDNCKCINCKNVSEAFQPPAF--------------------------S 193

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIP---VARASS 327
            + SP V ++R+ +EL   ++   P                     L SIP   +  A  
Sbjct: 194 AHNSPQVYRRRRDRELTEWNSCPAP---------------------LFSIPDNSIQNALG 232

Query: 328 TAAVGSSKFTY---RSLL--ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED 382
           +    S K  Y   RSL+     + P DL +LCS+LV  +SE+A + A  +N    + +D
Sbjct: 233 SPMSCSPKLPYRKKRSLMGYTSTLLP-DLGDLCSLLVA-ASESATTTAEDQNRIFTKPDD 290

Query: 383 RAGTSVASSAQGR 395
           +    ++S ++ R
Sbjct: 291 KEAIELSSESESR 303


>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
          Length = 369

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 108/313 (34%), Positives = 154/313 (49%), Gaps = 64/313 (20%)

Query: 95  PKSRSRSNVEVKEN----TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
           P SR  S  + K +    T +K K C CK S+CLKLYC+CFASG+ C  C+CV+CHNN E
Sbjct: 43  PISREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSE 102

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
              AR  A+   L RNPNAF  K   S     DN+ +       G  ++GC CK++ CLK
Sbjct: 103 KCDAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLK 159

Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
           KYCECFQAN+LCS+NC+C++CKN   + +  A                           S
Sbjct: 160 KYCECFQANLLCSDNCKCINCKNVSEAFQPPAF--------------------------S 193

Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIP---VARASS 327
            + SP V ++R+ +EL   ++   P                     L SIP   +  A  
Sbjct: 194 AHNSPQVYRRRRDRELTEWNSCPAP---------------------LFSIPDNSIQNALG 232

Query: 328 TAAVGSSKFTY---RSLL--ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED 382
           +    S K  Y   RSL+     + P DL +LCS+LV  +SE+A + A  +N    + +D
Sbjct: 233 SPMSCSPKLPYRKKRSLMGYTSTLLP-DLGDLCSLLVA-ASESATTTAEDQNRIFTKPDD 290

Query: 383 RAGTSVASSAQGR 395
           +    ++S ++ R
Sbjct: 291 KEAIELSSESESR 303


>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
          Length = 408

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 106/172 (61%), Gaps = 15/172 (8%)

Query: 89  VPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           +P P   +S+ +S   +  N P+K   CNCK+S+CLKLYC+CFA+G +C D CNC NC N
Sbjct: 156 IPPPSHQQSKPKSGANLASNKPRKP--CNCKNSQCLKLYCDCFANGEFCRDSCNCQNCKN 213

Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
           + ++E  R  AV+A LERNPNAF+PK+    HG   ++          +H KGC+CKKS 
Sbjct: 214 SFQHEGDRSRAVKACLERNPNAFKPKVG---HGRVGDER---------RHIKGCNCKKSS 261

Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           CLK YCEC++A I CS  CRC+ C+N     E + L    +A ++   QQ A
Sbjct: 262 CLKNYCECYEAKIPCSHLCRCVGCQNLADRPEGKGLMQLANAADLRTQQQKA 313


>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
 gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
          Length = 429

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 100/170 (58%), Gaps = 9/170 (5%)

Query: 97  SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
           S  R   + K   P ++K CNC  S+CLKLYC+CFA+G +C  CNC +CHNN+E ++ R 
Sbjct: 152 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 211

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
           +A+  +LERNPNAF+PKI  +  G  D         +   H KGCHCKKSGCLK YCEC+
Sbjct: 212 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 262

Query: 217 QANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGA 266
           +A + C++ C+C  C+N E     +    G   +N   +    NA+ T  
Sbjct: 263 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASSTAT 312


>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
          Length = 705

 Score =  149 bits (376), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 77/147 (52%), Positives = 101/147 (68%), Gaps = 11/147 (7%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+++E  R +AV+A L+RNP+AF P
Sbjct: 482 RKPCNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKACLDRNPHAFHP 541

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI    HG++ N+          +HNKGC+CK+SGCLK YCEC++A ILCS  C+C+ CK
Sbjct: 542 KIGKG-HGSQTNR----------RHNKGCNCKRSGCLKNYCECYEAKILCSNFCKCVGCK 590

Query: 233 NFEGSEERQALFHGDHANNMAYIQQAA 259
           NFE S ER+ L H   A  +   QQ A
Sbjct: 591 NFEESPERKTLMHLADAAEVRVQQQTA 617


>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
 gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
          Length = 879

 Score =  149 bits (375), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 86/124 (69%), Gaps = 12/124 (9%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R EA+ + LERNP+AF+
Sbjct: 663 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFK 722

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           PKI +           +G++ +   HNKGC+CK+SGCLK YCEC++A I CS  C+C+ C
Sbjct: 723 PKITA---------PNSGDIRL---HNKGCNCKRSGCLKNYCECYEAKIPCSTMCKCVGC 770

Query: 232 KNFE 235
           +N E
Sbjct: 771 RNME 774


>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
          Length = 437

 Score =  149 bits (375), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 66/136 (48%), Positives = 91/136 (66%), Gaps = 9/136 (6%)

Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAV 159
           R   + K   P ++K CNC  S+CLKLYC+CFA+G +C  CNC +CHNN++ ++ R +A+
Sbjct: 161 RMKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 220

Query: 160 EATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
             +LERNPNAF+PKI  +  GT D         +   H KGCHCKKSGCLK YCEC++A 
Sbjct: 221 RQSLERNPNAFKPKIGIARGGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 271

Query: 220 ILCSENCRCMDCKNFE 235
           + C++ C+C  C+N E
Sbjct: 272 VPCTDRCKCKGCQNTE 287


>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
 gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
 gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
 gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
 gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
          Length = 749

 Score =  148 bits (374), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+T + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
          Length = 793

 Score =  148 bits (373), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+T + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 560 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 619

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 620 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 668

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 669 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 701


>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
          Length = 821

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/157 (48%), Positives = 101/157 (64%), Gaps = 11/157 (7%)

Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT 162
           V   E T + +K CNC  S+CLKLYC+CFA+G +C+ CNC NC NN+E+E  R +A++  
Sbjct: 584 VSAAETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTC 643

Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
           L+RNP AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+C
Sbjct: 644 LDRNPEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMC 692

Query: 223 SENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           S  C+C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 693 SSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 729


>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
 gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
          Length = 433

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 66/137 (48%), Positives = 91/137 (66%), Gaps = 9/137 (6%)

Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAV 159
           R   + K   P ++K CNC  S+CLKLYC+CFA+G +C  CNC +CHNN++ ++ R +A+
Sbjct: 159 RLKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 218

Query: 160 EATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
             +LERNPNAF+PKI  +  GT D         +   H KGCHCKKSGCLK YCEC++A 
Sbjct: 219 RQSLERNPNAFKPKIGIARGGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 269

Query: 220 ILCSENCRCMDCKNFEG 236
           + C++ C+C  C+N E 
Sbjct: 270 VPCTDRCKCKCCQNTEA 286


>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
          Length = 709

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 983

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 74/152 (48%), Positives = 91/152 (59%), Gaps = 24/152 (15%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K+  CNCK S+CLKLYCECFA+G YCD  CNC++C N +E    R+ AV + LE+NPNAF
Sbjct: 583 KKAPCNCKKSKCLKLYCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPNAF 642

Query: 171 RPKIA------------------SSPHGTRDNKEET--GEVLMLGK---HNKGCHCKKSG 207
           +PKI                   S+  G    +  T  G     GK   H  GCHCKKS 
Sbjct: 643 KPKILPSQSSAVKSLYEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKKSA 702

Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           C KKYCECFQA +LC +NCRC+DC+N E  +E
Sbjct: 703 CQKKYCECFQAGVLCGDNCRCIDCRNVEKKKE 734


>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
          Length = 950

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 74/187 (39%), Positives = 110/187 (58%), Gaps = 14/187 (7%)

Query: 50  QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
           Q +     P  Q  P  V P +P  PP +      +P   V KP +  + +   V+    
Sbjct: 675 QRIRQQQLPTEQSTPIKVEPKLPTLPPGVKANVPAKPLFEVLKPPA-TAAAAGAVDPLGG 733

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
              ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+
Sbjct: 734 MTSRRKHCNCSRSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 793

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           AF+PKI ++P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I CS  C+C
Sbjct: 794 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSSICKC 841

Query: 229 MDCKNFE 235
           + C+N E
Sbjct: 842 VGCRNME 848


>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
          Length = 807

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 100/153 (65%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E  P+ +K CNC  S+CLKLYC+CFA+G +C+ CNC NC NN+E+E  R +A++  L+RN
Sbjct: 574 ETAPRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRN 633

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 634 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 682

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 683 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 715


>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
           protein of 120 kDa
 gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
          Length = 950

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 74/187 (39%), Positives = 110/187 (58%), Gaps = 14/187 (7%)

Query: 50  QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
           Q +     P  Q  P  V P +P  PP +      +P   V KP +  + +   V+    
Sbjct: 675 QRIRQQQLPTEQSTPIKVEPKLPTLPPGVKANVPAKPLFEVLKPPA-TAAAAGAVDPLGG 733

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
              ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+
Sbjct: 734 MTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 793

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           AF+PKI ++P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I CS  C+C
Sbjct: 794 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSSICKC 841

Query: 229 MDCKNFE 235
           + C+N E
Sbjct: 842 VGCRNME 848


>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
 gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
          Length = 952

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 74/187 (39%), Positives = 110/187 (58%), Gaps = 14/187 (7%)

Query: 50  QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
           Q +     P  Q  P  V P +P  PP +      +P   V KP +  + +   V+    
Sbjct: 677 QRIRQQQLPTEQSTPIKVEPKLPTLPPGVKANVPAKPLFEVLKPPA-TAAAAGAVDPLGG 735

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
              ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+
Sbjct: 736 MTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 795

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           AF+PKI ++P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I CS  C+C
Sbjct: 796 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSSICKC 843

Query: 229 MDCKNFE 235
           + C+N E
Sbjct: 844 VGCRNME 850


>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
          Length = 749

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
 gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
 gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
 gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
           domain-containing protein 1
 gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
 gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
 gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
 gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
 gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
 gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
 gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
           gorilla]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
 gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
          Length = 950

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 74/187 (39%), Positives = 111/187 (59%), Gaps = 15/187 (8%)

Query: 50  QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
           Q +     P  Q  P  V P +P  P  L   A  +P   + KP  P + + + V+    
Sbjct: 676 QRIRQQQLPTEQNAPTKVEPKLPTLPTGLKANAPAKPLFDILKP--PATGAAAAVDALGG 733

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
              ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+
Sbjct: 734 MTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 793

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           AF+PKI ++P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I C+  C+C
Sbjct: 794 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKC 841

Query: 229 MDCKNFE 235
           + C+N E
Sbjct: 842 VGCRNME 848


>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
 gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
          Length = 902

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 71/138 (51%), Positives = 92/138 (66%), Gaps = 14/138 (10%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           K+C+CK+S+CLK+YCECFA+ + C+GC C  C NN  N      A   TLERNP+AF PK
Sbjct: 360 KKCHCKNSKCLKMYCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPK 419

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           I   P  ++ N +     +++ KH+KGCHC+KS CLKKYCECFQA+I C+ENC+C DCKN
Sbjct: 420 IK--PSNSKINNQ-----IIIDKHSKGCHCRKSSCLKKYCECFQASIPCNENCKCYDCKN 472

Query: 234 FEGSEERQALFHGDHANN 251
                  QA  H  H +N
Sbjct: 473 -------QAHHHLHHPDN 483


>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
 gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
 gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
          Length = 748

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 575 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 623

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 624 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656


>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
 gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
 gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
          Length = 748

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 515 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 574

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 575 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 623

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 624 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656


>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
          Length = 407

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 66/136 (48%), Positives = 91/136 (66%), Gaps = 10/136 (7%)

Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAV 159
           R   + K   P ++K CNC  S+CLKLYC+CFA+G +C  CNC +CHNN++ ++ R +A+
Sbjct: 134 RMKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 193

Query: 160 EATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
             +LERNPNAF+PKI  +  GT D         +   H KGCHCKKSGCLK YCEC++A 
Sbjct: 194 RQSLERNPNAFKPKIGIA-RGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 243

Query: 220 ILCSENCRCMDCKNFE 235
           + C++ C+C  C+N E
Sbjct: 244 VPCTDRCKCKGCQNTE 259


>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
          Length = 748

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
          Length = 1772

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 65/133 (48%), Positives = 87/133 (65%), Gaps = 17/133 (12%)

Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLE 164
           K +  K+ K CNC++SRCLKLYC+CFA+G +C+  C CV+CHN+  +   R +A++ATLE
Sbjct: 323 KASATKRFKPCNCRNSRCLKLYCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAIKATLE 382

Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
           +NP AFR K+         + E T        H  GCHCKK+ CLKKYCECF+A I C  
Sbjct: 383 KNPKAFRAKV---------DAEAT-------THQNGCHCKKTKCLKKYCECFEAGITCGA 426

Query: 225 NCRCMDCKNFEGS 237
            C+C DC+N+ GS
Sbjct: 427 KCKCADCENYPGS 439



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 100 RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           R+ V+ +  T   Q  C+CK ++CLK YCECF +GI C   C C +C N
Sbjct: 389 RAKVDAEATT--HQNGCHCKKTKCLKKYCECFEAGITCGAKCKCADCEN 435


>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
          Length = 749

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
          Length = 749

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
 gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
          Length = 980

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 88/124 (70%), Gaps = 12/124 (9%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + LERNP+AF+
Sbjct: 765 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLERNPSAFK 824

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           PKI ++P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I CS  C+C+ C
Sbjct: 825 PKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSAMCKCVGC 872

Query: 232 KNFE 235
           +N E
Sbjct: 873 RNME 876



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 4/49 (8%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN---FEGSEER 240
           G   K C+C KS CLK YC+CF     C ++C C DC N   +E   ER
Sbjct: 763 GSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKRER 810


>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
          Length = 749

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 101/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R+ A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
 gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
          Length = 749

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
 gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
          Length = 749

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
          Length = 775

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 79/172 (45%), Positives = 106/172 (61%), Gaps = 12/172 (6%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E T + +K CNC  S+CLKLYC+CFA+G +C+ CNC NC NN+E+E  R +A++  L+RN
Sbjct: 542 ETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRN 601

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 602 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 650

Query: 227 RCMDCKNFEGSEERQALFH-GDHANNMAYIQQAANAAITGAIGSSGYASPPV 277
           +C+ CKNFE S ER+ L H  D A      Q AA   ++  I      + PV
Sbjct: 651 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTAAKTKLSSQISDLLMRTTPV 702


>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
           familiaris]
          Length = 749

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
          Length = 749

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
 gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
 gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
          Length = 771

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 78/161 (48%), Positives = 105/161 (65%), Gaps = 11/161 (6%)

Query: 99  SRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
           S + +   E   + +K CNC  S+CLKLYC+CFA+G +C+ CNCVNC NN+++E+ R +A
Sbjct: 531 SLNGLSTSEANSRPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKA 590

Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
           ++A L+RNP AF+PKI     G  D            +H+KGC+CKKSGCLK YCEC++A
Sbjct: 591 IKACLDRNPVAFKPKIGKGKEGESDR-----------RHSKGCNCKKSGCLKNYCECYEA 639

Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
            I+CS  C+CM CKNFE S ER+ L H   A  +   QQ A
Sbjct: 640 KIMCSSICKCMGCKNFEESPERKTLMHLADAAEVRVQQQTA 680


>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
          Length = 813

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 580 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 639

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 640 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 688

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 689 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 721


>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
 gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
 gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
          Length = 741

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 508 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 567

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 568 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 616

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 617 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 649


>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
          Length = 827

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 594 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 653

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 654 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 702

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 703 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 735


>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
 gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
          Length = 978

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 63/143 (44%), Positives = 94/143 (65%), Gaps = 12/143 (8%)

Query: 93  ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE 152
           +SP+  S    +       ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E
Sbjct: 745 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 804

Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
             R  A+ + L+RNP+AF+PKI ++P+        +G++ +   HNKGC+CK+SGCLK Y
Sbjct: 805 VERERAIRSCLDRNPSAFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNY 852

Query: 213 CECFQANILCSENCRCMDCKNFE 235
           CEC++A I CS  C+C+ C+N E
Sbjct: 853 CECYEAKIPCSSICKCVGCRNME 875


>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
 gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
          Length = 921

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 59/124 (47%), Positives = 88/124 (70%), Gaps = 12/124 (9%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+AF+
Sbjct: 706 RRKHCNCSKSQCLKLYCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPSAFK 765

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           PKI ++P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I C+  C+C+ C
Sbjct: 766 PKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGC 813

Query: 232 KNFE 235
           +N E
Sbjct: 814 RNME 817


>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
          Length = 763

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 530 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 589

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 590 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 638

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 639 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 671


>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
 gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
           gorilla]
 gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
          Length = 660

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 660

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
          Length = 659

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 426 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 485

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 486 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 534

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 535 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 567


>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
          Length = 660

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
          Length = 660

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
          Length = 660

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
          Length = 515

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+T + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 282 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 341

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 342 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 390

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 391 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 423


>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
          Length = 621

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 388 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 447

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 448 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 496

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 497 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 529


>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
          Length = 660

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
           familiaris]
          Length = 660

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
          Length = 660

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 427 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 486

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 487 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 535

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 536 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
 gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
          Length = 725

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 65/137 (47%), Positives = 88/137 (64%), Gaps = 18/137 (13%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           K++ CNC  S+CLKLYC+CFA+G YC  CNC +C N  +++  R++A+ +TLERNPNAF+
Sbjct: 454 KKRPCNCTKSQCLKLYCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERNPNAFK 513

Query: 172 PK---IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           PK   I S+  GTR              H KGC+CK+SGCLK YCEC++  I CS NC+C
Sbjct: 514 PKIGSIGSTDDGTR-------------LHTKGCNCKRSGCLKNYCECYEGKIACSSNCKC 560

Query: 229 MDCKNFEGSEERQALFH 245
           +   +F   +    LFH
Sbjct: 561 VGTPSFHLLQSH--LFH 575


>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
 gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
          Length = 949

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 69/178 (38%), Positives = 103/178 (57%), Gaps = 12/178 (6%)

Query: 58  PPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCN 117
           P  Q  P  V P +P  PP         P+      +    +   V+       ++K CN
Sbjct: 682 PTEQSSPIKVEPKLPTLPPGAKANVPAKPLFEVLKPAATPAAAGAVDQLGGMTSRRKHCN 741

Query: 118 CKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASS 177
           C  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+AF+PKI ++
Sbjct: 742 CSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKI-TA 800

Query: 178 PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I C+  C+C+ C+N E
Sbjct: 801 PN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 847


>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Saccoglossus kowalevskii]
          Length = 946

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/158 (48%), Positives = 103/158 (65%), Gaps = 11/158 (6%)

Query: 102 NVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEA 161
           +V+V+    + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+E+E+ R +A+++
Sbjct: 491 SVQVEAPGTRPRKPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKS 550

Query: 162 TLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
            LERNP AF PKI     G  D            +HNKGC+CK+SGCLK YCEC++A I+
Sbjct: 551 CLERNPLAFHPKIGKGKEGQADR-----------RHNKGCNCKRSGCLKNYCECYEAKIM 599

Query: 222 CSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           CS  C+C+ CKNFE S +R+ L H   A  +   QQ A
Sbjct: 600 CSSICKCVGCKNFEESPDRKTLMHLADAAEVRVQQQTA 637


>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
          Length = 804

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+++E  R++A++A L+RN
Sbjct: 571 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 630

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 631 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 679

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 680 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 712


>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 560

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 327 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 386

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 387 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 435

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 436 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 468


>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
          Length = 751

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+++E  R++A++A L+RN
Sbjct: 518 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 577

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 578 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 626

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 627 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 659


>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
          Length = 756

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 101/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC NN+E+E  R++A++A L+RN
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRN 582

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 583 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 631

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 632 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 664


>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
 gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
          Length = 469

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/172 (43%), Positives = 102/172 (59%), Gaps = 21/172 (12%)

Query: 69  PPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-----QKQCNCKHSRC 123
           P   P  PPL  +A Q  V+       ++R  + V   +  PKK     +K C+C  S C
Sbjct: 196 PQTGPLSPPLAYRAAQASVQ-------RNRVFTPVLAAKLAPKKRPLGSKKPCHCTRSMC 248

Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
           LKLYC+CFA+G +C+ C+C +C N +E E  R  A+  +LERNPNAF+PKI     G   
Sbjct: 249 LKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVAT 303

Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           N++   E L    H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E
Sbjct: 304 NRQVEPERL----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTE 351



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
           QK C+CK S CLK YCEC+ + + C D C C+ C N   + AAR
Sbjct: 314 QKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 357


>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
          Length = 284

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 70/139 (50%), Positives = 99/139 (71%), Gaps = 14/139 (10%)

Query: 105 VKENTPKKQ---KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVE 160
           ++EN+  K    K C C  S+CLK+YC+CF SG+YC   C CV C+NN +++  R  A++
Sbjct: 1   MEENSSTKTDVVKGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIK 60

Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
           + LE+NP+AF+PK+A+     RD    +G +++  KHNKGC+C+K+ CLKKYCECFQ+ +
Sbjct: 61  SVLEKNPDAFQPKVAT----FRD----SGGLIV--KHNKGCNCRKTACLKKYCECFQSGV 110

Query: 221 LCSENCRCMDCKNFEGSEE 239
           LCSE C+C  CKNFEGS E
Sbjct: 111 LCSELCKCSGCKNFEGSLE 129


>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
 gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
          Length = 950

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 69/178 (38%), Positives = 104/178 (58%), Gaps = 12/178 (6%)

Query: 58  PPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCN 117
           P  Q  P  V P +P  PP +       P+      +    +   V+       ++K CN
Sbjct: 683 PTEQSSPIKVEPKLPTLPPGVKADVPAKPLFEVLKPAATPAAAGAVDPLGGMTSRRKHCN 742

Query: 118 CKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASS 177
           C  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+AF+PKI ++
Sbjct: 743 CSKSQCLKLYCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKPKI-TA 801

Query: 178 PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           P+        +G++ +   HNKGC+CK+SGCLK YCEC++A I C+  C+C+ C+N E
Sbjct: 802 PN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 848


>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
          Length = 752

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+++E  R++A++A L+RN
Sbjct: 519 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 578

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 579 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 627

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 628 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 660


>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
           carolinensis]
          Length = 749

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+++E  R++A++A L+RN
Sbjct: 516 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRN 575

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
          Length = 756

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+++E  R++A++A L+RN
Sbjct: 523 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 582

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 583 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 631

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 632 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 664


>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
          Length = 495

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 111/192 (57%), Gaps = 22/192 (11%)

Query: 73  PPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-----QKQCNCKHSRCLKLY 127
           P  PPL  +A Q  ++       ++R  + V   +  PKK     +K C+C  S CLKLY
Sbjct: 226 PLSPPLAYRAAQASIQ-------RNRVFTPVLAAKLAPKKRPLGSKKPCHCTRSMCLKLY 278

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           C+CFA+G +C+ C+C +C N +E E  R  A+  +LERNPNAF+PKI     G   N++ 
Sbjct: 279 CDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVATNRQV 333

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
             E L    H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E S+    L +  
Sbjct: 334 EPERL----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCVCCRNTE-SDRAARLANTK 388

Query: 248 HANNMAYIQQAA 259
            A  +  I+ AA
Sbjct: 389 SATALTDIRSAA 400


>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
          Length = 460

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 63/125 (50%), Positives = 84/125 (67%), Gaps = 11/125 (8%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K ++ CNC  S+CLKLYCECFA+G +C  C C NC NN E+   R +AV+A L+RNP AF
Sbjct: 262 KSKRPCNCTKSQCLKLYCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAF 321

Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
           RPKI S   G            + G H KGC+CK+SGCLK YCEC++A I+C+  C+C+ 
Sbjct: 322 RPKIGSRKQGN-----------VKGCHTKGCNCKRSGCLKNYCECYEAKIMCTSTCKCVG 370

Query: 231 CKNFE 235
           C+N++
Sbjct: 371 CRNYD 375


>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
 gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
 gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
 gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
 gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
 gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
 gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
 gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
 gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
 gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
 gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
 gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
          Length = 528

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
          Length = 757

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+++E  R++A++A L+RN
Sbjct: 524 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 583

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 584 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 632

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 633 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 665


>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 521

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 288 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 347

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 348 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 396

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 397 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 429


>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
           gorilla]
          Length = 528

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
          Length = 528

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
          Length = 527

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 294 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 353

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 354 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 402

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 403 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 435


>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
          Length = 528

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
          Length = 528

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 528

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
          Length = 528

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
          Length = 452

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 111/192 (57%), Gaps = 22/192 (11%)

Query: 73  PPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ-----KQCNCKHSRCLKLY 127
           P  PPL  +A Q  ++       ++R  + V   +  PKK+     K C+C  S CLKLY
Sbjct: 183 PLSPPLAYRAAQASIQ-------RNRVFTPVLAAKLAPKKRPLGSKKPCHCTRSMCLKLY 235

Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           C+CFA+G +C+ C+C +C N +E E  R  A+  +LERNPNAF+PKI     G   N++ 
Sbjct: 236 CDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVATNRQV 290

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
             E L    H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E S+    L +  
Sbjct: 291 EPERL----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCVCCRNTE-SDRAARLANTK 345

Query: 248 HANNMAYIQQAA 259
            A  +  I+ AA
Sbjct: 346 SATALTDIRSAA 357


>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
           familiaris]
 gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
          Length = 528

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
           garnettii]
          Length = 544

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 68/146 (46%), Positives = 92/146 (63%), Gaps = 11/146 (7%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           K C C  S CLKL C+CFA+G + + CNC + +N++E+E  R++A++A L+RNP AF+PK
Sbjct: 345 KPCECVISLCLKLDCDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNPEAFKPK 404

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                  T   KEE  +      H+KGC+CK SGCLK YCEC++A I+CS  C+C  CKN
Sbjct: 405 -------TGKGKEEDSD----XHHSKGCNCKXSGCLKNYCECYEAKIMCSSICKCKGCKN 453

Query: 234 FEGSEERQALFHGDHANNMAYIQQAA 259
           FE   ER+ L H   A  +   QQ A
Sbjct: 454 FEEXPERKTLMHLAKAAKVRVQQQTA 479


>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
           both male and female reproductiv [Ectocarpus
           siliculosus]
          Length = 735

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 64/122 (52%), Positives = 77/122 (63%), Gaps = 14/122 (11%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIA 175
           CNCK S+CLKLYCECF    YC+GCNCV C N    E  R+ A++ T+ERNP AF  K  
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKFE 499

Query: 176 SSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
                 R  K           HN GC+CKKS CLKKYCECFQA + C  NC+C++CKN+E
Sbjct: 500 R-----RAGKRS---------HNAGCNCKKSACLKKYCECFQAGVACGTNCKCVNCKNYE 545

Query: 236 GS 237
           G+
Sbjct: 546 GA 547


>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 64/122 (52%), Positives = 91/122 (74%), Gaps = 9/122 (7%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK+S+C+KLYCEC+ + ++CD  C+C NC NN  N   R++A++ TLE+NP+AF+P
Sbjct: 16  KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAFQP 75

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI +S        +   + L  GKHNKGC CKKSGC+KKYCECFQA + CS+ C+C++C+
Sbjct: 76  KITTS--------DSKPDPLNFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECR 127

Query: 233 NF 234
           N+
Sbjct: 128 NY 129



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           EE+ +V   G  +K C+CK S C+K YCEC+Q  + C  NC C +C N
Sbjct: 4   EESDQVFEEGP-SKKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFN 50



 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           N  K  K C CK S C+K YCECF + + C D C C+ C N
Sbjct: 88  NFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 128


>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
          Length = 597

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 68/129 (52%), Positives = 85/129 (65%), Gaps = 13/129 (10%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K C+CK+S+CLKLYCECFA+ + C+GC+C  CHNN  N    + +    LERNP AF P
Sbjct: 341 RKGCHCKNSKCLKLYCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNP 400

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K           KEET +     KH KGCHC+KS CLKKYCECFQA I C E+C+C+DC+
Sbjct: 401 KFK--------QKEETKQ----HKHTKGCHCRKSECLKKYCECFQAGIPCGEHCKCIDCR 448

Query: 233 NF-EGSEER 240
           N  E  +ER
Sbjct: 449 NTGEDIKER 457



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 8/86 (9%)

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCH 146
           R P+  +PK + +   E K++  K  K C+C+ S CLK YCECF +GI C + C C++C 
Sbjct: 393 RNPEAFNPKFKQKE--ETKQH--KHTKGCHCRKSECLKKYCECFQAGIPCGEHCKCIDCR 448

Query: 147 NNVENEAARREAVEATL---ERNPNA 169
           N  E+   R +  E  L   ER P A
Sbjct: 449 NTGEDIKERSKPKEQPLKKEERKPTA 474


>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1852

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 19/162 (11%)

Query: 78   LPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC 137
             P    + P+ VP  E+P      N     +TP+ +  CNCK S+CLKLYCECF++ + C
Sbjct: 1481 FPSSIVKTPLAVPGKENP------NGITPMSTPRNR--CNCKKSQCLKLYCECFSAEVMC 1532

Query: 138  DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKH 197
            +GCNC +C N     A R +A+   L +N +AF+P+IA++P G  D          L  H
Sbjct: 1533 EGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAP-GAAD----------LQGH 1581

Query: 198  NKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
            + GC CKKS CLKKYCECFQA +LC   C+C  CKN  GS++
Sbjct: 1582 SMGCKCKKSECLKKYCECFQAGVLCGLKCKCESCKNIAGSQK 1623



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 48/104 (46%), Gaps = 2/104 (1%)

Query: 170  FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
            F   I  +P      +   G   M    N+ C+CKKS CLK YCECF A ++C E C C 
Sbjct: 1481 FPSSIVKTPLAVPGKENPNGITPMSTPRNR-CNCKKSQCLKLYCECFSAEVMC-EGCNCT 1538

Query: 230  DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYA 273
            DC+N       +A    D  N  A+  +   AA  GA    G++
Sbjct: 1539 DCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAPGAADLQGHS 1582


>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
          Length = 802

 Score =  142 bits (358), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 648

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 649 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 698

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 699 YEAKIPCSANCKCIGCRNVE 718


>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
          Length = 829

 Score =  142 bits (358), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 592 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 651

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 652 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 701

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 702 YEAKIPCSANCKCIGCRNVE 721


>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
          Length = 835

 Score =  142 bits (358), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 598 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 657

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 658 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 707

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 708 YEAKIPCSANCKCIGCRNVE 727


>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
          Length = 835

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 598 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 657

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 658 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 707

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 708 YEAKIPCSANCKCIGCRNVE 727


>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
          Length = 804

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 590 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 649

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 650 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 699

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 700 YEAKIPCSANCKCIGCRNVE 719


>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
          Length = 667

 Score =  142 bits (357), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+++E  R++A++A L+RN
Sbjct: 434 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRN 493

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 494 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 542

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 543 KCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 575


>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
          Length = 826

 Score =  142 bits (357), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 648

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 649 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 698

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 699 YEAKIPCSANCKCIGCRNVE 718


>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
          Length = 829

 Score =  142 bits (357), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 592 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 651

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 652 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 701

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 702 YEAKIPCSANCKCIGCRNVE 721


>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 235

 Score =  142 bits (357), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 96/214 (44%), Positives = 114/214 (53%), Gaps = 45/214 (21%)

Query: 23  PAKKLARQLDFTGGFVGSVI-----------------LP---EHPQSQAVTANSHPP--- 59
           P KKL RQLDF    +                     LP   +HPQ          P   
Sbjct: 25  PIKKLVRQLDFNSAALAGNPAMAAAAAAVSRALQPRALPAGFQHPQHVRAAVPMVVPQQL 84

Query: 60  -----PQPQPKPVVPPVPPPPPPLPGQATQPPVR-------------VP-KPESPKSRSR 100
                P  +P PVV  VP    PL      P +R             VP KP+SPK  +R
Sbjct: 85  HQRALPVMRPHPVVGHVPRHAVPL--AVPVPQLRPMPPQPVQRPPVAVPLKPDSPKPWAR 142

Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE 160
              E K++TP K+K CNC++SRCLKLYCECFASG +CDGCNC NC NN ENE ARREA+E
Sbjct: 143 L-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARREAIE 201

Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLML 194
           ATLERNP+AFRPKI SSPH  R+N     ++L +
Sbjct: 202 ATLERNPDAFRPKIGSSPHTNRNNVCSVVQILFV 235



 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           K C+C+ S CLK YCECF +   C + C C +C N
Sbjct: 155 KCCNCRNSRCLKLYCECFASGTHC-DGCNCTNCFN 188


>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
          Length = 827

 Score =  142 bits (357), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 74/140 (52%), Positives = 96/140 (68%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           KS  RS   V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 590 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 649

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 650 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 699

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 700 YEAKIPCSANCKCIGCRNVE 719


>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
          Length = 495

 Score =  142 bits (357), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 262 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 321

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 322 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 370

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 371 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 403


>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 444

 Score =  142 bits (357), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 67/119 (56%), Positives = 81/119 (68%), Gaps = 5/119 (4%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYCECFASG YC   C C  C NN +N    + A E TLERNP AF PKI
Sbjct: 318 CNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKI 377

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           + S         E G+VL +  H++GC+CK+S C KKYCECFQA + C +NC+C+DCKN
Sbjct: 378 SESTAAV----TEEGQVLRVAAHHRGCNCKRSNCRKKYCECFQAGVPCGDNCKCVDCKN 432


>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
          Length = 494

 Score =  141 bits (356), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 261 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 320

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 321 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 369

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 370 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 402


>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
 gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
          Length = 458

 Score =  141 bits (356), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 85/201 (42%), Positives = 117/201 (58%), Gaps = 14/201 (6%)

Query: 95  PKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAA 154
           P+ +    +  ++   + +K CNC  S+CLKLYC+CFA+G +C+ CNC NC NN+E+EA 
Sbjct: 217 PRQQVNGTLNTEQVGTRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHEAE 276

Query: 155 RREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCE 214
           R ++++A LERNP AF PKI     G  D            +HNKGC+CK+SGCLK YCE
Sbjct: 277 RAKSIKACLERNPYAFHPKIGKGKEGEGDR-----------RHNKGCNCKRSGCLKNYCE 325

Query: 215 CFQANILCSENCRCMDCKNFEGSEERQALFH-GDHANNMAYIQQAANAAITGAIGSSGYA 273
           C++A I+CS  C+C+ CKNFE S ER+ L H  D A      Q AA   ++  I      
Sbjct: 326 CYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTAAKTKLSSQIQDIPAR 385

Query: 274 SPPVSKKRKGQELFFGSTAKD 294
            P V+    G+ L F    +D
Sbjct: 386 PPAVNT--AGERLPFSFVTRD 404


>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
 gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
          Length = 818

 Score =  141 bits (356), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 75/139 (53%), Positives = 96/139 (69%), Gaps = 12/139 (8%)

Query: 97  SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
           SRS +N +     P+K   CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R+
Sbjct: 583 SRSFNNSDANGIRPRKP--CNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQ 640

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
            A+++ LERNPNAFRPKI            ETGE +   +HNKGC+CK+SGCLK YCEC+
Sbjct: 641 RAIKSCLERNPNAFRPKIGKG--------RETGEDIR--RHNKGCNCKRSGCLKNYCECY 690

Query: 217 QANILCSENCRCMDCKNFE 235
           +A I CSENC+C+ C+N E
Sbjct: 691 EAKIPCSENCKCIGCRNVE 709


>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
 gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
 gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
          Length = 419

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+T + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 186 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 245

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 246 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 294

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 295 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 327


>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
          Length = 1001

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 68/133 (51%), Positives = 83/133 (62%), Gaps = 2/133 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYC+CFA+GIYC +GC CV C N  E +    E  +    RNP AF P
Sbjct: 567 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 626

Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           KI    +G+  N  E G       +H +GC+CKKS CLKKYCEC+QAN+ CS  CRC  C
Sbjct: 627 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGC 686

Query: 232 KNFEGSEERQALF 244
           KN  G +E    F
Sbjct: 687 KNVYGRKEEYGAF 699



 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCEC+ + + C  GC C  C N
Sbjct: 651 RHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKN 688


>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
          Length = 614

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 75/149 (50%), Positives = 98/149 (65%), Gaps = 10/149 (6%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           + +K CNC  S+CLKLYC+CFA+G +C  CNCVNC NN+E+E  R  +++  LERNP AF
Sbjct: 391 RPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAF 450

Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
            PKI  S  G  D          + +HNKGC+CK+SGCLK YCEC++A ILC+  C+C+ 
Sbjct: 451 HPKIGKSKAGQGD----------IRRHNKGCNCKRSGCLKNYCECYEAKILCTSLCKCVG 500

Query: 231 CKNFEGSEERQALFHGDHANNMAYIQQAA 259
           CKNFE S +R+ L H   A  +   QQ A
Sbjct: 501 CKNFEESPDRKTLMHLADAAEVRVQQQTA 529


>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
          Length = 438

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 205 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 264

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 265 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 313

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 314 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 346


>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
 gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
          Length = 369

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 88/195 (45%), Positives = 119/195 (61%), Gaps = 11/195 (5%)

Query: 76  PPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGI 135
           P LP  AT   V +    S K  S    E K+ T  ++  C+CK+S+CLKLYCECF  G 
Sbjct: 137 PLLPRPATFIDVML---SSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCECFEKGR 193

Query: 136 YCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG 195
           YC GCNC NC NNV +E AR++A+   LERNP AF PK+ +S     ++K   G++  +G
Sbjct: 194 YCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNSTAHNCESKAAEGDI--VG 251

Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
           KH KGC CK++ CLKKYCECF+A++ CSENCRC  CKN++ + +R +        N  + 
Sbjct: 252 KHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRIS------QKNTVHA 305

Query: 256 QQAANAAITGAIGSS 270
               N A +G +G S
Sbjct: 306 HHVQNPASSGMVGQS 320


>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
          Length = 1094

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 68/133 (51%), Positives = 83/133 (62%), Gaps = 2/133 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYC+CFA+GIYC +GC CV C N  E +    E  +    RNP AF P
Sbjct: 574 KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAP 633

Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           KI    +G+  N  E G       +H +GC+CKKS CLKKYCEC+QAN+ CS  CRC  C
Sbjct: 634 KIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGC 693

Query: 232 KNFEGSEERQALF 244
           KN  G +E    F
Sbjct: 694 KNVYGRKEEYGAF 706



 Score = 45.4 bits (106), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCEC+ + + C  GC C  C N
Sbjct: 658 RHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKN 695


>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
          Length = 391

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/163 (42%), Positives = 97/163 (59%), Gaps = 16/163 (9%)

Query: 73  PPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFA 132
           P  PPL  +A Q  V+  +  +P       V   +     +K C+C  S CLKLYC+CFA
Sbjct: 201 PLSPPLAYRAAQASVQRNRVFTP-------VLAAKRPLGSKKPCHCTRSMCLKLYCDCFA 253

Query: 133 SGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL 192
           +G +C+ C+C +C N +E E  R  A+  +LERNPNAF+PKI     G   N++   E L
Sbjct: 254 NGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKI-----GVATNRQVEPERL 308

Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
               H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N E
Sbjct: 309 ----HQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTE 347



 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
           QK C+CK S CLK YCEC+ + + C D C C+ C N   + AAR
Sbjct: 310 QKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 353


>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
          Length = 419

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 186 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 245

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 246 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 294

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 295 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 327


>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1174

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 72/177 (40%), Positives = 99/177 (55%), Gaps = 31/177 (17%)

Query: 76  PPLPGQATQPPVRV---PKPESPKSRSR-SNVEV----------KENTPKKQKQCNCKHS 121
           P  PG    PPV +   P P  P S SR +  E           KEN+ + +  C CK+S
Sbjct: 809 PSNPGSLAHPPVMMAVAPTPTHPSSVSRMTTFETPAVAKRGGGGKENS-ENESTCKCKNS 867

Query: 122 RCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGT 181
           +CLKLYC+CF++  YC GC C++C N    E+ R +A+  T  RNPNAF+ K+ +S    
Sbjct: 868 KCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLNAS---- 923

Query: 182 RDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
                       + +H  GC CKKS CLKKYCECF   ++C E+C+C +CKN+ GS+
Sbjct: 924 ------------VTEHATGCKCKKSECLKKYCECFANLVVCGESCKCSNCKNYVGSQ 968



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 26/39 (66%), Gaps = 1/39 (2%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEA 153
           C CK S CLK YCECFA+ + C + C C NC N V ++A
Sbjct: 931 CKCKKSECLKKYCECFANLVVCGESCKCSNCKNYVGSQA 969


>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
          Length = 528

 Score =  139 bits (349), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 74/153 (48%), Positives = 101/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+C A+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
          Length = 209

 Score =  139 bits (349), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 67/137 (48%), Positives = 91/137 (66%), Gaps = 12/137 (8%)

Query: 97  SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
           S++++   ++ N  K +K CNC  S+CLKLYC+CFA+G +C GCNCV C NN+E+E  R 
Sbjct: 6   SQAKAQSVLESNGLKPRKPCNCTKSQCLKLYCDCFANGEFCHGCNCVCCANNLEHEELRL 65

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
            A+ + L+RNP+AF+PKI              G      +HNKGCHCK+SGCLK YCEC+
Sbjct: 66  RAIRSCLDRNPHAFKPKIG------------VGWGPEPRRHNKGCHCKRSGCLKNYCECY 113

Query: 217 QANILCSENCRCMDCKN 233
           +A I CS  C+C+ CKN
Sbjct: 114 EAKIACSAICKCIGCKN 130



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 7/64 (10%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAA------RREAVEATL 163
           +  K C+CK S CLK YCEC+ + I C   C C+ C N V+   +      +R +  A L
Sbjct: 93  RHNKGCHCKRSGCLKNYCECYEAKIACSAICKCIGCKNCVDPAGSPPGPGEKRLSKMAAL 152

Query: 164 ERNP 167
           + NP
Sbjct: 153 QTNP 156


>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
          Length = 822

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 70/131 (53%), Positives = 92/131 (70%), Gaps = 10/131 (7%)

Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLE 164
           V+ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R+ A+++ LE
Sbjct: 594 VEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLE 653

Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
           RNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC++A I CS 
Sbjct: 654 RNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCECYEAKIPCSA 703

Query: 225 NCRCMDCKNFE 235
           NC+C+ C+N E
Sbjct: 704 NCKCIGCRNIE 714



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 47/97 (48%), Gaps = 20/97 (20%)

Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAV--- 159
           E  ++  +  K CNCK S CLK YCEC+ + I C   C C+ C  N+E+    ++++   
Sbjct: 668 ETGDDIRRHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCR-NIEDPILEKKSLKDL 726

Query: 160 --------------EATLERNPNAFRPKIASSPHGTR 182
                         +A L+ +  AFRP  A S  GTR
Sbjct: 727 TETTEVRTTQLSLNKAQLQLSEMAFRPP-AVSNTGTR 762


>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
          Length = 348

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 115 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 174

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 175 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 223

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 224 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 256


>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
          Length = 305

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 63/134 (47%), Positives = 82/134 (61%), Gaps = 13/134 (9%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
           P     CNC+ SRCLKLYCECFAS ++C+GCNCVNC NN E+   R E  +A ++R+P+ 
Sbjct: 183 PNHSHTCNCRKSRCLKLYCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHV 242

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F P +      TR N              +GC C+K+ C+KKYC+CFQ  I C  NCRC+
Sbjct: 243 FEPHVVWDTPATRTNL-------------RGCRCQKTRCVKKYCDCFQNGICCGPNCRCV 289

Query: 230 DCKNFEGSEERQAL 243
           DCKN E +   +AL
Sbjct: 290 DCKNTEEAALNRAL 303


>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
          Length = 859

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 93/223 (41%), Positives = 126/223 (56%), Gaps = 21/223 (9%)

Query: 15  QSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQ-SQAVTANSHPPPQPQPKPVVPPVPP 73
           QS++   IPA  L++         G  +L  +P     V   +    Q QP   V    P
Sbjct: 546 QSKNAVAIPASALSQLAS------GQAVLSTNPNVGNIVVLPAQYIQQQQPIDEVKLKSP 599

Query: 74  PPPPLPG--QATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECF 131
             P L G  Q++   + V   ++ +    SNVE   N  + +K CNC  S+CLKLYC+CF
Sbjct: 600 TTPGLLGSSQSSTATLCVIDGKNSQRGVYSNVE--PNGIRPRKPCNCTKSQCLKLYCDCF 657

Query: 132 ASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEV 191
           A+G +C  CNC NC NN+ NE  R+ A+++ LERNPNAFRPKI            ETGE 
Sbjct: 658 ANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKG--------RETGED 709

Query: 192 LMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
           +   +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N 
Sbjct: 710 IR--RHNKGCNCKRSGCLKNYCECYEAKIPCSGNCKCIGCRNI 750


>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
          Length = 825

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 71/140 (50%), Positives = 95/140 (67%), Gaps = 10/140 (7%)

Query: 96  KSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR 155
           K+  R    ++ N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R
Sbjct: 588 KNSQRGYGNIEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEER 647

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
           + A+++ LERNPNAFRPKI            ETG+ +   +HNKGC+CK+SGCLK YCEC
Sbjct: 648 QRAIKSCLERNPNAFRPKIGKG--------RETGDDIR--RHNKGCNCKRSGCLKNYCEC 697

Query: 216 FQANILCSENCRCMDCKNFE 235
           ++A I CS NC+C+ C+N E
Sbjct: 698 YEAKIPCSANCKCIGCRNIE 717


>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
          Length = 834

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/134 (54%), Positives = 93/134 (69%), Gaps = 12/134 (8%)

Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVE 160
           SNVE   N  + +K CNC  S+CLKLYC+CFA+G +C  CNC NC NN+ NE  R+ A++
Sbjct: 604 SNVE--PNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIK 661

Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
           + LERNPNAFRPKI            ETGE +   +HNKGC+CK+SGCLK YCEC++A I
Sbjct: 662 SCLERNPNAFRPKIGKG--------RETGEDIR--RHNKGCNCKRSGCLKNYCECYEAKI 711

Query: 221 LCSENCRCMDCKNF 234
            CS NC+C+ C+N 
Sbjct: 712 PCSGNCKCIGCRNI 725


>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1223

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 68/156 (43%), Positives = 90/156 (57%), Gaps = 9/156 (5%)

Query: 93  ESPKSRSRSNVEVKENTPKKQ--------KQCNCKHSRCLKLYCECFASGIYCDG-CNCV 143
           +SP+S     +  +++T   Q        K+CNCK S+CLKLYCECFA+  +C G C C 
Sbjct: 708 DSPQSSKNRRLSRRKSTASSQGEKPGDGCKRCNCKKSKCLKLYCECFAAREFCVGSCACR 767

Query: 144 NCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHC 203
           NC N  E EA      +    RNP AF PKI  +P  +    +E  +     +H +GC+C
Sbjct: 768 NCFNKPEYEATVLNTRQQIESRNPLAFAPKIVQAPESSPIPGDEALDTPASARHKRGCNC 827

Query: 204 KKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           KKS CLKKYCEC+QA + CSE CRC  C+N  G +E
Sbjct: 828 KKSLCLKKYCECYQAGVGCSEGCRCEGCRNMYGRKE 863


>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 542

 Score =  135 bits (341), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 83/239 (34%), Positives = 118/239 (49%), Gaps = 29/239 (12%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K++K C CK+S CLKLYCECFA+G  C  C+C NC N+ ++E    EA  A L RNP AF
Sbjct: 228 KERKSCKCKNSMCLKLYCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAF 287

Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
            PK+ + P       +E G + +  KH KGC+CK+SGC K YCECF A + CS  C+C D
Sbjct: 288 EPKMTAVP-------KEDGALAI--KHQKGCNCKRSGCQKNYCECFHAGVFCSNICKCND 338

Query: 231 CKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIG-----SSGY-----ASPPVSKK 280
           C N E         + + + N+++I   +   I GA+      ++GY        P  + 
Sbjct: 339 CGNREDD-------NSEKSTNLSHILSDSGIGIRGALSPLRGETNGYQHLLFMESPRKRW 391

Query: 281 RKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPS---STLSSIPVARASSTAAVGSSKF 336
           R+     +   +       L HF    + R + S    TLS + V  A     +    F
Sbjct: 392 RESSSSSYSDQSSFRKSRSLTHFDSAFYERKTWSVNADTLSLLKVVEAEKKKLMEEESF 450


>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
           [Cucumis sativus]
          Length = 581

 Score =  135 bits (341), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 85/132 (64%), Gaps = 1/132 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK SRCLKLYCECFA+G+YC + C+C +C N   +EA   E       RNP AF P
Sbjct: 287 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 346

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+  +     + ++++ +     +H +GC+CKKSGCLKKYCEC+Q  + CS NCRC  CK
Sbjct: 347 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 406

Query: 233 NFEGSEERQALF 244
           N  G ++  AL 
Sbjct: 407 NAFGRKDESALL 418


>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
 gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
          Length = 602

 Score =  135 bits (341), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 85/132 (64%), Gaps = 1/132 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK SRCLKLYCECFA+G+YC + C+C +C N   +EA   E       RNP AF P
Sbjct: 308 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 367

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+  +     + ++++ +     +H +GC+CKKSGCLKKYCEC+Q  + CS NCRC  CK
Sbjct: 368 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 427

Query: 233 NFEGSEERQALF 244
           N  G ++  AL 
Sbjct: 428 NAFGRKDESALL 439


>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
 gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
          Length = 873

 Score =  135 bits (340), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 88/253 (34%), Positives = 125/253 (49%), Gaps = 25/253 (9%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK ++CLKLYC+CFA+GIYC D C C +C N  E E    E  +    RNP AF P
Sbjct: 453 KRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAP 512

Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           KI         ++E+    +  L +H +GC+CKKS CLKKYCEC+QAN+ CS  CRC  C
Sbjct: 513 KIVQHAKEFAASREDRSSSMPSLSRHKRGCNCKKSMCLKKYCECYQANVGCSSECRCEGC 572

Query: 232 KNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGST 291
           KN  G +E   +     ++ +   +         AI ++     P        EL+    
Sbjct: 573 KNGYGRKEEYGIIEETVSDRVGEERLEGRVDDKLAIVATDEDLLPA-------ELY---- 621

Query: 292 AKDPSIHRLG----HFQQGNHIRTSPSSTLSS---IPVARASSTAAVGSSKFTYRSLLAD 344
                +H L      FQ  +H +++P S LSS   +P   +  +    ++K T  S   D
Sbjct: 622 ----DLHNLTPSTPSFQHSDHGKSTPKSPLSSSRHVPSPESDISILPSNAKSTRSSRYCD 677

Query: 345 IIQPQDLKELCSV 357
            I P+  KE   +
Sbjct: 678 TI-PEASKETVDI 689


>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
          Length = 360

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 101/153 (66%), Gaps = 11/153 (7%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+  + +K CNC  S CLKLYC+CFA+G +C+ CNC NC NN+E+E  R++A++A L+RN
Sbjct: 127 ESANRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRN 186

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF+PKI     G  D            +H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 187 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 235

Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 236 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 268


>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 707

 Score =  135 bits (339), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 67/174 (38%), Positives = 91/174 (52%), Gaps = 45/174 (25%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           + +K CNCK+S+CLKLYC+CFA+G YC+GC+C NC N    +  R+ A+  +LERNP+AF
Sbjct: 99  RPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAF 158

Query: 171 RPKIASSPHGT-------------------------RDNKEETGEVL------------- 192
           R K+ +  + T                          D+ E   E +             
Sbjct: 159 RSKVVTLEYDTSSKDTASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEASSTSTQPR 218

Query: 193 -------MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
                     +H KGC C +S CLKKYCECFQ  + CS +CRC +C+NFEGS E
Sbjct: 219 TTGTTTQRHAQHIKGCSCLRSLCLKKYCECFQNGVYCSASCRCSNCRNFEGSAE 272



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/54 (48%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN---NVENEAARREAVEATL 163
           K C+C  S CLK YCECF +G+YC   C C NC N   + E   AR   V A L
Sbjct: 232 KGCSCLRSLCLKKYCECFQNGVYCSASCRCSNCRNFEGSAELSQARERLVLAEL 285



 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
           +A R    +SP G   N  E G V    +  K C+CK S CLK YC+CF A   C+  C 
Sbjct: 75  SALRKSAVASP-GLSSNALEGGSVT---RPRKPCNCKNSKCLKLYCDCFAAGTYCN-GCH 129

Query: 228 CMDCKNF 234
           C +C N 
Sbjct: 130 CTNCLNL 136


>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
 gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
          Length = 729

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 63/126 (50%), Positives = 81/126 (64%), Gaps = 1/126 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYC+CFA+G+YC + C+C +C N   +E    E  +    RNP AF P
Sbjct: 450 KRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAP 509

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+  S     D  +ET +     +H +GC+CKKS CLKKYCECFQ  + CS NCRC  CK
Sbjct: 510 KVIRSTDYVSDFGDETNKTPASARHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCK 569

Query: 233 NFEGSE 238
           N  GS+
Sbjct: 570 NSFGSK 575



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 26/40 (65%)

Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           +G   K C+CK+S CLK YC+CF A + C E C C DC N
Sbjct: 445 VGASCKRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFN 484



 Score = 46.2 bits (108), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           + ++ CNCK S CLK YCECF  G+ C   C C  C N
Sbjct: 533 RHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCKN 570


>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
          Length = 429

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 60/121 (49%), Positives = 78/121 (64%), Gaps = 10/121 (8%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           + CNCK+S+CLKLYCECFA+G YC   CNCV+C N    E  R++A+E TL +NP+AFRP
Sbjct: 216 RTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRP 275

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K      G   +++          H KGC C +S C+K+YCECFQA I C+  C+C DC 
Sbjct: 276 KF-----GANKSRQSAAP----SAHVKGCRCARSKCIKRYCECFQAGIPCTSACQCRDCH 326

Query: 233 N 233
           N
Sbjct: 327 N 327


>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 65/121 (53%), Positives = 83/121 (68%), Gaps = 8/121 (6%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYC+CFA+G+ C   CNC +CHNN ++   R   ++  +ERNP AFRPK+
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKV 263

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
            S     R N E+  E+    +H KGC+CKKS CLKKYCEC+Q  + CSE C+C DCKN 
Sbjct: 264 DS-----RSNSED--EIDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDDCKNC 316

Query: 235 E 235
           E
Sbjct: 317 E 317


>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 995

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 72/139 (51%), Positives = 98/139 (70%), Gaps = 12/139 (8%)

Query: 98  RSRS-NVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
           RS S N  ++ N  + +K CNC  S+CLKLYCECFA+G +C  CNC NC+NN+E+E AR+
Sbjct: 765 RSNSPNRVLEANGIRMKKPCNCTRSQCLKLYCECFANGEFCFQCNCNNCYNNIEHEEARQ 824

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
           +++++ LERNP AFRPKI     G  +  E         +HNKGCHCK+SGCLK YCEC+
Sbjct: 825 KSIKSCLERNPCAFRPKIQ---IGETEEDER--------RHNKGCHCKRSGCLKNYCECY 873

Query: 217 QANILCSENCRCMDCKNFE 235
           +A I CS++C+C+ C+N E
Sbjct: 874 EAKITCSKSCKCIGCRNTE 892


>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 684

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 63/134 (47%), Positives = 86/134 (64%), Gaps = 7/134 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C++C N   + E   A R+ +E+   RNP A
Sbjct: 390 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 446

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  S     +  ++  +     +H +GC+CKKS CLKKYCEC+Q  + CS NCRC 
Sbjct: 447 FAPKVIRSADSIMEAGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSVNCRCE 506

Query: 230 DCKNFEGSEERQAL 243
            CKN  G +E   L
Sbjct: 507 GCKNVFGRKEGSLL 520


>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 1042

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 61/119 (51%), Positives = 76/119 (63%), Gaps = 19/119 (15%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYC+CFA G+ C   CNC +C N   NE  R++A++   ERNPNAF+PKI
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEE-RKQAMDQITERNPNAFKPKI 712

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                               G H+KGCHCKKSGCLKKYCEC+Q+ ++C+  C C  CKN
Sbjct: 713 QEK-----------------GFHSKGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKN 754



 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 30/41 (73%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
           C+CKKS CLK YC+CF   + CS +C C DC N EG+EER+
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERK 694



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARRE 157
           K C+CK S CLK YCEC+ SG+ C   C C  C N  EN A+ R+
Sbjct: 720 KGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKNCDENMASSRK 764


>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 69/145 (47%), Positives = 89/145 (61%), Gaps = 13/145 (8%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYC+CFA+G+ C   CNC +CHNN ++   R   ++  +ERNP AFRPK+
Sbjct: 195 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKV 254

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
            S     R N E+  E+    +H KGC+CKKS CLKKYCEC+Q  + CSE C+C +CKN 
Sbjct: 255 ES-----RSNSED--EIDQKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKNC 307

Query: 235 E-----GSEERQALFHGDHANNMAY 254
           E      S +R    H    N   Y
Sbjct: 308 EMPVKKESRKRVKHTHSSFKNTEMY 332


>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 623

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 71/163 (43%), Positives = 99/163 (60%), Gaps = 9/163 (5%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
           P SPK + R + +  E      K+CNCK S+CLKLYCECFA+G+YC + C+C++C N   
Sbjct: 318 PISPKKKRRKSEQSGEGG-SSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPI 376

Query: 151 NE---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
           +E    A R+ +E+   RNP AF PK+  +     +  E+  +     +H +GC+CKKS 
Sbjct: 377 HEDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 433

Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHAN 250
           CLKKYCEC+Q  + CS NCRC  CKN  G ++  +LF  D  N
Sbjct: 434 CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQDEEN 475


>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
 gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
          Length = 226

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 80/152 (52%), Positives = 99/152 (65%), Gaps = 13/152 (8%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNC  S+CLKLYC+CFA G +C  CNCVNC NN+E+E  R +A++A LERNP+AF P
Sbjct: 4   RKPCNCTKSQCLKLYCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAFHP 63

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI     G  +            +HNKGCHCK+SGCLK YCEC++A ILC+  C+C  CK
Sbjct: 64  KIGKGKVGESER-----------RHNKGCHCKRSGCLKNYCECYEAKILCTSLCKCTGCK 112

Query: 233 NFEGSEERQALFHGDHANNMAYIQQAANAAIT 264
           NFE S ER+ L H   A  +   QQ  NAA T
Sbjct: 113 NFEESPERKTLMHLADAAEVRVKQQ--NAAKT 142



 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           +V E+  +  K C+CK S CLK YCEC+ + I C   C C  C N  E+
Sbjct: 69  KVGESERRHNKGCHCKRSGCLKNYCECYEAKILCTSLCKCTGCKNFEES 117


>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
           AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
 gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
 gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
 gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 609

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 9/163 (5%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN--- 147
           P SPK + R + +  E      K+CNCK S+CLKLYCECFA+G YC + C+C+NC N   
Sbjct: 307 PISPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPI 365

Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
           + +   A R+ +E+   RNP AF PK+  +     +  E+  +     +H +GC+CKKS 
Sbjct: 366 HKDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 422

Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHAN 250
           CLKKYCEC+Q  + CS NCRC  CKN  G ++  +LF  D  N
Sbjct: 423 CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQDEEN 464


>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
          Length = 609

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 9/163 (5%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN--- 147
           P SPK + R + +  E      K+CNCK S+CLKLYCECFA+G YC + C+C+NC N   
Sbjct: 307 PISPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPI 365

Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
           + +   A R+ +E+   RNP AF PK+  +     +  E+  +     +H +GC+CKKS 
Sbjct: 366 HKDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 422

Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHAN 250
           CLKKYCEC+Q  + CS NCRC  CKN  G ++  +LF  D  N
Sbjct: 423 CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQDEEN 464


>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
          Length = 348

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 94/257 (36%), Positives = 131/257 (50%), Gaps = 59/257 (22%)

Query: 44  PEHPQSQAVTANSHPPPQPQPKPVVPPVPPP---------PP-----PLPGQATQPPVRV 89
           P +P SQ VT +     QPQ + ++P  P P         PP     P PG        +
Sbjct: 18  PINPTSQIVTTS-----QPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVGYAVL 72

Query: 90  PKPESPKSRSRSNVEVKENT---------------------------PKKQKQCNCKHSR 122
           P     + +  S V +  N+                           P+K +  NC  S 
Sbjct: 73  PAQYVTQLQQSSYVSIASNSTFTGTSGIQTQARLPFNGIIPSESASRPRKPR--NCTKSL 130

Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           CL+LYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RNP AF+PKI     G  
Sbjct: 131 CLRLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGES 190

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
           D            +H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ 
Sbjct: 191 DR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKT 239

Query: 243 LFHGDHANNMAYIQQAA 259
           L H   A  +   QQ A
Sbjct: 240 LMHLADAAEVRVQQQTA 256


>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
          Length = 806

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 65/147 (44%), Positives = 91/147 (61%), Gaps = 3/147 (2%)

Query: 94  SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
           SPK + R + +  EN  +  K+CNCK S+CLKLYCECFA+G+YC + C+C +C N   +E
Sbjct: 485 SPKRKRRRSEQAAEN--EACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHE 542

Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
               +  +    RNP AF PK+  S     +  +E+ +     +H +GC+CKKS CLKKY
Sbjct: 543 DTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKY 602

Query: 213 CECFQANILCSENCRCMDCKNFEGSEE 239
           CECFQ  + CS +CRC  CKN  G ++
Sbjct: 603 CECFQGGVGCSISCRCEGCKNTFGRKD 629



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 30/55 (54%)

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           HG+   K    E     +  K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGSPKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537


>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
          Length = 829

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/147 (44%), Positives = 91/147 (61%), Gaps = 3/147 (2%)

Query: 94  SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
           SPK + R + +  EN  +  K+CNCK S+CLKLYCECFA+G+YC + C+C +C N   +E
Sbjct: 485 SPKRKRRRSEQAAEN--EACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHE 542

Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
               +  +    RNP AF PK+  S     +  +E+ +     +H +GC+CKKS CLKKY
Sbjct: 543 DTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKY 602

Query: 213 CECFQANILCSENCRCMDCKNFEGSEE 239
           CECFQ  + CS +CRC  CKN  G ++
Sbjct: 603 CECFQGGVGCSISCRCEGCKNTFGRKD 629



 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 30/55 (54%)

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           HG+   K    E     +  K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGSPKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537


>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
          Length = 777

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/147 (44%), Positives = 91/147 (61%), Gaps = 3/147 (2%)

Query: 94  SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
           SPK + R + +  EN  +  K+CNCK S+CLKLYCECFA+G+YC + C+C +C N   +E
Sbjct: 456 SPKRKRRRSEQAAEN--EACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHE 513

Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKY 212
               +  +    RNP AF PK+  S     +  +E+ +     +H +GC+CKKS CLKKY
Sbjct: 514 DTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKY 573

Query: 213 CECFQANILCSENCRCMDCKNFEGSEE 239
           CECFQ  + CS +CRC  CKN  G ++
Sbjct: 574 CECFQGGVGCSISCRCEGCKNTFGRKD 600



 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 30/55 (54%)

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           HG+   K    E     +  K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 454 HGSPKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 508


>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
 gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
          Length = 140

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 59/86 (68%), Positives = 71/86 (82%)

Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
           KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL    +A++  +I
Sbjct: 23  KHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI 82

Query: 256 QQAANAAITGAIGSSGYASPPVSKKR 281
           Q+AAN A+ GA GS GY + PV +K+
Sbjct: 83  QEAANFALNGATGSLGYKNSPVRRKK 108



 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVE 160
            PK  K C+C+ S CLK YCECF + I C   C C +C  N E    R+  V+
Sbjct: 21  IPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDC-KNFEGSEERKALVQ 72


>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
          Length = 613

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/132 (48%), Positives = 81/132 (61%), Gaps = 4/132 (3%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
           T   +K CNCK ++CLKLYC+C A+G+YC D C C NC N  ENE   +   E    R+P
Sbjct: 282 TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 341

Query: 168 NAFRPKIASSPHGTRDN-KEETGEVLM--LGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
            AF P+I +    T DN  +E G        +H +GC+CKKS C KKYCEC+QA I CS+
Sbjct: 342 LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAGIGCSD 401

Query: 225 NCRCMDCKNFEG 236
            CRC DC+N  G
Sbjct: 402 GCRCEDCRNSFG 413



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 7/83 (8%)

Query: 81  QATQPPVRVPKPESPKSRSRSNVEVKE---NTP---KKQKQCNCKHSRCLKLYCECFASG 134
           ++  P    P+  +P + +  NV  ++    TP   + ++ CNCK S C K YCEC+ +G
Sbjct: 337 ESRDPLAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAG 396

Query: 135 IYC-DGCNCVNCHNNVENEAARR 156
           I C DGC C +C N+   +A RR
Sbjct: 397 IGCSDGCRCEDCRNSFGIKADRR 419


>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1084

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 79/118 (66%), Gaps = 1/118 (0%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIA 175
           CNCK ++CLKLYCECFA+   C+GC C  C N  E EA R+EA +A LERN +AF PKI 
Sbjct: 533 CNCKRTKCLKLYCECFAASRMCEGCTCQGCFNKPEFEAMRKEARQAILERNNSAFDPKID 592

Query: 176 SSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           S     +   EE G       H+KGC+CKKS CLKKYCEC+Q  + C++ C+C +C+N
Sbjct: 593 SQSKLKQLCNEE-GVDNEQKVHSKGCNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRN 649



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
           CNCK S CLK YCEC+  G+ C   C C NC NN+ N     E  ++ L   P+A
Sbjct: 617 CNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRNNI-NHPHSHEEPDSRLNATPSA 670


>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
           gaditana CCMP526]
          Length = 365

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 77/210 (36%), Positives = 108/210 (51%), Gaps = 36/210 (17%)

Query: 55  NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
           NSHP    +P  V   +    P    + + PP+      +P+ R  S +    ++    K
Sbjct: 106 NSHPGGSSRPVRVEAAIRRQNPSQEARRS-PPLNGSGGGTPQGRGTSGLAATRSS----K 160

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +CNC+ S+CLKLYCECFA G YC D C C++C+N +  E  R +AVE+ L RNP+AF  K
Sbjct: 161 KCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEK 220

Query: 174 I------ASSPHGTRDNKE-----------ETGEVLMLG-------------KHNKGCHC 203
           I      A+ P     ++            + GE  M+G              H +GC+C
Sbjct: 221 IQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNC 280

Query: 204 KKSGCLKKYCECFQANILCSENCRCMDCKN 233
           KKS C+KKYCECF A + C ENC+C  C+N
Sbjct: 281 KKSLCIKKYCECFFAGVYCGENCQCEGCQN 310



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA 154
           + CNCK S C+K YCECF +G+YC + C C  C NN E  A 
Sbjct: 276 RGCNCKKSLCIKKYCECFFAGVYCGENCQCEGCQNNPEASAG 317


>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
 gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
          Length = 1987

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 63/147 (42%), Positives = 84/147 (57%), Gaps = 25/147 (17%)

Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENE---AARREAVEAT 162
           +  +    K C CK S+CLKLYC+CFA+G YC  C+C++CHN  E+      RRE + A 
Sbjct: 780 RRTSENSSKSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAA- 838

Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
             R+P AF  KI  +P+G              GKH +GC+C+KS CLKKYCEC+Q  + C
Sbjct: 839 --RDPQAFTRKIQLAPNGN-------------GKHKRGCNCRKSHCLKKYCECYQGGVKC 883

Query: 223 SENCRCMDCKNF------EGSEERQAL 243
              C CM+C+N       EG+  R AL
Sbjct: 884 GIQCTCMECENMDVGSSQEGAGARGAL 910


>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
 gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
          Length = 897

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 61/128 (47%), Positives = 79/128 (61%), Gaps = 2/128 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYC+CFA+G++C D C+C +C N  E      E  +    RNP AF P
Sbjct: 509 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 568

Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           KI  S      N E+         +H +GC+CK+S CLKKYCEC+Q+N+ CS  CRC  C
Sbjct: 569 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 628

Query: 232 KNFEGSEE 239
           KN  G +E
Sbjct: 629 KNVYGKKE 636



 Score = 47.4 bits (111), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 5/72 (6%)

Query: 81  QATQPPVRVPKPESPKSRSRSNVE-VKENTP---KKQKQCNCKHSRCLKLYCECFASGIY 136
           ++  P    PK     + + SN+E V   TP   +  + CNCK S CLK YCEC+ S + 
Sbjct: 559 ESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVG 618

Query: 137 C-DGCNCVNCHN 147
           C  GC C  C N
Sbjct: 619 CSSGCRCEGCKN 630


>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
 gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
           AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
 gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
          Length = 674

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 60/130 (46%), Positives = 86/130 (66%), Gaps = 7/130 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C++C N   +E    A R+ +E+   RNP A
Sbjct: 375 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLA 431

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +    ++  ++  +     +H +GC+CKKS CLKKYCEC+Q  + CS NCRC 
Sbjct: 432 FAPKVIRNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCE 491

Query: 230 DCKNFEGSEE 239
            CKN  G ++
Sbjct: 492 GCKNAFGRKD 501


>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
          Length = 821

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 61/128 (47%), Positives = 79/128 (61%), Gaps = 2/128 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYC+CFA+G++C D C+C +C N  E      E  +    RNP AF P
Sbjct: 433 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 492

Query: 173 KIASSPHGTRDNKEETGEVL-MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           KI  S      N E+         +H +GC+CK+S CLKKYCEC+Q+N+ CS  CRC  C
Sbjct: 493 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 552

Query: 232 KNFEGSEE 239
           KN  G +E
Sbjct: 553 KNVYGKKE 560



 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 5/72 (6%)

Query: 81  QATQPPVRVPKPESPKSRSRSNVE-VKENTP---KKQKQCNCKHSRCLKLYCECFASGIY 136
           ++  P    PK     + + SN+E V   TP   +  + CNCK S CLK YCEC+ S + 
Sbjct: 483 ESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVG 542

Query: 137 C-DGCNCVNCHN 147
           C  GC C  C N
Sbjct: 543 CSSGCRCEGCKN 554


>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 638

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 70/178 (39%), Positives = 101/178 (56%), Gaps = 28/178 (15%)

Query: 71  VPPP--PPPLPGQATQPPVRVPKPESPKSRSRSNVE-------VKENTPKKQ-----KQC 116
           +PP   PP LP  A +P    P P S  S S+S ++         +N P++      ++C
Sbjct: 107 IPPATKPPDLPTFAAKPIAPRP-PVSTISTSQSPIQHPLKPHATLQNKPQQDSVLTSRRC 165

Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
           +C  S CLKLYCECFA+G++C  C+CV C+N  ++E  R++A+   + R P+AF+ KIA 
Sbjct: 166 SCSRSFCLKLYCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNRKPDAFQSKIAH 225

Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
           S         E   V     H +GC+CK+SGCLK YCEC++A + C+  CRC  C N 
Sbjct: 226 S--------LENATV-----HARGCNCKRSGCLKNYCECYEARVRCTSRCRCQYCYNM 270


>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
 gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
          Length = 348

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 76/124 (61%), Gaps = 1/124 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYCECFA+G+YC D C C +C N  E E    +  +    RNP AF P
Sbjct: 20  KRCNCKRSKCLKLYCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAP 79

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+      +  N+E         +H +GC+CKKS CLKKYCEC+QA + CS  CRC  CK
Sbjct: 80  KVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGCSSGCRCEGCK 139

Query: 233 NFEG 236
           N  G
Sbjct: 140 NSYG 143



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 81  QATQPPVRVPKPESPKSRSRSNVEVKENTP---KKQKQCNCKHSRCLKLYCECFASGIYC 137
           +A  P    PK     + S +N E    TP   + ++ CNCK S+CLK YCEC+ +G+ C
Sbjct: 70  EARNPLAFAPKVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGC 129

Query: 138 -DGCNCVNCHNNVENEAA 154
             GC C  C N+   +A 
Sbjct: 130 SSGCRCEGCKNSYGKKAV 147


>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
          Length = 772

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 61/135 (45%), Positives = 86/135 (63%), Gaps = 7/135 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECFA+G+YC + C+C+ C NN    E   + R+ +E+   RNP A
Sbjct: 462 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIES---RNPLA 518

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +     + ++ + +     +H +GC+CKKS CLKKYCECFQ  + CS +CRC 
Sbjct: 519 FAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCE 578

Query: 230 DCKNFEGSEERQALF 244
            CKN  G  E  A+ 
Sbjct: 579 GCKNAFGKREGAAVL 593



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 27/45 (60%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           G   K C CKKS CLK YCECF A + CSE C C+ C N +   E
Sbjct: 458 GTSCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQSHTE 502



 Score = 45.8 bits (107), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCECF  G+ C   C C  C N
Sbjct: 545 RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 582


>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
          Length = 717

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 68/158 (43%), Positives = 94/158 (59%), Gaps = 20/158 (12%)

Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           E+ +++PKK+             K+CNCK S+CLKLYCECFA+G+YC + C+C  C N  
Sbjct: 400 ELNQSSPKKKRRRLEHGGETEGCKRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKP 459

Query: 150 ENE---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
            +E    A R+ +E+   RNP AF PK+  S     +  +E  +     +H +GC+CKKS
Sbjct: 460 IHEDTVLATRKQIES---RNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKS 516

Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
            CLKKYCEC+Q  + CS NCRC  CKN  G ++  AL 
Sbjct: 517 SCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSALI 554


>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 365

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/210 (36%), Positives = 108/210 (51%), Gaps = 36/210 (17%)

Query: 55  NSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQK 114
           NSHP    +P  V   +    P    + + PP+      +P+ R  S +    ++    K
Sbjct: 106 NSHPGGSSRPVRVEAAIRRQNPSQEARRS-PPLNGSGGGTPQGRGTSGLAATRSS----K 160

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +CNC+ S+CLKLYCECFA G YC D C C++C+N +  E  R +AVE+ L RNP+AF  K
Sbjct: 161 KCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAFTEK 220

Query: 174 I------ASSPHGTRDNKE-----------ETGEVLMLG-------------KHNKGCHC 203
           I      A+ P     ++            + GE  M+G              H +GC+C
Sbjct: 221 IQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRGCNC 280

Query: 204 KKSGCLKKYCECFQANILCSENCRCMDCKN 233
           KKS C++KYCECF A + C ENC+C  C+N
Sbjct: 281 KKSLCIQKYCECFFAGVYCGENCQCEGCQN 310



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA 154
           + CNCK S C++ YCECF +G+YC + C C  C NN E  A 
Sbjct: 276 RGCNCKKSLCIQKYCECFFAGVYCGENCQCEGCQNNPEASAG 317


>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
 gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
 gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 695

 Score =  128 bits (322), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C++C N   + E   A R+ +E+   RNP A
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 457

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +     +  ++  +     +H +GC+CKKS C+KKYCEC+Q  + CS NCRC 
Sbjct: 458 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 517

Query: 230 DCKNFEGSEERQAL 243
            C N  G ++   L
Sbjct: 518 GCTNVFGRKDGSLL 531



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
           + ++ CNCK S C+K YCEC+  G+ C   C C  C N     + +    +E+ LE N  
Sbjct: 484 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 543

Query: 169 AFRPKIASSPHGTRDNKE 186
            +  +IA   H    +KE
Sbjct: 544 TYEKRIAKIQHNVEVSKE 561


>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
 gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
          Length = 695

 Score =  128 bits (321), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C++C N   + E   A R+ +E+   RNP A
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 457

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +     +  ++  +     +H +GC+CKKS C+KKYCEC+Q  + CS NCRC 
Sbjct: 458 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 517

Query: 230 DCKNFEGSEERQAL 243
            C N  G ++   L
Sbjct: 518 GCTNVFGRKDGSLL 531



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
           + ++ CNCK S C+K YCEC+  G+ C   C C  C N     + +    +E+ LE N  
Sbjct: 484 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 543

Query: 169 AFRPKIASSPHGTRDNKE 186
            +  +IA   H    +KE
Sbjct: 544 TYEKRIAKIQHNVEVSKE 561


>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
          Length = 738

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 64/135 (47%), Positives = 85/135 (62%), Gaps = 7/135 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C  C N   +E    A R+ +E+   RNP A
Sbjct: 444 KRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIES---RNPLA 500

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  S     +  +E  +     +H +GC+CKKS CLKKYCEC+Q  + CS NCRC 
Sbjct: 501 FAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCE 560

Query: 230 DCKNFEGSEERQALF 244
            CKN  G ++  AL 
Sbjct: 561 GCKNAFGRKDGSALI 575



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCEC+  G+ C   C C  C N
Sbjct: 527 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKN 564


>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
          Length = 844

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/149 (48%), Positives = 98/149 (65%), Gaps = 9/149 (6%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVEN 151
           P +P       V  +    + +K CNC  S+CLKLYC+CFA+G +C+ CNC NCHNN+EN
Sbjct: 607 PTTPALSETDQVTTELGGLRPRKACNCTKSQCLKLYCDCFANGEFCNRCNCNNCHNNLEN 666

Query: 152 EAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKK 211
           E  R++A+   L+RNPNAFRPKI  S  G  +         ++ +HNKGC+CK+SGCLK 
Sbjct: 667 EELRQKAIRGCLDRNPNAFRPKIGKSKAGGPE---------IIRRHNKGCNCKRSGCLKN 717

Query: 212 YCECFQANILCSENCRCMDCKNFEGSEER 240
           YCEC++A I CS  C+C+ C+N E + ER
Sbjct: 718 YCECYEAKIACSSICKCVGCRNVEETLER 746


>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
           [Cucumis sativus]
          Length = 788

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 66/155 (42%), Positives = 95/155 (61%), Gaps = 8/155 (5%)

Query: 94  SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
           SPK + R  +E    T    K+CNCK S+CLKLYCECFA+G+YC + C+C +C N   +E
Sbjct: 474 SPKKKRR-RLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHE 532

Query: 153 ---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
               A R+ +E+   RNP AF PK+  +     +  +E+ +     +H +GC+CKKS CL
Sbjct: 533 DTVLATRKQIES---RNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCL 589

Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           KKYCEC+Q  + CS +CRC  CKN  G ++  ++ 
Sbjct: 590 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624


>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
          Length = 788

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 66/155 (42%), Positives = 95/155 (61%), Gaps = 8/155 (5%)

Query: 94  SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
           SPK + R  +E    T    K+CNCK S+CLKLYCECFA+G+YC + C+C +C N   +E
Sbjct: 474 SPKKKRR-RLEPAGETESSCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHE 532

Query: 153 ---AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
               A R+ +E+   RNP AF PK+  +     +  +E+ +     +H +GC+CKKS CL
Sbjct: 533 DTVLATRKQIES---RNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCL 589

Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           KKYCEC+Q  + CS +CRC  CKN  G ++  ++ 
Sbjct: 590 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624


>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
          Length = 774

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 92/153 (60%), Gaps = 20/153 (13%)

Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN-- 147
           + ++N+PKK+             K+CNCK S+CLKLYCECFA+G+YC + C C +C N  
Sbjct: 465 DFQQNSPKKKRRKLEPAGEGESCKRCNCKKSKCLKLYCECFAAGVYCIEPCACRDCFNKP 524

Query: 148 -NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
            +VE     R+ +E+   RNP AF PK+  S     +  ++  +     +H +GC+CKKS
Sbjct: 525 IHVETVLQTRQQIES---RNPLAFAPKVIRSSDSVSEIGDDLNKTPASARHKRGCNCKKS 581

Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
            CLKKYCEC+Q  + CS +CRC  CKN  G ++
Sbjct: 582 SCLKKYCECYQGGVGCSVSCRCEGCKNTYGRKD 614


>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
 gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
          Length = 767

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 61/135 (45%), Positives = 85/135 (62%), Gaps = 7/135 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECFA+G+YC + C+C+ C NN    E   + R+ +E+   RNP A
Sbjct: 457 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIES---RNPLA 513

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +     +  + + +     +H +GC+CKKS CLKKYCECFQ  + CS +CRC 
Sbjct: 514 FAPKVIHTSESGLELGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCE 573

Query: 230 DCKNFEGSEERQALF 244
            CKN  G  E  A+ 
Sbjct: 574 GCKNAFGKREGAAVL 588



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 49/95 (51%), Gaps = 9/95 (9%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-----RQALFHGDHA 249
           G   K C CKKS CLK YCECF A + CSE C C+ C N +   E     RQ +   +  
Sbjct: 453 GTSCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQI---ESR 509

Query: 250 NNMAYIQQAANAAITGA-IGSSGYASPPVSKKRKG 283
           N +A+  +  + + +G  +G     +P  ++ ++G
Sbjct: 510 NPLAFAPKVIHTSESGLELGDFSNKTPASARHKRG 544



 Score = 45.8 bits (107), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCECF  G+ C   C C  C N
Sbjct: 540 RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 577


>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
 gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
          Length = 587

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 84/130 (64%), Gaps = 7/130 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECFA+G+YC + C+C+ C NN   +E   + R+ +E+   RNP A
Sbjct: 460 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIES---RNPLA 516

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +     D  + + +     +H +GC+CKKS CLKKYCECFQ  + CS +CRC 
Sbjct: 517 FAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCE 576

Query: 230 DCKNFEGSEE 239
            CKN  G  E
Sbjct: 577 GCKNAFGKRE 586



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 28/45 (62%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           G   K C CKKS CLK YCECF A + CSE C C+ C+N +   E
Sbjct: 456 GTSCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQSHME 500



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCECF  G+ C   C C  C N
Sbjct: 543 RHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKN 580


>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
          Length = 584

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 58/111 (52%), Positives = 73/111 (65%), Gaps = 12/111 (10%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           + CNC  SRCLKLYCEC+A+G YC   C C NC NN+ENE  R+ AV +TL++NP+AFRP
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDKNPDAFRP 466

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
           KI          K+ET        H +GC C KS CL++YCECFQA + C+
Sbjct: 467 KI----------KQET-TTNRKANHLRGCKCTKSKCLQRYCECFQAGVPCT 506



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           + C+C KS CLK YCEC+ A   C   C+C +C+N   +E R+ L
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQL 451


>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
 gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
          Length = 727

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 84/133 (63%), Gaps = 7/133 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C  C N   +E    A R+ +E+   RNP A
Sbjct: 435 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKPIHEDTVLATRKQIES---RNPLA 491

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  S     +  +E  +     +H +GC+CKKS CLKKYCEC+Q  + CS NCRC 
Sbjct: 492 FAPKVIRSSEPVTEIGDEFSKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCE 551

Query: 230 DCKNFEGSEERQA 242
            CKN  G ++  A
Sbjct: 552 GCKNAFGRKDGSA 564



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 22/39 (56%), Positives = 26/39 (66%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           G+  K C+CKKS CLK YCECF A + C E C C +C N
Sbjct: 431 GEACKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFN 469



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCEC+  G+ C   C C  C N
Sbjct: 518 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 555


>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
 gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
          Length = 921

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/131 (48%), Positives = 76/131 (58%), Gaps = 7/131 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K CNCK S+CLKLYC+CF +GI+C DGC C  C N VE +    E  +    RNP AF P
Sbjct: 520 KTCNCKKSKCLKLYCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAP 579

Query: 173 KIASSPHGTRDNKEETGEVLML----GKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           KI         N  E  +V M      +H +GC+CK+S C KKYCECFQAN+ CS  CRC
Sbjct: 580 KIVPCAADVPPNNME--DVNMTTPASARHKRGCNCKRSKCTKKYCECFQANVGCSTGCRC 637

Query: 229 MDCKNFEGSEE 239
             C N  G  E
Sbjct: 638 DGCMNAFGKRE 648


>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
          Length = 665

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 88/155 (56%), Gaps = 15/155 (9%)

Query: 96  KSRSRSNVEVKENTPKKQ----KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
           K+R  SN   K+ +        K+CNCK ++CLKLYC+CFA+G YC + C C  C N  E
Sbjct: 465 KTRQASNCLTKKASTSTDDDTTKRCNCKKTKCLKLYCDCFAAGFYCGESCACQGCFNRPE 524

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL------MLGKHNKGCHCK 204
            E    E  +    RNP AF PK+  +   +      TG+V+         +H +GC+CK
Sbjct: 525 YEDTVLETRQQIESRNPLAFAPKVMQTVTAS----PTTGKVVEKLSTPSSARHKRGCNCK 580

Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           KS CLKKYCEC+QAN+ CSE CRC  CKN  G  E
Sbjct: 581 KSMCLKKYCECYQANVGCSEGCRCEGCKNIYGVRE 615


>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
           distachyon]
          Length = 771

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 69/171 (40%), Positives = 95/171 (55%), Gaps = 25/171 (14%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECF +G++C + C+C  C N   N+E   + RE +E+   RNP A
Sbjct: 465 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIES---RNPLA 521

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +    ++  E++ +     +H +GC+CKKS CLKKYCECFQ  + CS +CRC 
Sbjct: 522 FAPKVIRTSEPGQELGEQSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCE 581

Query: 230 DCKNFEGSEERQALFHGDHA------------------NNMAYIQQAANAA 262
            CKN  G  E   L   + A                  +N   I QAAN A
Sbjct: 582 GCKNAFGRREGVVLLSIEQAKQGSVENIACVKEETSENDNQLVIYQAANPA 632


>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C++C N   + E   A R+ +E+   RNP A
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 288

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +     +  ++  +     +H +GC+CKKS C+KKYCEC+Q  + CS NCRC 
Sbjct: 289 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 348

Query: 230 DCKNFEGSEERQAL 243
            C N  G ++   L
Sbjct: 349 GCTNVFGRKDGSLL 362



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
           + ++ CNCK S C+K YCEC+  G+ C   C C  C N     + +    +E+ LE N  
Sbjct: 315 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 374

Query: 169 AFRPKIASSPHGTRDNKE 186
            +  +IA   H    +KE
Sbjct: 375 TYEKRIAKIQHNVEVSKE 392


>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
          Length = 764

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 65/133 (48%), Positives = 86/133 (64%), Gaps = 11/133 (8%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C  CNC NC NN+++E  R  A+++ L+RNP AF PKI     G RD + 
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRR- 587

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     HNKGC+CK+SGCLK YCEC++A I+CS +C+C+ CKNFE S ER+ L H 
Sbjct: 588 ----------HNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKNFEESPERKTLMHL 637

Query: 247 DHANNMAYIQQAA 259
             A  +   QQ A
Sbjct: 638 ADAAEVRVQQQTA 650


>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+CNCK S+CLKLYCECFA+G+YC + C+C++C N   + E   A R+ +E+   RNP A
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 288

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +     +  ++  +     +H +GC+CKKS C+KKYCEC+Q  + CS NCRC 
Sbjct: 289 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 348

Query: 230 DCKNFEGSEERQAL 243
            C N  G ++   L
Sbjct: 349 GCTNVFGRKDGSLL 362



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
           + ++ CNCK S C+K YCEC+  G+ C   C C  C N     + +    +E+ LE N  
Sbjct: 315 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 374

Query: 169 AFRPKIASSPHGTRDNKE 186
            +  +IA   H    +KE
Sbjct: 375 TYEKRIAKIQHNVEVSKE 392


>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 510

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 59/122 (48%), Positives = 70/122 (57%), Gaps = 15/122 (12%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK SRCLKLYC+CFA+  +C   CNC NC N       R EA+   +ERNPNAF+ K 
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
                  R N E+         H  GC C+KS CLKKYCECF A   CS+ C C+ C+N 
Sbjct: 285 -------RPNGEDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330

Query: 235 EG 236
            G
Sbjct: 331 AG 332


>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1040

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 64/162 (39%), Positives = 89/162 (54%), Gaps = 39/162 (24%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNP 167
           T K++ +CNCK S+CLKLYC+CF  G +C+  CNC  C N   N+A R  A++   ERNP
Sbjct: 707 TSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAERESAIKQLQERNP 766

Query: 168 NAFRPKIASS-------------------------------PHGTRDNKEETGEVLMLGK 196
           +AF+PKI +                                P   +D++    E+L+   
Sbjct: 767 DAFKPKIQTKEQIIQQYLSKLQIFMLIISSLLSKNQIQYLIPKIIKDDE----EILI--- 819

Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
           HNKGC+CKKSGC KKYCEC+   + CS+ C+C  C+N + SE
Sbjct: 820 HNKGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNRDPSE 861



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE-ERQA 242
           C+CKKS CLK YC+CF     C+  C C +C N E ++ ER++
Sbjct: 714 CNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAERES 756


>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
 gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
          Length = 561

 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 89/155 (57%), Gaps = 8/155 (5%)

Query: 94  SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE 152
           SPK + R + +V        K+CNCK +RCLK YC+CFA+GIYC + C C  C N +E E
Sbjct: 407 SPKKK-RQDHKVSSTDGDGCKRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCFNRLEYE 465

Query: 153 AARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML----GKHNKGCHCKKSGC 208
               EA +     NP AF PKI    H T  +     +  +     G+H  GC+CK+S C
Sbjct: 466 DTVLEARQQKESHNPLAFAPKIVQ--HVTEFHAINVEDASLFTPSSGRHKTGCNCKRSMC 523

Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           +KKYCEC+QAN+ CS  CRC  CKN  G +E + L
Sbjct: 524 VKKYCECYQANVGCSNACRCEGCKNIHGRKEGEKL 558


>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
 gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
 gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
 gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 639

 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 63/151 (41%), Positives = 85/151 (56%), Gaps = 14/151 (9%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C C+ S+CLKLYCECF++G++C + C+C NC N   +E    ++ E    RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+ S+     D   E  +     +H +GC+C+KSGC KKYCECF   + CS NCRCM CK
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCK 572

Query: 233 NFEGSEERQ-------------ALFHGDHAN 250
           N  G    Q             A  +GDH +
Sbjct: 573 NTFGHTNEQCAGDSDVVTINDEAKHYGDHGD 603



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           + ++ CNC+ S C K YCECF  G+ C   C C+ C N
Sbjct: 536 RHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKN 573


>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
 gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
          Length = 896

 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 64/137 (46%), Positives = 77/137 (56%), Gaps = 10/137 (7%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYC+CFA+G YC D C C  C N  E      E  +    RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAP 533

Query: 173 KIAS-----SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
           KI       S H   +N           +H +GC+CK+S CLKKYCEC+QAN+ CS  CR
Sbjct: 534 KIVQPTTDISSHMDDENLTTPSS----ARHKRGCNCKRSMCLKKYCECYQANVGCSSGCR 589

Query: 228 CMDCKNFEGSEERQALF 244
           C  CKN  G +E    F
Sbjct: 590 CEGCKNVHGKKEDYVAF 606


>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 611

 Score =  125 bits (315), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 59/124 (47%), Positives = 78/124 (62%), Gaps = 1/124 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C CK S+CLKLYCECFA+G+YC + C+C NC N   +E    ++ E    RNP AF P
Sbjct: 431 KRCKCKKSQCLKLYCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKSREVIEARNPLAFAP 490

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+ S+     D   E  +     +H +GC+C+KSGCLKKYCEC    + CS +CRC+ CK
Sbjct: 491 KVVSTSDTAIDLWVENSKTPASARHTRGCNCRKSGCLKKYCECNLMGVRCSSSCRCIGCK 550

Query: 233 NFEG 236
           N  G
Sbjct: 551 NVFG 554


>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
          Length = 760

 Score =  125 bits (313), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 92/153 (60%), Gaps = 20/153 (13%)

Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN-- 147
           + ++N+PKK+             K+CNCK S+CLKLYCECFA+G+YC + C C +C N  
Sbjct: 452 DFQQNSPKKKRCKLEPAGEGESCKRCNCKKSKCLKLYCECFAAGVYCIEPCACHDCFNKP 511

Query: 148 -NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
            +VE     R+ +E+   RNP AF PK+  S     +  ++  +     +H +GC+CKKS
Sbjct: 512 IHVETVLQTRQQIES---RNPLAFAPKVIRSSDSVSEIGDDPNKTPASARHKRGCNCKKS 568

Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
            CLKKYCEC+Q  + CS +CRC  CKN  G ++
Sbjct: 569 SCLKKYCECYQGGVGCSISCRCEGCKNTYGRKD 601


>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 511

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 59/122 (48%), Positives = 70/122 (57%), Gaps = 15/122 (12%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK SRCLKLYC+CFA+  +C   CNC NC N       R EA+   +ERNPNAF+ K 
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
                  R N E+         H  GC C+KS CLKKYCECF A   CS+ C C+ C+N 
Sbjct: 285 -------RPNGEDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330

Query: 235 EG 236
            G
Sbjct: 331 AG 332


>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
          Length = 503

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/165 (40%), Positives = 96/165 (58%), Gaps = 27/165 (16%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ C+  NC NN+ +E  R +A++A L+RNP AF+PKI +   G  D K 
Sbjct: 302 YCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLG--DVKP 359

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN++ S ER+ L   
Sbjct: 360 ---------RHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDESPERKTLM-- 408

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASP----PVSKKRKGQELF 287
                        NA   G +  S ++SP     + K RK +  F
Sbjct: 409 ----------SLPNAMEIGRLDDSHFSSPAKFSALPKSRKDRRSF 443



 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 44/84 (52%), Gaps = 10/84 (11%)

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
           R P+   PK  + +  +VK   P+  K CNC+ S CLK YCEC+ + I C   C C+ C 
Sbjct: 340 RNPEAFQPKIGTGNLGDVK---PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 396

Query: 147 NNVENEAARREAVEATLERNPNAF 170
           N   +E+  R+    TL   PNA 
Sbjct: 397 NY--DESPERK----TLMSLPNAM 414


>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
          Length = 780

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/135 (45%), Positives = 87/135 (64%), Gaps = 8/135 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECF +G++C + C+C  C N   N+E   + RE +E+   RNP A
Sbjct: 477 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLA 533

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +  G ++  +++ +     +H +GC+CKKS CLKKYCEC+Q  + CS +CRC 
Sbjct: 534 FAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCE 592

Query: 230 DCKNFEGSEERQALF 244
            CKN  G  E  AL 
Sbjct: 593 GCKNAFGRREGVALL 607


>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
           distachyon]
          Length = 787

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 57/124 (45%), Positives = 77/124 (62%), Gaps = 1/124 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFA+G+YC + C+C+ C N   +E       +    RNP AF P
Sbjct: 475 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 534

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+       ++ +E+        +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CK
Sbjct: 535 KVIRLSDAGQETQEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCK 594

Query: 233 NFEG 236
           N  G
Sbjct: 595 NTFG 598



 Score = 46.6 bits (109), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 13/62 (20%)

Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---------NVENEAA 154
           NTP   + ++ CNCK S CLK YCEC+  G+ C   C C +C N         +VENE  
Sbjct: 552 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNTFGMRDGAVSVENEEM 611

Query: 155 RR 156
           ++
Sbjct: 612 KQ 613


>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 760

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/135 (45%), Positives = 87/135 (64%), Gaps = 8/135 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECF +G++C + C+C  C N   N+E   + RE +E+   RNP A
Sbjct: 457 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLA 513

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +  G ++  +++ +     +H +GC+CKKS CLKKYCEC+Q  + CS +CRC 
Sbjct: 514 FAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCE 572

Query: 230 DCKNFEGSEERQALF 244
            CKN  G  E  AL 
Sbjct: 573 GCKNAFGRREGVALL 587


>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
          Length = 774

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/153 (39%), Positives = 89/153 (58%), Gaps = 14/153 (9%)

Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           +  +N+PKK+             K+CNCK S+CLKLYCECFA+G+YC + C+C +C N  
Sbjct: 463 DFNQNSPKKKRRKSETPGETEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 522

Query: 150 ENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
            +E    +  +    RNP AF PK+  +     +  ++  +     +H +GC+CKKS CL
Sbjct: 523 IHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCL 582

Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
           KKYCEC+Q  + CS +CRC  CKN  G ++  A
Sbjct: 583 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 615


>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
          Length = 792

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 71/119 (59%), Gaps = 13/119 (10%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYC+CFA  + C   CNC +C+NN   E+ R +A+   LERNP AF  K 
Sbjct: 26  CNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAFDAKF 85

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                  R+ ++    V     H  GC C+KS CLKKYCECF   + CSE C C+ C+N
Sbjct: 86  -------RETEKHDAVV-----HKTGCKCRKSACLKKYCECFNKGVACSEKCNCVGCRN 132



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 23/35 (65%), Gaps = 1/35 (2%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           C C+ S CLK YCECF  G+ C + CNCV C N V
Sbjct: 100 CKCRKSACLKKYCECFNKGVACSEKCNCVGCRNTV 134



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 24/34 (70%)

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           GC+CK+S CLK YC+CF   ++C+ +C C  C N
Sbjct: 25  GCNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYN 58


>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
          Length = 764

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 61/153 (39%), Positives = 89/153 (58%), Gaps = 14/153 (9%)

Query: 104 EVKENTPKKQ-------------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           +  +N+PKK+             K+CNCK S+CLKLYCECFA+G+YC + C+C +C N  
Sbjct: 453 DFNQNSPKKKRRKVEPAGDTEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 512

Query: 150 ENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCL 209
            +E    +  +    RNP AF PK+  +     +  ++  +     +H +GC+CKKS CL
Sbjct: 513 IHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSNCL 572

Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEERQA 242
           KKYCEC+Q  + CS +CRC  CKN  G ++  A
Sbjct: 573 KKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 605


>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
          Length = 774

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 81/130 (62%), Gaps = 1/130 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+CNCK S+CLKLYCECFA+G+YC + C+C +C N   +E    +  +    RNP AF P
Sbjct: 486 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAP 545

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+  +     +  ++  +     +H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CK
Sbjct: 546 KVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 605

Query: 233 NFEGSEERQA 242
           N  G ++  A
Sbjct: 606 NAFGRKDGSA 615



 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 34/79 (43%), Gaps = 22/79 (27%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNA 169
           + ++ CNCK S CLK YCEC+  G+ C   C C  C N                     A
Sbjct: 569 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKN---------------------A 607

Query: 170 FRPKIASSPHGTRDNKEET 188
           F  K  S+P G   + EET
Sbjct: 608 FGRKDGSAPVGIETDPEET 626


>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 450

 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 71/137 (51%), Positives = 83/137 (60%), Gaps = 14/137 (10%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNP 167
           N    +K CNC  S CLKLYCECFA G  C  CNC NC NN+  E  R  A++ TLERNP
Sbjct: 139 NLTNSRKPCNCTKSHCLKLYCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNP 198

Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
            AF PKI     G R             KH KGC+CK+SGCLK YCEC++A I CS+ CR
Sbjct: 199 TAFHPKIGRG-EGER-------------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCR 244

Query: 228 CMDCKNFEGSEERQALF 244
           C  C+N E S ER++L 
Sbjct: 245 CQGCRNTEDSVERRSLM 261


>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 798

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 64/133 (48%), Positives = 86/133 (64%), Gaps = 11/133 (8%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C+ CNC NC NN+E+E  R +A++  L+RNP AF+PKI     G  D + 
Sbjct: 590 YCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR- 648

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H 
Sbjct: 649 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHL 698

Query: 247 DHANNMAYIQQAA 259
             A  +   QQ A
Sbjct: 699 ADAAEVRVQQQTA 711



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           +  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 648 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEES 689


>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
          Length = 800

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 57/124 (45%), Positives = 74/124 (59%), Gaps = 1/124 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFA+G+YC + C+C  C N   +E       +    RNP AF P
Sbjct: 482 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAP 541

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+        +  E+        +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CK
Sbjct: 542 KVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCK 601

Query: 233 NFEG 236
           N  G
Sbjct: 602 NAFG 605



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/39 (58%), Positives = 25/39 (64%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           G+  K C CKKS CLK YCECF A + CSE C C  C N
Sbjct: 478 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMN 516



 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 4/57 (7%)

Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVE 160
           NTP   + ++ CNCK S CLK YCEC+  G+ C   C C +C N      A  E  E
Sbjct: 559 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNAFGRRDAETELPE 615


>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
          Length = 712

 Score =  121 bits (304), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 63/150 (42%), Positives = 82/150 (54%), Gaps = 28/150 (18%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K CNCK S+CLKLYCECFA+  +C   C C  C N+ E++  R  A E  L RNP AFRP
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508

Query: 173 KIASSP----------HGTRDNKEETGEVLMLG-----------------KHNKGCHCKK 205
           K+ +              T+ N +E  + L+ G                 +H +GC+CKK
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKK 568

Query: 206 SGCLKKYCECFQANILCSENCRCMDCKNFE 235
           S C KKYCECFQ  + CS+ C+C +CKN E
Sbjct: 569 SNCQKKYCECFQQGVNCSDLCKCEECKNNE 598



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 44/110 (40%), Gaps = 26/110 (23%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN------------------------- 233
           K C+CKKS CLK YCECF  N  C  NC C  C N                         
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508

Query: 234 -FEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
             E  ++  +LF  D       IQ+ ++A ++G      + +  VS K K
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDK 558


>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 782

 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/146 (42%), Positives = 86/146 (58%), Gaps = 5/146 (3%)

Query: 95  PKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEA 153
           PK R +S+    E+     K+C+CK S+CLKLYCECFA+G+YC + C+C  C N   +E 
Sbjct: 458 PKKRRKSDNGDDESC----KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 513

Query: 154 ARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYC 213
                 +    RNP AF PK+       ++ +E+        +H +GC+CKKS CLKKYC
Sbjct: 514 IVLSTRKHIEFRNPLAFAPKVIRMSEAGQEAQEDPKNTPASARHKRGCNCKKSSCLKKYC 573

Query: 214 ECFQANILCSENCRCMDCKNFEGSEE 239
           EC+Q  + CS NCRC  CKN  G+ +
Sbjct: 574 ECYQGGVGCSNNCRCETCKNTFGTRD 599


>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
          Length = 475

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 64/189 (33%), Positives = 100/189 (52%), Gaps = 22/189 (11%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+     G    + 
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L   
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 382

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPP-VSKKRKGQELFFGSTAKDPSIHRLGHFQQ 305
            H        +  +   +  +  + ++ PP + K R+          +      L H ++
Sbjct: 383 PH------YMEPGDFESSHYLSPAKFSGPPKLRKNRQAFSCISWEVVEATCACLLAHGEE 436

Query: 306 GNHIRTSPS 314
             H   SPS
Sbjct: 437 AEHEHCSPS 445



 Score = 46.2 bits (108), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 8/83 (9%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
           +  K CNCK S CLK YCEC+ + I C   C C+ C  N E    R+  +       P  
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIAC-KNYEESPERKMLMSTPHYMEPGD 390

Query: 170 FR------PKIASSPHGTRDNKE 186
           F       P   S P   R N++
Sbjct: 391 FESSHYLSPAKFSGPPKLRKNRQ 413


>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 490

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 55/119 (46%), Positives = 72/119 (60%), Gaps = 5/119 (4%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C CK +RCLKLYC+CF   IYC   C C++CHN    E AR+ A++  L RNP+AF  K 
Sbjct: 217 CTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKF 276

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
             +P        ET   L+   H  GC C+KS C+KKYCEC+  ++ C+ +CRC  CKN
Sbjct: 277 QKTP--VDGATVETPSKLLT--HKLGCKCRKSACMKKYCECYAGHVYCNTHCRCTGCKN 331



 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 26/40 (65%)

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           GC CKK+ CLK YC+CF   I C  NCRC+DC N    E+
Sbjct: 216 GCTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQED 255


>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
 gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
          Length = 776

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 77/130 (59%), Gaps = 1/130 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFA+G+YC + C+C  C N   +E       +    RNP AF P
Sbjct: 464 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 523

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+        +  E+        +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CK
Sbjct: 524 KVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSTNCRCESCK 583

Query: 233 NFEGSEERQA 242
           N  G  + ++
Sbjct: 584 NTFGRRDAES 593



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/39 (58%), Positives = 25/39 (64%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           G+  K C CKKS CLK YCECF A + CSE C C  C N
Sbjct: 460 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 498



 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 4/57 (7%)

Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVE 160
           NTP   + ++ CNCK S CLK YCEC+  G+ C   C C +C N      A  E  E
Sbjct: 541 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSTNCRCESCKNTFGRRDAESELTE 597


>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
 gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
          Length = 620

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 66/110 (60%), Gaps = 13/110 (11%)

Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           LYCECFA+G++C  C+C NC N  +N    +   +    RNPNAF  KI +S  G     
Sbjct: 277 LYCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGG---- 332

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
                    G+H KGCHCKKS CLKKYCECFQA +LC + C+C  CKN E
Sbjct: 333 ---------GQHKKGCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKNKE 373


>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
 gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
          Length = 710

 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 68/169 (40%), Positives = 87/169 (51%), Gaps = 36/169 (21%)

Query: 94  SPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE-- 150
           +PK R +S   + E      K+C+CK S+CLKLYCECFA+G+YC D C C +C N  E  
Sbjct: 373 TPKKRRKS---LGEKAGDGCKRCHCKKSKCLKLYCECFAAGVYCLDSCACRDCFNKPEFA 429

Query: 151 -NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML--------------- 194
                 R+ +E    RNP AF PKI      T DN    G  +                 
Sbjct: 430 DTVMGTRQQIET---RNPLAFAPKIVQ----TSDNTPTEGVGIWFSGHAMTNSLFALQPN 482

Query: 195 -------GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
                   +H +GC+CKKS CLKKYCEC+QA + CS+ C+C  CKN  G
Sbjct: 483 CPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 531


>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
          Length = 407

 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/148 (43%), Positives = 83/148 (56%), Gaps = 19/148 (12%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
           +  P++ K C+CK S CLKLYCEC A+   CD  CNC  C N  E  A R  AV A LER
Sbjct: 42  QTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILER 101

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
           NP AF PK+AS                   +H +GC+C+KSGC+K YCEC QA + C+  
Sbjct: 102 NPLAFLPKVASGS----------------SQHLRGCNCRKSGCMKNYCECHQAGVACTSR 145

Query: 226 CRCMDCKNFEG--SEERQALFHGDHANN 251
           C C  C+N E   S ++  +F G  A++
Sbjct: 146 CACHQCRNTETFVSAKKMLVFAGVSADD 173



 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 36/69 (52%), Gaps = 1/69 (1%)

Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEAT 162
           +V   + +  + CNC+ S C+K YCEC  +G+ C   C C  C N     +A++  V A 
Sbjct: 109 KVASGSSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAG 168

Query: 163 LERNPNAFR 171
           +  + ++ R
Sbjct: 169 VSADDSSLR 177


>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 394

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 63/148 (42%), Positives = 84/148 (56%), Gaps = 19/148 (12%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
           +  P++ K C+CK S CLKLYCEC A+   CD  CNC  C N  E  A R  AV A L+R
Sbjct: 42  QTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDR 101

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
           NP AF+PK+AS                   +H +GC+C+KSGC+K YCEC QA + C+  
Sbjct: 102 NPLAFQPKVASGS----------------SQHLRGCNCRKSGCMKNYCECHQAGVACTSR 145

Query: 226 CRCMDCKNFEG--SEERQALFHGDHANN 251
           C C  C+N E   S ++  +F G  A++
Sbjct: 146 CACHQCRNTETFVSAKKMLVFAGVSADD 173



 Score = 42.4 bits (98), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNC+ S C+K YCEC  +G+ C   C C  C N     +A++  V A +  + N  R  I
Sbjct: 121 CNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSADDNDLRVTI 180


>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 974

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 56/124 (45%), Positives = 76/124 (61%), Gaps = 7/124 (5%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK ++CLKLYC+CFA G  C   C CV C NN      R +A+E  L+RNP AF  K 
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVK- 694

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNK--GCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
              P   + +K+E    ++   ++K  GC+C+KS CLKKYC C+ + + CSE C+C +CK
Sbjct: 695 ---PEEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSEACKCTECK 751

Query: 233 NFEG 236
           N E 
Sbjct: 752 NLEN 755



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 27/37 (72%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS 237
           C+CKK+ CLK YC+CF    LC ++C+C++C N E +
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEAT 672



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 36/66 (54%), Gaps = 2/66 (3%)

Query: 87  VRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNC 145
             V   E+ KS+   N  + +N+  K   CNC+ S CLK YC C+ SG+ C + C C  C
Sbjct: 691 FNVKPEEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSEACKCTEC 750

Query: 146 HNNVEN 151
             N+EN
Sbjct: 751 K-NLEN 755


>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
 gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
 gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
          Length = 782

 Score =  119 bits (298), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 58/128 (45%), Positives = 76/128 (59%), Gaps = 2/128 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFA+G+YC + C+C  C N   +E       +    RNP AF P
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 527

Query: 173 KIASSPHGTRD-NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           K+       +D   E+        +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  C
Sbjct: 528 KVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGC 587

Query: 232 KNFEGSEE 239
           KN  G  +
Sbjct: 588 KNTFGKRD 595



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/39 (58%), Positives = 25/39 (64%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           G+  K C CKKS CLK YCECF A + CSE C C  C N
Sbjct: 464 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 502



 Score = 46.6 bits (109), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 8/59 (13%)

Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
           NTP   + ++ CNCK S CLK YCEC+  G+ C   C C  C N       +R+A  +T
Sbjct: 546 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTF----GKRDAAVST 600


>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
          Length = 794

 Score =  119 bits (298), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 58/128 (45%), Positives = 76/128 (59%), Gaps = 2/128 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFA+G+YC + C+C  C N   +E       +    RNP AF P
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 527

Query: 173 KIASSPHGTRD-NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           K+       +D   E+        +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  C
Sbjct: 528 KVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGC 587

Query: 232 KNFEGSEE 239
           KN  G  +
Sbjct: 588 KNTFGKRD 595



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/39 (58%), Positives = 25/39 (64%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           G+  K C CKKS CLK YCECF A + CSE C C  C N
Sbjct: 464 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 502



 Score = 46.6 bits (109), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 8/59 (13%)

Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
           NTP   + ++ CNCK S CLK YCEC+  G+ C   C C  C N       +R+A  +T
Sbjct: 546 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTF----GKRDAAVST 600


>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
          Length = 782

 Score =  119 bits (298), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 58/128 (45%), Positives = 76/128 (59%), Gaps = 2/128 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFA+G+YC + C+C  C N   +E       +    RNP AF P
Sbjct: 456 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAP 515

Query: 173 KIASSPHGTRD-NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           K+       +D   E+        +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  C
Sbjct: 516 KVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGC 575

Query: 232 KNFEGSEE 239
           KN  G  +
Sbjct: 576 KNTFGKRD 583



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/39 (58%), Positives = 25/39 (64%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           G+  K C CKKS CLK YCECF A + CSE C C  C N
Sbjct: 452 GEACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLN 490



 Score = 46.2 bits (108), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 8/59 (13%)

Query: 108 NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
           NTP   + ++ CNCK S CLK YCEC+  G+ C   C C  C N       +R+A  +T
Sbjct: 534 NTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTF----GKRDAAVST 588


>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
 gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
 gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
 gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
          Length = 475

 Score =  119 bits (297), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 78/122 (63%), Gaps = 15/122 (12%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+     G    + 
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L   
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 382

Query: 247 DH 248
            H
Sbjct: 383 PH 384



 Score = 46.2 bits (108), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 8/83 (9%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
           +  K CNCK S CLK YCEC+ + I C   C C+ C  N E    R+  +       P  
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIAC-KNYEESPERKMLMSTPHYMEPGD 390

Query: 170 FR------PKIASSPHGTRDNKE 186
           F       P   S P   R N++
Sbjct: 391 FESSHYLSPAKFSGPPKLRKNRQ 413


>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
          Length = 380

 Score =  119 bits (297), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 61/161 (37%), Positives = 93/161 (57%), Gaps = 23/161 (14%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C  C+C    NN+ +E  R +A++A ++RNP AF+PK+     G    + 
Sbjct: 183 YCDCFASGDFCTSCSC----NNLRHELERFKAIKACIDRNPEAFQPKMGKGRLGVSKLR- 237

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L + 
Sbjct: 238 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMNT 287

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELF 287
            H        +A +      +  + ++ PP  K RK ++ F
Sbjct: 288 PH------YMEAGDFEGNHHLSPAKFSGPP--KLRKNRQTF 320



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           +  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 237 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEES 278


>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 805

 Score =  118 bits (296), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 59/131 (45%), Positives = 79/131 (60%), Gaps = 15/131 (11%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
           KK + CNCK ++CLKLYC+CFA+G +C   CNC  C N V N+ AR + VE  LE+NP A
Sbjct: 475 KKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEKNPFA 534

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNK-----GCHCKKSGCLKKYCECFQANILCSE 224
           F           +D + E    L L    K     GC+C++SGCLKKYC+C+Q  + C E
Sbjct: 535 F---------NVKDIEIEEPSQLSLKAQKKLASKKGCNCRRSGCLKKYCQCYQDGLQCGE 585

Query: 225 NCRCMDCKNFE 235
           +C+C  C+N E
Sbjct: 586 HCKCNGCENCE 596



 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 1/53 (1%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE-RQALFHG 246
            K  + C+CKK+ CLK YC+CF A   C   C C +C N   ++E R  +  G
Sbjct: 474 NKKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEG 526


>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
          Length = 363

 Score =  118 bits (296), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 70/133 (52%), Positives = 82/133 (61%), Gaps = 14/133 (10%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNC  S CLKLYCECFA G  C  CNC NC NN+  E  R  A++ TLERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 203

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI     G R             KH KGC+CK+SGCLK YCEC++A I CS+ CRC  C+
Sbjct: 204 KIGRG-EGER-------------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 249

Query: 233 NFEGSEERQALFH 245
           N E S ER++L  
Sbjct: 250 NTEDSVERRSLMR 262


>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
          Length = 364

 Score =  118 bits (296), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 70/133 (52%), Positives = 82/133 (61%), Gaps = 14/133 (10%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           +K CNC  S CLKLYCECFA G  C  CNC NC NN+  E  R  A++ TLERNP AF P
Sbjct: 145 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 204

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI     G R             KH KGC+CK+SGCLK YCEC++A I CS+ CRC  C+
Sbjct: 205 KIGRG-EGER-------------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 250

Query: 233 NFEGSEERQALFH 245
           N E S ER++L  
Sbjct: 251 NTEDSVERRSLMR 263


>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 471

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 60/135 (44%), Positives = 77/135 (57%), Gaps = 21/135 (15%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
           KKQK CNCK ++CLKLYC+CFA+G  C G CNC  C N   N   R  A+ + ++R P+A
Sbjct: 127 KKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCFN-TSNSEKRAGAIVSIMDRQPDA 185

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  + H                   KGC+C +SGCLKKYCEC+   + C E C+C 
Sbjct: 186 FGPKVQQNTHK------------------KGCNCTRSGCLKKYCECYTLGVNCGEYCKCT 227

Query: 230 DCKNFE-GSEERQAL 243
            CKN +    ER AL
Sbjct: 228 QCKNVDLNHSERLAL 242


>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 76/111 (68%), Gaps = 11/111 (9%)

Query: 124 LKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           LKLYCECF  G YC   C+C  CHN  +    R++A+++ L+RN +AF+PK+A S  G +
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS--GKQ 262

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            N +        GKH +GC+CK+SGCLKKYCEC+QA + C++ C C++CKN
Sbjct: 263 VNNK--------GKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKN 305



 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 6/64 (9%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           KP+  KS  + N     N  K  + CNCK S CLK YCEC+ +G+ C D CNCV C N  
Sbjct: 253 KPKVAKSGKQVN-----NKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKNRD 307

Query: 150 ENEA 153
           +++A
Sbjct: 308 DHDA 311


>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
          Length = 664

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 57/128 (44%), Positives = 75/128 (58%), Gaps = 19/128 (14%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNA 169
           ++Q+ CNCK S CLK+YCECFA+G +C   C+C++C N          A E  L +NPNA
Sbjct: 260 REQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNA 319

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRC 228
           F  K+ ++          TG       H +GC CK+S CLKKYCECF A   C+ E C+C
Sbjct: 320 FEVKVTAA----------TG-------HRRGCRCKRSKCLKKYCECFHAGARCNPEVCQC 362

Query: 229 MDCKNFEG 236
            DC+N EG
Sbjct: 363 EDCRNTEG 370


>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
           CCMP2712]
          Length = 113

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/122 (48%), Positives = 79/122 (64%), Gaps = 12/122 (9%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           ++K C+CK SRCLKLYCECFA+G  C GC CV+C N+ ++E  R +AV+   +RN NAF 
Sbjct: 4   RKKVCSCKKSRCLKLYCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNNAFA 63

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           PKI       +            G H +GC CKKS CLKKYCEC+QA + C++ C+C +C
Sbjct: 64  PKIVDEIQQDK------------GMHARGCRCKKSHCLKKYCECYQAGVQCTDKCKCEEC 111

Query: 232 KN 233
           +N
Sbjct: 112 QN 113



 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           E++++     + C CK S CLK YCEC+ +G+ C D C C  C N
Sbjct: 69  EIQQDKGMHARGCRCKKSHCLKKYCECYQAGVQCTDKCKCEECQN 113


>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
 gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific
          Length = 475

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 77/122 (63%), Gaps = 15/122 (12%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+     G    + 
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I CS  C+C+ CKN+E S ER+ L   
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKMLMST 382

Query: 247 DH 248
            H
Sbjct: 383 PH 384



 Score = 45.4 bits (106), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           +  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEES 373


>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
           norvegicus]
          Length = 467

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 77/122 (63%), Gaps = 15/122 (12%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+     G    + 
Sbjct: 270 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 324

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I CS  C+C+ CKN+E S ER+ L   
Sbjct: 325 ----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKMLMST 374

Query: 247 DH 248
            H
Sbjct: 375 PH 376



 Score = 45.4 bits (106), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           +  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 324 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEES 365


>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 75/111 (67%), Gaps = 11/111 (9%)

Query: 124 LKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           LKLYCECF  G YC   C+C  CHN  +    R++A+++ L+RN +AF+PK+A S  G +
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS--GKQ 262

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            N +        GKH +GC+CK+SGCLKKYCEC+QA + C++ C C+ CKN
Sbjct: 263 VNNK--------GKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKN 305



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 6/64 (9%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           KP+  KS  + N     N  K  + CNCK S CLK YCEC+ +G+ C D CNCV C N  
Sbjct: 253 KPKVAKSGKQVN-----NKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKNRD 307

Query: 150 ENEA 153
           +++A
Sbjct: 308 DHDA 311


>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
          Length = 662

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 76/132 (57%), Gaps = 1/132 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           + CNC+ S+CLKLYC CFA+ +YC   C+C  C NN  +E       + T   NP AF P
Sbjct: 365 RHCNCRRSKCLKLYCPCFAAKVYCSSLCSCQGCSNNHAHEELVSCTRKRTESHNPLAFAP 424

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
            +  +     +  +++ +     +H +GC+C+KS CLKKYCECFQ+ + C  +CRCM CK
Sbjct: 425 TVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECFQSGVGCCISCRCMSCK 484

Query: 233 NFEGSEERQALF 244
           N  G  E   L 
Sbjct: 485 NSFGKREGVLLL 496


>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
          Length = 360

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 87/149 (58%), Gaps = 27/149 (18%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+     G    + 
Sbjct: 163 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 217

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L   
Sbjct: 218 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 267

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASP 275
            H     Y++        G   SS Y SP
Sbjct: 268 PH-----YME-------PGDFESSHYLSP 284



 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           +  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 217 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEES 258


>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
          Length = 757

 Score =  115 bits (289), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 1/124 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFAS +YC + C+C  C N+  +E            RNP AF P
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 521

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+  +     +  E++       +H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CK
Sbjct: 522 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCK 581

Query: 233 NFEG 236
           N  G
Sbjct: 582 NSFG 585



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 502


>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 766

 Score =  115 bits (289), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 1/124 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFAS +YC + C+C  C N+  +E            RNP AF P
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 530

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+  +     +  E++       +H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CK
Sbjct: 531 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCK 590

Query: 233 NFEG 236
           N  G
Sbjct: 591 NSFG 594



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 511


>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
          Length = 260

 Score =  115 bits (289), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 28/211 (13%)

Query: 71  VPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-----QKQCNCKHSRCLK 125
           V      L  Q  +    + +    +++S   +  + N  +K     Q+QCNCK S+CLK
Sbjct: 14  VKNKQKKLINQIKKKSSFLTQKSHSQTKSEHTINQQANNIQKLKALQQRQCNCKKSKCLK 73

Query: 126 LYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDN 184
           LYC+CFA G YC+  C+C  C NN  N+  R  A++   ERNPNAF  +I          
Sbjct: 74  LYCDCFAIGEYCNPICHCFECKNNESNKEKREMALKQNKERNPNAFTSRIIQQSQ----- 128

Query: 185 KEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
                    L  H +GC+CKKSGC KKYC+C+   + CS  CRC+ C+N    E ++   
Sbjct: 129 ---------LNNHQRGCNCKKSGCQKKYCQCYLDGMECSRYCRCIGCENCNIFENKKICL 179

Query: 245 HGDHAN--------NMAYIQQAANAAITGAI 267
           +  +          N  YIQ+     I   I
Sbjct: 180 NNKNIEKQIQQQKGNSDYIQKDQQQNILNQI 210


>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
 gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
 gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
 gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
 gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
 gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
          Length = 295

 Score =  115 bits (289), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 87/149 (58%), Gaps = 27/149 (18%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+     G    + 
Sbjct: 98  YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 152

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L   
Sbjct: 153 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 202

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASP 275
            H     Y++        G   SS Y SP
Sbjct: 203 PH-----YME-------PGDFESSHYLSP 219



 Score = 43.1 bits (100), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 8/83 (9%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
           +  K CNCK S CLK YCEC+ + I C   C C+ C  N E    R+  +       P  
Sbjct: 152 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIAC-KNYEESPERKMLMSTPHYMEPGD 210

Query: 170 FR------PKIASSPHGTRDNKE 186
           F       P   S P   R N++
Sbjct: 211 FESSHYLSPAKFSGPPKLRKNRQ 233


>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 766

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 7/130 (5%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECF +G++C + C+C +C N   N+E   + R+ +E+   RNP A
Sbjct: 459 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIES---RNPLA 515

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+  +    ++  E +       +H +GC+CKKS CLKKYCEC+Q  + CS +CR  
Sbjct: 516 FAPKVIRTSEPGQELGEYSNRTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRYE 575

Query: 230 DCKNFEGSEE 239
            CKN  G  E
Sbjct: 576 GCKNAFGRRE 585



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/39 (61%), Positives = 25/39 (64%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           G   K C CKKS CLK YCECF A + CSE C C DC N
Sbjct: 455 GTSCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLN 493



 Score = 42.4 bits (98), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 5/50 (10%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNC-HNNVENEAARREAV 159
           + ++ CNCK S CLK YCEC+  G+ C     ++C +   +N   RRE +
Sbjct: 542 RHKRGCNCKKSSCLKKYCECYQGGVGCS----ISCRYEGCKNAFGRREGL 587


>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
          Length = 644

 Score =  115 bits (288), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 1/124 (0%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K+C+CK S+CLKLYCECFAS +YC + C+C  C N+  +E            RNP AF P
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAP 408

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           K+  +     +  E++       +H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CK
Sbjct: 409 KVIRTCGPGLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCK 468

Query: 233 NFEG 236
           N  G
Sbjct: 469 NSFG 472



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 389


>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 978

 Score =  115 bits (287), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 73/119 (61%), Gaps = 9/119 (7%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK ++CLKLYCECFA   YC +GC+C  C N  ENE  R+ A++    RN +AF+PK 
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKT 464

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            S     R NK    ++       KGC+C+K+ CLKKYCECF A   C+  C+C  C+N
Sbjct: 465 ESE--QERKNKANNEQM------KKGCNCRKTKCLKKYCECFNAGTYCNNMCKCDSCEN 515


>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score =  115 bits (287), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 78/136 (57%), Gaps = 19/136 (13%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF--- 170
           QCNCK S+CLKLYCECFA+   C   CNC  C N ++N   R +A+E  L RNP AF   
Sbjct: 163 QCNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTI 222

Query: 171 ------RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
                 +P+I   P   ++  +ET          KGC+CKKS C KKYCEC+  N  C++
Sbjct: 223 LTNNGQQPQIIPEPKSQKEQSKETK---------KGCNCKKSECKKKYCECYSINQKCTD 273

Query: 225 NCRCMDCKNFEGSEER 240
            C+C +C N E  +E+
Sbjct: 274 LCKCENCLNKEEPQEQ 289


>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
 gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
          Length = 781

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/141 (46%), Positives = 85/141 (60%), Gaps = 3/141 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           ++CNCK SRCLKLYCECFA+GIYC D C+CVNC N  E E    +  + T  RNP AF P
Sbjct: 441 RRCNCKKSRCLKLYCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAP 500

Query: 173 KIASSPHGTRDNKEETGEVLMLG--KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
           K+ ++   +  N  E G+ +     +H KGC+CKKS C KKYCECFQ  + C   CRC  
Sbjct: 501 KVVNNATNSPANMMEEGKWMKTSSSRHKKGCNCKKSKCSKKYCECFQGGVGCCNGCRCEG 560

Query: 231 CKNFEGSEERQALFHGDHANN 251
           C N  G++   +    +  NN
Sbjct: 561 CYNPYGNKTETSYRRAERWNN 581


>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1114

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 56/127 (44%), Positives = 71/127 (55%), Gaps = 11/127 (8%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNC  ++CLKLYC CF  G+ C D C C +C+N +     R  A++A  +R P+AF  K+
Sbjct: 714 CNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKV 773

Query: 175 ASSPHGTRDNKEETGEVL--------MLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
              P   R  K E  E L         L  H KGC+CKKS C KKYCECF A + C+  C
Sbjct: 774 NFLPELAR--KIEDAERLDADLELQNQLLAHKKGCNCKKSACKKKYCECFLAGVKCTYLC 831

Query: 227 RCMDCKN 233
           RC  C+N
Sbjct: 832 RCEQCQN 838


>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
          Length = 929

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 69/117 (58%), Gaps = 14/117 (11%)

Query: 126 LYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDN 184
           LYC+CFA+G YC   C+C +C N VEN A      EA  +RNPNAF  KI +        
Sbjct: 540 LYCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQRNPNAFVQKIEA-------- 591

Query: 185 KEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
                +  + G+H +GC+CKKS C+KKYCECFQA + C E+C+C  C N  G   R+
Sbjct: 592 -----DAALGGQHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
           + ++ CNCK S C+K YCECF +G+ C + C C +CHN   +   R
Sbjct: 598 QHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643


>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 712

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 68/194 (35%), Positives = 89/194 (45%), Gaps = 43/194 (22%)

Query: 83  TQPPVRVPKPESPKSRS-----RSNVEVKENTPKKQKQ---------CNCKHSRCLKLYC 128
           T PP   P P  P   S     RS++ ++  T + +           C CK S+CLKLYC
Sbjct: 291 TNPPANYPAPTIPSRPSVPVAQRSSIVLQAATARLKAHHYTDNSSVTCTCKKSKCLKLYC 350

Query: 129 ECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPK-------------- 173
            CF++   C   C C  C N  E    R +AV+ATL+RNPNAFR K              
Sbjct: 351 HCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKFVKVTGGEEFAKEW 410

Query: 174 -------------IASSPHGTRDNKEETGEVLMLG-KHNKGCHCKKSGCLKKYCECFQAN 219
                        +  SP  T+    E   V +    H  GC+C+KS CLK+YCECF A 
Sbjct: 411 EKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKSFCLKRYCECFGAQ 470

Query: 220 ILCSENCRCMDCKN 233
             C  NC+C+ C+N
Sbjct: 471 TPCGLNCKCLGCQN 484


>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
          Length = 386

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 75/130 (57%), Gaps = 6/130 (4%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           QCNCK S+CLKLYCECFA+   C   CNC +C N ++N   R +A+E  L RNP+AF   
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250

Query: 174 IASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 228
                      +++   +    K N     KGC+CKKSGC KKYCEC+  N+ CS+ C+C
Sbjct: 251 FQQKSQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310

Query: 229 MDCKNFEGSE 238
             C N   +E
Sbjct: 311 EQCLNRTDAE 320


>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1022

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 55/118 (46%), Positives = 71/118 (60%), Gaps = 6/118 (5%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIA 175
           C CK ++CLKLYCECF +  YC G NC  C N  E E  R+ A+E   +RN +AF PK  
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKAD 825

Query: 176 SSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            S       K ++G   +   H+KGC+CKKS C KKYCEC+Q    C+  C+C +CKN
Sbjct: 826 KSDQYY---KSDSGTKAI---HSKGCNCKKSDCRKKYCECYQMGAKCTHLCKCYNCKN 877


>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
 gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
          Length = 478

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/126 (44%), Positives = 75/126 (59%), Gaps = 31/126 (24%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNP 167
           N   + K+C+CK+S+CLK+YCECFA+ I C+ C C  C NN           EA +E+  
Sbjct: 97  NIADQCKKCHCKNSKCLKMYCECFANKILCNNCQCFGCQNN-----------EANIEKIG 145

Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
           N                     + +++ KH++GCHC+KSGCLKKYCECFQA I C+ENC+
Sbjct: 146 N--------------------DQSVLIDKHSRGCHCRKSGCLKKYCECFQAGIPCNENCK 185

Query: 228 CMDCKN 233
           C DCKN
Sbjct: 186 CYDCKN 191



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/42 (57%), Positives = 29/42 (69%), Gaps = 1/42 (2%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER 240
           K CHCK S CLK YCECF   ILC+ NC+C  C+N E + E+
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCN-NCQCFGCQNNEANIEK 143


>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
          Length = 384

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 75/130 (57%), Gaps = 6/130 (4%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           QCNCK S+CLKLYCECFA+   C   CNC +C N ++N   R +A+E  L RNP+AF   
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250

Query: 174 IASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 228
                      +++   +    K N     KGC+CKKSGC KKYCEC+  N+ CS+ C+C
Sbjct: 251 FQQKGQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310

Query: 229 MDCKNFEGSE 238
             C N   +E
Sbjct: 311 EQCLNRTDAE 320


>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
          Length = 396

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/121 (46%), Positives = 72/121 (59%), Gaps = 17/121 (14%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK SRCLKLYCECF S  YC   CNC+NC+NN ++E+ R EAV+    RNP+ F   +
Sbjct: 151 CNCKKSRCLKLYCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCFENHV 210

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
                   D K+        G +  GC C+++ C KKYCECF   + C + C+C DCKN 
Sbjct: 211 --------DEKK--------GVNKSGCRCRRTHCDKKYCECFSHGVPCGDQCQCKDCKNE 254

Query: 235 E 235
           E
Sbjct: 255 E 255


>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
 gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
          Length = 749

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 61/147 (41%), Positives = 88/147 (59%), Gaps = 11/147 (7%)

Query: 104 EVKENTPKKQKQCNCKHSR---CLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARR 156
           +  + +PKK++    + SR    L+LYCECFA+G+YC + C C +C N   +E    A R
Sbjct: 445 DFSQTSPKKKRHVG-RRSRGLQALQLYCECFAAGVYCIEPCACQDCFNKPIHEDTVLATR 503

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
           + +E+   RNP AF PK+  S     +  +E+ +     +H +GC+CKKS CLKKYCEC+
Sbjct: 504 KQIES---RNPLAFAPKVIRSSEPAPEIGDESSKTPASARHKRGCNCKKSSCLKKYCECY 560

Query: 217 QANILCSENCRCMDCKNFEGSEERQAL 243
           Q  + CS NCRC  CKN  G ++  AL
Sbjct: 561 QGGVGCSLNCRCEGCKNAFGRKDGSAL 587


>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 510

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 51/108 (47%), Positives = 64/108 (59%), Gaps = 13/108 (12%)

Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           LYCECFA+G +C  C+C  C N  ENEA   +  +   +RNP AF  KI +         
Sbjct: 207 LYCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKIMA--------- 257

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            + G+     +H KGCHCKKS CLKKYCECFQA + C + C+C  CKN
Sbjct: 258 -DAGDD---ARHTKGCHCKKSACLKKYCECFQAGVKCQDYCKCEGCKN 301


>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
          Length = 348

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 74/120 (61%), Gaps = 1/120 (0%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           QCNCK S+CLKLYCECF +   C   CNC  C N ++N   R +A+E  L RNP AF P 
Sbjct: 167 QCNCKKSKCLKLYCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPI 226

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           + ++    +  +E+  +  +  +  KGC+CKKS C KKYCEC+  N  C+E C+C +C N
Sbjct: 227 LTNNGQQPQVIQEQKSQKDIQKETKKGCNCKKSECKKKYCECYSINQKCTELCKCENCLN 286


>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 742

 Score =  112 bits (281), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 54/128 (42%), Positives = 75/128 (58%), Gaps = 19/128 (14%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
           E +P + K CNC++S+CLK YC+CFA+G+YC   C C  CHN  E E  R++A++   +R
Sbjct: 493 EKSPTEAKPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKR 552

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
             +AF P I ++ H                   KGCHCK S C KKYC C Q  +LCS  
Sbjct: 553 IKDAFLPVIQNNSHI------------------KGCHCKNSHCQKKYCVCHQNGVLCSSL 594

Query: 226 CRCMDCKN 233
           C+C++C+N
Sbjct: 595 CQCVECEN 602



 Score = 43.1 bits (100), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVE 160
           K C+CK+S C K YC C  +G+ C   C CV C N +E+    + +++
Sbjct: 568 KGCHCKNSHCQKKYCVCHQNGVLCSSLCQCVECENKIESMKKEQSSIQ 615


>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 495

 Score =  112 bits (281), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 67/163 (41%), Positives = 91/163 (55%), Gaps = 11/163 (6%)

Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATL 163
           + ++  K ++ C CK+S+CLKLYCECFA G YC D C C+ C N  E E   +EA + TL
Sbjct: 245 INDSEQKAREGCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECENEIKEARKVTL 304

Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
            RNP+AF  K+       +D   E     MLG + KGC CK++ C KKYCEC+ A + CS
Sbjct: 305 TRNPDAFTSKLEVVGTIVQDEDIEANRG-MLG-YKKGCKCKRTYCKKKYCECYNAGVKCS 362

Query: 224 ENCRCMDCKNFE--------GSEERQALFHGDHANNMAYIQQA 258
             C C +C N E         + E+Q   + +HA  +A   QA
Sbjct: 363 YLCICENCHNQEDDPSKEKDNAAEQQVHKNPNHAGGVANTNQA 405


>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
 gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
          Length = 414

 Score =  111 bits (278), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 53/114 (46%), Positives = 68/114 (59%), Gaps = 1/114 (0%)

Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           +K YCECFA+G+YC + C+C+ C NN  +E    E       RNP AF PK+  +     
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVS 197

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
           +  EET +     +H +GC+CKKS CLKKYCECFQ  + CS  CRC  CKN  G
Sbjct: 198 EFGEETNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFG 251



 Score = 45.8 bits (107), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 7/47 (14%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGI------YCDGC-NCVNCHNNVE 150
           + ++ CNCK S CLK YCECF  G+       C+GC N   C N VE
Sbjct: 211 RHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFGCKNGVE 257



 Score = 38.9 bits (89), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           +K YCECF A + C E C C++C N    E+
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHED 168


>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
          Length = 196

 Score =  111 bits (277), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 53/136 (38%), Positives = 73/136 (53%), Gaps = 25/136 (18%)

Query: 99  SRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARRE 157
           S+ +++++      ++ CNCK S+CLK YC+CFA+G  C + CNCV C NN  N   R+E
Sbjct: 82  SQKSLQIQSKLFINERSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKE 141

Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
           A    + R+P AF+                           KGC+CKKSGC KKYCECF 
Sbjct: 142 AKVQIINRDPGAFKQSF------------------------KGCNCKKSGCQKKYCECFL 177

Query: 218 ANILCSENCRCMDCKN 233
           + + C+  CRC  C N
Sbjct: 178 SGLACTHLCRCDGCLN 193


>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
          Length = 218

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 56/119 (47%), Positives = 71/119 (59%), Gaps = 15/119 (12%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYC+CFA+G  C   CNC  C NN  N   R + +E  +ERNP AF  K+
Sbjct: 85  CNCKKSKCLKLYCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPEAFNQKV 144

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
               +                 H+KGC+C+KSGC KKYCEC+Q  I CS+NC+C  C+N
Sbjct: 145 KEVDYKL--------------AHSKGCNCRKSGCKKKYCECYQLGIECSDNCKCDGCQN 189


>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
          Length = 1430

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 53/128 (41%), Positives = 73/128 (57%), Gaps = 12/128 (9%)

Query: 114  KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAAR--REAVEAT--LERNPNA 169
            K CNCK S+CLKLYCEC+A+G++C  CNC NC N  E+  A     A  A   L     A
Sbjct: 1012 KPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPEDGEAHPPYPATHALGRLRHGDVA 1071

Query: 170  FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
                + +S    +   ++        +   GC CKKS CLKKYCECF+A + C  +C+C 
Sbjct: 1072 VPTAVLTSAQKRKPQSKQ--------RQESGCFCKKSKCLKKYCECFEAGVHCEASCKCE 1123

Query: 230  DCKNFEGS 237
            +C+N++GS
Sbjct: 1124 NCENYDGS 1131


>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
          Length = 142

 Score =  110 bits (275), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 55/129 (42%), Positives = 76/129 (58%), Gaps = 10/129 (7%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNA 169
           +K + C CK SRCL+LYCECF +GI+C+  C C NC N   N+     A +    RNP+A
Sbjct: 10  RKVEFCKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKMRNPDA 69

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F  K        + N+ E      +  H KGC+C K+ C KKYCECF A I C+ENC+C 
Sbjct: 70  FSQKFLV----VKQNQYEG-----IVSHKKGCNCTKTQCTKKYCECFNAGIKCTENCKCE 120

Query: 230 DCKNFEGSE 238
           +C+N++  +
Sbjct: 121 NCENYKDEQ 129



 Score = 48.5 bits (114), Expect = 0.010,   Method: Composition-based stats.
 Identities = 19/40 (47%), Positives = 28/40 (70%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER 240
           C CKKS CL+ YCECF   I C+++C C +C N E ++++
Sbjct: 15  CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQ 54



 Score = 48.1 bits (113), Expect = 0.014,   Method: Composition-based stats.
 Identities = 25/62 (40%), Positives = 36/62 (58%), Gaps = 5/62 (8%)

Query: 91  KPESPKSRSRSNVEVKENTPK----KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNC 145
           K  +P + S+  + VK+N  +     +K CNC  ++C K YCECF +GI C + C C NC
Sbjct: 63  KMRNPDAFSQKFLVVKQNQYEGIVSHKKGCNCTKTQCTKKYCECFNAGIKCTENCKCENC 122

Query: 146 HN 147
            N
Sbjct: 123 EN 124


>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas reinhardtii]
 gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 91

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/96 (59%), Positives = 65/96 (67%), Gaps = 9/96 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YCECFAS  YC+ CNC+ C NN ENEA R+ AVEA +ERNPNAF+PKI    H T     
Sbjct: 1   YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHET----- 55

Query: 187 ETGEVLML----GKHNKGCHCKKSGCLKKYCECFQA 218
            T  V+      G+H KGC+CKKS CLKKYCECFQ 
Sbjct: 56  HTPVVVAAAGASGRHLKGCNCKKSFCLKKYCECFQV 91


>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
          Length = 593

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/131 (43%), Positives = 81/131 (61%), Gaps = 8/131 (6%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN--- 147
           P SPK + R + +  E      K+CNCK S+CLKLYCECFA+G YC + C+C+NC N   
Sbjct: 307 PISPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPI 365

Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
           + +   A R+ +E+   RNP AF PK+  +     +  E+  +     +H +GC+CKKS 
Sbjct: 366 HKDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 422

Query: 208 CLKKYCECFQA 218
           CLKKYCEC+Q+
Sbjct: 423 CLKKYCECYQS 433



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/58 (48%), Positives = 34/58 (58%), Gaps = 5/58 (8%)

Query: 177 SPHGTRDNKEETGEVLMLGKHN-KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           SP   R   E++GE    G  + K C+CKKS CLK YCECF A   C E C C++C N
Sbjct: 309 SPKKKRRKSEQSGE----GDSSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFN 362


>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
 gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
          Length = 155

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/144 (43%), Positives = 81/144 (56%), Gaps = 14/144 (9%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +C+CK S+CLKLYCECFASG YC D C C NC N  E+E    E  +    RNP AF P+
Sbjct: 1   RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60

Query: 174 IASSPHGTRDNKEETGEVLML-------------GKHNKGCHCKKSGCLKKYCECFQANI 220
           + S     R  +       ++              KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61  VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120

Query: 221 LCSENCRCMDCKNFEGSEERQALF 244
            CS+ C+C  CKN  G+++    F
Sbjct: 121 GCSDACKCRGCKNTFGAKQGDDFF 144



 Score = 42.4 bits (98), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           T K ++ CNCK S CLK YCECF S + C D C C  C N
Sbjct: 94  TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKN 133


>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
          Length = 243

 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 65/122 (53%), Positives = 87/122 (71%), Gaps = 11/122 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC+NN  +EA R +A++A L RNP AF+PKI          K 
Sbjct: 52  YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIG---------KG 102

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
           + GEV    +HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S +R+ L++G
Sbjct: 103 KLGEVKP--RHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEESPQRRILWNG 160

Query: 247 DH 248
            H
Sbjct: 161 PH 162



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 36/69 (52%), Gaps = 4/69 (5%)

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
           R P+   PK       EVK   P+  K CNCK S CLK YCECF + I C   C C++C 
Sbjct: 90  RNPEAFQPKIGKGKLGEVK---PRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCK 146

Query: 147 NNVENEAAR 155
           N  E+   R
Sbjct: 147 NYEESPQRR 155


>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
          Length = 512

 Score =  108 bits (271), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC N++ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 311 YCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 361

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 362 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 418



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 368 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 410


>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Oryctolagus cuniculus]
          Length = 450

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 62/140 (44%), Positives = 89/140 (63%), Gaps = 11/140 (7%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN  +E  R +A++A L+RNP AF+PKI     G+   + 
Sbjct: 250 YCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGSAKPR- 308

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                     HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L + 
Sbjct: 309 ----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEESPERKTLMNA 358

Query: 247 DHANNMAYIQQAANAAITGA 266
            +  +   ++   + + TGA
Sbjct: 359 PNCVDAGDLEGGHHLSPTGA 378



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C CV C N  E+
Sbjct: 307 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEES 349


>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
          Length = 511

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 61/119 (51%), Positives = 84/119 (70%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 310 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 360

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 361 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 417



 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 367 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 409


>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 651

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 74/119 (62%), Gaps = 7/119 (5%)

Query: 125 KLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNAFRPKIASSPHG 180
           K YCECFA+G+YC + C+C++C N   +E    A R+ +E+   RNP AF PK+  +   
Sbjct: 363 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLAFAPKVIRNSES 419

Query: 181 TRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
             +  ++  +     +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G ++
Sbjct: 420 VLETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 478



 Score = 45.8 bits (107), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCEC+  G+ C   C C  C N
Sbjct: 435 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 472


>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
          Length = 509

 Score =  108 bits (270), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 61/119 (51%), Positives = 85/119 (71%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 308 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 358

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER++L +
Sbjct: 359 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKSLIN 415



 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 365 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 407


>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
          Length = 658

 Score =  108 bits (270), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 75/119 (63%), Gaps = 7/119 (5%)

Query: 125 KLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNAFRPKIASSPHG 180
           K YCECFA+G+YC + C+C++C N   +E    A R+ +E+   RNP AF PK+  +   
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLAFAPKVIRNSDS 426

Query: 181 TRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
            ++  ++  +     +H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G ++
Sbjct: 427 VQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 485



 Score = 45.4 bits (106), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCEC+  G+ C   C C  C N
Sbjct: 442 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 479


>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
          Length = 517

 Score =  108 bits (270), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 62/122 (50%), Positives = 84/122 (68%), Gaps = 11/122 (9%)

Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
           L  YC+CFASG +C+ CNC NC NN+ +E  R +A++A L RNP AF+PKI         
Sbjct: 317 LARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG-------- 368

Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
            K + G+V    +HNKGC C++SGCLK YCEC++A I+CS  C+C+DCKN+E S ER+ L
Sbjct: 369 -KGQLGDVK--PRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEESPERKTL 425

Query: 244 FH 245
            +
Sbjct: 426 MN 427



 Score = 45.8 bits (107), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K C+C+ S CLK YCEC+ + I C   C C++C N  E+
Sbjct: 377 PRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEES 419


>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
 gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
          Length = 141

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 80/139 (57%), Gaps = 14/139 (10%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +C+CK S+CLKLYCECFASG YC D C C NC N  E+E    E  +    RNP AF P+
Sbjct: 1   RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60

Query: 174 IASSPHGTRDNKEETGEVLML-------------GKHNKGCHCKKSGCLKKYCECFQANI 220
           + S     R  +       ++              KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61  VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120

Query: 221 LCSENCRCMDCKNFEGSEE 239
            CS+ C+C  CKN  G+++
Sbjct: 121 GCSDACKCRGCKNTFGAKQ 139



 Score = 42.0 bits (97), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           T K ++ CNCK S CLK YCECF S + C D C C  C N
Sbjct: 94  TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKN 133


>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
          Length = 671

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/108 (45%), Positives = 61/108 (56%), Gaps = 13/108 (12%)

Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           LYC+CFA+G++C  C+C +C N   +    R+       RNPNAF  KI           
Sbjct: 358 LYCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRYQIESRNPNAFANKIVDDDS------ 411

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                  +  KH KGCHCKKS CLKKYCECFQAN+ C + C+C  CKN
Sbjct: 412 -------VDAKHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKN 452



 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
           K  K C+CK S CLK YCECF + + C D C C  C N  +
Sbjct: 415 KHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKNTTD 455


>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
          Length = 373

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 63/144 (43%), Positives = 83/144 (57%), Gaps = 10/144 (6%)

Query: 115 QCNCKHSRCLKLYCECFAS-GIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           QCNCK S+CLKLYCECFA+ G+    CNC +C N ++N   R +A+E  L RN +AF   
Sbjct: 187 QCNCKKSKCLKLYCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFVQC 246

Query: 174 IASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 228
            ++   G     ++   +    K N     KGC+CKKSGC KKYCEC+  NI C+E C+C
Sbjct: 247 FSTK--GGAQFVQQDKPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECYSQNIRCNELCKC 304

Query: 229 MDCKNFEGSE--ERQALFHGDHAN 250
             C N   +E   +Q L   DH N
Sbjct: 305 EHCLNKTDAEIQAQQDLGQEDHNN 328


>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
 gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
          Length = 316

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 94/161 (58%), Gaps = 16/161 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YCECFA+G +C  CNC NCHNN+E+E  R  A++A LERNP+AFRPKI  +  G      
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEGNER--- 194

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH- 245
                    +H+KGC+CK+SGCLK YCEC++A I C+  C+C +CKN E   ER+ L   
Sbjct: 195 ---------RHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKNREDCLERKNLMQL 245

Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQEL 286
            D A+     Q AA + ++  I      S  +S K  G  L
Sbjct: 246 ADAADIRVKQQTAAKSKLSTQIDD---LSASISAKNDGSRL 283



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           N  +  K CNCK S CLK YCEC+ + I C   C C NC N
Sbjct: 192 NERRHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKN 232


>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
          Length = 326

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/116 (46%), Positives = 69/116 (59%), Gaps = 4/116 (3%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
           T   +K CNCK ++CLKLYC+C A+G+YC D C C NC N  ENE   +   E    R+P
Sbjct: 10  TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 69

Query: 168 NAFRPKIASSPHGTRDN-KEETGEVLM--LGKHNKGCHCKKSGCLKKYCECFQANI 220
            AF P+I +    T DN  +E G        +H +GC+CKKS C KKYCEC+Q  I
Sbjct: 70  LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQVYI 125



 Score = 46.6 bits (109), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 25/35 (71%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           K C+CK++ CLK YC+C  A + C+++C C +C N
Sbjct: 15  KHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLN 49


>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
          Length = 276

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 57/124 (45%), Positives = 71/124 (57%), Gaps = 15/124 (12%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
           ++ K CNC+ S+CLKLYC+CFA G  C   CNC  C NN  N   R   ++  +ERNP A
Sbjct: 144 QEHKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFIQKMIERNPQA 203

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F  K+               EV     H KGC+C+KSGC KKYCEC+Q  I CS+NC+C 
Sbjct: 204 FNQKVQ--------------EVESKMTHAKGCNCRKSGCQKKYCECYQMGIECSDNCKCD 249

Query: 230 DCKN 233
            C N
Sbjct: 250 GCLN 253



 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%)

Query: 191 VLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           +  L + +K C+C+KS CLK YC+CF    LCS  C C  C N
Sbjct: 139 LFYLNQEHKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFN 181


>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
          Length = 601

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 55/138 (39%), Positives = 74/138 (53%), Gaps = 14/138 (10%)

Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           YCECF++G++C + C+C NC N   +E    ++ E    RNP AF PK+ S+     D  
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ---- 241
            E  +     +H +GC+C+KSGC KKYCECF   + CS NCRCM CKN  G    Q    
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFGHTNEQCAGD 547

Query: 242 ---------ALFHGDHAN 250
                    A  +GDH +
Sbjct: 548 SDVVTINDEAKHYGDHGD 565



 Score = 42.4 bits (98), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           + ++ CNC+ S C K YCECF  G+ C   C C+ C N
Sbjct: 498 RHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKN 535


>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
 gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
          Length = 777

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 56/124 (45%), Positives = 74/124 (59%), Gaps = 13/124 (10%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C CK + CLK+YC CF+S  +C  GC C +C N   ++  R EA+++ L  +P AF    
Sbjct: 540 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAF--SF 596

Query: 175 ASSPHGTRDNKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           AS P  T      TG + +L + +     +GC CKKS CLKKYCECFQ  I C+ +CRCM
Sbjct: 597 ASLPQDT----NTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCM 652

Query: 230 DCKN 233
           DC N
Sbjct: 653 DCSN 656


>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
          Length = 713

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/148 (40%), Positives = 82/148 (55%), Gaps = 20/148 (13%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFR--- 171
           C CK + CLK+YC CF+S  +C  GC C +C N  +++  R EA++  L  +P AF    
Sbjct: 536 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAFSFAS 594

Query: 172 -PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
            P+ AS+        +++  V+M     +GC CKKS CLKKYCECFQ  I C+ +CRCMD
Sbjct: 595 LPQSASTTGFLHLLPQKSSAVVM-----RGCRCKKSKCLKKYCECFQNGIACTSHCRCMD 649

Query: 231 CKNFEGS---------EERQALFHGDHA 249
           C N   S         ++R A  HG  A
Sbjct: 650 CSNHSESTAAHQHPHLQKRSATAHGKRA 677


>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score =  107 bits (267), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 56/130 (43%), Positives = 68/130 (52%), Gaps = 28/130 (21%)

Query: 108 NTPK---KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATL 163
           N PK     K CNCK S+CLK YC+CFA+G  C + CNCV C NN  N   R+EA    +
Sbjct: 80  NLPKIITNTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQII 139

Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
            R+P AF+                           KGC+CKKSGC KKYCECF + + C+
Sbjct: 140 NRDPGAFKQSF------------------------KGCNCKKSGCQKKYCECFLSGLACT 175

Query: 224 ENCRCMDCKN 233
             CRC  C N
Sbjct: 176 HLCRCDGCLN 185


>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
          Length = 728

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/119 (50%), Positives = 83/119 (69%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C+ CNC NC+NN+ +E  R +A++A L+RNP AF+PKI     G  D K 
Sbjct: 527 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--DIKP 584

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
                    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 585 ---------RHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 634



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 584 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 626


>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
 gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
          Length = 510

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 309 YCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIG---------KG 359

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 360 KLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 416



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 366 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 408


>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
          Length = 515

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/118 (51%), Positives = 82/118 (69%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L RNP AF+PKI          K 
Sbjct: 314 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG---------KG 364

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 365 QLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 420



 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 371 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 413


>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 58/119 (48%), Positives = 69/119 (57%), Gaps = 15/119 (12%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYC+CFA+   C   CNC  C NN  N   R    E  +ERNP AF  K+
Sbjct: 109 CNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPEAFNQKV 168

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                          EV     H+KGC+C+KSGC KKYCEC+Q  I CS+NC+C  CKN
Sbjct: 169 K--------------EVDQKMAHSKGCNCRKSGCKKKYCECYQMGIECSDNCKCDGCKN 213


>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
          Length = 511

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 360

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 361 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 417



 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 367 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 409


>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
 gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
          Length = 511

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 60/119 (50%), Positives = 84/119 (70%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 310 YCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 360

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 361 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 417



 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 367 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 409


>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
 gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
          Length = 808

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 98/206 (47%), Gaps = 37/206 (17%)

Query: 68  VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKL- 126
           V P    P P+   AT+   +     SPK + R +  V E   +  K+CNCK S+CLKL 
Sbjct: 448 VQPTEDCPQPIAYMATEDFNQ----NSPKKKRRKSEPVGE--IEGCKRCNCKKSKCLKLI 501

Query: 127 ------------------YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
                             YCECFA+G+YC + C+C  C N   +E    +  +    RNP
Sbjct: 502 DLKMTILVLELTILFFFSYCECFAAGVYCIEPCSCQECFNKPIHEDTVLQTRKQIESRNP 561

Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL------ 221
            AF PK+  S     +   +  +     +H +GC+CKKS CLKKYCEC+Q  IL      
Sbjct: 562 LAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYCECYQVLILTISCLP 621

Query: 222 -----CSENCRCMDCKNFEGSEERQA 242
                CS +CRC  CKN  G ++  A
Sbjct: 622 QGGVGCSISCRCEGCKNAFGRKDGSA 647


>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
          Length = 383

 Score =  105 bits (263), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 61/119 (51%), Positives = 84/119 (70%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 182 YCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG---------KG 232

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 233 KLGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 289



 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 239 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 281


>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
          Length = 548

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 58/119 (48%), Positives = 82/119 (68%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C+ CNC NC+NN+ +E  R +A++A L+RNP AF+PKI     G      
Sbjct: 347 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLGD----- 401

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
                 +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 402 ------IKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 454


>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
          Length = 265

 Score =  105 bits (262), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 99/191 (51%), Gaps = 27/191 (14%)

Query: 71  VPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKK-QKQ------CNCKHSRC 123
           +P  P P+P    +PPV  P+ +  + ++  + E++    +  Q Q      CNCK S+C
Sbjct: 1   MPHYPHPIPY-WYRPPVYFPQYDGQQYQNTLSKELQSKVSQLVQYQKVSHFACNCKKSKC 59

Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF---------RPK 173
           LKLYCECFA+   C   CNC  C N ++N   R +A E  L RNP+AF         +P+
Sbjct: 60  LKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAILTNNGQQPQ 119

Query: 174 IASS-----PHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           I        P   +  KE   ++       KGC+CKKS C KKYCEC+  N  C++ C+C
Sbjct: 120 IIQDKTLILPLEEKSQKESQIDITT----RKGCNCKKSECKKKYCECYSINQKCTDLCKC 175

Query: 229 MDCKNFEGSEE 239
            +C N   S+E
Sbjct: 176 ENCLNKAESQE 186


>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
          Length = 508

 Score =  105 bits (262), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 60/118 (50%), Positives = 82/118 (69%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L RNP AF+PKI          K 
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG---------KG 357

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G+V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 358 QLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 413



 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 406


>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
          Length = 783

 Score =  105 bits (262), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 11/99 (11%)

Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
           A L+RNP AF+PKI     G  D +           H+KGC+CK+SGCLK YCEC++A I
Sbjct: 604 ACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKI 652

Query: 221 LCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
           +CS  C+C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 653 MCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 691



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           K CNCK S CLK YCEC+ + I C   C C+ C N
Sbjct: 631 KGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 665


>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 710

 Score =  105 bits (261), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 94/169 (55%), Gaps = 18/169 (10%)

Query: 81  QATQPPVRVPKPE-SPKSRSRSNVEVKENTPKK----QKQCNCKHSRCLKLYCECFASGI 135
           QAT   V +P+ + +    S  N +   N+P++     K+C C  S+CLKLYCECFA+G+
Sbjct: 487 QATSVGVNIPQKQITHNHHSHHNEQKLGNSPQQPGVSTKKCTCSKSKCLKLYCECFAAGL 546

Query: 136 YCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML 194
            C   C C +C N  ++  A ++A +  ++R+P AF  K++          + +G   + 
Sbjct: 547 VCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEIKVS----------QNSGTDKL- 595

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
            +H KGC CKKSGC K YCECFQ  + C++ C+C  C N E   ++  L
Sbjct: 596 -QHRKGCTCKKSGCQKGYCECFQLGVPCTDFCKCSGCANCEKKSDQPCL 643


>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
          Length = 374

 Score =  105 bits (261), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 73/128 (57%), Gaps = 17/128 (13%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF---- 170
           CNCK S+CLKLYCECFA+   C   CNC  C N ++N   R +A+E  L RNP+AF    
Sbjct: 169 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAFAAIL 228

Query: 171 -----RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
                +P+I       +  KE   ++       KGC+CKKS C KKYCEC+  N  C++ 
Sbjct: 229 TNNGQQPQIIQE---EKSQKESQKDITT----RKGCNCKKSECKKKYCECYSINQRCTDL 281

Query: 226 CRCMDCKN 233
           C+C +C N
Sbjct: 282 CKCENCLN 289



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 4/71 (5%)

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK----GCHCKKSGCLKKYCECFQANILC 222
           P  + P+     +    +KE   +V  L ++ K     C+CKKS CLK YCECF  N +C
Sbjct: 131 PPIYYPQYDGQHYQNTLSKELQSKVSQLVQYQKVSPFACNCKKSKCLKLYCECFANNWVC 190

Query: 223 SENCRCMDCKN 233
           S+NC C +CKN
Sbjct: 191 SQNCNCCECKN 201



 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVE 150
           K CNCK S C K YCEC++    C D C C NC N V+
Sbjct: 255 KGCNCKKSECKKKYCECYSINQRCTDLCKCENCLNKVQ 292


>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
           fascicularis]
          Length = 490

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 60/118 (50%), Positives = 82/118 (69%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L RNP AF+PKI          K 
Sbjct: 289 YCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIG---------KG 339

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G+V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 340 QLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 395



 Score = 47.0 bits (110), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 346 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 388


>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
          Length = 283

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/119 (51%), Positives = 82/119 (68%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ C+C NC NN+ +E  R  A++A L+RNP AF+PKI  S  G  D K 
Sbjct: 77  YCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKSRLG--DVKP 134

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
                    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+DC N+E S ER+ L +
Sbjct: 135 ---------RHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEESPERKTLMN 184



 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNCK S CLK YCEC+ + I C   C C++C+N  E+
Sbjct: 134 PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEES 176


>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
          Length = 516

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 59/119 (49%), Positives = 84/119 (70%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 315 YCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIG---------KG 365

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + G+V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S E++ L +
Sbjct: 366 KLGDVK--PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPEQKTLMN 422



 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 372 PRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEES 414


>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
          Length = 263

 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/118 (50%), Positives = 82/118 (69%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+++E  R +A++A L RNP AF+PKI +S  G  D K 
Sbjct: 53  YCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLG--DVKP 110

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
                    +H+KGC C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 111 ---------RHSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 159



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 35/65 (53%), Gaps = 4/65 (6%)

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
           R P+   PK  +    +VK   P+  K C+C+ S CLK YCEC+ + I C   C C+ C 
Sbjct: 91  RNPEAFQPKIGTSQLGDVK---PRHSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCK 147

Query: 147 NNVEN 151
           N  E+
Sbjct: 148 NYEES 152


>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
          Length = 391

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/119 (50%), Positives = 82/119 (68%), Gaps = 11/119 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ +E  R  A++  L+RNP AF+PKI          K 
Sbjct: 186 YCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIG---------KG 236

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
            +G+V    +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 237 RSGDVK--PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 293



 Score = 47.0 bits (110), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 35/65 (53%), Gaps = 4/65 (6%)

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
           R P+   PK     + +VK   P+  K CNCK S CLK YCEC+ + I C   C C+ C 
Sbjct: 224 RNPEAFQPKIGKGRSGDVK---PRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 280

Query: 147 NNVEN 151
           N  E+
Sbjct: 281 NYEES 285


>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
          Length = 502

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/118 (48%), Positives = 80/118 (67%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C  CNC NC NN+ +E  R +A++A L+RNP AF+PKI          K 
Sbjct: 302 YCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIG---------KG 352

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
             G+V    +HN+GC+C++SGCLK YCEC++A I+CS  C+C+ CKN++   ER+ L 
Sbjct: 353 RPGDVT--PRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDEGPERKTLM 408



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
           TP+  + CNC+ S CLK YCEC+ + I C   C C+ C N  E
Sbjct: 358 TPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDE 400


>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
           [Homo sapiens]
          Length = 529

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ ++  R +A++A L RNP AF+PKI          K 
Sbjct: 328 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 378

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 379 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 434



 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 385 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 427


>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
           tropicalis]
          Length = 183

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 66/125 (52%), Positives = 87/125 (69%), Gaps = 11/125 (8%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K +K CNC  S+CLKLYC+CFA+G  C+ CNC NC+NN E+E  R +AV+A L RNP+AF
Sbjct: 9   KTKKPCNCTRSQCLKLYCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPDAF 68

Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
           +PKI          K + G+V    +HNKGC+CK+SGC K YCEC++A I CS  C+C+ 
Sbjct: 69  QPKI---------EKGKIGDV--KPRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVG 117

Query: 231 CKNFE 235
           CKN +
Sbjct: 118 CKNVK 122



 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 8/61 (13%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN-------NVENEAARREAVEA 161
           P+  K CNCK S C K YCEC+ + I C   C CV C N       ++      RE +EA
Sbjct: 82  PRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVGCKNVKSLDSGSIIPTCTSREVIEA 141

Query: 162 T 162
           T
Sbjct: 142 T 142


>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
 gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
 gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
           [Homo sapiens]
          Length = 508

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ ++  R +A++A L RNP AF+PKI          K 
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 357

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 358 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 413



 Score = 46.6 bits (109), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 406


>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
          Length = 508

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ ++  R +A++A L RNP AF+PKI          K 
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 357

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 358 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 413



 Score = 46.6 bits (109), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 406


>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
           [Homo sapiens]
          Length = 463

 Score =  102 bits (254), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ ++  R +A++A L RNP AF+PKI          K 
Sbjct: 262 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 312

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 313 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 368



 Score = 46.6 bits (109), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 319 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 361


>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
 gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
          Length = 641

 Score =  102 bits (253), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 51/130 (39%), Positives = 75/130 (57%), Gaps = 4/130 (3%)

Query: 93  ESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVEN 151
           +S K R R   +++ +  K  + C+CK S+CLKLYC CFA+ +YC + C+C  C NN  +
Sbjct: 339 QSHKKRRR---KLQNDDGKSCRHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMH 395

Query: 152 EAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKK 211
           E A     + T  RNP AF P +        +  +++       +H +GC+C+KS CLKK
Sbjct: 396 EEAVSHIRKQTESRNPLAFAPTVTRKCGSVSELGDDSNNTPASARHKRGCNCRKSSCLKK 455

Query: 212 YCECFQANIL 221
           YCECFQ  +L
Sbjct: 456 YCECFQVLML 465



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 26/45 (57%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           GK  + C CKKS CLK YC CF A + CSE C C  C N    EE
Sbjct: 353 GKSCRHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMHEE 397


>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
          Length = 622

 Score =  102 bits (253), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 90/156 (57%), Gaps = 16/156 (10%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YCECFASG +C+ CNC NC+NN  +E  R +A++  L+RNP AF PKI  S  G      
Sbjct: 427 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSKLGD----- 481

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                 +   HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S +++     
Sbjct: 482 ------IKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT---- 531

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
                + Y+    N      + ++   SP + K R+
Sbjct: 532 -QLTTLNYMDIGNNDENNPFLTTTFEISPKLEKDRE 566



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
           P   K CNCK S CLK YCECF + I C   C C+ C N  E
Sbjct: 484 PHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 525


>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
          Length = 377

 Score =  101 bits (252), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/138 (45%), Positives = 86/138 (62%), Gaps = 13/138 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C  CNC NC+NN ++E  R  A++A L+RNP AF PKI     G      
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLGD----- 230

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
                 +  +HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S ER+ L   
Sbjct: 231 ------IKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEESPERKTLI-- 282

Query: 247 DHANNMAYIQQAANAAIT 264
           +  N+M       N A++
Sbjct: 283 NMPNDMEIKSHEENDAVS 300



 Score = 47.4 bits (111), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 22/39 (56%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           P+  K CNCK S CLK YCECF + I C   C CV C N
Sbjct: 233 PRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKN 271


>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
           trifallax]
          Length = 600

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 52/115 (45%), Positives = 68/115 (59%), Gaps = 10/115 (8%)

Query: 122 RCLKLYCECFASGIYCD-GCNCVNCHNNVEN-EAARREAVEATLERNPNAFRPKIAS-SP 178
           +CLKLYC+CFA G  C   C C+NC N  +N E+   EA +  + RNP AF+PK+    P
Sbjct: 454 QCLKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGP 513

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                N  +        +H KGC+CK SGC K YCECFQ  + C++NCRC+ C N
Sbjct: 514 AQMSQNDADK-------QHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561


>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
          Length = 503

 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 92/156 (58%), Gaps = 16/156 (10%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YCECFASG +C+ CNC NC+NN  +E  R +A++  L+RNP AF PKI  S         
Sbjct: 307 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQS--------- 357

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
           E G++     HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S +++     
Sbjct: 358 ELGDIK--PHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT---- 411

Query: 247 DHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
                + Y+    N      + S+   SP + K R+
Sbjct: 412 -QPTMLNYMDIGNNDENNLFLTSTFEISPKLEKDRE 446



 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
           E+ +  P   K CNCK S CLK YCECF + I C   C C+ C N  E
Sbjct: 358 ELGDIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 405


>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
          Length = 387

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 75/130 (57%), Gaps = 11/130 (8%)

Query: 115 QCNCKHSRCLKLYCECFAS-GIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           QCNCK S+CLKLYCECFA+ G+    CNC  C N ++N   R +A+E  L RN +AF   
Sbjct: 194 QCNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAFAQC 253

Query: 174 IAS--SPHGTRD---NKEETGEVLMLGKHN-----KGCHCKKSGCLKKYCECFQANILCS 223
             +  +P   +    N  +   +    K N     KGC+CKKSGC KKYCEC+  N+ C+
Sbjct: 254 FTTKGAPQFVQQGIFNFIQDKPIKEPSKDNSSVVHKGCNCKKSGCKKKYCECYSQNLKCN 313

Query: 224 ENCRCMDCKN 233
           + C+C  C N
Sbjct: 314 DLCKCEHCLN 323


>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
          Length = 735

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/94 (48%), Positives = 60/94 (63%), Gaps = 9/94 (9%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           K C+CK+S+CLK+YCECFAS   C+GC C  CHNN ++  +   A  +TLERNP+AF PK
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAFGPK 385

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
              S +G +   E         KH KGCHC++SG
Sbjct: 386 FKQSFNGEKPKAE---------KHLKGCHCRRSG 410



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           K CHCK S CLK YCECF +  LC+  C+C  C N E   E
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCN-GCQCFGCHNNEDHIE 365


>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
          Length = 299

 Score = 99.8 bits (247), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ ++  R +A++A L RNP AF+PKI          K 
Sbjct: 98  YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 148

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 149 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 204



 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 155 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 197


>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 492

 Score = 99.8 bits (247), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 45/97 (46%), Positives = 60/97 (61%), Gaps = 12/97 (12%)

Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
           C+C NC NN+ NE  R  A+++ LERNP AF+PKI     G  D +           HNK
Sbjct: 282 CHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG---KGRADTER---------LHNK 329

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
           GC+CKKS CLK YCEC++A + C+  C+C+ C+N E 
Sbjct: 330 GCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRNTEA 366



 Score = 45.4 bits (106), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           K CNCK S CLK YCEC+ + + C   C CV C N
Sbjct: 329 KGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRN 363


>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
          Length = 223

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 47/101 (46%), Positives = 61/101 (60%), Gaps = 11/101 (10%)

Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
            +A L+RNP AF+PKI     G  D +           H+KGC+CK+SGCLK YCEC++A
Sbjct: 42  TQACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEA 90

Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
            I+CS  C+C+ CKNFE S ER+ L H   A  +   QQ A
Sbjct: 91  KIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 131



 Score = 46.6 bits (109), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 47/106 (44%), Gaps = 15/106 (14%)

Query: 60  PQPQPKPVVPPVPP----PPPP---------LPGQATQPPVRVPKPESPKSRSRSNVEVK 106
           P PQ +P +  +PP     P P         LP Q          PE+ K +     E  
Sbjct: 5   PLPQIQPNLTNLPPGTVLAPAPGTGNVGYAVLPAQYVTQACLDRNPEAFKPKIGKGKE-G 63

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           E+  +  K CNCK S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 64  ESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEES 109


>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 617

 Score = 99.0 bits (245), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 46/108 (42%), Positives = 60/108 (55%), Gaps = 14/108 (12%)

Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           YCECF  G YC D C C  C NN +N   R +A++    RN +AF  K   + + T    
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQITSNSTV--- 413

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                      H KGCHCK++ CLKKYCECF A + C+ NC+C +C+N
Sbjct: 414 ----------THKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRN 451


>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
          Length = 508

 Score = 99.0 bits (245), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 56/118 (47%), Positives = 77/118 (65%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ ++  R +A++A L RNP AF+PKI     G    + 
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLGNVKPR- 365

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
                     HNKGC+C++SGCLK YCEC++A I+CS  C+C+ C+N E S ER+ L 
Sbjct: 366 ----------HNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTLM 413



 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEES 406


>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
          Length = 508

 Score = 99.0 bits (245), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 58/118 (49%), Positives = 80/118 (67%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFASG +C+ CNC NC NN+ ++  R +A++A L RNP AF+PKI          K 
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 357

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + G V    +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ C+N E S ER+ L 
Sbjct: 358 QLGNVK--PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTLM 413



 Score = 46.6 bits (109), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
           P+  K CNC+ S CLK YCEC+ + I C   C C+ C N  E+
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEES 406


>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 455

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/129 (37%), Positives = 72/129 (55%), Gaps = 22/129 (17%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG--CNCVNCHNNVENEAARREAVEATLE 164
           +  P  Q  CNC  SRCLKLYC CF +G  C    C C+ C N+VE+   R++A++ TL+
Sbjct: 329 QKLPAGQVACNCIRSRCLKLYCTCFQAGKMCQPGICTCIACANSVEDHPERKQAIKHTLQ 388

Query: 165 RNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE 224
           + P+AF+ K    P G                   GC CK + C++KYCECF+  + C++
Sbjct: 389 KRPDAFQTK--DRPVGL------------------GCACKNNKCIRKYCECFRNGLSCAD 428

Query: 225 NCRCMDCKN 233
            C C++C+N
Sbjct: 429 KCCCLNCEN 437



 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 70/138 (50%), Gaps = 20/138 (14%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD--GCNCVNCHNNVEN---EAARREAVEATLERNPNAF 170
           C+C  S+C+ LYC+CF +G  C+   CNC  C N +     + AR  A+ A L RNP AF
Sbjct: 260 CSCPKSKCVALYCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAF 319

Query: 171 RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCM 229
                ++   +   K   G+V         C+C +S CLK YC CFQA  +C    C C+
Sbjct: 320 ----VTAGVASAVQKLPAGQV--------ACNCIRSRCLKLYCTCFQAGKMCQPGICTCI 367

Query: 230 DCKNF--EGSEERQALFH 245
            C N   +  E +QA+ H
Sbjct: 368 ACANSVEDHPERKQAIKH 385



 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSE-NCRCMDCKN 233
           T E+  L ++  GC C KS C+  YC+CF+A   C+  NC C  CKN
Sbjct: 247 TLEIGPLQENPTGCSCPKSKCVALYCDCFKAGRRCNPINCNCTACKN 293


>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
          Length = 175

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/121 (45%), Positives = 66/121 (54%), Gaps = 25/121 (20%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K CNCK S+CLK YC+CFA+G  C + CNCV C NN  N + R+EA      R+P AF+ 
Sbjct: 76  KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFKQ 135

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
                                     KGC+CKKSGC KKYCECFQ N+ C+  CRC  C 
Sbjct: 136 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 171

Query: 233 N 233
           N
Sbjct: 172 N 172


>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
          Length = 190

 Score = 97.1 bits (240), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/121 (44%), Positives = 66/121 (54%), Gaps = 25/121 (20%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K CNCK S+CLK YC+CFA+G  C + CNCV C NN  N + R++A      R+P AF+ 
Sbjct: 91  KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQARDPGAFKQ 150

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
                                     KGC+CKKSGC KKYCECFQ N+ C+  CRC  C 
Sbjct: 151 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 186

Query: 233 N 233
           N
Sbjct: 187 N 187


>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 90

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 63/96 (65%), Positives = 69/96 (71%), Gaps = 11/96 (11%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI-ASSPHGTRDNK 185
           YCECFASG YCDGCNCVNC NN E+E  R+ AVEA LERNPNAFRPKI    P  T    
Sbjct: 1   YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCT---- 56

Query: 186 EETGEVLML----GKHNKGCHCKKSGCLKKYCECFQ 217
             TG  + +     +HNKGC+CKKSGCLKKYCECFQ
Sbjct: 57  --TGTHVAIQGAAARHNKGCNCKKSGCLKKYCECFQ 90


>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 113

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 54/114 (47%), Positives = 70/114 (61%), Gaps = 7/114 (6%)

Query: 127 YCECFASGIYCDG-CNCVNCHNNVENEAA---RREAVEATLERNPNAFRPKIASSPHGTR 182
           YCECFA+ ++C G C C NC N  E EA     R+ +EA   R+P AF PKI  +   T 
Sbjct: 1   YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEA---RDPLAFAPKIVQAAEPTP 57

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
             +++  +     +H +GC+CKKS CLKKYCEC+QA + CSE CRC  CKN  G
Sbjct: 58  IPRDDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNMYG 111



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 26/38 (68%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           + ++ CNCK S CLK YCEC+ +G+ C +GC C  C N
Sbjct: 71  RHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKN 108


>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 886

 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/127 (41%), Positives = 66/127 (51%), Gaps = 17/127 (13%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA---RREAVEATLERN 166
           KK   C C  ++CL LYC+CF   I+C + C C++C N VE   A   R  AVE  LER 
Sbjct: 6   KKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERR 65

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
           P AF  +  S     R NK++            GC C K+ CLKKYC CF A + C   C
Sbjct: 66  PEAFGRE--SGAFAERRNKKDG-----------GCTCAKNSCLKKYCVCFGAKMQCRARC 112

Query: 227 RCMDCKN 233
            C +CKN
Sbjct: 113 NCSNCKN 119



 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/145 (33%), Positives = 68/145 (46%), Gaps = 17/145 (11%)

Query: 95  PKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN---NVE 150
           P++  R +    E   KK   C C  + CLK YC CF + + C   CNC NC N    + 
Sbjct: 66  PEAFGRESGAFAERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPFGTIR 125

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
            E    + V  T+E          A  P     N         L K N  C C ++ CLK
Sbjct: 126 IE----DGVTCTIEGGKVNALKGAAFVPIRKLAN---------LAKGNTKCSCTRAACLK 172

Query: 211 KYCECFQANILCSENCRCMDCKNFE 235
            YC+CF+ +++C++NC C++CKN E
Sbjct: 173 LYCDCFRQSMVCNDNCSCLECKNSE 197



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 57/132 (43%), Gaps = 24/132 (18%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEA---ARREAVEATL 163
           N  K   +C+C  + CLKLYC+CF   + C D C+C+ C N+ E       R   ++  L
Sbjct: 155 NLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDIL 214

Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
           E+ P+ F+        G                   GC C+K+ C   YC+C  +   C 
Sbjct: 215 EKRPHIFQKTAKRKAGG-------------------GCSCQKNKCRAGYCDCKGSGGKCD 255

Query: 224 -ENCRCMDCKNF 234
              C C++C N 
Sbjct: 256 PAVCTCINCDNM 267


>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 356

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/185 (37%), Positives = 84/185 (45%), Gaps = 23/185 (12%)

Query: 64  PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---------- 113
           P  + PP PPP    P + +   V + + E    RS  N       PK+Q          
Sbjct: 5   PISMKPPAPPPVTLEPIELSTHAVGLTELE----RSSMNQLAIAPDPKRQRVEEAADGNG 60

Query: 114 -KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPN 168
            K C CK SRCLKLYC CFA G YC D C C  C N    + A  E V+ T   L     
Sbjct: 61  CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFN----KEAFSETVQTTRKVLLSRQK 116

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
               KI   P    +  E+           +GC+CKKS CLKKYC+C+Q    CS  CRC
Sbjct: 117 RMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRC 176

Query: 229 MDCKN 233
            DC+N
Sbjct: 177 DDCQN 181


>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
          Length = 466

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/118 (45%), Positives = 79/118 (66%), Gaps = 11/118 (9%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           +C+CFASG + + CNC NC NN+ +E  R +A++A L R+P AF+P+I          K 
Sbjct: 308 HCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIG---------KG 358

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           + GEV    +H+KGC+C++ GCL+ YCEC +A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 359 QLGEVR--PRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCKNYEESPERKTLM 414



 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 7/77 (9%)

Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEAT 162
           ++ E  P+  K CNC+   CL+ YCEC  + I C   C C+ C  N E    R+     T
Sbjct: 359 QLGEVRPRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGC-KNYEESPERK-----T 412

Query: 163 LERNPNAFRPKIASSPH 179
           L   PN  +  +  S H
Sbjct: 413 LMSMPNCAQTGLEGSHH 429


>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 586

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 35/60 (58%), Positives = 50/60 (83%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           K++ CNC  S+CLKLYC+CFA+G +C  CNC +C+NN++NE  R++A+ ATLERNP+AF+
Sbjct: 527 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 586



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 26/52 (50%), Gaps = 2/52 (3%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDC-KNFEGSEERQALFHGDHANN 251
           C+C KS CLK YC+CF     C  NC C DC  N +  EERQ         N
Sbjct: 531 CNCTKSQCLKLYCDCFANGEFCY-NCNCRDCYNNLDNEEERQKAIRATLERN 581


>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
          Length = 289

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/157 (42%), Positives = 89/157 (56%), Gaps = 19/157 (12%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CFA+G +C  CNC NC NN  +E  R  A++A LERNP AF PKI          K 
Sbjct: 151 YCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIG---------KR 201

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS-EERQALFH 245
           + GE+     HNKGC+CK+SGCL+ YCECF+A I+CS  C+C  CKN+E S +E + L  
Sbjct: 202 KLGEIKH--HHNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKNYEESPDENKQL-- 257

Query: 246 GDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRK 282
                N+       N      + +S +   P S+K +
Sbjct: 258 -----NVINYMDIGNKEGNSPVLTSAFNMLPKSRKDR 289



 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 32/64 (50%), Gaps = 4/64 (6%)

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCH 146
           R P+   PK   R   E+K +     K CNCK S CL+ YCECF + I C   C C  C 
Sbjct: 189 RNPEAFLPKIGKRKLGEIKHH---HNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCK 245

Query: 147 NNVE 150
           N  E
Sbjct: 246 NYEE 249


>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
          Length = 434

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 70/131 (53%), Gaps = 21/131 (16%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNA 169
           +K+K C C  ++CLK YC CF + + C + C CV+CHN+ ++E AR  AV      +P A
Sbjct: 12  EKEKGCRCGRTKCLKQYCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMA 71

Query: 170 FR-------PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
           F+        +   +P+GT              K  +GC C++S C KKYCECF A + C
Sbjct: 72  FKGTSLELENQEVHTPNGTL-------------KTVRGCRCRRSKCQKKYCECFGAGLKC 118

Query: 223 SENCRCMDCKN 233
           S NC C  C N
Sbjct: 119 STNCVCEGCLN 129



 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 8/75 (10%)

Query: 81  QATQPPVRVPKPESPKSRSRSNVEVKEN-------TPKKQKQCNCKHSRCLKLYCECFAS 133
           QA    VR+ +   P +   +++E++         T K  + C C+ S+C K YCECF +
Sbjct: 55  QARMMAVRLVRLNDPMAFKGTSLELENQEVHTPNGTLKTVRGCRCRRSKCQKKYCECFGA 114

Query: 134 GIYCD-GCNCVNCHN 147
           G+ C   C C  C N
Sbjct: 115 GLKCSTNCVCEGCLN 129


>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
          Length = 286

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 76/140 (54%), Gaps = 12/140 (8%)

Query: 117 NCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIAS 176
           NC ++ C+K YC+CF    YC  CN   C NN ENE  R++A++  L +NP AF  K  +
Sbjct: 144 NCSNTYCIKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKT 203

Query: 177 SPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC--MDCKNF 234
             +  + +K+++  +         C+C KSGC  KYC C +  ILCS+NC+C    CKN 
Sbjct: 204 RKNFIKKSKKDSNLI-------HTCNCSKSGCNNKYCVCLKEGILCSKNCKCNHRKCKNK 256

Query: 235 EGS---EERQALFHGDHANN 251
           +G    ++R  +   D   N
Sbjct: 257 QGDRNQKQRSVIISNDENEN 276


>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
          Length = 496

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 63/110 (57%), Gaps = 5/110 (4%)

Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
           ++ YC CF + + C+GC+CV+C+N+ ++E  R  A+E     +P AF  ++ +       
Sbjct: 297 VRQYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASV 356

Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           + E   +     +H +GC CK S CLKKYCECF+  + CS  C C DC N
Sbjct: 357 SVESKPK-----QHVRGCKCKNSKCLKKYCECFEFGVSCSSKCDCKDCMN 401


>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 375

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 57/141 (40%), Positives = 68/141 (48%), Gaps = 11/141 (7%)

Query: 109 TPKKQK---------QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREA 158
           TPK+QK          C CK SRCLKLYC CFA G YC + C CV C N  +     +  
Sbjct: 66  TPKRQKVEDAADGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCVPCFNKADFAETVQTT 125

Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
            +  L R       KI        +  E+           +GC+CKKS CLKKYC+C+Q 
Sbjct: 126 RKVLLSRQKR-MSLKINRRLEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 184

Query: 219 NILCSENCRCMDCKNFEGSEE 239
              CS  CRC DCKN  G  E
Sbjct: 185 GTGCSLFCRCEDCKNPFGKNE 205


>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 120

 Score = 93.2 bits (230), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 57/121 (47%), Positives = 72/121 (59%), Gaps = 14/121 (11%)

Query: 127 YCECFASGIYCDG-CNCVNCHNN---VENEAARREAVEATLERNPNAFRPKIA----SSP 178
           YCECFA+G+YC G C C +C N    +E     R+ +E+   RNP AF PKI     SSP
Sbjct: 1   YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIES---RNPLAFAPKIVQGAESSP 57

Query: 179 HGTRDNKEETGEVL---MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
                + ++  EVL      +H +GC+CKKS CLKKYCEC+QA + CSE CRC  C N  
Sbjct: 58  VPGVSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMNKY 117

Query: 236 G 236
           G
Sbjct: 118 G 118



 Score = 42.0 bits (97), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           + + ++ CNCK S CLK YCEC+ +G+ C +GC C  C N
Sbjct: 76  SARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMN 115


>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
          Length = 356

 Score = 93.2 bits (230), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 68/185 (36%), Positives = 84/185 (45%), Gaps = 23/185 (12%)

Query: 64  PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---------- 113
           P  + PP PPP    P + +   V + + E    RS  N       PK+Q          
Sbjct: 5   PISMKPPAPPPVTLEPIELSTHAVGLTELE----RSSMNQLAIAPDPKRQRVEEAADGNG 60

Query: 114 -KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPN 168
            K C CK SRCLKLYC CFA G YC D C C  C +    + A  E V+ T   L     
Sbjct: 61  CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFS----KEAFSETVQTTRKVLLSRQK 116

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
               KI   P    +  E+           +GC+CKKS CLKKYC+C+Q    CS  CRC
Sbjct: 117 RMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRC 176

Query: 229 MDCKN 233
            DC+N
Sbjct: 177 DDCQN 181


>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
           distachyon]
          Length = 377

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/151 (39%), Positives = 71/151 (47%), Gaps = 29/151 (19%)

Query: 109 TPKKQ-----------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARR 156
           TPK+Q           K C CK S+CLKLYC CFA G YC + C CV C N  +     +
Sbjct: 73  TPKRQRVEEAADGNGCKHCACKKSKCLKLYCPCFAGGGYCSEKCGCVPCFNKSDYAETVQ 132

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHN--------KGCHCKKSGC 208
              +  L R       KI        + + E     M   HN        +GC+CKKS C
Sbjct: 133 TTRKVLLSRQKR-MSLKI--------NRRSEANAEAMEDAHNSSSSTPPRRGCNCKKSSC 183

Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           LKKYC+C+Q    CS  CRC DCKN  G  E
Sbjct: 184 LKKYCDCYQDGTGCSLFCRCEDCKNPYGKNE 214


>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
 gi|194698372|gb|ACF83270.1| unknown [Zea mays]
 gi|194708474|gb|ACF88321.1| unknown [Zea mays]
 gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
          Length = 394

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/124 (41%), Positives = 65/124 (52%), Gaps = 18/124 (14%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K C+CK SRCLKLYC C+ASG +C + C C  CHN     A                  P
Sbjct: 78  KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTA--------------P 123

Query: 173 KIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
            +   P  T +  +E  E L+   +    + C CKKSGCLKKYC+C+Q    CS NC+C 
Sbjct: 124 VLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCD 183

Query: 230 DCKN 233
           DC+N
Sbjct: 184 DCRN 187


>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
          Length = 176

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 62/97 (63%), Gaps = 8/97 (8%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNCK S+CLKLYC+CF +G+ C   CNC +CHNN E+   R   ++  +ERNP+AFRPK+
Sbjct: 85  CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQIMERNPSAFRPKV 144

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKK 211
            S     + N E+  +     +H KGC+CKKS   KK
Sbjct: 145 ES-----KSNSEDEQD--HKPRHFKGCNCKKSNWSKK 174



 Score = 46.2 bits (108), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 29/45 (64%), Gaps = 1/45 (2%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDC-KNFEGSEERQALF 244
           C+CKKS CLK YC+CF A + C ++C C  C  N E  +ER+ + 
Sbjct: 85  CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVI 129


>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
 gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
          Length = 409

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 50/124 (40%), Positives = 67/124 (54%), Gaps = 17/124 (13%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K C+CK SRC+KLYC CFASG  C + C C  C N    + A R A+   L+        
Sbjct: 102 KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPLK-------- 153

Query: 173 KIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
                P  T +  ++  E ++   +    + C CKKSGCLKKYC+C+Q    CS NC+C 
Sbjct: 154 -----PVQTSEAGQDIVEQVIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCD 208

Query: 230 DCKN 233
           DC+N
Sbjct: 209 DCRN 212


>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 238

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 88/186 (47%), Gaps = 22/186 (11%)

Query: 60  PQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCK 119
           P     PV   V P       +A  P      PE+P  + R  +         ++ CNC 
Sbjct: 10  PSTTKSPVASSVSPAVGSNETKAAFPATE-STPEAPVLKRRQVI-------NNRRPCNCT 61

Query: 120 HSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPH 179
            S+CLKLYC+CFA+G +C  C+C NC NN+ NE  R  A+++ LERNP AF+PKI     
Sbjct: 62  KSQCLKLYCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKI----- 116

Query: 180 GTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           G + ++    + L L     GC   +   +K       A + C+  C+C+ C+N E    
Sbjct: 117 GNKTSEIMAKDELTLN----GCT-TRDAIVKSQ----AARVSCTVRCKCVGCRNTEADRN 167

Query: 240 RQALFH 245
            +   H
Sbjct: 168 HRNRGH 173


>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 75/143 (52%), Gaps = 29/143 (20%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
           P   + R  +   +K+   +KQ+QC CK S+CLK+YC+C A G YC   C+C NCHN+ +
Sbjct: 18  PTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDK 77

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           +   R + +E  +E+N +A                               C+C+KS C K
Sbjct: 78  HPEQRSKVLEQMVEKNSSA----------------------------QLKCNCRKSNCQK 109

Query: 211 KYCECFQANILCSENCRCMDCKN 233
           KYCEC+ A + CSE C+C DCKN
Sbjct: 110 KYCECYNAGVKCSELCKCDDCKN 132


>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
 gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
          Length = 359

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 55/130 (42%), Positives = 65/130 (50%), Gaps = 8/130 (6%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPNA 169
           K C CK SRCLKLYC CF+ G YC D C C  C N    + A  E V+ T   L      
Sbjct: 67  KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFN----KEAFSETVQTTRKVLLSRQKR 122

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
              KI   P    +  E+           +GC+CKKS CLKKYC+C+Q    CS  CRC 
Sbjct: 123 MSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCD 182

Query: 230 DCKNFEGSEE 239
           DC+N  G  E
Sbjct: 183 DCQNPFGKNE 192


>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 625

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/108 (43%), Positives = 59/108 (54%), Gaps = 14/108 (12%)

Query: 126 LYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           LYCECFA+G +C  C+C NC N  +N A  +        RNP AF  KI ++     D +
Sbjct: 280 LYCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKIVAT-----DGE 334

Query: 186 EETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           E+  +             KKS CLKKYCECFQA +LC E C+C  CKN
Sbjct: 335 EKHKKGCHC---------KKSACLKKYCECFQAGVLCQEYCKCEGCKN 373


>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|194699308|gb|ACF83738.1| unknown [Zea mays]
 gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|224028923|gb|ACN33537.1| unknown [Zea mays]
 gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 359

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 53/127 (41%), Positives = 61/127 (48%), Gaps = 2/127 (1%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K C CK SRCLKLYC CF+ G YC D C C  C N  E  A         L         
Sbjct: 67  KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNK-EAFAETVHTTRKVLLSRQKRMSM 125

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           KI   P    +  E+           +GC+CKKS CLKKYC+C+Q    CS  CRC DC+
Sbjct: 126 KINRRPEANTEPMEDAHHSSSSTPPKRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 185

Query: 233 NFEGSEE 239
           N  G  E
Sbjct: 186 NPFGKNE 192


>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
          Length = 159

 Score = 89.4 bits (220), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 63/121 (52%), Gaps = 25/121 (20%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K CNCK S+CLK YC+CFA+G  C + CNCV C NN  N   R+ A    + R+P AF+ 
Sbjct: 60  KSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNRDPGAFKQ 119

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
                                     KGC+CKKSGC KKYCECF   + C+  CRC  C 
Sbjct: 120 SF------------------------KGCNCKKSGCQKKYCECFLNGLTCTHLCRCDGCL 155

Query: 233 N 233
           N
Sbjct: 156 N 156



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 21/37 (56%), Positives = 27/37 (72%)

Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           + K C+CKKS CLK+YC+CF    +CSENC C+ C N
Sbjct: 58  NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFN 94


>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 159

 Score = 89.4 bits (220), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 55/132 (41%), Positives = 71/132 (53%), Gaps = 22/132 (16%)

Query: 127 YCECFASGIYCDG-CNCVNCHNN---VENEAARREAVEATLERNPNAFRPKIASS----- 177
           YCECFA G+YC G C C +C N    +E     R+ +E+   RNP AF PKI        
Sbjct: 1   YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIES---RNPLAFAPKIVQGAEPSP 57

Query: 178 -PHGTRDNKEETGEVLML---------GKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
            P  T    E   ++++           +H +GC+CKKS CLKKYCEC+QA + CSE CR
Sbjct: 58  VPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCR 117

Query: 228 CMDCKNFEGSEE 239
           C  C N  G +E
Sbjct: 118 CEGCMNKYGKKE 129



 Score = 42.7 bits (99), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 7/69 (10%)

Query: 86  PVRVPKPESPKSRSRSNVEVKE---NTP---KKQKQCNCKHSRCLKLYCECFASGIYC-D 138
           P  VP    P + +   + V++   +TP   + ++ CNCK S CLK YCEC+ +G+ C +
Sbjct: 55  PSPVPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSE 114

Query: 139 GCNCVNCHN 147
           GC C  C N
Sbjct: 115 GCRCEGCMN 123


>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
          Length = 214

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 70/137 (51%), Gaps = 15/137 (10%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +C+C  S C +LYC CF S  +C D CNC  C+N    E A  E  +    +NPNAF P+
Sbjct: 9   KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68

Query: 174 IASSPHGTR--DNKEETGEV--------LMLG----KHNKGCHCKKSGCLKKYCECFQAN 219
           I  S       D +  T  +         M G    KH KGC C+KS C K YCECF+ +
Sbjct: 69  IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128

Query: 220 ILCSENCRCMDCKNFEG 236
           + C+  C+C +C N  G
Sbjct: 129 VGCTAKCKCQECSNSFG 145


>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
          Length = 371

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 62/93 (66%), Gaps = 15/93 (16%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+          K 
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKM---------GKG 324

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
             G   +  +H+KGC+CK+SGCLK YCEC++A+
Sbjct: 325 RLGAAKL--RHSKGCNCKRSGCLKNYCECYEAS 355


>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 405

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 66/124 (53%), Gaps = 12/124 (9%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           + C+CK SRCLKLYC CFASG +C + C C  C+N         + +    +  P     
Sbjct: 95  RTCSCKKSRCLKLYCVCFASGSHCSELCGCDPCYN---------KPIHGVQQNTPPGLPL 145

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           ++  +    +++ E       +    + C CKKSGCLKKYC+C+Q    CS NC+C DCK
Sbjct: 146 QVVRTAEAGQNSAEFA--RYQMDFFRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCK 203

Query: 233 NFEG 236
           N  G
Sbjct: 204 NPYG 207


>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
          Length = 223

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 71/137 (51%), Gaps = 15/137 (10%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +C+C  S C +LYC CF S  +C D CNC  C+N    E A  E  +    +NPNAF P+
Sbjct: 9   KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68

Query: 174 IASSPHGTR--DNKEETGEV--------LMLG----KHNKGCHCKKSGCLKKYCECFQAN 219
           I  S       D +  T  +         M G    KH KGC C+KS C K YCECF+ +
Sbjct: 69  IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128

Query: 220 ILCSENCRCMDCKNFEG 236
           + C+  C+C++C N  G
Sbjct: 129 VGCTAKCKCLECSNSFG 145


>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
           [Oryza sativa Japonica Group]
          Length = 194

 Score = 89.0 bits (219), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 71/137 (51%), Gaps = 15/137 (10%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +C+C  S C +LYC CF S  +C D CNC  C+N    E A  E  +    +NPNAF P+
Sbjct: 9   KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68

Query: 174 IASSPHGTR--DNKEETGEV--------LMLG----KHNKGCHCKKSGCLKKYCECFQAN 219
           I  S       D +  T  +         M G    KH KGC C+KS C K YCECF+ +
Sbjct: 69  IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128

Query: 220 ILCSENCRCMDCKNFEG 236
           + C+  C+C++C N  G
Sbjct: 129 VGCTAKCKCLECSNSFG 145


>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 88.6 bits (218), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 76/143 (53%), Gaps = 29/143 (20%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVE 150
           P   + R  +   +K+   +KQ+QC CK S+CLK+YC+C A G YC   C+C NCHN+ +
Sbjct: 18  PTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDK 77

Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
           +   R + +E  +E+N          +P   +                  C+C+KS C K
Sbjct: 78  HPEQRNKVLEQMVEKN----------TPAQIK------------------CNCRKSNCQK 109

Query: 211 KYCECFQANILCSENCRCMDCKN 233
           KYCEC+ A + CSE C+C DCKN
Sbjct: 110 KYCECYNAGVKCSELCKCDDCKN 132


>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
 gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
          Length = 144

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 53/125 (42%), Positives = 67/125 (53%), Gaps = 22/125 (17%)

Query: 127 YCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           YCECFA+G+YC D C C +C N     +     R+ +E    RNP AF PKI      T 
Sbjct: 1   YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIET---RNPLAFAPKIVQ----TS 53

Query: 183 DNKEETG-----------EVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           DN    G           +     +H +GC+CKKS CLKKYCEC+QA + CS+ C+C  C
Sbjct: 54  DNTPTEGVASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGC 113

Query: 232 KNFEG 236
           KN  G
Sbjct: 114 KNIYG 118



 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           + + ++ CNCK S CLK YCEC+ +G+ C DGC C  C N
Sbjct: 76  SARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKN 115


>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
          Length = 374

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/150 (41%), Positives = 73/150 (48%), Gaps = 27/150 (18%)

Query: 109 TPKKQK-----------QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARR 156
           TPK+QK            C CK SRCLKLYC CFA G YC + C C  C N    +A   
Sbjct: 70  TPKRQKVEESADGNGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCQPCFN----KALYA 125

Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHN-------KGCHCKKSGCL 209
           E V+ T  R     R K  S     R   E   E +    H+       +GC+CKKS CL
Sbjct: 126 ETVQTT--RKVLLSRQKRMSLKINRRS--EANAEAVEDAHHSSSSTPPRRGCNCKKSSCL 181

Query: 210 KKYCECFQANILCSENCRCMDCKNFEGSEE 239
           KKYC+C+Q    CS  CRC DC+N  G  E
Sbjct: 182 KKYCDCYQDGTGCSLFCRCEDCRNPFGKNE 211


>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
           niloticus]
          Length = 310

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/93 (40%), Positives = 60/93 (64%), Gaps = 15/93 (16%)

Query: 147 NNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKS 206
           ++++  A+R+      L  NP+AFRPKIA    G            + G HNKGC+CK+S
Sbjct: 160 SDIQKAASRKWC----LGHNPDAFRPKIAGVKSGA-----------VRGWHNKGCNCKRS 204

Query: 207 GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           GCLK+YCEC++ANI+C+ +C+C+ C+N++   +
Sbjct: 205 GCLKRYCECYEANIMCTSSCKCVGCRNYDAGSQ 237



 Score = 46.2 bits (108), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           K CNCK S CLK YCEC+ + I C   C CV C N
Sbjct: 197 KGCNCKRSGCLKRYCECYEANIMCTSSCKCVGCRN 231


>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
 gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
          Length = 191

 Score = 85.9 bits (211), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 62/93 (66%), Gaps = 15/93 (16%)

Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
           YC+CF+SG +C+ C+C    NN+ +E  R +A++A L+RNP AF+PK+          K 
Sbjct: 98  YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMG---------KG 144

Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQAN 219
             G   +  +H+KGC+CK+SGCLK YCEC++A+
Sbjct: 145 RLGAAKL--RHSKGCNCKRSGCLKNYCECYEAS 175


>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
 gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
          Length = 554

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 50/77 (64%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+ + L+RNP+AF 
Sbjct: 475 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFN 534

Query: 172 PKIASSPHGTRDNKEET 188
            K     +  +D K +T
Sbjct: 535 LKHGEDSNNAKDVKRQT 551



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 43/95 (45%), Gaps = 24/95 (25%)

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
           P++AS+P         T  +  +    K C+C KS CLK YC+CF     C ++C C DC
Sbjct: 458 PQLASAP--------TTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDC 508

Query: 232 KN---FEGSEERQA------------LFHGDHANN 251
            N   +E   ER              L HG+ +NN
Sbjct: 509 FNNLDYEVERERAIRSCLDRNPSAFNLKHGEDSNN 543


>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
 gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
          Length = 847

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 56/103 (54%), Gaps = 6/103 (5%)

Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLML----GKHNKG 200
           C N  E E    E  +    RNP AF PKI    H T     +  +V +     G+H  G
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQ--HVTEFQAIDVEDVDLFTPYSGRHKTG 501

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           C+CK+S C+KKYCEC+QAN+ CS  CRC  C+N  G +E  A+
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRNIHGRKEEYAM 544



 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           CNCK S C+K YCEC+ + + C + C C  C N
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRN 534


>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
          Length = 573

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/66 (56%), Positives = 50/66 (75%), Gaps = 8/66 (12%)

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
           AF+PKI++S     D+K +    L  GKHNKGC CKKSGC+KKYCECFQA + CS+ C+C
Sbjct: 433 AFQPKISTS-----DSKPDP---LNFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKC 484

Query: 229 MDCKNF 234
           ++C+N+
Sbjct: 485 IECRNY 490



 Score = 45.8 bits (107), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           N  K  K C CK S C+K YCECF + + C D C C+ C N
Sbjct: 449 NFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 489


>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 81

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 40/95 (42%), Positives = 55/95 (57%), Gaps = 17/95 (17%)

Query: 123 CLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGT 181
           CLKLYC+CFA+G++C D C C +C N  +   A  +A +  + ++P AF+PK+  +    
Sbjct: 1   CLKLYCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPKVLDAS--- 57

Query: 182 RDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
                        G H KGC C+KS CLKKYCECF
Sbjct: 58  -------------GGHVKGCACRKSRCLKKYCECF 79



 Score = 40.4 bits (93), Expect = 2.4,   Method: Composition-based stats.
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 208 CLKKYCECFQANILCSENCRCMDCKN 233
           CLK YC+CF   + C++ C C DC+N
Sbjct: 1   CLKLYCDCFATGLFCNDACMCKDCEN 26


>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 313

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/122 (43%), Positives = 62/122 (50%), Gaps = 20/122 (16%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN---VENEAARREAVEATLERNPNAFR 171
           C CK  RCLKLYC+CFA   +C   C C  C N    V  +AARR    A LERNP+AF 
Sbjct: 201 CTCKKCRCLKLYCDCFAQSRFCGSACLCKQCCNRLGAVVVQAARR----AVLERNPDAF- 255

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
                  H     +   G V        GC C +S CLKKYC CFQ ++LC   C+C  C
Sbjct: 256 -------HRAHSLETRPGSVT----SGAGCGCTRSHCLKKYCVCFQKSLLCGAGCKCEGC 304

Query: 232 KN 233
            N
Sbjct: 305 LN 306



 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 33/77 (42%), Gaps = 2/77 (2%)

Query: 78  LPGQATQPPVRVPKPESPKSRSRSN-VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIY 136
           L     Q   R     +P +  R++ +E +  +      C C  S CLK YC CF   + 
Sbjct: 235 LGAVVVQAARRAVLERNPDAFHRAHSLETRPGSVTSGAGCGCTRSHCLKKYCVCFQKSLL 294

Query: 137 CD-GCNCVNCHNNVENE 152
           C  GC C  C N VEN 
Sbjct: 295 CGAGCKCEGCLNTVENR 311


>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
          Length = 184

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 45/64 (70%)

Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYI 255
           +H KGC+CK+SGCLK YCEC++A ILCS  C+C+ CKNFE S ER+ L     A  +   
Sbjct: 3   RHTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQ 62

Query: 256 QQAA 259
           QQAA
Sbjct: 63  QQAA 66



 Score = 45.4 bits (106), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           K CNCK S CLK YCEC+ + I C   C CV C N
Sbjct: 6   KGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKN 40


>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 57/149 (38%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQ-CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNN 148
           K +  +++ R NVE   N+ + + Q CNC  S CLK YC CF SG  C D C C +C N 
Sbjct: 128 KKQKDQNKLRRNVE---NSKELEIQPCNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNC 184

Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
            E    R EA+    ++                RD K    E+L L + + GC CK +GC
Sbjct: 185 TEFAEERDEAINHVYKK--------------CHRDKKVPVNELLSL-QMSYGCKCKSTGC 229

Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGS 237
            KKYCECF+   +C E C C DC N   S
Sbjct: 230 QKKYCECFKRGQICGEQCACEDCLNIPFS 258


>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
 gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
          Length = 466

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 42/91 (46%), Positives = 49/91 (53%), Gaps = 14/91 (15%)

Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCK 204
           C N VEN +   +       RNPNAF  KI           E+ G+     +H KGCHCK
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKI-----------EDDGDD---ARHTKGCHCK 216

Query: 205 KSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           KS CLKKYCECFQA + C E C+C  C N E
Sbjct: 217 KSACLKKYCECFQAGVKCQEYCKCEGCMNKE 247



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           S +E   +  +  K C+CK S CLK YCECF +G+ C + C C  C N
Sbjct: 198 SKIEDDGDDARHTKGCHCKKSACLKKYCECFQAGVKCQEYCKCEGCMN 245


>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
          Length = 482

 Score = 82.4 bits (202), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 76/162 (46%), Gaps = 21/162 (12%)

Query: 84  QPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-GCNC 142
           Q    + K +    R  +++  KE    + K C CK S+C+KLYCECF S  +C   C C
Sbjct: 327 QKTSSIRKNQLSNIRKSTSLMNKEANLSECKGCTCKRSKCIKLYCECFLSKGFCGPSCTC 386

Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLG------- 195
            +C N  ENE    +  E  L RNP AF  K+           EE  + L +G       
Sbjct: 387 NDCENMEENEEEIVKIREKILSRNPKAFEKKLQI---------EEASQPLGMGPKAASAP 437

Query: 196 ----KHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
               +H KGC CK+S C   YCEC Q    C+E C C DC N
Sbjct: 438 SSNLRHIKGCTCKRSKCSNNYCECHQHGAKCTELCSCTDCNN 479


>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
           CCMP2712]
          Length = 125

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 73/136 (53%), Gaps = 21/136 (15%)

Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLE 164
           +E+   K++ C C  ++CLK YC+CF + I C   C C +CHN+ ++E  R EA+     
Sbjct: 3   EESQQGKERGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRM 62

Query: 165 RNPNAFR-------PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
            NP+AF+        +  ++P G +             K  +GC CK+S C KKYCECF 
Sbjct: 63  NNPSAFKGTALELEDQEVTTPKGGK-------------KTVRGCRCKRSKCQKKYCECFS 109

Query: 218 ANILCSENCRCMDCKN 233
           A I C+ NC C DC N
Sbjct: 110 AGIPCTSNCVCTDCAN 125



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 33/49 (67%), Gaps = 2/49 (4%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER--QALFH 245
           +GC C ++ CLK+YC+CF+ +I C+ +C C DC N    EE+  +A+ H
Sbjct: 11  RGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRH 59


>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
          Length = 204

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/124 (41%), Positives = 65/124 (52%), Gaps = 18/124 (14%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K C+CK SRCLKLYC C+ASG +C + C C  CHN     A                  P
Sbjct: 78  KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTA--------------P 123

Query: 173 KIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
            +   P  T +  +E  E L+   +    + C CKKSGCLKKYC+C+Q    CS NC+C 
Sbjct: 124 VLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCD 183

Query: 230 DCKN 233
           DC+N
Sbjct: 184 DCRN 187


>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/129 (37%), Positives = 63/129 (48%), Gaps = 16/129 (12%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNC  S CLK YC CF SG  C D C C +C N  +    R EA+    ++         
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKK--------- 201

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
                  RD K    E+L L + + GC CK +GC KKYCECF+   +C E C C  C N 
Sbjct: 202 -----CHRDKKVPVNELLSL-QMSYGCKCKTTGCQKKYCECFKRGQICGEQCSCEGCLNI 255

Query: 235 EGSEERQAL 243
             ++ ++ L
Sbjct: 256 PSAQNQKEL 264


>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 932

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 61/117 (52%), Gaps = 15/117 (12%)

Query: 124 LKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           LK YC+CFA+G +C   C C  C N+ +N+    E VE          R + ++S   + 
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQ----EQVE----------RARQSASQRSST 864

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
             +   GE     + ++GC CK++GCLK+YCECFQA   C+  C C  C N  G  E
Sbjct: 865 SGEALNGEPSASERSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLNCRGLSE 921



 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           +  + C CK + CLK YCECF +G  C   C C  C N
Sbjct: 878 RSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLN 915


>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
          Length = 341

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 75/169 (44%), Gaps = 23/169 (13%)

Query: 64  PKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---------- 113
           P  + PP PPP    P + +   V + + E    RS  N       PK+Q          
Sbjct: 5   PISMKPPAPPPVTLEPIELSTHAVGLTELE----RSSMNQLAIAPDPKRQRVEEAADGNG 60

Query: 114 -KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPN 168
            K C CK SRCLKLYC CFA G YC D C C  C     N+ A  E V+ T   L     
Sbjct: 61  CKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCF----NKEAFSETVQTTRKVLLSRQK 116

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
               KI   P    +  E+           +GC+CKKS CLKKYC+C+Q
Sbjct: 117 RMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQ 165



 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 32/67 (47%), Gaps = 4/67 (5%)

Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
           ++A +P   R   EE  +    G   K C CKKS CLK YC CF     CS+ C C  C 
Sbjct: 40  QLAIAPDPKRQRVEEAAD----GNGCKHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCF 95

Query: 233 NFEGSEE 239
           N E   E
Sbjct: 96  NKEAFSE 102


>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
          Length = 377

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 34/70 (48%), Positives = 48/70 (68%), Gaps = 3/70 (4%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVEN 151
           P+S +S     +E +    + +K C+C  SRCLKLYC+CF+SG+ C  CNC+NCHNN EN
Sbjct: 309 PQSSESLRTLRLECES---RPRKTCHCTKSRCLKLYCDCFSSGMMCSSCNCINCHNNAEN 365

Query: 152 EAARREAVEA 161
           E+ R EA++ 
Sbjct: 366 ESQRHEAIKV 375



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 26/35 (74%), Gaps = 1/35 (2%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           K CHC KS CLK YC+CF + ++CS +C C++C N
Sbjct: 328 KTCHCTKSRCLKLYCDCFSSGMMCS-SCNCINCHN 361


>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
 gi|194696492|gb|ACF82330.1| unknown [Zea mays]
          Length = 278

 Score = 78.6 bits (192), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 3/102 (2%)

Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCH 202
           +N  ++ E   + R+ +E+   RNP AF PK+  +     + ++ + +     +H +GC+
Sbjct: 1   MNNQSHTETVLSTRQQIES---RNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCN 57

Query: 203 CKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           CKKS CLKKYCECFQ  + CS +CRC  CKN  G  E  A+ 
Sbjct: 58  CKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGKREGAAVL 99



 Score = 45.4 bits (106), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCECF  G+ C   C C  C N
Sbjct: 51  RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 88


>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
 gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 84

 Score = 78.6 bits (192), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 43/95 (45%), Positives = 55/95 (57%), Gaps = 12/95 (12%)

Query: 123 CLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           CLKLYC+CFA+G +C  C+C +C N  E     ++  E    R+P AF  KI  +P G  
Sbjct: 1   CLKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMDAPGGG- 59

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
                       GKH +GC+CK+S CLKKYCECFQ
Sbjct: 60  -----------GGKHKRGCNCKRSHCLKKYCECFQ 83


>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
 gi|238005872|gb|ACR33971.1| unknown [Zea mays]
          Length = 272

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 55/96 (57%), Gaps = 3/96 (3%)

Query: 149 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
           +E   + R+ +E+   RNP AF PK+  +     D  + + +     +H +GC+CKKS C
Sbjct: 1   METVLSTRQQIES---RNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSC 57

Query: 209 LKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
           LKKYCECFQ  + CS +CRC  CKN  G  E  ++ 
Sbjct: 58  LKKYCECFQGGVGCSVSCRCEGCKNAFGKREGASVL 93



 Score = 45.4 bits (106), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           + ++ CNCK S CLK YCECF  G+ C   C C  C N
Sbjct: 45  RHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKN 82


>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
          Length = 330

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 62/121 (51%), Gaps = 30/121 (24%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C CK ++CLK YC+CF  G +C + C C++C NN +N    +E +E T        R +I
Sbjct: 216 CICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQN----KELIEQT--------RSQI 263

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
                    N+    EV         C CKKS CLKKYCEC+ A   C  +C+C  C N+
Sbjct: 264 ---------NRNLNDEV--------KCFCKKSKCLKKYCECYNAGKKCGVDCKCEGCMNY 306

Query: 235 E 235
           E
Sbjct: 307 E 307



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 28/41 (68%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           +GC CKK+ CLK YC+CFQ    C E C C+DC N E ++E
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE 254


>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
          Length = 391

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 80/191 (41%), Gaps = 25/191 (13%)

Query: 54  ANSHPPPQPQPKPV-------VPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVK 106
           A + P P+P    +           P P            V  P P      S      K
Sbjct: 9   ATNQPWPEPDDAHLSPPPPPPPASAPTPAAEAAAPTASAAVSPPAPAEANGNSDRKRRRK 68

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLER 165
            +     K C+CK SRCLKLYC C+ASG +C + C C  CHN     A            
Sbjct: 69  ADDGDGCKACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQGAPWTA-------- 120

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILC 222
                 P +   P  T +  +E  E L+   +    + C CKKSGCLKKYC+C+Q    C
Sbjct: 121 ------PVLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGC 174

Query: 223 SENCRCMDCKN 233
           S NC+C DC+N
Sbjct: 175 SINCKCDDCRN 185


>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 53/136 (38%), Positives = 69/136 (50%), Gaps = 20/136 (14%)

Query: 103 VEVKENTPK----KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARRE 157
           + VKE +PK    K + C+C  + CLK+YC CF +G  C + C C +C N  E    R E
Sbjct: 65  IVVKEESPKLESNKWQPCSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHE 124

Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
           AVE             I    H    NK+ T E L   K   GC+CKK+ CLK+YCECF 
Sbjct: 125 AVEY------------IKKKAHR---NKKVTQEKLFETKEVWGCNCKKTRCLKRYCECFI 169

Query: 218 ANILCSENCRCMDCKN 233
               C+ +C C  C+N
Sbjct: 170 RQKTCTVDCNCNYCEN 185



 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 21/35 (60%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           C C K+ CLK YC CF     C E C+C DCKN +
Sbjct: 82  CSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVD 116


>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|219884557|gb|ACL52653.1| unknown [Zea mays]
 gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
          Length = 394

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 62/125 (49%), Gaps = 19/125 (15%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
           K C+CK SRCLKLYC CFASG++C + C C  CHN  + +   R A    L         
Sbjct: 87  KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL--------- 137

Query: 173 KIASSPHGTRDNKEETGEVLM-LGKHNKGCH---CKKSGCLKKYCECFQANILCSENCRC 228
           K   +    +D  E+     M L +    C    C     LKKYC+C+Q    CS NC+C
Sbjct: 138 KPVQTFEAGQDIVEQVIRSPMDLIRRKCTCKKSGC-----LKKYCDCYQGGAGCSINCKC 192

Query: 229 MDCKN 233
            DC+N
Sbjct: 193 DDCRN 197


>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
          Length = 484

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/97 (47%), Positives = 64/97 (65%), Gaps = 11/97 (11%)

Query: 124 LKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD 183
           ++ YC+CFASG  C+ CNC NC NN+ +E  R +A++A L+RNP AF+PKI   P    D
Sbjct: 295 VRKYCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIG--PGSLGD 352

Query: 184 NKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
            K          +HN+GC+CK+SGCLK YCEC++  +
Sbjct: 353 AKP---------RHNRGCNCKRSGCLKNYCECYELAL 380


>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/119 (36%), Positives = 51/119 (42%), Gaps = 23/119 (19%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNC  S C K+YCEC A G  C   C C NC N   N+       E  L +N    + K 
Sbjct: 72  CNCSKSECRKMYCECLAKGRLCSSACRCENCQNRTSNKKVLHVIKELDLNKNLKKIKSK- 130

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                                +   GC CKKS CLKKYCECF +   C   C C +C+N
Sbjct: 131 ---------------------RFKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQN 168



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 5/78 (6%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C CK S CLK YCECF SG  C  GCNC NC N + +EA +   +    ++  + F  K 
Sbjct: 136 CTCKKSMCLKKYCECFHSGKSCGYGCNCENCQNQLLDEAQKN--ICTPNQKKTSKFSSKF 193

Query: 175 ASSPHGTRDNKEETGEVL 192
            S+   TR   E  G VL
Sbjct: 194 ESANTNTRS--EYYGMVL 209


>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
          Length = 710

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/140 (35%), Positives = 67/140 (47%), Gaps = 40/140 (28%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECF +G++C + C+C  C N   N+E   + RE +E+   RNP A
Sbjct: 439 KRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLA 495

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           F PK+          + E G+ L  G                        + CS +CRC 
Sbjct: 496 FAPKVI---------RTEPGQELAGG------------------------VGCSVSCRCE 522

Query: 230 DCKNFEGSEERQALFHGDHA 249
            CKN  G  E  AL   + A
Sbjct: 523 GCKNAFGRREGVALLGIEEA 542



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 26/45 (57%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           G   K C CKKS CLK YCECF A + CSE C C  C N   + E
Sbjct: 435 GTSCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNME 479


>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
          Length = 303

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 43/119 (36%), Positives = 60/119 (50%), Gaps = 18/119 (15%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNC  S CLKLYC+CF     C + C C++C N   +   R+ A+E          + K+
Sbjct: 183 CNCSKSNCLKLYCQCFHQNKQCTELCKCLDCKNCDYHTQVRQTALE----------KIKM 232

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            S      ++  +  +V        GC C+KS C K YCECF+ N  C+ +CRC DC N
Sbjct: 233 KSQRQKHDNDLFDLSKVW-------GCKCQKSQCQKNYCECFRRNQKCNSSCRCKDCAN 284


>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
 gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
          Length = 141

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 58/117 (49%), Gaps = 12/117 (10%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C  SRC+KLYC+CF  G  C+  CNC +C N  E    R   ++  L+ NP AF    
Sbjct: 35  CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAFSED- 93

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
            S    T+  +   G          GC C  S CLKKYC CF A+  C++ C C DC
Sbjct: 94  -SDKFNTKRQRMSRGN---------GCACPSSHCLKKYCSCFGADAGCTDKCSCNDC 140



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN-FEGSEERQ 241
           C C KS C+K YC+CF    LC+  C C DCKN  E  EER+
Sbjct: 35  CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREERE 76


>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 48/132 (36%), Positives = 65/132 (49%), Gaps = 16/132 (12%)

Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEA 161
           VE  +N   K + C+C  + CLK+YC CF +G  C + C C +C N   N   R +AVE 
Sbjct: 69  VESPKNEIAKWQPCSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEY 128

Query: 162 TLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
                       I    H    NK+ + E L   K   GC+CKK+ CLK+YCEC+     
Sbjct: 129 ------------IKKKAHR---NKKVSQETLFETKDIWGCNCKKTRCLKRYCECYIRQKA 173

Query: 222 CSENCRCMDCKN 233
           C+  C C  C+N
Sbjct: 174 CTVECNCTHCEN 185



 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 40/79 (50%), Gaps = 8/79 (10%)

Query: 86  PVRVPKPESPKSRSRSNVEVKENTPKKQKQ---CNCKHSRCLKLYCECFASGIYCD-GCN 141
           P R    E  K ++  N +V + T  + K    CNCK +RCLK YCEC+     C   CN
Sbjct: 120 PQRDKAVEYIKKKAHRNKKVSQETLFETKDIWGCNCKKTRCLKRYCECYIRQKACTVECN 179

Query: 142 CVNCHN----NVENEAARR 156
           C +C N    ++ NE  R+
Sbjct: 180 CTHCENGKDEDLYNEIRRQ 198


>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
 gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
          Length = 163

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 66/140 (47%), Gaps = 15/140 (10%)

Query: 48  QSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKE 107
           +SQA  A      Q       P       P   QAT     +P  E+      + +  + 
Sbjct: 7   ESQAAVAAGSSSYQSMSTMQTPRDSLHSIPFEQQATLCEGTMPSSENADIIELNQMSPER 66

Query: 108 NTPKKQ--------KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAA---R 155
           N+ K +        ++CNCK SRCLKLYCECFA+G YC D C+C NC N  E   A    
Sbjct: 67  NSKKAKYSTESEGCRRCNCKRSRCLKLYCECFAAGNYCLDTCSCENCFNKSEYADAVLHM 126

Query: 156 REAVEATLERNPNAFRPKIA 175
           R+ +EA   RNP AF PK+ 
Sbjct: 127 RQQIEA---RNPVAFAPKVV 143



 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFH 245
           + C+CK+S CLK YCECF A   C + C C +C  F  SE   A+ H
Sbjct: 81  RRCNCKRSRCLKLYCECFAAGNYCLDTCSCENC--FNKSEYADAVLH 125


>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 255

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 54/119 (45%), Gaps = 18/119 (15%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C  S CLKLYC CF     C D C C  C N  E+   R  A+E            K+
Sbjct: 136 CRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALE------------KV 183

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
               H     +++  + L       GC C+KS C K YCECF  N  CS +CRC DC N
Sbjct: 184 KQKLH-----RQKNDDDLFDRSKIWGCKCQKSQCQKNYCECFVRNQKCSSSCRCKDCAN 237


>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
          Length = 275

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 85/213 (39%), Gaps = 26/213 (12%)

Query: 33  FTGGFVGSVILPEHP------QSQAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQATQPP 86
           +   FV   ++P+ P          +  N H      P  +      P P    Q +Q  
Sbjct: 60  YKSWFVYHPVMPDDPFLSYLSYVSNIVQNPHQQYPDNPIKIENDQDIPQPQFQNQTSQIN 119

Query: 87  VRVPKPESPKSRSRSNVEVKENT----PKKQKQCNCKHSRCLKLYCECFASGIYC-DGCN 141
           +     +   S+ ++ +++K  T      + + CNC  S CLK YC CF SG  C D C 
Sbjct: 120 LNKTIQQQKHSQPKNQIKIKTITNLYKASEIQPCNCSQSSCLKRYCTCFHSGRMCMDECQ 179

Query: 142 CVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGC 201
           C +C N       R  A+    ++                RD      E+L L + + GC
Sbjct: 180 CKDCKNCDLFSEEREAAINYVFKK--------------CNRDKNVPANELLSL-QISYGC 224

Query: 202 HCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
            CK +GC KKYCEC +    C + C C DC N 
Sbjct: 225 KCKSTGCQKKYCECLKRGQACGDLCSCEDCLNI 257



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 31/43 (72%), Gaps = 1/43 (2%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE-GSEERQA 242
           C+C +S CLK+YC CF +  +C + C+C DCKN +  SEER+A
Sbjct: 153 CNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREA 195


>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
          Length = 321

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 56/119 (47%), Gaps = 18/119 (15%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C  S CLKLYC+CF     C + C C  C N  ++   R+ A+E          + KI
Sbjct: 201 CKCTKSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQTALE----------KIKI 250

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            S      D+  +   V        GC C+KS C K YCEC+  N  CS +CRC DC N
Sbjct: 251 KSQRQKHDDDLFDRSRVW-------GCKCQKSQCQKNYCECYIRNQKCSSSCRCKDCAN 302


>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 319

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 11/81 (13%)

Query: 155 RREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCE 214
           R +A+ + LERNP AF+PK+ +       N  ET  +     H +GC+C+KS CLK YCE
Sbjct: 20  RSQAIASILERNPLAFQPKVVA-------NGSETNNL----HHLRGCNCRKSNCLKNYCE 68

Query: 215 CFQANILCSENCRCMDCKNFE 235
           C QA + C+  C C  C N E
Sbjct: 69  CHQARVSCTNRCACHKCCNTE 89



 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAV 159
           N     + CNC+ S CLK YCEC  + + C + C C  C N  E  +A++  +
Sbjct: 47  NNLHHLRGCNCRKSNCLKNYCECHQARVSCTNRCACHKCCNTEEFHSAKKMLI 99


>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
          Length = 372

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 54/111 (48%), Gaps = 18/111 (16%)

Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           YC C+ASG +C + C C  CHN     A                  P +   P  T +  
Sbjct: 69  YCVCYASGSHCTELCGCEPCHNKPPQGAPWTA--------------PVLPLKPVQTLEAG 114

Query: 186 EETGEVLM---LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           +E  E L+   +    + C CKKSGCLKKYC+C+Q    CS NC+C DC+N
Sbjct: 115 QEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 165


>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
 gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
          Length = 916

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 61/139 (43%), Gaps = 14/139 (10%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEAT--------- 162
           ++K CNC  S+CLKLYC+CFA+G +C  C C +C NN++ E  R  A+  +         
Sbjct: 753 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIPWSVIPAHLRCR 812

Query: 163 -LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
            +E  P+     I +        KE  G  L   + N+         ++    C  + I+
Sbjct: 813 NMEDRPDVDMDSIDNFADVANKRKELNGTYL---QDNRSNVYLTDDVIEATIMCMISRIV 869

Query: 222 CSENCR-CMDCKNFEGSEE 239
             E  R C D    E  EE
Sbjct: 870 MHEKQRMCKDSTEREVMEE 888



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 4/49 (8%)

Query: 195 GKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN---FEGSEER 240
           G   K C+C KS CLK YC+CF     C ++C C DC N   +E   ER
Sbjct: 751 GSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKRER 798


>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
          Length = 524

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 28/59 (47%), Positives = 42/59 (71%), Gaps = 3/59 (5%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           K++ CNC  S+CLKLYC+CFA+G +C  CNC +CHN  +++ AR++A+     RN + F
Sbjct: 325 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIRC---RNTDQF 380



 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           C+C KS CLK YC+CF     C  NC C DC N
Sbjct: 329 CNCTKSQCLKLYCDCFANGEFCY-NCNCKDCHN 360


>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
          Length = 96

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 39/84 (46%), Positives = 47/84 (55%), Gaps = 9/84 (10%)

Query: 158 AVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQ 217
           AV     RNPNAF  K         + K++   V+    H KGC CKKS CLKKYCECF 
Sbjct: 2   AVNDAKNRNPNAFNIKFQ-----IIEEKQDDKIVI----HKKGCTCKKSNCLKKYCECFN 52

Query: 218 ANILCSENCRCMDCKNFEGSEERQ 241
           A +LC++NC C  CKN +  E  Q
Sbjct: 53  AGVLCNQNCVCDSCKNMQKEENNQ 76



 Score = 44.7 bits (104), Expect = 0.13,   Method: Composition-based stats.
 Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARRE 157
           +K C CK S CLK YCECF +G+ C+  C C +C N  + E  +++
Sbjct: 33  KKGCTCKKSNCLKKYCECFNAGVLCNQNCVCDSCKNMQKEENNQQQ 78


>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/119 (36%), Positives = 56/119 (47%), Gaps = 18/119 (15%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C  S CLKLYC CF   I C + C C +CHN  +    R +A+E          + K+
Sbjct: 225 CKCSKSHCLKLYCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE----------KVKV 274

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
                   D+  +   V        GC C+KS C K YCECF  N  CS  C+C +C+N
Sbjct: 275 KQQRRKHDDDLFDKDTVW-------GCQCRKSQCKKNYCECFIRNKKCSSLCKCNNCEN 326



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 26/43 (60%), Gaps = 3/43 (6%)

Query: 194 LGKHNKG---CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           L +HN     C C KS CLK YC CF  NI CSE C+C DC N
Sbjct: 215 LDEHNLNNSPCKCSKSHCLKLYCACFHRNIECSELCQCHDCHN 257


>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 49/131 (37%), Positives = 65/131 (49%), Gaps = 18/131 (13%)

Query: 104 EVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT 162
           ++ ENT  K + C+C  + CLK+YC CF +G  C + C C +C N  E    R  AVE  
Sbjct: 72  QILENT--KWQPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEY- 128

Query: 163 LERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILC 222
                      I    H    NK+ T E L   K   GC+CKK+ C K+YCECF     C
Sbjct: 129 -----------IRKKAHR---NKKVTQEKLFETKDVWGCNCKKTRCQKRYCECFIRQKTC 174

Query: 223 SENCRCMDCKN 233
           + +C C  C+N
Sbjct: 175 TVDCNCNHCEN 185


>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
          Length = 298

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/115 (39%), Positives = 55/115 (47%), Gaps = 8/115 (6%)

Query: 123 CLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEAT---LERNPNAFRPKIASSP 178
           C+  YC CFA G YC D C C  C N    + A  E V+ T   L         KI   P
Sbjct: 13  CICSYCPCFAGGGYCSDKCGCQPCFN----KEAFSETVQTTRKVLLSRQKRMSMKINRRP 68

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
               +  E+           +GC+CKKS CLKKYC+C+Q    CS  CRC DC+N
Sbjct: 69  EANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 123



 Score = 38.9 bits (89), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           ++ CNCK S CLK YC+C+  G  C   C C +C N
Sbjct: 88  RRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 123


>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
 gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
          Length = 421

 Score = 69.3 bits (168), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 55/111 (49%), Gaps = 18/111 (16%)

Query: 127 YCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNK 185
           YC CFASG +C + C C  C+N   +   R                P +      T +  
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRST--------------PALPLKAVQTLETG 173

Query: 186 EETGEVLMLGKHNKG---CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           +++ E L+    + G   C CKKSGCLKKYC+C+Q    CS NC+C DCKN
Sbjct: 174 QDSVEQLIRSPMDFGRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKN 224


>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
          Length = 246

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 49/125 (39%), Positives = 60/125 (48%), Gaps = 17/125 (13%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVN-CHNNVENEAARREAVEATLERNPN 168
           K Q+ CNCK S C+K YC CF SG  C   C C + C N   N   R EA++     N  
Sbjct: 131 KCQQPCNCKSSGCIKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERSEAIKHV---NEK 187

Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
            +R            N++   E L       GC C KS C K+YCECF  N  C+E C+C
Sbjct: 188 CYR------------NRKIPKEALFKLDVIYGCSCTKSKCRKRYCECFLRNQNCTEKCKC 235

Query: 229 MDCKN 233
            DC N
Sbjct: 236 FDCCN 240


>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 381

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/118 (40%), Positives = 60/118 (50%), Gaps = 31/118 (26%)

Query: 98  RSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC--DGCNCVNCHNNVE-NEAA 154
           RSR    + E  PK  K C CK++ CLKLYC CF  G +C  D C C++C+N  E NE  
Sbjct: 264 RSR---RLFEQVPKTVKPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNETG 320

Query: 155 --RREAVEATLERNPNAF--RPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGC 208
             R+EAV   L R  +AF  RP            K++TGE         GC CKK+ C
Sbjct: 321 GKRQEAVSEILLRRIDAFESRP------------KKKTGE---------GCACKKNRC 357



 Score = 45.4 bits (106), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSEN-CRCMDCKNFEGSEE 239
           K C CK + CLK YC CFQ    C  + C+C+DC N     E
Sbjct: 277 KPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNE 318


>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/150 (32%), Positives = 73/150 (48%), Gaps = 22/150 (14%)

Query: 89  VPKPESPKSRSRSNVEVKENTP----KKQKQ-CNCKHSRCLKLYCECFASGIYC-DGCNC 142
           + K +  + ++    ++ +N P    K Q+Q CNCK+S CLK YC CF SG  C   C C
Sbjct: 116 LQKKQENEEQAFKYFQILQNDPSEIQKLQQQPCNCKNSGCLKRYCRCFHSGRMCLKECQC 175

Query: 143 V-NCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGC 201
             +C NN +++  R  A+   +  +   +R            N+    + L       GC
Sbjct: 176 SEDCQNNEQHQEQRNNAI---IHVDQKCYR------------NRRMPRDALFKLDVIYGC 220

Query: 202 HCKKSGCLKKYCECFQANILCSENCRCMDC 231
            C KS C K+YCEC+  N  C++ C+C DC
Sbjct: 221 SCTKSKCRKRYCECYLRNQKCTDKCKCFDC 250


>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 56/129 (43%), Gaps = 23/129 (17%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           CNC  S C K+YCEC A G  C   C C NCHN   N+       E              
Sbjct: 72  CNCSRSGCRKMYCECLAKGRLCSSFCRCQNCHNKTSNKLVLNVIEEL------------- 118

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
                    ++    +     +   GC C+KS CLKKYCECF +   C   C+C++C+N+
Sbjct: 119 ---------DQNRNRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKSCGSGCKCVNCENY 169

Query: 235 EGSEERQAL 243
              E  + +
Sbjct: 170 VLDEVEKII 178



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 3/68 (4%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C+ S CLK YCECF SG  C  GC CVNC N V +E  +   + +  ++  + F  K 
Sbjct: 136 CTCQKSMCLKKYCECFNSGKSCGSGCKCVNCENYVLDEVEK--IICSPNQKKTSKFSSKF 193

Query: 175 ASSPHGTR 182
            S+   TR
Sbjct: 194 ESANTNTR 201


>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 16/127 (12%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERN 166
           N   K + C+C  + CLK+YC CF +G  C + C C +C N   +   R EAVE      
Sbjct: 74  NESSKWQPCSCTKTYCLKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEY----- 128

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
                  I    H    NK+   E L   K   GC+CKK+ CLK+YCEC+     C+  C
Sbjct: 129 -------IKKKAHR---NKKVPQEKLFETKDIWGCNCKKTRCLKRYCECYIRQKTCTVEC 178

Query: 227 RCMDCKN 233
            C  C+N
Sbjct: 179 NCNHCEN 185


>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
          Length = 123

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 64/120 (53%), Gaps = 18/120 (15%)

Query: 115 QCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPK 173
           +C C  + CLKLYCEC  +G  C D C CV+C N  +    R++A++   + NP AF   
Sbjct: 14  RCTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKF-PERKKALKHIKKTNPKAFLR- 71

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
             S+  G  D++               C C+KS C KKYC C+Q+   C+  C+C+DCKN
Sbjct: 72  -VSNLKGHEDSR--------------CCACQKSLCNKKYCSCYQSGRYCTPKCKCVDCKN 116



 Score = 43.1 bits (100), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEER-QALFHGDHANNMAYIQ 256
           C C K+ CLK YCEC +A   C++ C C+DC N     ER +AL H    N  A+++
Sbjct: 15  CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFPERKKALKHIKKTNPKAFLR 71


>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 73/171 (42%), Gaps = 55/171 (32%)

Query: 69  PPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYC 128
           P    P  P P Q  Q   ++ + E P   + +             +C C  S+C K YC
Sbjct: 46  PKFFSPIRPFPIQ--QKITKIKQIEKPILNTIT-------------KCKCMKSKCQKSYC 90

Query: 129 ECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
           ECFA+G  C+  CNC  C+N + +++ +                             K +
Sbjct: 91  ECFAAGKACNMDCNCQGCNNTICSQSIK-----------------------------KVQ 121

Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
           TG          GC+CKK+GCLKKYCEC+     C+  C C+DC N E SE
Sbjct: 122 TG----------GCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDSE 162



 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAAR 155
           CNCK + CLK YCEC+     C   CNC++C N  ++E+ +
Sbjct: 125 CNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDSESEK 165


>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
          Length = 326

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/66 (50%), Positives = 43/66 (65%), Gaps = 7/66 (10%)

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           FRPK+ S     + N E+  +     +H KGC+CKKS CLKKYCEC+Q  + CSE C+C 
Sbjct: 244 FRPKVES-----KSNSEDEQD--HKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCD 296

Query: 230 DCKNFE 235
           +CKN E
Sbjct: 297 ECKNCE 302



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 2/55 (3%)

Query: 95  PKSRSRSNVE-VKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
           PK  S+SN E  +++ P+  K CNCK S CLK YCEC+  G+ C + C C  C N
Sbjct: 246 PKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKN 300


>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
          Length = 223

 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 59/122 (48%), Gaps = 22/122 (18%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLE---RNPNAFR 171
           C+C+ + CLK+YC CF +G +C   C C  C N  E +  R +AV+   +   RN    +
Sbjct: 89  CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKKAHRNKKVPK 148

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
            KI            ET E+        GC+C K+ C+KKYCECF     C+  C C  C
Sbjct: 149 EKIF-----------ETVEIW-------GCNCSKTRCVKKYCECFIRGKKCTVECNCDHC 190

Query: 232 KN 233
            N
Sbjct: 191 DN 192



 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 23/35 (65%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           C C+K+ CLK YC CF     C ++CRC +C+N E
Sbjct: 89  CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTE 123


>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
          Length = 523

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 51/64 (79%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
           E+T + +K CNC  S CLKLYC+CFA+G +C+ CNC NC+NN+E+E  R++A++A L+RN
Sbjct: 451 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 510

Query: 167 PNAF 170
           P AF
Sbjct: 511 PEAF 514


>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
 gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
          Length = 459

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 29/36 (80%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
           KGC+CKK+GCLK+YCECFQ N  C+  CRC  CKN+
Sbjct: 196 KGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKNY 231



 Score = 42.7 bits (99), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 33/72 (45%), Gaps = 4/72 (5%)

Query: 80  GQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD- 138
           G   +PP    KP   K + +   +  +N  K    CNCK + CLK YCECF +   C  
Sbjct: 165 GDQEEPPTTRAKPIELKLQHQQQPKFVDNNAKG---CNCKKTGCLKRYCECFQNNKRCTI 221

Query: 139 GCNCVNCHNNVE 150
            C C  C N  E
Sbjct: 222 KCRCQGCKNYHE 233


>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
          Length = 285

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 63/128 (49%), Gaps = 17/128 (13%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVN-CHNNVENEAARREAVEATLER 165
           N+  +Q+ C C++S CLK YC CF SG  C   C CV  C NN ++   R +A++   E+
Sbjct: 152 NSELEQQPCFCRNSGCLKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEK 211

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
                             NK    + L   K   GC+CKK+ C   YCECF     C+ +
Sbjct: 212 ---------------CHRNKNVPKDALFKLKDCFGCNCKKTRCQTGYCECFLRKSKCTMD 256

Query: 226 CRCMDCKN 233
           C+C +C+N
Sbjct: 257 CQCDNCEN 264


>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
          Length = 259

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 72/151 (47%), Gaps = 17/151 (11%)

Query: 84  QPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNC 142
           + P+   +     S++  NV+       + K C+C  + CL+LYC CF +   C   C C
Sbjct: 111 EEPIVNQRTRRKASKANYNVDDSPTMDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKC 170

Query: 143 VNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCH 202
             C N+ ++     +A+E          + K   + H   D+ + T +V        GC 
Sbjct: 171 SECFNDGQHSDEVLKAIEQ--------IKIKEQRASHHDLDSFD-TRQVW-------GCK 214

Query: 203 CKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           CKK+ C+K YCECF  +  CS +C+C +C+N
Sbjct: 215 CKKTQCIKGYCECFIRHKKCSSHCQCSNCQN 245


>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 78

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/58 (50%), Positives = 37/58 (63%), Gaps = 1/58 (1%)

Query: 123 CLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPH 179
           CLKLYC+CFA  + C D C C NC+N+ E+   RR AV   L R+P+AF  K  + PH
Sbjct: 2   CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKFKTEPH 59



 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 2/38 (5%)

Query: 208 CLKKYCECFQANILCSENCRCMDCKNF--EGSEERQAL 243
           CLK YC+CF   ++CS+ CRC +C N    G E R A+
Sbjct: 2   CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAV 39


>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 55/122 (45%), Gaps = 22/122 (18%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAV---EATLERNPNAFR 171
           C+C  S CL++YC CF +G  C   C C  C N  +N   R +AV   +   +RN     
Sbjct: 89  CSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPE 148

Query: 172 PKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDC 231
            KI            ET E+        GC+C K+ C+K YCECF     CS  C C  C
Sbjct: 149 GKIF-----------ETTEIW-------GCNCSKTRCVKNYCECFIRGKKCSVECNCQHC 190

Query: 232 KN 233
            N
Sbjct: 191 DN 192



 Score = 39.3 bits (90), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGS 237
            C C KS CL+ YC CF    +C ++C+C +C+N E +
Sbjct: 88  SCSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDN 125


>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 379

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 84/169 (49%), Gaps = 26/169 (15%)

Query: 80  GQATQPPVRVPKPESPKSRSR--SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC 137
            QA+Q P ++   +   S  +  S  +V  +T     +CNC+ ++C+K YC C+A+  +C
Sbjct: 214 SQASQTPTKLISLQEQTSSQQISSFTKVHVDT-----ECNCQTTQCIKKYCSCYANNNFC 268

Query: 138 --DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRD---NKEETGEVL 192
               C C +C N  +      ++  +T ++     +PK+ SS          + +    L
Sbjct: 269 VPSICKCQSCMNTFQ------QSYNSTSQK-----KPKLISSQKSLSQVILYQIQNINQL 317

Query: 193 MLGKHNK--GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
            + K N    C C+KS C K+YC+C+++   CS  C+C +C N +G +E
Sbjct: 318 KIQKENATISCTCQKSFCKKQYCDCYKSGKKCSTLCKCQNCFN-QGQKE 365



 Score = 39.3 bits (90), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARR 156
           C C+ S C K YC+C+ SG  C   C C NC N  + E  +R
Sbjct: 328 CTCQKSFCKKQYCDCYKSGKKCSTLCKCQNCFNQGQKEVVQR 369


>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
          Length = 139

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 40/133 (30%), Positives = 65/133 (48%), Gaps = 21/133 (15%)

Query: 103 VEVKENTPKKQKQ-CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVE 160
           ++++ N  ++ K+ C C+ ++CLKLYC+CF  G  C   C C  C N++ +   R +A+ 
Sbjct: 8   LQIQGNDSEETKEGCQCQKTKCLKLYCQCFHDGKCCGQSCGCNGCKNSLNDYLERNKAIG 67

Query: 161 ATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANI 220
             + +  + ++ K++                   G   +GC CKKS C   YCECF    
Sbjct: 68  KIIHKYKDDYKQKLS-------------------GDKGQGCCCKKSKCKLNYCECFIKGR 108

Query: 221 LCSENCRCMDCKN 233
            C E C C  C+N
Sbjct: 109 SCGEQCHCKKCEN 121


>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
          Length = 262

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 70/155 (45%), Gaps = 37/155 (23%)

Query: 83  TQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQ--CNCKHSRCLKLYCECFASGIYCDG- 139
           +QP + + +    + + ++ +++ +    K+K+  C+C   +C   YC C   G  C   
Sbjct: 96  SQPILDISQISQIQLQPKTKLDLTDQPKYKKKKVGCSCTKRKCSSKYCFCAKVGQSCSNL 155

Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
           C CV+C NN   ++   + ++ T     NA                          +  K
Sbjct: 156 CKCVDCSNN---KSFMSQQIQVT-----NA--------------------------QEQK 181

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
           GC CKK+ CLK YC+CF   + CS NC+C+ CKN 
Sbjct: 182 GCKCKKNFCLKGYCDCFAKGMQCSSNCKCISCKNM 216


>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
          Length = 278

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 58/124 (46%), Gaps = 17/124 (13%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVN-CHNNVENEAARREAVEATLERNPNA 169
           +Q+ C C++S CLK YC CF SG  C   C CV  C NN ++   R  A++   E+    
Sbjct: 149 EQQPCFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEK---- 204

Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
                         NK    + L   K   GC CKKS C   YCECF     C+ +C C 
Sbjct: 205 -----------CHRNKNIPKDALFKLKDCFGCSCKKSRCQTGYCECFLRKSKCTIDCECQ 253

Query: 230 DCKN 233
           +C+N
Sbjct: 254 NCEN 257


>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 61/132 (46%), Gaps = 26/132 (19%)

Query: 103 VEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEA 161
           +EVK+  P  Q +C C  S+C + YCECFA G  C   C C NC N   N+      +E 
Sbjct: 86  IEVKDLQP--QIKCQCNKSQCQQSYCECFARGKTCGKHCGCQNCQNKQMNKK-----LEK 138

Query: 162 TLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANIL 221
             ++     R K  +S                  ++ KGC C KS C  K+C C Q    
Sbjct: 139 IQKKLIKQKRIKQRTS------------------RYLKGCTCGKSKCQNKFCSCHQDGKY 180

Query: 222 CSENCRCMDCKN 233
           C+  CRC+DCKN
Sbjct: 181 CNSGCRCVDCKN 192


>gi|38637223|dbj|BAD03489.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|38637276|dbj|BAD03540.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 141

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 11/85 (12%)

Query: 428 SDGSDGQKA--RPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVY 485
           +D  D QK   RP+SPGT ALMCDEQD +F  + +P+         ++     + + +++
Sbjct: 57  ADPFDIQKGNKRPLSPGTQALMCDEQDAVFQDSKAPD---------ATPSTTEENLVDIF 107

Query: 486 AEQERIVLTKFRDCLNRLITFGEIK 510
            EQE+ VLT FRD L +L T G+++
Sbjct: 108 KEQEKRVLTNFRDYLCKLATCGKLQ 132


>gi|145481073|ref|XP_001426559.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393634|emb|CAK59161.1| unnamed protein product [Paramecium tetraurelia]
          Length = 254

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 51/117 (43%), Gaps = 35/117 (29%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN-NVENEAARREAVEATLERNPNAFRPK 173
           C CK ++C+K YCECF  G  C   C C  C N  V+  + + + V  T+E         
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCNKQVKQSSGKMQLVSPTIEF-------- 195

Query: 174 IASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
                                    +GC C+K  C KKYCEC + N  C++ C+C+D
Sbjct: 196 -------------------------QGCKCQKQNCSKKYCECLKRNQRCTDLCKCID 227



 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 20/33 (60%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           C CKK+ C+K YCECFQ    CS  C C  C N
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCN 176


>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 103

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 22/34 (64%), Positives = 29/34 (85%)

Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
           +H KGC+CK+SGCLK YCEC++A I CS NC+C+
Sbjct: 60  RHTKGCNCKRSGCLKNYCECYEAKIACSGNCKCI 93



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 27/61 (44%), Positives = 33/61 (54%), Gaps = 4/61 (6%)

Query: 92  PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVN-CHNNV 149
           P  PK  + S  E  + T +  K CNCK S CLK YCEC+ + I C G C C+  CH   
Sbjct: 43  PVQPKIGATSAAE--DATRRHTKGCNCKRSGCLKNYCECYEAKIACSGNCKCIGECHAER 100

Query: 150 E 150
           E
Sbjct: 101 E 101


>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
 gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
          Length = 113

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 32/65 (49%), Positives = 44/65 (67%), Gaps = 7/65 (10%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
           K+C+CK S+CLKLYCECFAS +YC + C+C  C N+  +E    + R  +E+   RNP A
Sbjct: 19  KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIES---RNPLA 75

Query: 170 FRPKI 174
           F PK+
Sbjct: 76  FAPKV 80



 Score = 47.4 bits (111), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 19  KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 59


>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
          Length = 677

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 199 KGCHCKKSGCLKKY--CECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQ 256
           K C+C KS CLK +       A I+CS  C+C+ CKNFE S ER+ L H   A  +   Q
Sbjct: 523 KPCNCTKSLCLKLWEVNNISMAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQ 582

Query: 257 QAA 259
           Q A
Sbjct: 583 QTA 585


>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 957

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 41/78 (52%), Gaps = 14/78 (17%)

Query: 106 KENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG--CNCVNCHNNVENEAARR------- 156
           K N  K  K C CK ++CLKLYC CF+  I CD   C+C  C  N ++E AR        
Sbjct: 308 KSNKKKTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERC-ANTQDELARDGDNGMAM 366

Query: 157 ----EAVEATLERNPNAF 170
                A ++ LERNP AF
Sbjct: 367 SAVVTARKSILERNPRAF 384



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 27/40 (67%)

Query: 194 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           L K  + C CKKS CLKKYCECF  ++ C+  C+C +C N
Sbjct: 652 LTKQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           K+ + C CK S+CLK YCECFA  + C   C C NC N  
Sbjct: 654 KQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGNQA 693



 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 196 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCMDCKN 233
           K +KGC CKK+ CLK YC CF  +I C  + C C  C N
Sbjct: 313 KTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCAN 351


>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
 gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
          Length = 590

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 71/176 (40%), Gaps = 45/176 (25%)

Query: 67  VVPPVPPPPPPL--PGQATQPP----VRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKH 120
           +VP + P        G +T P     +++P P S    + S V+ K     ++  C   +
Sbjct: 422 LVPEIRPQTRNFRESGNSTIPDKDDRLQMPGPYSLFEITSSQVDKKIKCSCQKNSCTQAY 481

Query: 121 SRCLKLYCECFASGIYCD--GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSP 178
                  C+C  +G  CD   C+C +C N +EN+  R E  E   ++             
Sbjct: 482 -------CDCLKNGQACDPSSCSCASCLNTIENQNLRLEIQEKKQKQ------------- 521

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRCMDCKN 233
                           G+  +GC CK S C K+YCECFQ    C    C+C DCKN
Sbjct: 522 ----------------GQAKEGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKN 561



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 11/55 (20%)

Query: 103 VEVKENTPKKQKQ------CNCKHSRCLKLYCECFASGIYCD--GCNCVNCHNNV 149
           +E++E   KKQKQ      C+CK+S+C K YCECF +G  CD   C C +C NN+
Sbjct: 512 LEIQE---KKQKQGQAKEGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKNNL 563


>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
          Length = 192

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 53/181 (29%), Positives = 76/181 (41%), Gaps = 57/181 (31%)

Query: 59  PPQPQPKPVVPPVPPPPPPLPGQATQP-PVR--VPKPESPKSRSRSNVEVKENTPKKQKQ 115
           P +  P P++    P   P    A +P P++  + KP+          E++++  K    
Sbjct: 28  PQKESPMPILKFFDPDESPNILSAKKPFPIQKNIVKPK----------EIEKSIVK---- 73

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C  S+C K YCECFA G  C   CNC+ C+N                           
Sbjct: 74  CKCLKSKCQKSYCECFAVGRSCSIDCNCLGCNN--------------------------- 106

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
             S    +  K + G          GC+CKK+GC+KKYCEC+     C+  C C+DC N 
Sbjct: 107 --SNCSQKIKKSQCG----------GCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCNQ 154

Query: 235 E 235
           E
Sbjct: 155 E 155


>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
 gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
          Length = 296

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 3/80 (3%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQA 258
           KGC CK++ C+K YC+C+Q+  +C E C+C+DCKN   S ER  +      N+      A
Sbjct: 38  KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN---SVERPLVTDNLPKNSRRQRASA 94

Query: 259 ANAAITGAIGSSGYASPPVS 278
             A +   +   G    P +
Sbjct: 95  VAAKVEAGVLKVGLKPAPTT 114



 Score = 45.4 bits (106), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 11/61 (18%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           +P +PK+   S +          K C CK + C+K YC+C+ S   C + C CV+C N+V
Sbjct: 25  QPRTPKTPKESTI----------KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKNSV 74

Query: 150 E 150
           E
Sbjct: 75  E 75


>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
 gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
          Length = 296

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 3/80 (3%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQA 258
           KGC CK++ C+K YC+C+Q+  +C E C+C+DCKN   S ER  +      N+      A
Sbjct: 38  KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN---SVERPLVTDNLPKNSRRQRASA 94

Query: 259 ANAAITGAIGSSGYASPPVS 278
             A +   +   G    P +
Sbjct: 95  VAAKVEAGVLKVGLKPAPTT 114



 Score = 45.4 bits (106), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 11/61 (18%)

Query: 91  KPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNV 149
           +P +PK+   S +          K C CK + C+K YC+C+ S   C + C CV+C N+V
Sbjct: 25  QPRTPKTPKESTI----------KGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKNSV 74

Query: 150 E 150
           E
Sbjct: 75  E 75


>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
          Length = 754

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 79/177 (44%), Gaps = 33/177 (18%)

Query: 20  GDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVVPP--VPPP--- 74
           G IP     RQ+        SV+    P S+   +  HP  Q Q +   P   V PP   
Sbjct: 149 GQIPHHAYQRQMS-----SASVVQGPGPSSRGFRS-YHPIEQFQQRQAPPSHYVYPPYVH 202

Query: 75  --PPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFA 132
             P  +P   TQ P   P+P+    R RS  +         K C+C  ++CLKLYC CFA
Sbjct: 203 VAPTVVPTVHTQKP---PQPQVLTLRQRSASD---------KGCSCSKTQCLKLYCSCFA 250

Query: 133 SGIYC--DGC---NCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDN 184
           + + C  D C   +C NC      + A  +A  + L +NP AF+ K  S   GTR++
Sbjct: 251 NNLLCNKDVCICSDCKNCEAESTEDGAILKARRSILRKNPKAFKSKFQS---GTREH 304



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 4/51 (7%)

Query: 190 EVLMLGKH---NKGCHCKKSGCLKKYCECFQANILCSEN-CRCMDCKNFEG 236
           +VL L +    +KGC C K+ CLK YC CF  N+LC+++ C C DCKN E 
Sbjct: 220 QVLTLRQRSASDKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEA 270



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 24/37 (64%)

Query: 197 HNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           H  GC C+KS CLKKYCECF   + C  +C C +C N
Sbjct: 421 HRTGCKCRKSRCLKKYCECFSNGVKCGASCSCENCGN 457



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENE 152
           C C+ SRCLK YCECF++G+ C   C+C NC N    E
Sbjct: 425 CKCRKSRCLKKYCECFSNGVKCGASCSCENCGNQPAQE 462


>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
          Length = 151

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 65/141 (46%), Gaps = 20/141 (14%)

Query: 108 NTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERN 166
           N+ + ++ C C+ ++CLKLYC+CF  G  C + C+C  C N++ +   R +A+   + + 
Sbjct: 19  NSEETKEGCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHKY 78

Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
            +  + K +                   G    GC CKKS C   YCECF     C + C
Sbjct: 79  KDESKLKFS-------------------GDKGSGCCCKKSKCQLNYCECFIKGRSCGDQC 119

Query: 227 RCMDCKNFEGSEERQALFHGD 247
            C  C+N +   ++    H D
Sbjct: 120 HCKKCQNGKKQNQKICANHED 140



 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 27/55 (49%), Gaps = 9/55 (16%)

Query: 179 HGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           H    N EET E         GC C+K+ CLK YC+CF     C E C C  CKN
Sbjct: 14  HSYAMNSEETKE---------GCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKN 59


>gi|194759119|ref|XP_001961797.1| GF14749 [Drosophila ananassae]
 gi|190615494|gb|EDV31018.1| GF14749 [Drosophila ananassae]
          Length = 324

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 30/37 (81%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           KGC CK+S C+K YC+C+Q+ ++CS+ CRC+ C+N E
Sbjct: 103 KGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRNTE 139



 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 101 SNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           +N ++        K C CK S+C+K YC+C+ S + C   C CV C N
Sbjct: 90  ANAQIPVQCQATMKGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRN 137


>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
 gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
          Length = 250

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 34/43 (79%), Gaps = 3/43 (6%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQ 241
           KGC CK+S C+K YC+C+Q+  +CS++CRC+ C+N   ++ER+
Sbjct: 24  KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCRN---TQERK 63



 Score = 42.7 bits (99), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENE 152
           K C+CK S+C+K YC+C+ S   C   C C+ C N  E +
Sbjct: 24  KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCRNTQERK 63


>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 79/183 (43%), Gaps = 40/183 (21%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
           +N PK  K C C  S CLK YC+CFA+G  C   C C  CHN  +    R EA+     +
Sbjct: 50  QNIPKSLK-CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKMQ 108

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
           N +  +      P G     +++                    LK+YC+CFQ N  CSE+
Sbjct: 109 NQSIEKE----VPVGISCKCKKSKC------------------LKRYCDCFQNNQKCSES 146

Query: 226 CRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQE 285
           C+C +C N     + + +  GD        Q+    +I+    +S  A  P+   R+G  
Sbjct: 147 CQCNNCSN-----QVEKVEQGD--------QRLMMNSISQFDENSRLAKSPIVYNRQG-- 191

Query: 286 LFF 288
            FF
Sbjct: 192 -FF 193


>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
          Length = 244

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 54/126 (42%), Gaps = 24/126 (19%)

Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNP 167
           +PK Q +C C  S+C + YCECFA G  C   C C NC N   N+               
Sbjct: 90  SPKPQIKCQCSKSQCQQSYCECFARGKTCGKHCGCQNCQNMQMNKKL------------- 136

Query: 168 NAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCR 227
                KI       +  K+ T   L      KGC C KS C   +C C Q    C+  C+
Sbjct: 137 ----GKIQKRLVKQKRIKQRTPRYL------KGCTCGKSKCQNNFCSCHQDGKYCNSECK 186

Query: 228 CMDCKN 233
           C+DCKN
Sbjct: 187 CVDCKN 192


>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 622

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 54/116 (46%), Gaps = 14/116 (12%)

Query: 64  PKPVVPPVPPPPPPL-PGQATQPPVRVPKPESPKSRSRSNVEVKE-------NTPKKQKQ 115
           P+ ++P        +   QA      + K + P ++ +S    KE        TP     
Sbjct: 274 PQQILPNFDQSSLSINANQAIVINNNMQKKQQPNNKLQSIDSSKEGEILRVSKTP----- 328

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           C C+ S CLKLYCECFA G  C   C C+ C N+  +   R EA++    +NP+AF
Sbjct: 329 CKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNHLELRNEAIKVIEAKNPSAF 384


>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
          Length = 289

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 57/233 (24%), Positives = 98/233 (42%), Gaps = 35/233 (15%)

Query: 14  SQSESGGDIPAKKLA-----RQLDFTGGFVGSVILPEHPQSQAVTANSHPPPQPQPKPVV 68
           S SE  GD+P  +L        LD   G+    +  E     A++   + P Q Q   + 
Sbjct: 50  SSSELIGDVPVPRLTLFVSQMSLDENQGYYKKQMSLE-----ALSPTKYSPQQKQENQIR 104

Query: 69  PPVPPPPPPLPGQATQPPVR----VPKPESPKSRSRSNVEVKENTPKKQKQ---CNCKHS 121
                       +  Q  ++    V + ++ +  +++N  V +N+P ++ +   C+C  +
Sbjct: 105 QDAQEIKQLEIQEIKQVDIQEVQNVQQIQTRRKVNKANYNV-DNSPVEEDEMKPCHCTKT 163

Query: 122 RCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHG 180
            CL+LYC CF +   C   C C +C+N+ ++E    +AVE    +   A    + S    
Sbjct: 164 HCLQLYCSCFHNRRQCMSECKCNDCYNDGQHEEDVLKAVEQIKMKEQRASHHDLDSF--- 220

Query: 181 TRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
             D K+  G      K  KG           YCECF     C+ +C+C +C+N
Sbjct: 221 --DTKQVWGCKCKKTKCVKG-----------YCECFIRGKKCTSHCQCTECEN 260


>gi|195473860|ref|XP_002089210.1| GE18995 [Drosophila yakuba]
 gi|194175311|gb|EDW88922.1| GE18995 [Drosophila yakuba]
          Length = 237

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 29/37 (78%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           KGC CK+S C+K YC+C+Q+  +C+  CRC+DC+N E
Sbjct: 26  KGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRNTE 62



 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 4/58 (6%)

Query: 94  SPKSRSRSNVE---VKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           +PK +S    E    K   P   K C CK S+C+K YC+C+ S   C   C C++C N
Sbjct: 3   TPKKKSVDRSEGRKAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRN 60


>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
          Length = 514

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 43/114 (37%), Gaps = 44/114 (38%)

Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           L  YC+C A+G+YC D C C NC N  ENE   +   E    R+P AF P+I        
Sbjct: 244 LSSYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRI-------- 295

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
                                              A I CS+ CRC DC+N  G
Sbjct: 296 -----------------------------------AGIGCSDGCRCEDCRNSFG 314


>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 24/128 (18%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLER 165
           +N PK  K C C  S CLK YC+CFA+G  C   C C  CHN  + +  R EA+     +
Sbjct: 50  QNIPKSLK-CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQ 108

Query: 166 NPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSEN 225
           N +  +      P G     +++  +                  K+YC+CFQ N  C E+
Sbjct: 109 NQSIEKE----VPVGISCKCKKSKCL------------------KRYCDCFQNNQKCHES 146

Query: 226 CRCMDCKN 233
           C+C +C N
Sbjct: 147 CQCSNCSN 154



 Score = 42.7 bits (99), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 24/39 (61%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           C C KS CLK+YC+CF    +C+  C+C  C N E  +E
Sbjct: 58  CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKE 96


>gi|194856723|ref|XP_001968813.1| GG24300 [Drosophila erecta]
 gi|190660680|gb|EDV57872.1| GG24300 [Drosophila erecta]
          Length = 239

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 29/37 (78%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFE 235
           KGC CK+S C+K YC+C+Q+  +C++ CRC+ C+N E
Sbjct: 26  KGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRNTE 62



 Score = 39.3 bits (90), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 7/55 (12%)

Query: 100 RSNVEVKENT------PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHN 147
           + NVE  E+       P   K C CK S+C+K YC+C+ S   C   C C+ C N
Sbjct: 6   KKNVERTESRRGKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRN 60


>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
 gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
          Length = 225

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 33/45 (73%), Gaps = 3/45 (6%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           KGC CK+S C+K YC+C+Q+  +CS  C+C+ C+N   +EER A+
Sbjct: 25  KGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRN---TEERLAV 66



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 7/77 (9%)

Query: 96  KSRSRSNVEVKENTPKKQ--KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENE 152
           +S +  N E  +  PK+Q  K C CK S+C+K YC+C+ S   C   C C+ C N  E  
Sbjct: 5   RSGNYFNKEQSKRQPKQQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRNTEERL 64

Query: 153 AA----RREAVEATLER 165
           A     ++ A+ A  ER
Sbjct: 65  AVATPPKKTALAAKRER 81


>gi|219114349|ref|XP_002176345.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217402591|gb|EEC42581.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 636

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 25/35 (71%)

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
           GC C++S CLKKYCECFQ    C  NC+C +C+N 
Sbjct: 245 GCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQNL 279



 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 113 QKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHN 147
           Q  C C+ S CLK YCECF +   C   C C NC N
Sbjct: 243 QFGCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQN 278


>gi|195034702|ref|XP_001988959.1| GH11450 [Drosophila grimshawi]
 gi|193904959|gb|EDW03826.1| GH11450 [Drosophila grimshawi]
          Length = 225

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 34/45 (75%), Gaps = 3/45 (6%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           +GC CK+S C+K YC+C+Q+  +CS +C+C+ C+N   +E+R A+
Sbjct: 22  RGCCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRN---TEQRMAV 63



 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAA----RREAVEATLER 165
           C CK S+C+K YC+C+ S   C   C C+ C N  +  A      + A+ A  ER
Sbjct: 24  CCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRNTEQRMAVTTPPLKTAIAAKRER 78


>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
          Length = 97

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 112 KQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAF 170
           + + C C  S C+KLYC CF++  YC   C C NC N    E    E V+    +NP AF
Sbjct: 5   QHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAF 64

Query: 171 RPKIA 175
            PKI 
Sbjct: 65  DPKIV 69



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 29/47 (61%)

Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           M    ++ C C KS C+K YC CF A   CS+NCRC +CKN E  E+
Sbjct: 1   MSNDQHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYED 47


>gi|195118094|ref|XP_002003575.1| GI21907 [Drosophila mojavensis]
 gi|193914150|gb|EDW13017.1| GI21907 [Drosophila mojavensis]
          Length = 234

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 33/45 (73%), Gaps = 3/45 (6%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQAL 243
           KGC CK+S C+K YC+C+Q+  +CS  C+C+ C+N   ++ER A+
Sbjct: 25  KGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRN---TQERLAV 66



 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 11/80 (13%)

Query: 97  SRSRSNVEVKENTPKKQ------KQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV 149
           S+ RS + + +  PK+Q      K C CK S+C+K YC+C+ S   C   C CV C N  
Sbjct: 2   SQVRSGIYLSKEHPKRQPKPQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRNTQ 61

Query: 150 ENEAA----RREAVEATLER 165
           E  A     ++ A+ A  ER
Sbjct: 62  ERLAVATPPKKTALAAKRER 81


>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
 gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
          Length = 240

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 27/35 (77%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           KGC CK+S C+K YC+C+Q+  +C++ CRC  C+N
Sbjct: 23  KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRN 57



 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 114 KQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFR 171
           K C+CK S+C+K YC+C+ S   C   C C  C N V  E     +V     +N NA +
Sbjct: 23  KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRNTVVRELVDPNSV----AKNSNAVK 77


>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
          Length = 264

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 61/138 (44%), Gaps = 17/138 (12%)

Query: 97  SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAAR 155
           S++  NV+       + K C+C  + CL+LYC CF +   C   C C +C+N+ ++    
Sbjct: 129 SKANYNVDESPTLDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYNDGQHTDEV 188

Query: 156 REAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCEC 215
            +AVE    +   A    + S      D ++  G      K  KG           YCEC
Sbjct: 189 LKAVEQIKIKEQRASHHDLDSF-----DTRQVWGCKCKKTKCVKG-----------YCEC 232

Query: 216 FQANILCSENCRCMDCKN 233
           F  N  CS +C C DC+N
Sbjct: 233 FIRNKKCSSHCHCSDCQN 250


>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
          Length = 280

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 17/119 (14%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C+C  + CL+LYC CF +   C   C C +C+N+ ++E   ++AVE    +   A    +
Sbjct: 151 CHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEEEVQKAVEQIKIKEQRASHHDL 210

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
            S      D K+  G      K  KG           YCECF     C+ +C+C +C+N
Sbjct: 211 DSF-----DTKQVWGCKCKKTKCVKG-----------YCECFIRGKKCTSHCQCTECEN 253


>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
 gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
          Length = 213

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 47/114 (41%), Gaps = 33/114 (28%)

Query: 124 LKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTR 182
           L  YCECFA+G+YC + C+C++C NN  +E    E       RNP AF PK+        
Sbjct: 8   LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIR------ 61

Query: 183 DNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEG 236
            N +   E                         F+  + CS  CRC  CKN  G
Sbjct: 62  -NLDSASE-------------------------FRGGVGCSSYCRCEGCKNTFG 89


>gi|24581949|ref|NP_608936.1| tombola [Drosophila melanogaster]
 gi|7296988|gb|AAF52259.1| tombola [Drosophila melanogaster]
 gi|294862322|gb|ADE06681.2| RT06796p1 [Drosophila melanogaster]
          Length = 243

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 28/38 (73%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSE 238
           C CK+S C+K YC+C+Q+  +C++ CRC+ C+N E  E
Sbjct: 28  CCCKRSQCIKNYCDCYQSMAICTKFCRCVGCRNTEVRE 65


>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 75/161 (46%), Gaps = 21/161 (13%)

Query: 77  PLPGQATQPPVRVPKPESPKSRSRSNVEVKENTPKKQ---KQCNCKHSRCLKLYCECFAS 133
           P+  +  +    +P  ++ +  +++N  V +++P ++   K C+C  + CL+LYC CF +
Sbjct: 85  PIEMKQVEQVEEIPHIQTRRKANKTNYNV-DDSPTEEDDFKACHCSKTHCLQLYCSCFHN 143

Query: 134 GIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVL 192
              C + C C +C N+ ++E    +AVE    +   A    + S      D K+  G   
Sbjct: 144 RRPCTNECKCNDCFNDGKHEDEVIKAVEQIKLKEQRASHHDLDSF-----DTKQVWGCKC 198

Query: 193 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
              K  KG           YCECF     C+ +C+C +C+N
Sbjct: 199 KKTKCVKG-----------YCECFIRGKKCTSHCQCTECEN 228


>gi|218194439|gb|EEC76866.1| hypothetical protein OsI_15056 [Oryza sativa Indica Group]
          Length = 177

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 48/109 (44%), Gaps = 3/109 (2%)

Query: 88  RVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKL--YCECFASGIYC-DGCNCVN 144
           R+  P + K  S + V +K +  +K   C+ K     K   YC CF++  YC   C C N
Sbjct: 59  RLLSPAASKQGSTAGVALKRDEKEKGGICHFKKRARGKQESYCLCFSAYGYCSQNCRCTN 118

Query: 145 CHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLM 193
           C N    E    E V+    +NP AF PKI      +    + +  VLM
Sbjct: 119 CKNREYYEDFVEERVDMIKMKNPRAFDPKIVRVQDASEIEPQSSNAVLM 167



 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 31/60 (51%), Gaps = 12/60 (20%)

Query: 182 RDNKEETGEVLMLGKHNKGCHCKKS--GCLKKYCECFQANILCSENCRCMDCKNFEGSEE 239
           RD KE+ G           CH KK   G  + YC CF A   CS+NCRC +CKN E  E+
Sbjct: 78  RDEKEKGGI----------CHFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYED 127


>gi|145520699|ref|XP_001446205.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413682|emb|CAK78808.1| unnamed protein product [Paramecium tetraurelia]
          Length = 226

 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 61/155 (39%), Gaps = 43/155 (27%)

Query: 81  QATQPPVRVPKPESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCD-G 139
           +  Q  V V K   P      N+   EN       C C+ S+C++ YC C A+   C   
Sbjct: 99  ELNQNNVIVNKLVKPSDDELFNLSRDENIV-----CKCRKSKCIQNYCVCSANNQECSFK 153

Query: 140 CNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNK 199
           C C NC N                       +PK A S       K+   E L       
Sbjct: 154 CECYNCSNK----------------------QPKSAIS-------KQSGSEYL------- 177

Query: 200 GCHCKKSGCLKKYCECFQANILCSENCRCMD-CKN 233
           GC+C+K  C K YCEC +  I C+  CRC D CKN
Sbjct: 178 GCNCRKQDCSKGYCECQKRKIKCTSKCRCCDECKN 212


>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 54/121 (44%), Gaps = 17/121 (14%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C+C  + CL+LYC CF +   C + C C +C+N+ ++E    +AVE    +   A    +
Sbjct: 149 CHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYNDGKHEDEVLKAVEQIKLKEQRASHHDL 208

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNF 234
            S      D K+  G      K  KG           YCECF     C+ +C+C + K  
Sbjct: 209 DSF-----DTKQVWGCKCKKTKCVKG-----------YCECFIRGKKCTSHCQCTELKTI 252

Query: 235 E 235
           E
Sbjct: 253 E 253


>gi|428180254|gb|EKX49122.1| hypothetical protein GUITHDRAFT_151625 [Guillardia theta CCMP2712]
          Length = 289

 Score = 46.6 bits (109), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 23/33 (69%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           C CK+S C+KKYCECFQ +  C+  C C DC N
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGN 190



 Score = 38.5 bits (88), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 1/35 (2%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNV 149
           C CK SRC+K YCECF +   C   C C +C N +
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGNTI 192


>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 45.4 bits (106), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 51/119 (42%), Gaps = 17/119 (14%)

Query: 116 CNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C+C  ++CL+LYC CF     C   C C  C N+  ++    E V+A  +     +R  +
Sbjct: 147 CHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFNDGNHKV---EMVKALQKVKLKEYR--V 201

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           + S   + D ++  G         KG           YCECF     C+  C+C +C N
Sbjct: 202 SQSDLDSFDTRQVLGCKCKKTHCKKG-----------YCECFIRGRKCTSQCKCCECNN 249



 Score = 42.4 bits (98), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 21/35 (60%)

Query: 199 KGCHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           K CHC K+ CL+ YC CF  N  CS  C+C  C N
Sbjct: 145 KPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFN 179


>gi|145483717|ref|XP_001427881.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394964|emb|CAK60483.1| unnamed protein product [Paramecium tetraurelia]
          Length = 219

 Score = 45.4 bits (106), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 51/122 (41%), Gaps = 38/122 (31%)

Query: 116 CNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C+ S+C++ YC C A+   C   C C NC N                 ++P +  PK+
Sbjct: 122 CKCRKSKCIQNYCVCSANNQECKSNCECYNCSN-----------------KSPKSAPPKM 164

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM-DCKN 233
           +S                       GC+C+K  C K+YCEC +    C+  C C  DC+N
Sbjct: 165 SSIEFN-------------------GCNCRKQNCSKRYCECQKRMTRCTSRCNCCEDCEN 205

Query: 234 FE 235
            E
Sbjct: 206 QE 207


>gi|145508477|ref|XP_001440188.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407394|emb|CAK72791.1| unnamed protein product [Paramecium tetraurelia]
          Length = 235

 Score = 45.1 bits (105), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 47/116 (40%), Gaps = 37/116 (31%)

Query: 116 CNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNVENEAARREAVEATLERNPNAFRPKI 174
           C C+ S+C++ YC C A+   C   C C NC N                 + P +   ++
Sbjct: 138 CKCRRSKCIQNYCVCSANNQECKLNCECYNCSN-----------------KQPKSTPSQL 180

Query: 175 ASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMD 230
           +S                       GC+C+K  C K+YCEC + NI C+  C C +
Sbjct: 181 SSIEFN-------------------GCNCRKQNCSKRYCECQKRNIKCTSKCNCCE 217


>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
          Length = 261

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAV 159
           +N PK  + C C  S CLK YC+CFA+G  C + C C  CHN+ E    R EA+
Sbjct: 82  QNVPKSLR-CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHNSEEYMEEREEAI 134



 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 201 CHCKKSGCLKKYCECFQANILCSENCRCMDCKN 233
           C C KS CLK+YC+CF    +C+  C+C  C N
Sbjct: 90  CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122


>gi|221487871|gb|EEE26103.1| Radical SAM domain-containing protein, putative [Toxoplasma gondii
           GT1]
          Length = 1083

 Score = 44.3 bits (103), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 82/187 (43%), Gaps = 18/187 (9%)

Query: 363 SEAAKSVAVQRNTTEKQAEDRAGT--SVASSAQGRLQSHKEPDVDRTVTDDCSSANQADK 420
           SE AK++A+     EK+  D  G   +  +  QG +QS + P  +   +   S++  AD+
Sbjct: 412 SEPAKAIAL--TGVEKEGADLLGEDGAAGTDMQGTIQSREIPVFNNIPSTGGSASRTADR 469

Query: 421 VGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQG 480
             P  S +  S      P   GT      E+ T   A SSP+ L   G   + Q P    
Sbjct: 470 --PLQSRTSASA-----PRETGTTGTTVREEPTSETANSSPSPLIPFGFGRAPQPPTEMA 522

Query: 481 MTEVYAEQERI-------VLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNGT 533
           ++ + A    +       VL ++R+ L+RL +F E   ++  +  R E G+    LSN  
Sbjct: 523 LSRLLAPTSPLPVMGRPRVLGEYRNGLHRLQSFAEPLTLRGPATLRVENGAALLFLSNAW 582

Query: 534 ASSRTGT 540
            S+ + T
Sbjct: 583 TSAESAT 589


>gi|123410061|ref|XP_001303597.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121884989|gb|EAX90667.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 162

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 105 VKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATL 163
           +++  PK    CNC+   CL   C CF    YC DGC C  C N  E E  R  + E  L
Sbjct: 1   MEQTEPKCASCCNCE-GNCLSTTCPCFLHSKYCCDGCKCQKCRNKKEYEQERVASFEQHL 59

Query: 164 ERNPNAF 170
             NP AF
Sbjct: 60  LENPLAF 66


>gi|76155324|gb|AAX26592.2| SJCHGC03965 protein [Schistosoma japonicum]
          Length = 176

 Score = 42.4 bits (98), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 16/22 (72%), Positives = 17/22 (77%)

Query: 113 QKQCNCKHSRCLKLYCECFASG 134
           +K CNC  S CLKLYCECFA G
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQG 165


>gi|237830953|ref|XP_002364774.1| radical SAM domain-containing protein [Toxoplasma gondii ME49]
 gi|211962438|gb|EEA97633.1| radical SAM domain-containing protein [Toxoplasma gondii ME49]
 gi|221507654|gb|EEE33258.1| Radical SAM domain-containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 1083

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 81/188 (43%), Gaps = 20/188 (10%)

Query: 363 SEAAKSVA---VQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQAD 419
           SE AK++A   VQ+   +   ED A     +  QG +QS + P  +   +   S++  AD
Sbjct: 412 SEPAKAIALTGVQKEGADLLGEDGAA---GTDMQGTIQSREIPVFNNIPSTGGSASRTAD 468

Query: 420 KVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQ 479
           +  P  S +  S      P   GT      E+ T   A SSP+ L   G   + Q P   
Sbjct: 469 R--PLQSRTSASA-----PRETGTTGTTVREEPTSETANSSPSPLIPFGFGRAPQPPTEM 521

Query: 480 GMTEVYAEQERI-------VLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKDALSNG 532
            ++ + A    +       VL ++R+ L+RL +F E   ++  +  R E G     LSN 
Sbjct: 522 ALSRLPAPTSPLPVMGRPRVLGEYRNGLHRLQSFAEPLTLRGPATLRVENGGALLFLSNA 581

Query: 533 TASSRTGT 540
             S+ + T
Sbjct: 582 WTSAASAT 589


>gi|384246535|gb|EIE20025.1| hypothetical protein COCSUDRAFT_58250 [Coccomyxa subellipsoidea
           C-169]
          Length = 556

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 44/70 (62%), Gaps = 8/70 (11%)

Query: 164 ERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCS 223
           +RNP AF+ KI    +G     + +G   + G H +GCHCKKS C KKYCECFQA + C 
Sbjct: 162 QRNPQAFQEKIEKQANG-----QGSG---VSGLHKRGCHCKKSHCKKKYCECFQAGVNCG 213

Query: 224 ENCRCMDCKN 233
            +C+C +C N
Sbjct: 214 VHCKCDECHN 223


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.311    0.127    0.376 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,784,014,009
Number of Sequences: 23463169
Number of extensions: 434168873
Number of successful extensions: 4556111
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 7289
Number of HSP's successfully gapped in prelim test: 11037
Number of HSP's that attempted gapping in prelim test: 3852991
Number of HSP's gapped (non-prelim): 396011
length of query: 602
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 453
effective length of database: 8,863,183,186
effective search space: 4015021983258
effective search space used: 4015021983258
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 80 (35.4 bits)