BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 008269
         (571 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
 gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
          Length = 629

 Score =  538 bits (1387), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 313/594 (52%), Positives = 381/594 (64%), Gaps = 74/594 (12%)

Query: 1   MRSPSQAMPQ-WQARPQHVRM---VNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKD 56
           + +P +  PQ  QARPQHV +   + R+PHPV KL L TL   KQ+SP SRPR N + KD
Sbjct: 79  LETPPKQQPQATQARPQHVMVPPAMRRIPHPVQKLSLHTLQLIKQQSPSSRPRNNIEAKD 138

Query: 57  --GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
             GTPKK K CNC+NSRCLKL                             YCECFAAG++
Sbjct: 139 NSGTPKKAKHCNCKNSRCLKL-----------------------------YCECFAAGVH 169

Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
           C GCNC NCHNNVE+E +RQEAVGA LERNP AF+PKIASSPHG++DA+EDA E QL  K
Sbjct: 170 CTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKEDAMEVQLVGK 229

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 234
           HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH +HNGI +MQ+
Sbjct: 230 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNHNGIVYMQQ 289

Query: 235 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS 294
           AANAAI GA+GSSGYGT +TSNKRKSEE LFG   +DQ   R  ++QQ N+ +N A   S
Sbjct: 290 AANAAICGAIGSSGYGTPLTSNKRKSEE-LFGSTNRDQLGRRVAKTQQENHQRNPAASAS 348

Query: 295 Q-SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAM 353
             SVPV    NA+ LGS+K   +SPLA ILQPQD+K++CSLLV++S EA K    K+  M
Sbjct: 349 PLSVPVPH--NATALGSSKFTYKSPLAGILQPQDVKKMCSLLVVLSQEARKAYAGKMD-M 405

Query: 354 QTEREGSR---SSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDML 410
           Q   + +R   SS +S I+E ED   G+ VH  +P+D  +GN  + D +S SG DGG + 
Sbjct: 406 QPGGDDNRKLESSSSSSIREREDGLMGNNVHKTVPNDHVNGNTAEKD-SSDSGKDGGGLE 464

Query: 411 NERPLSPGTRALMCDEEDSAFMQAGPQTGLA--SKSTVQK-SNAYASTRIYAEQERLILT 467
           N RPLSP    LMC E++  F +    TG+A  S++ +QK S+A   +  YAEQE+LILT
Sbjct: 465 NGRPLSPEID-LMCHEQEMVFREGESATGMAGLSQNEIQKSSSAQECSEAYAEQEKLILT 523

Query: 468 RFRDFLNRLITCGSIK---------------ETMCSPLVKSETRSQQKAVEDGTSRSGTE 512
            FRDFLNRLITCGSIK               ET CSPL +S+T S Q+  +    ++G E
Sbjct: 524 GFRDFLNRLITCGSIKGKLLWSETFLCSNFHETTCSPLARSKTGSHQEPADGEMIKAGIE 583

Query: 513 VRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRVGLSNGQRVVDSE 566
                     G  KS V  T           S  N DLP++  L  G   V+ E
Sbjct: 584 TLNDKNAYASGIVKSPVLAT-----------SASNVDLPIKALLPIGNGEVNPE 626


>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
 gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
          Length = 601

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 282/557 (50%), Positives = 348/557 (62%), Gaps = 41/557 (7%)

Query: 6   QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPG------KQESPRSRPRANTDGKDGTP 59
           Q  P   A PQ   +  + P  V     P LPP       K ESP+S+PR N + KDGTP
Sbjct: 58  QQPPATVALPQEPVVAPQQPQAVAIPEAPLLPPQPPVRAIKPESPKSKPRPNAELKDGTP 117

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           K+QKQCNC++SRCLKL                             YCECFA+G YCDGCN
Sbjct: 118 KRQKQCNCKHSRCLKL-----------------------------YCECFASGTYCDGCN 148

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           CVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+  E  +  KHNKGC
Sbjct: 149 CVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREENGEGLILGKHNKGC 208

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANA 238
           HCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANA
Sbjct: 209 HCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHTNNMAYIQQAANA 268

Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSV- 297
           AI GA+GSSGY +   S KRK +ELLFG   KD S  R    QQ  +++ S    S S  
Sbjct: 269 AITGAIGSSGYVSPPISKKRKGQELLFGPTTKDPSFHRLGHFQQAVHIRPSTSSSSLSSN 328

Query: 298 PVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTER 357
           P+++  +++ LG +K   RS LADI+QPQD+KELCS+LV++S EA KTL  +  A +   
Sbjct: 329 PIARAGSSATLGPSKFTYRSLLADIIQPQDLKELCSVLVVLSGEAAKTLAGQRNATENWV 388

Query: 358 EG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLS 416
           E  +  S+AS  QE   S+KG    N I  D  S NQ D  G   S SDG D+   RP+S
Sbjct: 389 EDQTEPSLASSPQERLQSQKGAGAGNSIAHDCSSANQADKTGHGSSSSDGVDVPKGRPMS 448

Query: 417 PGTRALMCDEEDSAFMQAGPQTGLASK---STVQKSNAYASTRIYAEQERLILTRFRDFL 473
           PGT ALMCDEED+ FM A    GL  +   +T Q       T IY EQER++LT+FRD L
Sbjct: 449 PGTLALMCDEEDTMFMTAASPNGLTGRGCSTTSQFPCGQGMTEIYTEQERIVLTKFRDCL 508

Query: 474 NRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTL 533
           NRLIT G IKET CS   ++E  + +  V +GT+ +  EV     P   G  +  +  T 
Sbjct: 509 NRLITFGEIKETKCSSFARTELGNPKDPVSNGTTNARAEVGNQQVPYSNGVGRHVIPPTS 568

Query: 534 EPSCTTTSVSSRQNNDL 550
           + S    S  +  +NDL
Sbjct: 569 KTSQMVGSTVATSSNDL 585


>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
          Length = 592

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 280/537 (52%), Positives = 347/537 (64%), Gaps = 42/537 (7%)

Query: 26  HPVHKLPLP--TLPPG----KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
           HPV  +P+P  T  P     K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKL    
Sbjct: 65  HPV-TMPVPPQTTHPSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKL---- 119

Query: 80  IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
                                    YCECFA+GIYCDGCNCVNCHNNVE+E AR+EAV  
Sbjct: 120 -------------------------YCECFASGIYCDGCNCVNCHNNVENEAARREAVEV 154

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
           TLERNPNAFRPKIASSPHGA+D+RE++ EA +  KHNKGCHCKKSGCLKKYCECFQANIL
Sbjct: 155 TLERNPNAFRPKIASSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECFQANIL 214

Query: 200 CSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKR 258
           CSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GSSG+G+   S KR
Sbjct: 215 CSENCKCMDCKNFEGSEERQALFHGDHANSMAYIQQAANAAITGAIGSSGFGSPPVSKKR 274

Query: 259 KSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVPCSQSVPVSQTANASVLGSTKSALRS 317
           K +EL +G  +KD S+ R  Q QQ +  + +       S PVS+  N++  G++K   RS
Sbjct: 275 KGQELFYGPTSKDPSLHRLAQFQQASLTKASVTTSSLSSTPVSRVPNSAASGTSKFTYRS 334

Query: 318 PLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQT-EREGSRSSVASPIQEGEDSRK 376
            LADI+QPQD+KELCS+LV+VS EA +T   K   ++    E   +S AS  Q+   S+K
Sbjct: 335 LLADIIQPQDLKELCSVLVVVSEEAARTFADKRNLVEKPAEEQVETSRASSTQDRLQSQK 394

Query: 377 GHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGP 436
             ++   + DD  S NQ D  G   S SDGGD+   RP+SPGT ALMCDE+D+ FM A  
Sbjct: 395 ESDIEKAVADDCSSRNQADKLGPDDSSSDGGDVPKGRPMSPGTLALMCDEQDTMFMAAAS 454

Query: 437 QTGLASKS---TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKS 493
              L   S   + Q       T  YAEQER+ILT+FRD LNRLIT G IKET CS L ++
Sbjct: 455 PNVLMGHSCNTSSQLPYGQGITEAYAEQERIILTKFRDCLNRLITFGEIKETKCSSLART 514

Query: 494 ETRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDL 550
           E+  Q +   +G + + T+      P   G +K       + S T T+ ++  NN L
Sbjct: 515 ESGGQNEPFSNGPANARTDTGNQQVPVSNGVSKPPFPPATKMSQTVTAAAASPNNTL 571


>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
          Length = 524

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 274/523 (52%), Positives = 341/523 (65%), Gaps = 35/523 (6%)

Query: 34  PTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAH 93
           P++   K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKL                  
Sbjct: 10  PSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKL------------------ 51

Query: 94  LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
                      YCECFA+GIYCDGCNCVNCHNNVE+E AR+EAV  TLERNPNAFRPKIA
Sbjct: 52  -----------YCECFASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIA 100

Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           SSPHGA+D+RE++ EA +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFE
Sbjct: 101 SSPHGARDSREESGEALVLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFE 160

Query: 214 GSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQ 272
           GSEER+ALFHGDH N +A++Q+AANAAI GA+GSSG+G+   S KRK +EL +G  +KD 
Sbjct: 161 GSEERQALFHGDHANSMAYIQQAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTSKDP 220

Query: 273 SVIRNPQSQQGNNVQ-NSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKEL 331
           S+ R  Q QQ +  + +       S PVS+  N++  G++K   RS LADI+QPQD+KEL
Sbjct: 221 SLHRLAQFQQASLTKASVTTSSLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDLKEL 280

Query: 332 CSLLVMVSSEATKTLTKKVGAMQT-EREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQS 390
           CS+LV+VS EA +T   K   ++    E   +S AS  Q+   S+K  ++   + DD  S
Sbjct: 281 CSVLVVVSEEAARTFADKRNLVEKPAEEQVETSRASSTQDRLQSQKESDIEKAVADDCSS 340

Query: 391 GNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS---TVQ 447
            NQ D  G   S SDGGD+   RP+SPGT ALMCDE+D+ FM A     L   S   + Q
Sbjct: 341 RNQADKLGPDDSSSDGGDVPKGRPMSPGTLALMCDEQDTMFMAAASPNVLMGHSCNTSSQ 400

Query: 448 KSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTS 507
                  T  YAEQER+ILT+FRD LNRLIT G IKET CS L ++E+  Q +   +G +
Sbjct: 401 LPYGQGITEAYAEQERIILTKFRDCLNRLITFGEIKETKCSSLARTESGGQNEPFSNGPA 460

Query: 508 RSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDL 550
            + T+      P   G +K       + S T T+ ++  NN L
Sbjct: 461 NARTDTGNQQVPVSNGVSKPPFPPATKMSQTVTAAAASPNNTL 503


>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
          Length = 676

 Score =  500 bits (1287), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 354/527 (67%), Gaps = 46/527 (8%)

Query: 18  VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDF 77
           V +V R+ +P      PTL   KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKL  
Sbjct: 151 VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKL-- 204

Query: 78  VEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 137
                                      YCECFA+G+YC+ C+C NC NNV +E AR+EAV
Sbjct: 205 ---------------------------YCECFASGVYCNNCHCSNCQNNVVNEAARKEAV 237

Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
           G+TL+RNPNAFRPKIA+SPHG+ D  E+A +  L  KHNKGCHCKKSGCLKKYCECFQAN
Sbjct: 238 GSTLDRNPNAFRPKIANSPHGSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQAN 297

Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNK 257
           ILCSENC+C+DCKNFEGSEERRALFH + N +A++Q+AANAAI GA+GSSGYGT + S K
Sbjct: 298 ILCSENCKCMDCKNFEGSEERRALFHDNGNYMAYIQQAANAAISGAIGSSGYGTAVASKK 357

Query: 258 RKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRS 317
           RK +++ F   A +QS  RN Q QQ N+++ SAV  S S+ V+ T NA+VLGS+K   RS
Sbjct: 358 RK-QQVSFCTTANNQSNHRNTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRS 416

Query: 318 PLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKG 377
           PLADILQ QD+K+LCSLLV+VS EA KTLT      + ER+   +S+ S  QE E  +  
Sbjct: 417 PLADILQLQDVKDLCSLLVVVSQEAGKTLT------EAERDHIETSIISSTQEREHLQTD 470

Query: 378 HEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQ 437
            +V   +PD   S NQ  V+       D  D+ N R +SPGT ALMCDE+DS FM  G  
Sbjct: 471 -DVQKPLPDACLSENQ--VERVQTDELDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSP 527

Query: 438 TGLASKS---TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSE 494
            G+   S   TV+ S+  A T +Y EQERL+LT+F DFLNRLITCGSIKETMCSPL KS 
Sbjct: 528 DGVVGSSQIPTVKSSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSH 587

Query: 495 TRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTS 541
           T +QQ+ +ED  ++ GT    + EP G    K + + T + S T  +
Sbjct: 588 TGNQQEPLEDDATKVGTGKGNHKEPNGNSIVKPSFAATTQMSQTIAA 634


>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
          Length = 559

 Score =  499 bits (1285), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 266/496 (53%), Positives = 325/496 (65%), Gaps = 41/496 (8%)

Query: 27  PVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWI 82
           PV  LPLP   P    GK ESP+SR R N + KD TPKKQKQCNC++S+CLKL       
Sbjct: 52  PVAVLPLPLQAPPARVGKPESPKSRSRPNFEMKDATPKKQKQCNCKHSKCLKL------- 104

Query: 83  NIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLE 142
                                 YCECFA+GIYCDGCNCVNC NNVE+E AR+EAV ATLE
Sbjct: 105 ----------------------YCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLE 142

Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
           RNPNAFRPKIASSPHG +D+REDA E  +  KHNKGCHCKKSGCLKKYCECFQANILCSE
Sbjct: 143 RNPNAFRPKIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSE 202

Query: 203 NCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEE 262
           NC+C+DCKNFEGSEER+ALFHGD N +A++Q+AANAAI GA+GSSGY +   S KRK +E
Sbjct: 203 NCKCMDCKNFEGSEERQALFHGDQNNMAYIQQAANAAITGAIGSSGYSSPPVSKKRKGQE 262

Query: 263 LLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQT-ANASVLGSTKSALRSPLAD 321
           L F    KD S+ +    QQ N+V+  A P S   PVS      + LG +K   RS LAD
Sbjct: 263 LFFWPTTKDPSISK--LGQQVNHVRGPA-PSSSLSPVSGARVGTATLGPSKFMYRSLLAD 319

Query: 322 ILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEV 380
           I+QPQ +KELCS+LV+VS +A KTLT +   M+ + E  + +S+AS  QE   S+K   +
Sbjct: 320 IIQPQHLKELCSVLVLVSGQAAKTLTDQKILMEKQAEDQTETSLASSSQEQLPSQKEGHI 379

Query: 381 HNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFM-QAGPQTG 439
              + DD  S NQ D      S SDG D+   RP+SPGT ALMCDE+D+ FM  A P   
Sbjct: 380 EKTVADDCSSANQTDKISPDNSSSDGADVPKGRPMSPGTLALMCDEQDTMFMTTASPIAP 439

Query: 440 LASKSTVQKSNAYAS--TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRS 497
           +A          Y    T +Y EQER++LT+FRDFLNR+IT G I ET CS L ++E  S
Sbjct: 440 MAHACNTSSQFPYGQGMTEVYPEQERIVLTKFRDFLNRVITMGEINETKCSSLTRNELES 499

Query: 498 QQKAVEDGTSRSGTEV 513
           Q+  + +G   + +E+
Sbjct: 500 QKDPIINGIGNASSEI 515


>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
          Length = 559

 Score =  496 bits (1277), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 273/530 (51%), Positives = 337/530 (63%), Gaps = 42/530 (7%)

Query: 11  WQARPQHVRMVNRVPHPVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCN 66
           +   P+H ++      PV  LPL    P    GK ESP+SR R N + KD TPKKQKQCN
Sbjct: 34  FTGAPEHPQLSQPPQLPVAVLPLQPQAPHARVGKPESPKSRSRPNFEIKDATPKKQKQCN 93

Query: 67  CRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNN 126
           C++S+CLKL                             YCECFA+GIYCDGCNCVNC NN
Sbjct: 94  CKHSKCLKL-----------------------------YCECFASGIYCDGCNCVNCFNN 124

Query: 127 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 186
           VE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+A E  +  KHNKGCHCKKSGC
Sbjct: 125 VENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREEAGEVLILGKHNKGCHCKKSGC 184

Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD-HNGIAFMQRAANAAIIGAVG 245
           LKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD +N +A++Q+AANAAI GA+G
Sbjct: 185 LKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNMAYIQQAANAAITGAIG 244

Query: 246 SSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQT-AN 304
           SSGY +   S KRK +EL F    KD S+ +    QQ N+V+  A P S   PVS     
Sbjct: 245 SSGYSSPPVSKKRKGQELFFWPTTKDPSISK--LGQQVNHVRGPA-PSSSLSPVSGARVG 301

Query: 305 ASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREG-SRSS 363
            + LG +K   RS LADI+QPQ +KELCS+LV+VS +A KTLT +   M+ + E  + +S
Sbjct: 302 TATLGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKILMEKQAEDQTETS 361

Query: 364 VASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALM 423
           +AS  QE   S+K   V   + DD  S NQ D      S SDG D+   RP+SPGT ALM
Sbjct: 362 LASSSQEQLPSQKEGRVEKTVADDCSSANQTDKISPDNSSSDGADVPKGRPMSPGTLALM 421

Query: 424 CDEEDSAFMQAG-PQTGLASKSTVQKSNAYAS--TRIYAEQERLILTRFRDFLNRLITCG 480
           CDE+D+ FM A  P   +A          Y    T +YAEQER++LT+FRDFLNR+IT G
Sbjct: 422 CDEQDTMFMTAASPIAPMAHACNTSSQFPYGQEMTEVYAEQERIVLTKFRDFLNRVITMG 481

Query: 481 SIKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVS 530
            I ET CS L ++E  SQ+  + +G   + +E+         G  K AVS
Sbjct: 482 EINETKCSSLARNELESQKDPIINGIGNTSSEIVLQHGTTSNGVTKPAVS 531


>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
          Length = 561

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 272/513 (53%), Positives = 328/513 (63%), Gaps = 40/513 (7%)

Query: 25  PHPVHKLPLP---TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIW 81
           P P   LPLP   ++  GK ESP+SRPR   + K+ TPKKQKQCNC++S+CLKL      
Sbjct: 52  PQPPVILPLPPSASVRVGKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKL------ 105

Query: 82  INIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATL 141
                                  YCECFA+GIYCDGCNCVNC+NNVE+E AR+EAV ATL
Sbjct: 106 -----------------------YCECFASGIYCDGCNCVNCYNNVENEAARREAVEATL 142

Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
           ERNPNAFRPKIASSPHG +D+REDA E  +  KHNKGCHCKKSGCLKKYCECFQANILCS
Sbjct: 143 ERNPNAFRPKIASSPHGTRDSREDAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCS 202

Query: 202 ENCRCLDCKNFEGSEERRALFHGD-HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKS 260
           ENC+C+DCKNFEGSEER+ALFHGD +N + ++Q+AANAAI GA+GSSGY +   S KRK 
Sbjct: 203 ENCKCMDCKNFEGSEERQALFHGDQNNNMVYIQQAANAAITGAIGSSGYSSPPISKKRKG 262

Query: 261 EELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLA 320
           +ELLFG   KD SV R  Q QQ NNV+  A   S S         + LG +K   RS LA
Sbjct: 263 QELLFGPTVKDPSVGR--QGQQANNVRVPAPSSSLSPIPGARVGPATLGPSKLMYRSLLA 320

Query: 321 DILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHE 379
           DI+QPQ +KELCS+LV+VS +A K  T     M    E  + +S AS  QE   S+K   
Sbjct: 321 DIIQPQHLKELCSVLVLVSGQAAKMFTDHKNFMDKHAEDQTETSRASSTQEQLPSQKEAN 380

Query: 380 VHNGIPDDR-QSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQT 438
           V   + DD   S NQ D      S SDG D+   RP+SPGT ALMCDE+D+ FM A    
Sbjct: 381 VEKAMADDDCSSANQTDNISPDNSCSDGADVPKGRPMSPGTLALMCDEQDTMFMTAASTV 440

Query: 439 G---LASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
           G    A  +++Q       T +YAEQER++LT+FRDFLNR+IT G I ET CS L +SE 
Sbjct: 441 GSRAHACNTSLQPPYGQVMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSEL 500

Query: 496 RSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSA 528
            SQ+    + T  + TE +        G AK+A
Sbjct: 501 ESQKDPNINCTGNASTETQHQQGATNNGVAKTA 533


>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
          Length = 579

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 280/527 (53%), Positives = 347/527 (65%), Gaps = 62/527 (11%)

Query: 18  VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDF 77
           V +V R+ +P      PTL   KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKL  
Sbjct: 70  VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKL-- 123

Query: 78  VEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 137
                                      YCECFA+G+YC+ C+C NC NNV +E AR+EAV
Sbjct: 124 ---------------------------YCECFASGVYCNNCHCSNCQNNVVNEAARKEAV 156

Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
           G+TL+RNPNAFRPKIA+SPHG+ D  E+A +  L  KHNKGCHCKKSGCLKKYCECFQAN
Sbjct: 157 GSTLDRNPNAFRPKIANSPHGSHDGGEEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQAN 216

Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNK 257
           ILCSENC+C+DCKNFEGSEERRALFH + N +A++Q+AANAAI GA+GSSGYGT + S K
Sbjct: 217 ILCSENCKCMDCKNFEGSEERRALFHDNGNYMAYIQQAANAAISGAIGSSGYGTAVASKK 276

Query: 258 RKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRS 317
           RK +++ F   A +QS  RN Q QQ N+++ SAV  S S+ V+ T NA+VLGS+K   RS
Sbjct: 277 RK-QQVSFCTTANNQSNHRNTQFQQDNHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRS 335

Query: 318 PLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKG 377
           PLADILQ QD+K+LCSLLV+VS EA KTL        TERE  ++               
Sbjct: 336 PLADILQLQDVKDLCSLLVVVSQEAGKTL--------TEREHLQTD-------------- 373

Query: 378 HEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQ 437
            +V   +PD   S NQ  V+       D  D+ N R +SPGT ALMCDE+DS FM  G  
Sbjct: 374 -DVQKPLPDACLSENQ--VERVQTDELDQADVQNGRLMSPGTLALMCDEQDSIFMVRGSP 430

Query: 438 TGLASKS---TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSE 494
            G+   S   TV+ S+  A T +Y EQERL+LT+F DFLNRLITCGSIKETMCSPL KS 
Sbjct: 431 DGVVGSSQIPTVKSSHGQACTEVYVEQERLVLTKFWDFLNRLITCGSIKETMCSPLAKSH 490

Query: 495 TRSQQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTS 541
           T +QQ+ +ED  ++ GT    + EP G    K + + T + S T  +
Sbjct: 491 TGNQQEPLEDDATKVGTGKGNHKEPNGNSIVKPSFAATTQMSQTIAA 537


>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
 gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
          Length = 682

 Score =  470 bits (1210), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 272/521 (52%), Positives = 339/521 (65%), Gaps = 63/521 (12%)

Query: 11  WQARP--QH--VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCN 66
           +Q RP  QH  V +  R+PHPVHK+    +P GKQ+SP SR  AN    DGTPKK KQCN
Sbjct: 60  YQERPVSQHEAVAVAQRIPHPVHKIS--AMPRGKQDSPGSR-LAN----DGTPKKPKQCN 112

Query: 67  CRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNN 126
           C+NS CLKL                             YCECFAAGI+C+GCNC+NC NN
Sbjct: 113 CKNSMCLKL-----------------------------YCECFAAGIHCNGCNCLNCRNN 143

Query: 127 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 186
           VE+E  R+EAV  TL+RNPNAFRPKIASSPHG++D  EDA+E Q+  KHNKGC+CK+SGC
Sbjct: 144 VENEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTEDAQEVQMLGKHNKGCNCKRSGC 203

Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 246
           LKKYCECFQ NI CSENC+CL+CKNFEGS+ERRALFHG  +  A+ Q+A NAAI GA+GS
Sbjct: 204 LKKYCECFQGNIFCSENCKCLECKNFEGSDERRALFHGSPHATAYAQQATNAAISGAIGS 263

Query: 247 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ-SVPVSQTANA 305
           SGYGT + S KRKSEE LFG+A KDQ      +  QGN ++NSA   S  SVPVS+TAN 
Sbjct: 264 SGYGTTLASKKRKSEE-LFGIATKDQPAHSTAKFSQGNTLRNSAAFSSPLSVPVSRTANG 322

Query: 306 SVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAM--QTEREGSRSS 363
           +VLGSTK   RSPLA +L+PQ +KE+C+LLV +S +AT+ L  KVG M  Q+ERE   + 
Sbjct: 323 AVLGSTKPTYRSPLAGVLKPQFVKEICTLLVTISGKATEALAGKVGKMDTQSERENIETY 382

Query: 364 VASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALM 423
            ASP QE E S KG      +PDD  +GN+ + DG+     D GD+ N RP SP    LM
Sbjct: 383 SASPTQEKESSLKGWNAIEHMPDDCLNGNKAEKDGS----IDSGDVENARPSSPDID-LM 437

Query: 424 CDEEDSAFMQAGPQTGLAS--KSTVQK-SNAYASTRIYAEQERLILTRFRDFLNRLITCG 480
           C EE+  FM+ G   G+A   +S  QK  + Y  ++++AEQE++ILT F D LN + T G
Sbjct: 438 CHEEEMMFMEMGSPIGVARLCQSKTQKPDDGYECSQLHAEQEKIILTSFLDILNTVSTYG 497

Query: 481 SIKETMC------SPLVKSETRSQQKAVEDGTSRSGTEVRG 515
           SIK  +C       P V+   ++    +E    R+G   RG
Sbjct: 498 SIK--VCQSGNQKEPAVRDTVKT---GIETVNHRNGYHNRG 533


>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
 gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
          Length = 519

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 250/451 (55%), Positives = 303/451 (67%), Gaps = 39/451 (8%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           K ESP+SRPR N + KDGTPKKQKQCNC++SRCLKL                        
Sbjct: 90  KPESPKSRPRLNAELKDGTPKKQKQCNCKHSRCLKL------------------------ 125

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YCDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG 
Sbjct: 126 -----YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGT 180

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D+RE+  E  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+
Sbjct: 181 RDSREETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQ 240

Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           ALFHGDH N +A++Q+AANAAI GA+GSSGY +L  S KRK +EL FG   KD S  R  
Sbjct: 241 ALFHGDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLG 300

Query: 279 QSQQGNNVQNSAVPCS--QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLV 336
             QQ ++++  A P S   S PV++  N + LGS+K   RS LADI+Q QD+KELCS+LV
Sbjct: 301 HFQQASHIR-PAAPSSPLPSNPVARAGNTTTLGSSKITYRSLLADIIQSQDLKELCSVLV 359

Query: 337 MVSSEATKTLTKKVGAMQTEREGSRSS-VASPIQEGEDSRKGHEVHNGIPDDRQSGNQMD 395
           ++S EA KTL+ +  +M+   E  R + +AS  QE   S+K  +    + +D  S N   
Sbjct: 360 VLSGEAAKTLSDQRNSMKKRVEDQRDTFLASSTQERLQSQKEIDADKIVSNDCSSANHE- 418

Query: 396 VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASK---STVQKSNAY 452
             G   S  DG DM   RP+SPGT ALMCDE+D+ FM A    GL      S+ Q     
Sbjct: 419 -GGLDDSSLDGVDMPKGRPMSPGTMALMCDEQDTIFMAAASPNGLMGHGCDSSSQLPCGQ 477

Query: 453 ASTRIYAEQERLILTRFRDFLNRLITCGSIK 483
             T  +AEQER++L +FRD LNRLIT G IK
Sbjct: 478 GMTEAHAEQERIVLMKFRDCLNRLITFGEIK 508


>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
 gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
          Length = 545

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 248/468 (52%), Positives = 301/468 (64%), Gaps = 39/468 (8%)

Query: 33  LPTLPPG--KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSP 90
           L  LPP   K ESP+S P  NT+ KDGTPKKQ+QCNC++SRCLKL               
Sbjct: 102 LRLLPPMFRKPESPKSIPIPNTELKDGTPKKQRQCNCKHSRCLKL--------------- 146

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
                         YCECFA+G YCDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRP
Sbjct: 147 --------------YCECFASGTYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRP 192

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KIASSPHG +D+RE+  +  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCK
Sbjct: 193 KIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 252

Query: 211 NFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAA 269
           NFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GSSGY +   S KRK +EL FG   
Sbjct: 253 NFEGSEERQALFHGDHGNNMAYIQQAANAAITGAIGSSGYASPPVSRKRKGQELFFGHTV 312

Query: 270 KDQSVIRNPQSQQGNNVQN---SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQ 326
           KDQS  R    QQ N       +      S P+++  NA  LG +K   RS LADI+QPQ
Sbjct: 313 KDQSFDRLGHFQQVNGSHTRPPAPSSSLPSNPIARAGNAITLGPSKITYRSLLADIIQPQ 372

Query: 327 DMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSS-VASPIQEGEDSRKGHEVHNGIP 385
           D+KELCS+LV++S EA KT + +  +M+   E  R + +AS  QE   S K  +    + 
Sbjct: 373 DLKELCSVLVVLSGEAAKTFSDQRNSMEKRVEDQRETLLASSTQERLQSHKESDADKIVA 432

Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLAS--- 442
           +D  S N  D  G   S SDG DM   RP+SPGT  LMCDE+D+  M A   +GL     
Sbjct: 433 NDCSSANHADKVGPDDSSSDGADMPKGRPMSPGTLELMCDEQDTMLMAAASPSGLMGHGC 492

Query: 443 KSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPL 490
            ++ Q          +AEQER++LT+FRD LNRLIT G IK     P 
Sbjct: 493 NTSSQLPCGQGMAEAHAEQERIVLTKFRDCLNRLITFGEIKGKFLFPF 540


>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
          Length = 602

 Score =  457 bits (1175), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 257/537 (47%), Positives = 329/537 (61%), Gaps = 57/537 (10%)

Query: 30  KLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLDFVEIWINIYS 86
           K PLP  P  K +SP+SR R+N + KD TPKKQKQC   NC++SRCLKL           
Sbjct: 94  KAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKL----------- 140

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YCECFA+G+YCDGCNC NCHNNVEHE +R+EAV  TLERNPN
Sbjct: 141 ------------------YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPN 182

Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
           AFRPKIA+SPHG +++R++  E  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C
Sbjct: 183 AFRPKIANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKC 242

Query: 207 LDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
           +DCKNFEGSEER+ALFHGDH N IA++Q+AANAAI GA+GSSGY  L  S KRK  EL F
Sbjct: 243 MDCKNFEGSEERQALFHGDHANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGF 302

Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
           G   KD  +   PQ  Q NN+ +S +  S   P +   +    G +K + RS LAD++QP
Sbjct: 303 GPVGKDSPLNSIPQLHQANNIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQP 362

Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
            D+KELCS+LV+++SE  K        +Q E+      +  P Q    S  G  V +   
Sbjct: 363 NDLKELCSVLVVLTSEVAK--------IQAEQRSDEKQINDPQQTSRASSSGDGVQHQKA 414

Query: 386 DDRQ------SGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
           +++       S NQ D      S S+  DM   RP+SPGT ALMCDE+D+ FM AG   G
Sbjct: 415 EEKAADGECGSSNQSDRSVHDSSNSNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADG 474

Query: 440 LASKSTVQKSNA--YASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRS 497
            A+      S+    + + +Y EQER++LT+FRD LN+LIT G IKET  +   +SE  +
Sbjct: 475 SAAHDCNTSSHVPDRSLSEVYIEQERIVLTKFRDCLNKLITLGEIKETKLTSRSESEVGN 534

Query: 498 QQKAVEDGTSRSGTEVRGYMEPCGYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRV 554
           Q ++  + TS SG + R          A SA  +   P+CT    +   NNDLP+++
Sbjct: 535 QNQS-NNFTSNSGCQQRSISNGVVKNVALSAPRIP--PACT---AAHHPNNDLPLKI 585


>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
 gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
 gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
 gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
          Length = 629

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 270/562 (48%), Positives = 342/562 (60%), Gaps = 70/562 (12%)

Query: 6   QAMPQWQARPQHVRMVNRVPHPVHKLPLP-----------TLPPGKQESPRSRPRANTDG 54
           Q+ PQ Q   Q   MV + P  V  LP+P           ++  GKQESP+ R R N + 
Sbjct: 61  QSKPQQQVVTQGSVMVQK-PVGVGGLPMPVPAQVQTLQHSSVRVGKQESPKPRSRPNFEV 119

Query: 55  KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
           K+GTPKKQ+QCNC++S+CLKL                             YCECFA+GIY
Sbjct: 120 KEGTPKKQRQCNCKHSKCLKL-----------------------------YCECFASGIY 150

Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR---------ED 165
           CDGCNCVNC NNV++E AR+EAV ATLERNPNAFRPKIASSP GA+D+R         E+
Sbjct: 151 CDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPQGARDSRVSRSSILWQEE 210

Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
           A E  +  KH+KGCHCKKSGCLKKYCECFQAN+LCSENCRC+DCKNFEGSEER+ALF GD
Sbjct: 211 AGEGLILIKHHKGCHCKKSGCLKKYCECFQANVLCSENCRCMDCKNFEGSEERQALFRGD 270

Query: 226 HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQ------ 279
            N   ++Q+AANAAI GA+GS G+ +   S KRK +EL     AKD S+ +  Q      
Sbjct: 271 QNNNVYLQQAANAAITGAIGSYGFSSPPASKKRKGQELFLWPTAKDPSISKPGQQVKFAY 330

Query: 280 --SQQG---NNVQNSAVPCSQSVPVSQTANASVLGSTKSAL--RSPLADILQPQDMKELC 332
             S+ G   N V+  A   S S   S       LG + S L  RS L+D++QP  +KELC
Sbjct: 331 CTSRTGTGVNLVKGPAPSSSASPVSSARGTNPTLGQSPSKLKYRSLLSDVVQPHHLKELC 390

Query: 333 SLLVMVSSEATKTLT--KKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQS 390
           S+LV+VS +A KTL   KK    +TE + + +S+AS  QE   S+K  +V   + DD  S
Sbjct: 391 SVLVLVSGQAAKTLADQKKTVEKRTEDQ-TETSLASSTQEQLLSQKEVDVEKAMDDDCSS 449

Query: 391 GNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGP---QTGLASKSTVQ 447
            NQ D      S SDG D + +RP+SPGT ALMCDE+DS FM A     QT  A  ++ Q
Sbjct: 450 ANQTDKISPGNSCSDGAD-VPKRPMSPGTLALMCDEQDSMFMTAASPIGQTTHACNTSSQ 508

Query: 448 KSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTS 507
             +    T +YAEQER++LT+FRDFLNR+IT G I ET CS L +SE  +++  + + T 
Sbjct: 509 FPDGQGVTEVYAEQERIVLTQFRDFLNRVITMGEINETKCSSLARSELENKKDLINNETG 568

Query: 508 RSGTEVRGYMEPCGYGSAKSAV 529
            + TE     E    G AK+A+
Sbjct: 569 NASTETVHQQEATSNGDAKAAI 590


>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
          Length = 551

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 259/467 (55%), Positives = 307/467 (65%), Gaps = 37/467 (7%)

Query: 39  GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           GK ESP+SRPR   + K+ TPKKQKQCNC++S+CLKL                       
Sbjct: 71  GKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKL----------------------- 107

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YCECFA+GIYCDGCNCVNC NNVE+E AR+EAV ATLERNPNAFRPKIASSPH 
Sbjct: 108 ------YCECFASGIYCDGCNCVNCCNNVENEAARREAVEATLERNPNAFRPKIASSPHR 161

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
            +D REDA E  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 162 TRDIREDAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 221

Query: 219 RALFHGD-HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
           +ALFHGD +N + ++Q+AANAAI GA+GSSGY +   S KRK +ELLFG   KD SV R 
Sbjct: 222 QALFHGDQNNNLVYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKDPSVGR- 280

Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
            Q QQ NNV+  A   S S         + LG +K   RS LADI+QPQ +KELCS+LV+
Sbjct: 281 -QGQQANNVRVPAPSSSLSPIPGARVGPATLGPSKLMYRSLLADIIQPQHLKELCSVLVL 339

Query: 338 VSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDR-QSGNQMD 395
           VS +A KT T +   M    E  + +S AS  QE   S+K   V   + DD   S NQ D
Sbjct: 340 VSGQAAKTFTDQKNLMDKHAEDQTETSRASSTQEQLPSQKEANVDKAMADDDCSSANQTD 399

Query: 396 VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGL---ASKSTVQKSNAY 452
                 S SDG D    RP+SPGT ALMCDE+D+ FM A    G    A  +++Q     
Sbjct: 400 NISLDNSCSDGADAPKGRPMSPGTLALMCDEQDTMFMTAASPVGSRAHACNTSLQPPYGQ 459

Query: 453 ASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQ 499
             T +YAEQER++LT+FRDFLNR+IT G I ET CS L +SE  SQ+
Sbjct: 460 VMTEVYAEQERIVLTKFRDFLNRVITMGEINETKCSSLARSELESQK 506


>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
           [Cucumis sativus]
          Length = 556

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 237/468 (50%), Positives = 296/468 (63%), Gaps = 51/468 (10%)

Query: 30  KLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLDFVEIWINIYS 86
           K PLP  P  K +SP+SR R+N + KD TPKKQKQC   NC++SRCLKL           
Sbjct: 94  KAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKL----------- 140

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YCECFA+G+YCDGCNC NCHNNVEHE +R+EAV  TLERNPN
Sbjct: 141 ------------------YCECFASGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPN 182

Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
           AFRPK A+SPHG +++R++  E  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C
Sbjct: 183 AFRPKXANSPHGTRESRDEIGELVMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKC 242

Query: 207 LDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
           +DCKNFEGSEER+ALFHGDH N IA++Q+AANAAI GA+GSSGY  L  S KRK  EL F
Sbjct: 243 MDCKNFEGSEERQALFHGDHANNIAYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGF 302

Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
           G   KD  +   PQ  Q NN+ +S +  S   P +   +    G +K + RS LAD++QP
Sbjct: 303 GPVGKDSPLNSIPQLHQANNIMSSGMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQP 362

Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
            D+KELCS+LV+++SE  K        +Q E+      +  P Q    S  G  V +   
Sbjct: 363 NDLKELCSVLVVLTSEVAK--------IQAEQRSDEKQINDPQQTSRASSSGDGVQHQKA 414

Query: 386 DDRQ------SGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
           +++       S NQ D      S S+  DM   RP+SPGT ALMCDE+D+ FM AG   G
Sbjct: 415 EEKAADGECGSSNQSDRSVHDSSNSNSSDMTRARPMSPGTLALMCDEQDTMFMGAGLADG 474

Query: 440 LASKSTVQKSNA--YASTRIYAEQERLILTRFRDFLNRLITCGSIKET 485
            A+      S+    + + +Y EQER++LT+FRD LN+LIT G IKET
Sbjct: 475 SAAHDCNTSSHVPDRSLSEVYIEQERIVLTKFRDCLNKLITLGEIKET 522


>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
          Length = 515

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 241/480 (50%), Positives = 303/480 (63%), Gaps = 51/480 (10%)

Query: 11  WQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNS 70
           +  RP+ V  V R+PHPV K P+  L   K ESPRS+PR N + KD TPKKQKQCNC+NS
Sbjct: 79  YSPRPKLVSPVRRIPHPVVKFPVKVLQVVKPESPRSQPRCNVELKDTTPKKQKQCNCKNS 138

Query: 71  RCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHE 130
           RCLKL                             YCECFAAGIYCDGCNCVNCHNNV++E
Sbjct: 139 RCLKL-----------------------------YCECFAAGIYCDGCNCVNCHNNVDNE 169

Query: 131 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 190
            ARQEAVG TLERNPNAFRPKIASSP   +D++E   E ++  KHNKGCHCKKSGCLKKY
Sbjct: 170 AARQEAVGITLERNPNAFRPKIASSPQEQRDSKE--CEIKVIGKHNKGCHCKKSGCLKKY 227

Query: 191 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYG 250
           CECFQANILCSENC+C+DCKNFEGS+ER A+FH D+N +  M++AANA I GAVGSSGYG
Sbjct: 228 CECFQANILCSENCKCMDCKNFEGSDERIAIFHKDYN-LVHMKQAANATISGAVGSSGYG 286

Query: 251 TLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGS 310
           T +T  KRK++E+  G +A DQ+V    Q  Q  +   S+        +S  +N  + GS
Sbjct: 287 THITPKKRKNQEMFPGKSAMDQTVNMTAQYLQEIDPMASSPSSLSDSFISDPSNTRISGS 346

Query: 311 TKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAM--QTEREGSRSSVASPI 368
           ++S  RS LAD+L+P ++K LCSL V++S  A KT  +  G +  QT+     +SVAS  
Sbjct: 347 SRSTYRSVLADVLKPNNVKNLCSLFVVLSRVAAKTNAEMRGKVDQQTKTGNFEASVASSA 406

Query: 369 QEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEED 428
           +  +++R   +VH    DD  + ++ +    +            RPLSP T ALMCDE+D
Sbjct: 407 KSLQETR---DVHQPACDDHVNKDEAEAVDVAHYN---------RPLSPETLALMCDEQD 454

Query: 429 SAFMQAGPQTGLASKST----VQKS-NAYASTRIYAEQERLILTRFRDFLNRLITCGSIK 483
                     G+A  ST    +QKS N+   T +Y EQERLILT+F D L  L+T GSIK
Sbjct: 455 DMLFGNCSADGVACNSTFQNMIQKSFNSDGCTDVYREQERLILTKFLDILRGLVTHGSIK 514


>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
 gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
 gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
 gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
 gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
 gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 603

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 241/485 (49%), Positives = 300/485 (61%), Gaps = 51/485 (10%)

Query: 42  ESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           ESP++R PR N +G+DGTP+K+KQCNC++SRCLKL                         
Sbjct: 110 ESPKARGPRPNVEGRDGTPQKKKQCNCKHSRCLKL------------------------- 144

Query: 101 NLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
               YCECFA+G YCDGCNCVNC NNV++E AR+EAV ATLERNP AFRPKIASSPHG +
Sbjct: 145 ----YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGGR 200

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
           D RED  E  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSEER+A
Sbjct: 201 DKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEERQA 260

Query: 221 LFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN-P 278
           LFHG+H N +A++Q+AANAAI GAVGSSG+       +RK +E+LF  A KD S + + P
Sbjct: 261 LFHGEHSNHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQEILFNQAIKDSSRLSHFP 320

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q   G          +   PVS+    +    +K   RS LADI+QP D++ LCS+LV V
Sbjct: 321 QVNNGRT--GGPTSGTSPSPVSRAGGNASSVPSKFVYRSLLADIIQPHDVRALCSVLVTV 378

Query: 339 SSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVD 397
           + EA KT T K   ++   +  + +S+AS  Q   D  +G+     +       NQ D  
Sbjct: 379 AGEAAKTSTDKRNEIENRVDDQTETSLASSAQ---DQPQGNNNAADVEMVATDHNQADKS 435

Query: 398 GTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRI 457
           G   S SDG D     PLSP T ALMCDE+D+ FM A P    +      + N+   + I
Sbjct: 436 GPEESNSDGVDASKVTPLSPATLALMCDEQDTIFMVAAPSPNGSVDPNGCRPNSQGQSEI 495

Query: 458 YAEQERLILTRFRDFLNRLITCGSIKETMCSPL------------VKSETRSQQKA-VED 504
           YAEQERL+LT+FRD LNRLI+   IKE+ C  L            VK+E   QQ+  + +
Sbjct: 496 YAEQERLVLTKFRDCLNRLISYAEIKESKCLSLARMHIQPPAIATVKTENGIQQQVPIVN 555

Query: 505 GTSRS 509
           G SR+
Sbjct: 556 GASRT 560


>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 601

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 246/490 (50%), Positives = 303/490 (61%), Gaps = 51/490 (10%)

Query: 40  KQESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           K ESP++R PR   +G+DGTP+K+KQCNC++SRCLKL                       
Sbjct: 104 KPESPKARGPRPIVEGRDGTPQKKKQCNCKHSRCLKL----------------------- 140

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YCECFA+G YCDGCNCVNC NNV++E AR+EAV ATLERNP AFRPKIASSPHG
Sbjct: 141 ------YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHG 194

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
            +D RED  E  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSEER
Sbjct: 195 VRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEER 254

Query: 219 RALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVI-R 276
           +ALFHG+H N +A++Q+AANAAI GAVGSSG+       +RK +E+LF  A KD S + +
Sbjct: 255 QALFHGEHANHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQEILFNQATKDSSRLGQ 314

Query: 277 NPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLV 336
            PQ   G          S S       NAS   S K   RS LADI+QP D++ LCS+LV
Sbjct: 315 FPQVNSG-RASGPTSGSSPSPVSRAGGNASSAPS-KFVYRSLLADIIQPHDVRALCSVLV 372

Query: 337 MVSSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMD 395
            V+ EA KT T K   ++   E  + +S+AS  Q   D  +G      +      GNQ D
Sbjct: 373 AVAGEAAKTSTDKRNEIENRVEDQTETSLASSAQ---DQPQGDNDAADMEMVATDGNQAD 429

Query: 396 VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYAST 455
             G   S SDG D     PLSP T ALMCDE+D+ FM A P    A     +++N+   +
Sbjct: 430 KSGAEESNSDGADASKGNPLSPATLALMCDEQDTIFMVAAPSPNGAVDPGGRRTNSQGQS 489

Query: 456 RIYAEQERLILTRFRDFLNRLITCGSIKETMCSPL------------VKSETRSQQKA-V 502
            IYAEQERL+LT+FRD L+RLI+   IKE+ C  L            VK+E   QQ+  +
Sbjct: 490 EIYAEQERLVLTKFRDCLSRLISYAEIKESKCLSLARMHIQPSATATVKTENGVQQQVPI 549

Query: 503 EDGTSRSGTE 512
            +G SR+ ++
Sbjct: 550 VNGASRTNSQ 559


>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
          Length = 656

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 238/491 (48%), Positives = 302/491 (61%), Gaps = 53/491 (10%)

Query: 31  LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSP 90
           L    +   K +SPRS+P+ N + KD TPKKQKQCNC+NSRCLKL               
Sbjct: 170 LNFSAIKIVKPKSPRSQPQCNVELKDTTPKKQKQCNCKNSRCLKL--------------- 214

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
                         YCECFAAGIYCDGCNCVNCHNNV++E ARQEAVG TLERNPNAFRP
Sbjct: 215 --------------YCECFAAGIYCDGCNCVNCHNNVDNEAARQEAVGITLERNPNAFRP 260

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KIASSP   +D++E   E Q+  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCK
Sbjct: 261 KIASSPLERRDSKE--CEIQVIGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 318

Query: 211 NFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAK 270
           NFEGS+ERRA+FH D+N +  +++AANA I    GSSGYGT +T  KRK  E+  G +A 
Sbjct: 319 NFEGSDERRAIFHKDYN-LVHIKQAANATISRVAGSSGYGTHLTPKKRKIHEMFPGKSAM 377

Query: 271 DQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKE 330
           DQ+     Q QQ  +   S+        VS  +N  +  S++S  RS LAD  QPQ++K 
Sbjct: 378 DQTDNITAQYQQEIDPMASSPSFLSDSFVSDPSNTRISASSRSTYRSVLADFFQPQNVKN 437

Query: 331 LCSLLVMVSSEATKTLTKKVGAM--QTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR 388
           LCSL V++S  A KT  + +G +  QT+     +SVAS  +  +++R   +VH  + DD 
Sbjct: 438 LCSLFVVLSQVAAKTNAEMIGKVDQQTKTGNFEASVASSSKSLQETR---DVHQLVCDDH 494

Query: 389 QSGNQMDVDGTSGSGSDGGDMLN-ERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQ 447
            + ++          +D  D+ N  RPLSP T ALMCDE+D          G+A  ST+Q
Sbjct: 495 VNKDE----------ADAVDIANYNRPLSPETLALMCDEQDDMIFGNSSANGVACNSTLQ 544

Query: 448 K-----SNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAV 502
                 SN+   T +  EQERLILT+F D L  L+T GSIKETMCS   K   RS+++  
Sbjct: 545 NMIQKSSNSDGCTDVCREQERLILTKFLDVLRGLVTHGSIKETMCSSSTKKGERSKKEPA 604

Query: 503 EDGTSRSGTEV 513
           ++    + T V
Sbjct: 605 DNANIGAETNV 615


>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
 gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
 gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
 gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 571

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 217/450 (48%), Positives = 282/450 (62%), Gaps = 66/450 (14%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           + ESP S PR   + +DGTP+K+KQCNC++SRCLKL                        
Sbjct: 96  RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 131

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG 
Sbjct: 132 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 186

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D RE+  +  + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R+
Sbjct: 187 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQ 246

Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +LFHG+H + +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  
Sbjct: 247 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 305

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q+  G          + S   S+    + LG +K   +S LA+I++P D+K LCS+LV V
Sbjct: 306 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 357

Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
           + EA KTLT+K  A Q E     +SVAS +Q+     +GH             N+ +  G
Sbjct: 358 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 396

Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
              S +DG      R LSP T ALMCDE D+  M A      + + T Q  N     ++Y
Sbjct: 397 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 450

Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS 488
           AEQE+++LT+FRD LNR+I+CG +KE+ CS
Sbjct: 451 AEQEKVVLTKFRDCLNRIISCGEVKESNCS 480


>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
          Length = 536

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 224/485 (46%), Positives = 295/485 (60%), Gaps = 77/485 (15%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           + ESP S PR   + +DGTP+K+KQCNC++SRCLKL                        
Sbjct: 61  RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 96

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG 
Sbjct: 97  -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 151

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D RE+  +  + A+HNKGCHCKKSGCLKKYCECFQANILCSENC CLDCKNFEGSE R+
Sbjct: 152 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQ 211

Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +LFHG+H + +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  
Sbjct: 212 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 270

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q+  G          + S   S+    + LG +K   +S LA+I++P D+K LCS+LV V
Sbjct: 271 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 322

Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
           + EA KTLT+K  A Q E     +SVAS +Q+     +GH             N+ +  G
Sbjct: 323 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 361

Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
              S +DG      R LSP T ALMCDE D+  M A      + + T Q  N     ++Y
Sbjct: 362 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 415

Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS--------PL---VKSETRSQQKAVEDGTS 507
           AEQE+++LT+FRD LNR+I+CG +KE+ CS        P+   V+ +   QQ  V +G S
Sbjct: 416 AEQEKVVLTKFRDCLNRIISCGEVKESNCSMSRMDLDTPVQTTVRIDPVVQQAPVANGVS 475

Query: 508 RSGTE 512
           ++  +
Sbjct: 476 QTAKQ 480


>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 568

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 223/474 (47%), Positives = 291/474 (61%), Gaps = 74/474 (15%)

Query: 53  DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAG 112
           + +DGTP+K+KQCNC++SRCLKL                             YCECFA+G
Sbjct: 107 EARDGTPQKKKQCNCKHSRCLKL-----------------------------YCECFASG 137

Query: 113 IYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 172
            YCDGCNCVNCHNNVE+E  R+EA+G+TLERNPNAFRPKIA+SPHG +D RE+  E  L 
Sbjct: 138 SYCDGCNCVNCHNNVENEPERREAIGSTLERNPNAFRPKIAASPHGGRDNREEVGEVVLL 197

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAF 231
            +HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R++LFHG+H + +A+
Sbjct: 198 GRHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGEHSHNVAY 257

Query: 232 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV 291
           +Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  Q+  G        
Sbjct: 258 LQH-ANAAITGAIGSSGFASAPAPKRRKGQEIFFNQGPKDSSTHRLGQANSGRTT----- 311

Query: 292 PCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG 351
             S         NAS L  +K   RS LADI++P+D+K LCS+LV V+ EA KTLT++  
Sbjct: 312 --SPKTGSHAGGNAS-LRPSKVVYRSLLADIIKPKDVKALCSVLVSVAGEAAKTLTEERL 368

Query: 352 AMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR-QSGNQMDVDGTSGSGSDGGDML 410
           A Q E     +SVAS +Q+     +GH  + G   ++  SGNQ +  G  GS        
Sbjct: 369 ADQKE-----TSVASSVQD-----QGHSNNKGTEGEKAASGNQAEKSGPDGSKG------ 412

Query: 411 NERPLSPGTRALMCDEEDSAFM-QAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRF 469
             R LSP T ALMCDE D+  M  A P   +   S +         ++YAEQE+++LT+F
Sbjct: 413 --RSLSPETLALMCDERDTMLMVAASPNCSVEPPSQLPN----GQDQVYAEQEKVVLTKF 466

Query: 470 RDFLNRLITCGSIKETMC--------SPL---VKSETRSQQKAVEDGTSRSGTE 512
           RD LNR+I+CG +KE+ C        +P+   VK +    Q+ V +G S++  +
Sbjct: 467 RDCLNRIISCGEVKESNCLMSRMDLDTPVHTAVKIDPVVDQRPVANGVSQTAKQ 520


>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 578

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 214/445 (48%), Positives = 278/445 (62%), Gaps = 66/445 (14%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           + ESP S PR   + +DGTP+K+KQCNC++SRCLKL                        
Sbjct: 96  RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 131

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG 
Sbjct: 132 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 186

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D RE+  +  + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R+
Sbjct: 187 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQ 246

Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +LFHG+H + +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  
Sbjct: 247 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 305

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q+  G          + S   S+    + LG +K   +S LA+I++P D+K LCS+LV V
Sbjct: 306 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 357

Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
           + EA KTLT+K  A Q E     +SVAS +Q+     +GH             N+ +  G
Sbjct: 358 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 396

Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
              S +DG      R LSP T ALMCDE D+  M A      + + T Q  N     ++Y
Sbjct: 397 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 450

Query: 459 AEQERLILTRFRDFLNRLITCGSIK 483
           AEQE+++LT+FRD LNR+I+CG +K
Sbjct: 451 AEQEKVVLTKFRDCLNRIISCGEVK 475


>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
          Length = 532

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 224/485 (46%), Positives = 294/485 (60%), Gaps = 81/485 (16%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           + ESP S PR   + +DGTP+K+KQCNC++SRCLKL                        
Sbjct: 61  RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 96

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG 
Sbjct: 97  -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 151

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D RE+  +  + A+HNKGCHCKKSGCLKKYCECFQANILCSENC CLDCKNFEGSE R+
Sbjct: 152 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQ 211

Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +LFHG+H + +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  
Sbjct: 212 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 270

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q+  G          + S   S+    + LG +KS     LA+I++P D+K LCS+LV V
Sbjct: 271 QANNGRT--------TSSQTGSRAGGNASLGPSKSL----LANIIKPMDVKALCSVLVAV 318

Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
           + EA KTLT+K  A Q E     +SVAS +Q+     +GH             N+ +  G
Sbjct: 319 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 357

Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
              S +DG      R LSP T ALMCDE D+  M A      + + T Q  N     ++Y
Sbjct: 358 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 411

Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS--------PL---VKSETRSQQKAVEDGTS 507
           AEQE+++LT+FRD LNR+I+CG +KE+ CS        P+   V+ +   QQ  V +G S
Sbjct: 412 AEQEKVVLTKFRDCLNRIISCGEVKESNCSMSRMDLDTPVQTTVRIDPVVQQAPVANGVS 471

Query: 508 RSGTE 512
           ++  +
Sbjct: 472 QTAKQ 476


>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
          Length = 543

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 214/445 (48%), Positives = 277/445 (62%), Gaps = 66/445 (14%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           + ESP S PR   + +DGTP+K+KQCNC++SRCLKL                        
Sbjct: 61  RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 96

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG 
Sbjct: 97  -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 151

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D RE+  +  + A+HNKGCHCKKSGCLKKYCECFQANILCSENC CLDCKNFEGSE R+
Sbjct: 152 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCXCLDCKNFEGSEVRQ 211

Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +LFHG+H + +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  
Sbjct: 212 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 270

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q+  G          + S   S+    + LG +K   +S LA+I++P D+K LCS+LV V
Sbjct: 271 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 322

Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
           + EA KTLT+K  A Q E     +SVAS +Q+     +GH             N+ +  G
Sbjct: 323 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 361

Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
              S +DG      R LSP T ALMCDE D+  M A      + + T Q  N     ++Y
Sbjct: 362 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 415

Query: 459 AEQERLILTRFRDFLNRLITCGSIK 483
           AEQE+++LT+FRD LNR+I+CG +K
Sbjct: 416 AEQEKVVLTKFRDCLNRIISCGEVK 440


>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
          Length = 628

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 209/378 (55%), Positives = 259/378 (68%), Gaps = 12/378 (3%)

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           E+A +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH
Sbjct: 215 EEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFH 274

Query: 224 GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQG 283
            + N +A++Q+AANAAI GA+GSSGYGT + S KRK +E+ F   A +QS  RN Q QQ 
Sbjct: 275 DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRKQQEVSFCTTANNQSNHRNTQFQQD 334

Query: 284 NNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 343
           N+++ SAV  S S+ V+ T NA+VLGS+K   RSPLADILQPQD+K+LCSLLV+VS EA 
Sbjct: 335 NHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQPQDVKDLCSLLVVVSQEAG 394

Query: 344 KTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSG 403
           KTLT      + ER+   +S+ S   E E  +   +V   +PD   S NQ  V+      
Sbjct: 395 KTLT------EAERDHIETSIISSTHEREHLQTD-DVQKPLPDACLSENQ--VERVQTDE 445

Query: 404 SDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS---TVQKSNAYASTRIYAE 460
            D  D+ N R +SPGT ALMCDE+DS FM  G   G+   S   TV+ S+  A T +Y E
Sbjct: 446 LDQADVQNGRLMSPGTLALMCDEQDSIFMARGSPDGVVGSSQIPTVKSSHGQACTEVYVE 505

Query: 461 QERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYMEPC 520
           QERL+LT+F DFLNRLITCGSIKETMCSPL KSET +QQ+ +ED  ++  T    + EP 
Sbjct: 506 QERLVLTKFWDFLNRLITCGSIKETMCSPLAKSETGNQQEPLEDDATKVRTGKGNHKEPN 565

Query: 521 GYGSAKSAVSVTLEPSCT 538
           G    K + + T + S T
Sbjct: 566 GNSIVKPSFAATTQMSQT 583


>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
          Length = 576

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 215/475 (45%), Positives = 282/475 (59%), Gaps = 60/475 (12%)

Query: 40  KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKL                       
Sbjct: 89  KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 124

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH 
Sbjct: 125 ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 178

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
            ++  E A +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 179 NRNNMEAAGDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 238

Query: 219 RALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +ALF GDH     MQ+AANAA+ GA+G++ + +  TS KRK  +     +AK+    R  
Sbjct: 239 KALFQGDHKNSIHMQQAANAAVNGAIGAAAFSSPSTSRKRKHIDPSLDHSAKENGAHRTN 298

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
              Q    Q + +P    +P++Q  +   LG  K   R  LADI+QP D+KELC LLV+V
Sbjct: 299 HVPQV-FYQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVV 357

Query: 339 SSEATKT----------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDD 387
           S  A K           + +K   M  +REG    S+AS   + E++ K         DD
Sbjct: 358 SGNAAKAYAGKKSQEDKVVEKEDEMGGKREGEIGGSLASTNHDREENNKDPNNQKASADD 417

Query: 388 RQSG----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASK 443
             +G     +  ++ +    +D  D  + RP+SPGT ALMCDE+D+ FM         S+
Sbjct: 418 PSTGGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQ 467

Query: 444 STVQKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
           + V +  A  +   + +YAEQER +LT FRD L++L+T G +KE   S  +KSE 
Sbjct: 468 NVVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 522


>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 615

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 220/475 (46%), Positives = 283/475 (59%), Gaps = 48/475 (10%)

Query: 27  PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
           PV + P+    P K ESP+ R R   DGKDGTP K+K CNCR+S+CLKL           
Sbjct: 128 PVQRPPVAV--PLKPESPKPRGRL-YDGKDGTPTKKKCCNCRHSKCLKL----------- 173

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YCECFA+GIYCDGCNC NC NNVE+EVAR+EA+ ATLERNP+
Sbjct: 174 ------------------YCECFASGIYCDGCNCTNCFNNVENEVARREAIEATLERNPD 215

Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
           AFRPKI SSPH  ++  E A +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C
Sbjct: 216 AFRPKIGSSPHTNRNNMEVAGDLPLIGKHNKGCHCKKSGCLKKYCECFQANILCSENCKC 275

Query: 207 LDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFG 266
            DCKNF+GSE+R+ L   DH  +  MQ+AANAA+ GA+G++ + +  TS KRK  +    
Sbjct: 276 RDCKNFDGSEDRKHL-SLDHKNVVHMQQAANAAVNGAIGATAFSSPSTSKKRKHIDSSLD 334

Query: 267 VAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQ 326
           ++ K+   +RN Q  Q    Q +A+    S+PV Q+ +  ++G  K   R  LADI+QP 
Sbjct: 335 LSTKEHLAMRNGQLSQ-VIYQKNAMASDGSLPVGQSVHPHMMGPFKVTYRPLLADIIQPG 393

Query: 327 DMKELCSLLVMVSSEATKTLTKKVGAMQTER-----EGSRSSVASPIQEGEDSRKGHEVH 381
           D+KELC LLV VS +A K  T +   +Q E+     +G+  S+AS   + E   +     
Sbjct: 394 DVKELCKLLVTVSGQAAKAYTGR--KIQEEKVAEKEDGTGGSLASTNHDREGKTQDQNHR 451

Query: 382 NGIPDDRQS-GNQMDVDGTSGSGSD-GGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
               DDR S G  M       S  D   D  + RP+SPGT ALMCDE+D+ F  A  Q  
Sbjct: 452 KTSIDDRSSVGTDMGKASLDDSRPDCNDDQKSSRPMSPGTLALMCDEQDAMF--ATSQNA 509

Query: 440 LASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSE 494
           +A  +     N    + +YAEQER +LT FRD L +L+  G +KE   S  ++SE
Sbjct: 510 VAPPAVAADQN---RSELYAEQERCVLTEFRDCLRKLVQYGRMKEEKYSMAIRSE 561


>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
 gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
          Length = 550

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 233/535 (43%), Positives = 295/535 (55%), Gaps = 108/535 (20%)

Query: 14  RPQHVRMVNRVPHPVHKLPL----PTL--------------PPGKQESPRSRPRANTDGK 55
           RP  V  V  VP P H +P+    P L               P K ESP+ R R   +GK
Sbjct: 48  RPHQV--VGHVPLPRHAVPMAVPVPQLRPMPPQPVQRPPVAVPLKPESPKPRARL-YEGK 104

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           D TP K+K CNCRNSRCLKL                             YCECFA+G +C
Sbjct: 105 DSTPTKKKCCNCRNSRCLKL-----------------------------YCECFASGTHC 135

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
           DGCNC NC NN E+EVAR+EA+ ATLERNP+AFRPKI SSPH  ++  E + +  L  KH
Sbjct: 136 DGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPHTNRN-NEVSSDLPLVGKH 194

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRA 235
           NKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH     MQ+A
Sbjct: 195 NKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQA 254

Query: 236 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ 295
            NAA+ GA+G++G+ +  TS KRK  +     + K+    RN    Q +  Q +AVP   
Sbjct: 255 TNAAVNGAIGATGFPSPSTSRKRKHIDPSLDNSNKEHVAHRNYHISQ-SVYQKNAVP-DG 312

Query: 296 SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT------KTLTKK 349
            +P+SQ+A+   LG  K   R  LADI+Q +D+KELC LLV+VS EA       KT+ ++
Sbjct: 313 PIPISQSAHPPNLGPFKVTYRPLLADIVQEEDIKELCKLLVVVSGEAAKTYAGRKTMEER 372

Query: 350 VGAMQTEREGSRSSVASPIQEGEDSRKG--------HEVHNGIPDDRQSGNQMDVDGTSG 401
           VG  + ER G +          ED R G         E +N  PD + S     +D  S 
Sbjct: 373 VGEKEDERGGQK----------EDDRAGSLALTNHDREGNNQDPDQKAS-----IDDHSS 417

Query: 402 SGSDGG-------------DMLNERPLSPGTRALMCDEE-------DSAFMQAGPQTGLA 441
            G+  G             D  + RP+SPGT AL+CDEE       D+ F  +  Q  + 
Sbjct: 418 RGTHTGKAILEESRPNCADDQKSNRPMSPGTLALLCDEEVKMCDEQDTVFTTS--QNAVP 475

Query: 442 SKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPL-VKSET 495
            ++     N  A   +YAEQER++LT FRD L + +  G  KE   S + ++SET
Sbjct: 476 QQTITVNQNQSA---LYAEQERVVLTEFRDCLRKFVERGRNKERRYSSMAIESET 527


>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
 gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 619

 Score =  362 bits (930), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 213/474 (44%), Positives = 279/474 (58%), Gaps = 63/474 (13%)

Query: 40  KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKL                       
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 172

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH 
Sbjct: 173 ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 226

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
            ++  E A +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 227 NRNNMEAAGDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 286

Query: 219 RALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +ALF GDH     MQ+AANAA+ GA+G++ + +   S KRK  +     +AK+    R  
Sbjct: 287 KALFQGDHKNSIHMQQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTN 346

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
              Q N      +P    +P++Q  +   LG  K   R  LADI+QP D+KELC LLV+V
Sbjct: 347 HVPQKN-----GMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVV 401

Query: 339 SSEATKT---------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDDR 388
           S  A K          + +K   M  +REG    S+AS   + E++           DD 
Sbjct: 402 SGNAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASADDP 461

Query: 389 QSG----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS 444
            +G     +  ++ +    +D  D  + RP+SPGT ALMCDE+D+ FM         S++
Sbjct: 462 STGGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQN 511

Query: 445 TVQKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
            V +  A  +   + +YAEQER +LT FRD L++L+T G +KE   S  +KSE 
Sbjct: 512 VVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 565


>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
          Length = 529

 Score =  361 bits (926), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 213/474 (44%), Positives = 280/474 (59%), Gaps = 59/474 (12%)

Query: 40  KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKL                       
Sbjct: 43  KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 78

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH 
Sbjct: 79  ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 132

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
            ++  E A +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER
Sbjct: 133 NRNNMEAAGDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEER 192

Query: 219 RALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +ALF GDH     MQ+AANAA+ GA+G++ + +   S KRK  +     +AK+    R  
Sbjct: 193 KALFQGDHKNSIHMQQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTN 252

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
              Q    Q + +P    +P++Q  +   LG  K   R  LADI+QP D+KELC LLV+V
Sbjct: 253 HVPQV-FYQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVV 311

Query: 339 SSEATKT---------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDDR 388
           S  A K          + +K   M  +REG    S+AS   + E++           DD 
Sbjct: 312 SGNAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASADDP 371

Query: 389 QSG----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS 444
            +G     +  ++ +    +D  D  + RP+SPGT ALMCDE+D+ FM         S++
Sbjct: 372 STGGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQN 421

Query: 445 TVQKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
            V +  A  +   + +YAEQER +LT FRD L++L+T G +KE   S  +KSE 
Sbjct: 422 VVPQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 475


>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
 gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
          Length = 646

 Score =  350 bits (897), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 232/576 (40%), Positives = 307/576 (53%), Gaps = 117/576 (20%)

Query: 7   AMPQWQA-RPQHVRMVNRVPHPVHKLPLPTLPP-GKQESPRSRPRANTDGKDGTPKKQKQ 64
            MP +Q+ RPQ V  V+R+P+   KLP+       +  SPRS+ +     KD T KKQK+
Sbjct: 90  VMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKR 149

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
           CNC+NS+CLKL                             YCEC+AAGIYCDGCNC NCH
Sbjct: 150 CNCKNSKCLKL-----------------------------YCECYAAGIYCDGCNCQNCH 180

Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSP---------------------------- 156
           NN+ +E AR+EA+G TLE+NPNAFRPKIASSP                            
Sbjct: 181 NNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSFE 240

Query: 157 ---------HGA--------------QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 193
                    +GA              +  +E+  E QL  +HNKGCHCKK GCLKKYCEC
Sbjct: 241 LKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCEC 299

Query: 194 FQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAV--GSSGYGT 251
           F AN+LCSENC+C+DCKNFEGS+  R +   +   +  +++A NAAI GAV  G S  GT
Sbjct: 300 FHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQATNAAINGAVGFGPSISGT 358

Query: 252 LMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGST 311
            +T  KRK +E   G +  DQ V    Q Q+  +   S+     +  VS TA   +   +
Sbjct: 359 HITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASFVSDTAYKRI---S 415

Query: 312 KSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG--AMQTEREGSRSSVASPIQ 369
           +   RS LAD+LQ Q++K LCSLLV++S EA KT  +  G  A + + E   +S+AS  Q
Sbjct: 416 RPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEASIASSSQ 475

Query: 370 EGEDSR---KGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDE 426
             +DSR   +  E H     ++   + +D+D            ++ RPLSP T  LMCDE
Sbjct: 476 LLQDSRNVVRASENH----ANKDVADAVDID------------IHNRPLSPETLKLMCDE 519

Query: 427 EDSAFMQAGPQTGL----ASKSTVQK-SNAYASTRIYAEQERLILTRFRDFLNRLITCGS 481
            D  F   G   G+    A ++ +QK SN+   T +YAEQERLILT+FRD L  LI  GS
Sbjct: 520 LDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELIILGS 579

Query: 482 IKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYM 517
           IKETM S  VK +   ++    +G   SG E +G +
Sbjct: 580 IKETMHSSSVKKDVSIEKTPKNNGD--SGAETKGIL 613


>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
 gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
          Length = 649

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 231/576 (40%), Positives = 306/576 (53%), Gaps = 114/576 (19%)

Query: 7   AMPQWQA-RPQHVRMVNRVPHPVHKLPLPTLPP-GKQESPRSRPRANTDGKDGTPKKQKQ 64
            MP +Q+ RPQ V  V+R+P+   KLP+       +  SPRS+ +     KD T KKQK+
Sbjct: 90  VMPHFQSPRPQLVSPVHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKR 149

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
           CNC+NS   K DF                          +YCEC+AAGIYCDGCNC NCH
Sbjct: 150 CNCKNS---KSDFAT-----------------------ARYCECYAAGIYCDGCNCQNCH 183

Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSP---------------------------- 156
           NN+ +E AR+EA+G TLE+NPNAFRPKIASSP                            
Sbjct: 184 NNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSIQLPHVFIVGVLHQSFE 243

Query: 157 ---------HGA--------------QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 193
                    +GA              +  +E+  E QL  +HNKGCHCKK GCLKKYCEC
Sbjct: 244 LKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNKGCHCKK-GCLKKYCEC 302

Query: 194 FQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAV--GSSGYGT 251
           F AN+LCSENC+C+DCKNFEGS+  R +   +   +  +++A NAAI GAV  G S  GT
Sbjct: 303 FHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQATNAAINGAVGFGPSISGT 361

Query: 252 LMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGST 311
            +T  KRK +E   G +  DQ V    Q Q+  +   S+     +  VS TA   +   +
Sbjct: 362 HITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLSASFVSDTAYKRI---S 418

Query: 312 KSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG--AMQTEREGSRSSVASPIQ 369
           +   RS LAD+LQ Q++K LCSLLV++S EA KT  +  G  A + + E   +S+AS  Q
Sbjct: 419 RPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEASIASSSQ 478

Query: 370 EGEDSR---KGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDE 426
             +DSR   +  E H     ++   + +D+D            ++ RPLSP T  LMCDE
Sbjct: 479 LLQDSRNVVRASENH----ANKDVADAVDID------------IHNRPLSPETLKLMCDE 522

Query: 427 EDSAFMQAGPQTGL----ASKSTVQK-SNAYASTRIYAEQERLILTRFRDFLNRLITCGS 481
            D  F   G   G+    A ++ +QK SN+   T +YAEQERLILT+FRD L  LI  GS
Sbjct: 523 LDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELIILGS 582

Query: 482 IKETMCSPLVKSETRSQQKAVEDGTSRSGTEVRGYM 517
           IKETM S  VK +   ++    +G   SG E +G +
Sbjct: 583 IKETMHSSSVKKDVSIEKTPKNNGD--SGAETKGIL 616


>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
 gi|224030123|gb|ACN34137.1| unknown [Zea mays]
 gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
          Length = 607

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 214/491 (43%), Positives = 269/491 (54%), Gaps = 89/491 (18%)

Query: 38  PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
           P K ESP+ R R   +GKD TP K+K  NCRNS+CLKL                      
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCYNCRNSKCLKL---------------------- 162

Query: 98  FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                  YCECFA+G YCDGCNC NC NN ++EVAR+EAV A L+RNP+AFRPKI SSPH
Sbjct: 163 -------YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPH 215

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
             ++  E   +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEE
Sbjct: 216 MHRN-NEVPSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 274

Query: 218 RRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
           RR LF GDH     MQ+  NAA+ GA+G++G+ +  TS KRK  +       K+    +N
Sbjct: 275 RRNLFQGDHKNAINMQQVTNAAVNGAIGATGFPSPSTSRKRKHNDPSLDHFNKEHIAHKN 334

Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
               Q N     AVP   S+P+SQ+A+   LG      +  LAD +Q +D+KELC LLV+
Sbjct: 335 YHIPQKN-----AVP-DGSIPISQSAHPPTLGPF--TYKPLLADTVQAEDIKELCKLLVV 386

Query: 338 VSSEATKTLT------KKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSG 391
           VS EA KT        ++VG  + E  G +          ED R G         DR+  
Sbjct: 387 VSGEAAKTYAGQKTKEERVGEKEDEIGGQK----------EDDRVGSLALTN--HDREGN 434

Query: 392 NQ-----MDVDGTSGSGSDGGDMLNE-------------RPLSPGTRALMCDEE------ 427
           NQ       +D  S  G+  G  + E             RP+SPGT ALMCDEE      
Sbjct: 435 NQDQDQKAPIDDHSSRGTHTGKAIIEESRPNCVDDHKTHRPMSPGTLALMCDEEAFEKMC 494

Query: 428 ---DSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKE 484
              D+ F  +  Q     ++ +   N  A   +YAEQE ++LT FRD L + +  G +KE
Sbjct: 495 DEQDTMFTTS--QNAAPQQTVIVNHNQSA---LYAEQENVVLTEFRDCLRKFVRYGRMKE 549

Query: 485 TMCSPLVKSET 495
              S  +KSET
Sbjct: 550 ERYSMAIKSET 560


>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
 gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 459

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 184/437 (42%), Positives = 245/437 (56%), Gaps = 68/437 (15%)

Query: 41  QESPRSRPRANTDGKDGTP-KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           Q+   SR     + K+ TP K+QK CNC+NS+CLKL                        
Sbjct: 72  QDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKL------------------------ 107

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YC+GCNCVNCHN +E+E +RQ A+   LERNP+AF+PKIA SPHG 
Sbjct: 108 -----YCECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGM 162

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D +E+ ++  L  KH+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+
Sbjct: 163 KDLQENVQQVLLIGKHSKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERK 222

Query: 220 ALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQ 279
           AL HG      ++Q+  NAA+  A+  S Y     S KRKS+++   V +         Q
Sbjct: 223 ALLHGSQVSDTYIQQMTNAAVNRAIDMSAYLYPPESRKRKSKDISDSVVSS-----YGVQ 277

Query: 280 SQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVS 339
            Q+ N+V+ +      S+P     N +V GST SA RS  ++  QP  ++ELCSLLV  S
Sbjct: 278 YQRANHVRRNGENSLFSLP----NNKAVSGSTTSAYRSSWSNTFQPHHVRELCSLLVSNS 333

Query: 340 SEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGT 399
            +    L+ K           R +   P       R  +E+ N  PD     N+MD    
Sbjct: 334 VDVANKLSDK----------GRKNDKGPSSLDGAQRVANEI-NDSPDCVLDANRMD---- 378

Query: 400 SGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYA 459
                       E+P+SP TRALMCDEE    + +  +T    K++ +K +   S+ IY 
Sbjct: 379 ------------EKPISPATRALMCDEEHE--IGSEKETSARVKTSQEKEDTDTSSGIYL 424

Query: 460 EQERLILTRFRDFLNRL 476
           EQER IL+ FRD+L +L
Sbjct: 425 EQERQILSIFRDYLIQL 441


>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 412

 Score =  307 bits (787), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 163/308 (52%), Positives = 200/308 (64%), Gaps = 37/308 (12%)

Query: 38  PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
           P K +SP+   R   +GKD TP K+K CNCRNSRCLKL                      
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKL---------------------- 167

Query: 98  FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                  YCECFA+G +CDGCNC NC NN E+EVAR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 168 -------YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPH 220

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
             ++  E + +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEE
Sbjct: 221 TNRN-NEVSSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 279

Query: 218 RRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
           RR LF GDH     +Q+A NAA+ GA+G++G+ +  TS KRK  +     + K+ +  RN
Sbjct: 280 RRNLFQGDHKNAINIQQATNAAVNGAIGTTGFPSPSTSRKRKHIDPSLDHSNKEHAAHRN 339

Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
               Q N V +       S+P+SQ+A+   LG  K   R  LADI+Q +D+KELC LLV+
Sbjct: 340 CHIPQKNAVPDG------SIPISQSAHPPTLGPFKVTYRPLLADIVQAEDVKELCKLLVV 393

Query: 338 VSSEATKT 345
           VS EA KT
Sbjct: 394 VSGEAAKT 401


>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 405

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 205/447 (45%), Positives = 257/447 (57%), Gaps = 71/447 (15%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           K ESPR R R+  D KDGTPKK KQCNC+NSRCLKL                        
Sbjct: 4   KHESPRQRSRS-FDAKDGTPKKCKQCNCKNSRCLKL------------------------ 38

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YC+GCNCVNC NNVE+E+ RQEAV ATLERNPNAFRPKI SSP G 
Sbjct: 39  -----YCECFASGTYCEGCNCVNCCNNVENEIVRQEAVEATLERNPNAFRPKIFSSP-GI 92

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D+ ED  E  LA KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNF+GSEERR
Sbjct: 93  RDSGEDVGEHPLAGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCVDCKNFDGSEERR 152

Query: 220 ALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTS---NKRKSEELLF-GVAAKDQSV 274
           ALF  DHN  + FMQ    + ++    S+    L  S    KR++ EL+F G   K+Q  
Sbjct: 153 ALFQADHNMSVNFMQPLGGSGVL----SNSPAYLSPSPLLKKRRTHELVFSGPGLKEQLT 208

Query: 275 IRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSL 334
                +   +   +S +    S P SQ  + +V    K   +S LA ++Q + + ELC L
Sbjct: 209 SGGGTAPSLSLALSSTI---ASTPSSQGVSPAV-PPAKPMHKSLLAGVVQIEAIHELCKL 264

Query: 335 LVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQM 394
           LV+VSSE  K +   +G + ++   S S++ +  Q  +  R    V    PDD       
Sbjct: 265 LVVVSSETQKEI---LGMLLSDVANS-STLRAFSQSFQCDRA---VTTTGPDDE------ 311

Query: 395 DVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQ-AGPQTGLASKSTVQKSNAYA 453
           D D              +RP+SPGT ALMCDE++  F     P  GLAS+  +  S +  
Sbjct: 312 DADKP------------QRPMSPGTLALMCDEKEPLFTAPPSPIGGLASEE-LSTSGSPQ 358

Query: 454 STRIYAEQERLILTRFRDFLNRLITCG 480
              ++AEQER IL  FRD L R+ + G
Sbjct: 359 IALLHAEQERAILLEFRDCLRRITSVG 385


>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
 gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
          Length = 497

 Score =  275 bits (704), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 181/459 (39%), Positives = 246/459 (53%), Gaps = 46/459 (10%)

Query: 28  VHKLPLPTLPPGKQESPR-SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
           +H LP   L   K ESPR S P+A T   +GTPK  +QC+C+ S+CLKL           
Sbjct: 57  LHVLPRSPLSFKKSESPRISLPQARTKIINGTPKSHRQCHCKQSKCLKL----------- 105

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YCECFA+G YCD C+C NCHNNVE+E  R+EA    LERNPN
Sbjct: 106 ------------------YCECFASGSYCDECSCANCHNNVENEDVRREAAECILERNPN 147

Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
           AF+PKI  SP    D  + A++  + AKH KGCHCK++GCLKKYCECFQANILCSENC+C
Sbjct: 148 AFKPKITGSPCTPPDDGDAAKDVLVMAKHIKGCHCKRTGCLKKYCECFQANILCSENCKC 207

Query: 207 LDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
           + CKN EG E    +   +H    A++Q+    +    V SS   +   S KR   E LF
Sbjct: 208 VSCKNLEGRELGMVVTSENHCKTKAYIQQENANSSSAVVSSSHIFS-QESRKRTFRE-LF 265

Query: 266 GVAAKDQSVIRNPQSQQGNNV-QNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQ 324
               +D  +    +    N V  + ++P  ++  V Q A+++ LGS+K   RS LAD + 
Sbjct: 266 DSNMRDTEIHEFTKHHAVNPVGVHVSIPTLRADCVCQIASSAALGSSKLTYRSLLADAIH 325

Query: 325 PQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGI 384
           P D +ELCSLLV+V SEA K L  +    Q  +E     +A  ++  +D +K  +VH  +
Sbjct: 326 PLDTRELCSLLVVV-SEAAKFLADEHCKAQI-KEVKEYQMACDLE--KDCKKEPDVHQVV 381

Query: 385 PDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKS 444
           P ++  G++ D+     S S   DM +  P S     LMCD++D  F +        S S
Sbjct: 382 PHNQFPGSRTDITVMDNSRSCDIDMQDGGPFSSEEDYLMCDKKDKLFTR--------SAS 433

Query: 445 TVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIK 483
             Q  N   +     E ERL+LT  RD L +LI  G++K
Sbjct: 434 LDQNLNHGCNENFTKEHERLVLTCLRDCLGKLIAFGTMK 472


>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 912

 Score =  274 bits (701), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 197/477 (41%), Positives = 253/477 (53%), Gaps = 96/477 (20%)

Query: 37  PPGKQESPR---SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAH 93
           P  K  SPR   SR R+  DGKDGTPKK KQCNC+NSRCLKL                  
Sbjct: 440 PSIKHSSPRRPVSRSRSALDGKDGTPKKCKQCNCKNSRCLKL------------------ 481

Query: 94  LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
                      YCECFA+G YCDGCNCVNC NN EHEV RQEAV ATLERNPNAFRPKIA
Sbjct: 482 -----------YCECFASGTYCDGCNCVNCCNNSEHEVVRQEAVEATLERNPNAFRPKIA 530

Query: 154 SSPHGAQDAREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           +SP    D +ED  +    AAKHNKGCHCKKSGCLKKYCEC+QANILCSENC+C+DC+NF
Sbjct: 531 NSPRILHDGKEDVVDRNGSAAKHNKGCHCKKSGCLKKYCECYQANILCSENCKCVDCRNF 590

Query: 213 EGSEERRALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKD 271
             S+ERRALF GD   G++F+QR +  A       SG+ + + S KRK++++ +      
Sbjct: 591 ITSDERRALFPGDPALGLSFVQRPSGCA-------SGHMSPL-SRKRKTQDVEY------ 636

Query: 272 QSVIRNPQSQQGN--NVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMK 329
             ++   Q+ + N   V+N A    +    + +A        K   RS LAD+L    ++
Sbjct: 637 --IMPGLQTNKSNLVKVENHAATTPKMSIYTASAEHIPEPMVKVVNRSLLADVLHTDVIQ 694

Query: 330 ELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR- 388
           +LC LLV+VS++    ++   G    +   + +S   P    ED   G E       DR 
Sbjct: 695 DLCKLLVIVSADVQHEVS---GIPAVDNGTNNASTVDPTNT-EDCPSGKEPSAPAAPDRL 750

Query: 389 ---QSGNQM------------DVDGTSGSGSDGGDMLN----------ERPLSPGTRALM 423
               +G  M             V     + S     L+          ERP+SPGT ALM
Sbjct: 751 QAESTGTTMAEYVRNDPTSTPPVAAVRTAPSPSNQALDVVAKREETRKERPMSPGTLALM 810

Query: 424 CDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCG 480
           CDE+D                T   S A+   +++  QER+IL+ FRD L R+I+ G
Sbjct: 811 CDEKDPLL-------------TAPSSPAH-ENQLHVVQERVILSEFRDCLRRIISVG 853


>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
 gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
          Length = 220

 Score =  274 bits (701), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 135/210 (64%), Positives = 152/210 (72%), Gaps = 35/210 (16%)

Query: 24  VPHPVHKLPLPTLPP------GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDF 77
            P P H  P PTLPP      GK ESP+S+ R N + KD TPKKQKQCNC++SRCLKL  
Sbjct: 38  APLPEHPQPSPTLPPLPATKLGKPESPKSKSRPNFETKDATPKKQKQCNCKHSRCLKL-- 95

Query: 78  VEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 137
                                      YCECFA+GIYCDGCNCVNC NNV++E AR+EAV
Sbjct: 96  ---------------------------YCECFASGIYCDGCNCVNCFNNVDNEAARREAV 128

Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
            ATLERNPNAFRPKIASSPHG +D +E+  E ++  KHNKGCHCKKSGCLKKYCECFQAN
Sbjct: 129 EATLERNPNAFRPKIASSPHGTRDNKEETGEVKVLVKHNKGCHCKKSGCLKKYCECFQAN 188

Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGDHN 227
           ILCSENC+C+DCKNFEGSEER+ALFHGD N
Sbjct: 189 ILCSENCKCMDCKNFEGSEERQALFHGDQN 218


>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
 gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
          Length = 473

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 178/478 (37%), Positives = 247/478 (51%), Gaps = 60/478 (12%)

Query: 23  RVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWI 82
           ++P P    P+  +PP      +S P         + K++K CNC+NS+CLKL    I  
Sbjct: 51  QMPTP-RPWPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKLACSSIL- 99

Query: 83  NIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLE 142
                      L T   D L  YCECFAAG+YCDGC+C  C N V +E ARQEA+ +T +
Sbjct: 100 ----------DLFTGQIDQLYGYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQ 149

Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
           RNP AF+PKI +  +     ++DA       KHNKGCHCKKSGCLKKYCECFQANILCS+
Sbjct: 150 RNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSK 209

Query: 203 NCRCLDCKNFEGSEERRALFHGDHNGIA-FMQRAANAAIIGAVGSSGYGTLMTSNKRKSE 261
           NC+C DCKNFEGSEE R +  GD++     +Q AAN A+ GA+GSSGY       KR  E
Sbjct: 210 NCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPE 269

Query: 262 ELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLAD 321
           + L+     + S ++  Q Q+ N+V +S +  S  +   +   ++    +    RS LA+
Sbjct: 270 DHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALAN 325

Query: 322 ILQPQDMKELCSLLVMV---SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGH 378
            + P D   L   LV+V   +++A  T  +  G M+ ERE                    
Sbjct: 326 TISPTDATGLAKHLVIVCRKAADAFLTTAENKGEMEVERE-------------------- 365

Query: 379 EVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQT 438
                I  D      MD       G     + + RP SPGT+ALMCDE+DS F   G   
Sbjct: 366 -----IHTDSDGATNMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF---GTDY 417

Query: 439 GLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETR 496
            ++    +   +   ++ + A+QE+ +LT FRD+L  +IT G I  T  + +  S  R
Sbjct: 418 RISFPVALHDQD---TSELNAQQEKAVLTGFRDYLRTVITRGKINGTEFAKIFLSLKR 472


>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
           distachyon]
          Length = 487

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 186/490 (37%), Positives = 248/490 (50%), Gaps = 74/490 (15%)

Query: 24  VPHPVHKLPLPTLPPGKQES----PRSRPRANT------DGKDGTPKKQKQCNCRNSRCL 73
           VP     +  P + P +Q+     PR  P A T      + K+ TPKK+K CNC+NSRCL
Sbjct: 19  VPAAAGGMKAPVVAPQQQKPAAPVPRPWPVAFTPMKPAAEVKNVTPKKRKHCNCKNSRCL 78

Query: 74  KLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVAR 133
           KL                             YCECFAA + CDGCNC NC NN E+E  R
Sbjct: 79  KL-----------------------------YCECFAARVSCDGCNCSNCGNNSENEKIR 109

Query: 134 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 193
           + A+ ATL RNP AF+PKI + P      ++++    L  KHNKGCHCKKSGCLKKYCEC
Sbjct: 110 KGAIEATLLRNPLAFQPKIENGPSTVNVRKDNSGAVPLRPKHNKGCHCKKSGCLKKYCEC 169

Query: 194 FQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI-AFMQRAANAAIIGAVGSSGYGTL 252
           FQANILCS+NCRC+DCKNFEGSEERRAL  GD+      +Q+AAN A+ GA+GSSGY   
Sbjct: 170 FQANILCSKNCRCMDCKNFEGSEERRALVQGDNTSDRNNIQQAANTALNGAIGSSGYKCS 229

Query: 253 MTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTK 312
               K   E+ L  +           Q QQ N    S +  S +         S   S+K
Sbjct: 230 PARRKISQEDSLSEM-----------QFQQVNLSDVSQLTPSCTGFGGHNGCYSQSNSSK 278

Query: 313 SALRSPLADILQPQDMKELCSLLVMV---SSEATKTLTKKVGAMQTEREGSRSSVASPIQ 369
              RSPLA+ +   D+ +L   +V+    ++EA  T+    G            V   ++
Sbjct: 279 MIYRSPLANTIHLSDVNDLVKHVVIACRKAAEAFPTIADNKG------------VEVEVE 326

Query: 370 EGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDS 429
           +   +  G   +     D +  + MD+   + +     D    RP SPGT+ALMCDE+D+
Sbjct: 327 KEFHANYGLNNNRSKVQDLKEASPMDIHSKACTDQHNVD---PRPASPGTQALMCDEQDT 383

Query: 430 AFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKETMCSP 489
           +F     ++  A+ S  +       + I A QE L+LT FRD+L  LIT G I E   S 
Sbjct: 384 SFGNDIYRSSFAATSCDKN-----ISEINALQENLVLTGFRDYLRVLITRGKINEAKSSS 438

Query: 490 LVKSETRSQQ 499
               E  +Q+
Sbjct: 439 EAAMELNAQR 448


>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
 gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
          Length = 448

 Score =  265 bits (678), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 171/465 (36%), Positives = 236/465 (50%), Gaps = 80/465 (17%)

Query: 24  VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWIN 83
            P P    P+  +PP      +S P         + K++K CNC+NS+CLKL        
Sbjct: 3   TPRP---WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKL-------- 42

Query: 84  IYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 143
                                YCECFAAG+YCDGC+C  C N V +E ARQEA+ +T +R
Sbjct: 43  ---------------------YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQR 81

Query: 144 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
           NP AF+PKI +  +     ++DA       KHNKGCHCKKSGCLKKYCECFQANILCS+N
Sbjct: 82  NPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKN 141

Query: 204 CRCLDCKNFEGSEERRALFHGDHNGIA-FMQRAANAAIIGAVGSSGYGTLMTSNKRKSEE 262
           C+C DCKNFEGSEE R +  GD++     +Q AAN A+ GA+GSSGY       KR  E+
Sbjct: 142 CKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPED 201

Query: 263 LLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADI 322
            L+     + S ++  Q Q+ N+V +S +  S  +   +   ++    +    RS LA+ 
Sbjct: 202 HLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANT 257

Query: 323 LQPQDMKELCSLLVMV---SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHE 379
           + P D   L   LV+V   +++A  T  +  G M+ ERE                     
Sbjct: 258 ISPTDATGLAKHLVIVCRKAADAFLTTAENKGEMEVERE--------------------- 296

Query: 380 VHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTG 439
               I  D      MD       G     + + RP SPGT+ALMCDE+DS F   G    
Sbjct: 297 ----IHTDSDGATNMDQQNGGDFGPCCNSLEDSRPASPGTQALMCDEQDSTF---GTDYR 349

Query: 440 LASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKE 484
           ++    +   +   ++ + A+QE+ +LT FRD+L  +IT G I E
Sbjct: 350 ISFPVALHDQD---TSELNAQQEKAVLTGFRDYLRTVITRGKINE 391


>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
 gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
          Length = 460

 Score =  261 bits (667), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 190/490 (38%), Positives = 251/490 (51%), Gaps = 120/490 (24%)

Query: 5   SQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQ 64
           SQA+P  Q  P       R   PV +L   ++ P        RP    + K+   KK KQ
Sbjct: 51  SQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELKEVAAKKCKQ 99

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
           CNC+N+RCLKL                             YCECFA+G YCDGCNC+NC 
Sbjct: 100 CNCKNTRCLKL-----------------------------YCECFASGTYCDGCNCLNCC 130

Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
           NNVE+E  RQEAV  TLERNPNAFRPKIA++ H  +D+ +D R      KHNKGC+CKKS
Sbjct: 131 NNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKS 186

Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANA 238
           GCLKKYCECFQANILCS+NC+C+DCKN++ S+ERRAL+  D      H G      +A  
Sbjct: 187 GCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATG 246

Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQS 296
           ++   V SS     M + KR++ +L+F G + K  Q + R  Q  Q   +  S +P + S
Sbjct: 247 SL-ACVSSSP----MPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSS 301

Query: 297 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTE 356
           +            S K+   S LA ++QP+ + +LC+LLVM S+EA K +          
Sbjct: 302 L------------SAKTIQTSLLAGVVQPETVSDLCNLLVMASAEAAKAM---------- 339

Query: 357 REGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLS 416
                      I  G  S++  E  NG                +GSG    D+  +RP+S
Sbjct: 340 -----------IASGPVSKRPAE--NG--------------QEAGSGQGDEDIREKRPMS 372

Query: 417 PGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRL 476
           PGT ALMCDE+D  F  A              ++      + AEQERLIL  FRD L ++
Sbjct: 373 PGTLALMCDEQDPLFTPA--------------THGDYDKELAAEQERLILEEFRDCLRKI 418

Query: 477 ITCGSIKETM 486
           IT G  + T+
Sbjct: 419 ITVGRKRGTL 428


>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
 gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
          Length = 461

 Score =  261 bits (666), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 190/490 (38%), Positives = 251/490 (51%), Gaps = 120/490 (24%)

Query: 5   SQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQ 64
           SQA+P  Q  P       R   PV +L   ++ P        RP    + K+   KK KQ
Sbjct: 51  SQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELKEVAAKKCKQ 99

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
           CNC+N+RCLKL                             YCECFA+G YCDGCNC+NC 
Sbjct: 100 CNCKNTRCLKL-----------------------------YCECFASGTYCDGCNCLNCC 130

Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
           NNVE+E  RQEAV  TLERNPNAFRPKIA++ H  +D+ +D R      KHNKGC+CKKS
Sbjct: 131 NNVENEAVRQEAVEVTLERNPNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKS 186

Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANA 238
           GCLKKYCECFQANILCS+NC+C+DCKN++ S+ERRAL+  D      H G      +A  
Sbjct: 187 GCLKKYCECFQANILCSDNCKCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATG 246

Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQS 296
           ++   V SS     M + KR++ +L+F G + K  Q + R  Q  Q   +  S +P + S
Sbjct: 247 SL-ACVSSSP----MPAKKRRTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSS 301

Query: 297 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTE 356
           +            S K+   S LA ++QP+ + +LC+LLVM S+EA K +          
Sbjct: 302 L------------SAKTIQTSLLAGVVQPETVSDLCNLLVMASAEAAKAM---------- 339

Query: 357 REGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLS 416
                      I  G  S++  E  NG                +GSG    D+  +RP+S
Sbjct: 340 -----------IASGPVSKRPAE--NG--------------QEAGSGQGDEDIREKRPMS 372

Query: 417 PGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRL 476
           PGT ALMCDE+D  F  A              ++      + AEQERLIL  FRD L ++
Sbjct: 373 PGTLALMCDEQDPLFTPA--------------THGDYDKELAAEQERLILEEFRDCLRKI 418

Query: 477 ITCGSIKETM 486
           IT G  + T+
Sbjct: 419 ITVGRKRGTL 428


>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
          Length = 356

 Score =  255 bits (651), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 137/263 (52%), Positives = 165/263 (62%), Gaps = 35/263 (13%)

Query: 38  PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
           P K ESP+ R R   +GKD TP K+K CNCRNS+CLKL                      
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCCNCRNSKCLKL---------------------- 162

Query: 98  FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                  YCECFA+G YCDGCNC NC NN ++EVAR+EAV A L+RNP+AFRPKI SSPH
Sbjct: 163 -------YCECFASGTYCDGCNCTNCFNNPDNEVARREAVEAILDRNPDAFRPKIGSSPH 215

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
             ++  E   +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEE
Sbjct: 216 MHRN-NEVPSDLPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 274

Query: 218 RRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
           RR LF GDH     MQ+  NAA+ GA+G++G+ +  TS KRK  +       K+    +N
Sbjct: 275 RRNLFQGDHKNAINMQQVTNAAVNGAIGATGFPSPSTSRKRKHNDPSLDHFNKEHIAHKN 334

Query: 278 PQSQQGNNVQNSAVPCSQSVPVS 300
               Q    + + VPC  +  VS
Sbjct: 335 YHIPQ----EQAEVPCYITRSVS 353


>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
          Length = 437

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 156/397 (39%), Positives = 215/397 (54%), Gaps = 39/397 (9%)

Query: 104 KYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           KYCECFAAG+YCDGC+C  C N V +E ARQEA+ +T +RNP AF+PKI +  +     +
Sbjct: 75  KYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRK 134

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +DA       KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +  
Sbjct: 135 DDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQ 194

Query: 224 GDHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQ 282
           GD++     +Q AAN A+ GA+GSSGY       KR  E+ L+     + S ++  Q Q+
Sbjct: 195 GDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQE 253

Query: 283 GNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV---S 339
            N+V +S +  S  +   +   ++    +    RS LA+ + P D   L   LV+V   +
Sbjct: 254 ANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKA 310

Query: 340 SEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGT 399
           ++A  T  +  G M+ ERE                         I  D      MD    
Sbjct: 311 ADAFLTTAENKGEMEVERE-------------------------IHTDSDGATNMDQQNG 345

Query: 400 SGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYA 459
              G     + + RP SPGT+ALMCDE+DS F   G    ++    +   +   ++ + A
Sbjct: 346 GDFGPCCNSLEDSRPASPGTQALMCDEQDSTF---GTDYRISFPVALHDQD---TSELNA 399

Query: 460 EQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETR 496
           +QE+ +LT FRD+L  +IT G I  T  + +  S  R
Sbjct: 400 QQEKAVLTGFRDYLRTVITRGKINGTEFAKIFLSLKR 436


>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
 gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
          Length = 509

 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 157/386 (40%), Positives = 219/386 (56%), Gaps = 23/386 (5%)

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
           L  YCECFAAG YCDGCNC  C N VE+E  RQ+A+  T  RNPNAF+PKI + P     
Sbjct: 81  LKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQPKIENGPIPPS- 139

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
            R+DA       KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DCKN+EGSEE R+ 
Sbjct: 140 VRKDAGALPSLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEELRST 199

Query: 222 FHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 280
             GD++     +Q+AAN A+ GA+GSSGY       KR +E+  +     + S+++  Q 
Sbjct: 200 IQGDNSCDRNNLQQAANVALNGAIGSSGYRFSPVRRKRPAEDPHYQRLNVEGSMMQT-QF 258

Query: 281 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 340
           Q+ NNV  S +  S        +  +  G +K   RSPLA+ +   D+ +L + LV+V  
Sbjct: 259 QEANNVDVSQITSSTG---RDGSTGNFQGRSKLVYRSPLANTIHVTDVNDLANHLVIVCR 315

Query: 341 EATKTLTKKV---GAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVD 397
           +A +  T  V     M+ +RE   ++      E +D +K      G      + +Q  VD
Sbjct: 316 KAAERFTTIVDNKAEMEIDREICTNTYQK-FDENKDVQKDALSQLG---KVTNIDQQIVD 371

Query: 398 GTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKS-NAYASTR 456
            +    SD  +  + RP SPGT+AL+CDE+D  F       G A +S +  + +    + 
Sbjct: 372 DSILYCSDTQE--DARPASPGTQALLCDEQDLTF-------GTAHRSPIPIALHDQDISE 422

Query: 457 IYAEQERLILTRFRDFLNRLITCGSI 482
           ++A QE  +L  FR++L  +I  G I
Sbjct: 423 LHAAQENAVLKEFRNYLRLIIARGKI 448


>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
 gi|223943031|gb|ACN25599.1| unknown [Zea mays]
 gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
          Length = 530

 Score =  248 bits (632), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 179/521 (34%), Positives = 255/521 (48%), Gaps = 66/521 (12%)

Query: 5   SQAMPQWQARPQHVRMVNRVPHPV--HKLPLPTLPPGKQESPRSRP------RANTDGKD 56
           +Q  PQ  + P  +   + +  P    + P P  P      PR  P      +++T+ K 
Sbjct: 6   NQQHPQQLSSPPVLAAASEIKAPAVPSQQPKPAAP--VMSVPRQWPIAFNPLKSSTEEKS 63

Query: 57  GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD 116
            TPKK+K CNCRNS+CLK+                             YCECF    YCD
Sbjct: 64  VTPKKKKHCNCRNSKCLKM-----------------------------YCECFQELQYCD 94

Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
           GCNC NC N V +E AR EA+ A  +RNP+AF+PKI +  +     ++++    L AKH+
Sbjct: 95  GCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHH 154

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRA 235
           KGCHCKKSGCLKKYCEC+QAN+ CS+NCRC+ CKN EG+++       DH +    +++A
Sbjct: 155 KGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQA 214

Query: 236 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCS 294
           AN A  G+VGSSGY  + + ++++S E    V    +  +   Q QQGN+   S + PCS
Sbjct: 215 ANIAFNGSVGSSGY--ICSLSRKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCS 272

Query: 295 QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQ 354
                   AN           RSPLA+ +   ++ +L   LVMV   A  T+      + 
Sbjct: 273 TGFDGHNAANTKPYNPN---YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIADDETDLI 329

Query: 355 TEREGSRSSVASPIQEGEDSRKGH-EVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNER 413
            E +GS+S+    +  G   ++   E  + +       +Q ++D T    S   D    R
Sbjct: 330 AEEKGSQSNNG--LSNGNCKQQDFKEASSMVILSEGCSDQPNIDETGSHRSKTLDF--SR 385

Query: 414 PLSPGTRALMCDEEDSAFMQ--AGPQTG--LASKSTVQKSNAYAS-----------TRIY 458
           P SP T+ALMCDE+D+ F    A P T   +  +      N Y S           + I 
Sbjct: 386 PASPTTQALMCDEQDATFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEIN 445

Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQ 499
           A QE L+LT  R++L  +IT G I E   S     E  +QQ
Sbjct: 446 AAQENLVLTGLREYLRVIITRGKINEHKSSLEAAMELDAQQ 486


>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
          Length = 485

 Score =  244 bits (624), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 174/504 (34%), Positives = 250/504 (49%), Gaps = 66/504 (13%)

Query: 5   SQAMPQWQARPQHVRMVNRVPHPV--HKLPLPTLPPGKQESPRSRP------RANTDGKD 56
           +Q  PQ  + P  +   + +  P    + P P  P      PR  P      +++T+ K 
Sbjct: 6   NQQHPQQLSSPPVLAAASEIKAPAVPSQQPKPAAP--VMSVPRQWPIAFNPLKSSTEEKS 63

Query: 57  GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD 116
            TPKK+K CNCRNS+CLK+                             YCECF    YCD
Sbjct: 64  VTPKKKKHCNCRNSKCLKM-----------------------------YCECFQELQYCD 94

Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
           GCNC NC N V +E AR EA+ A  +RNP+AF+PKI +  +     ++++    L AKH+
Sbjct: 95  GCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHH 154

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRA 235
           KGCHCKKSGCLKKYCEC+QAN+ CS+NCRC+ CKN EG+++       DH +    +++A
Sbjct: 155 KGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQA 214

Query: 236 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCS 294
           AN A  G+VGSSGY  + + ++++S E    V    +  +   Q QQGN+   S + PCS
Sbjct: 215 ANIAFNGSVGSSGY--ICSLSRKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCS 272

Query: 295 QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQ 354
                   AN           RSPLA+ +   ++ +L   LVMV   A  T+      + 
Sbjct: 273 TGFDGHNAANTKPYNPN---YRSPLANTIHLSEVNDLVKQLVMVCRMAAATIADDETDLI 329

Query: 355 TEREGSRSSVASPIQEGEDSRKGH-EVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNER 413
            E +GS+S+  + +  G   ++   E  + +       +Q ++D T    S   D    R
Sbjct: 330 AEEKGSQSN--NGLSNGNCKQQDFKEASSMVILSEGCSDQPNIDETGSHRSKTLDF--SR 385

Query: 414 PLSPGTRALMCDEEDSAFMQ--AGP--QTGLASKSTVQKSNAYAS-----------TRIY 458
           P SP T+ALMCDE+D+ F    A P  Q  +  +      N Y S           + I 
Sbjct: 386 PASPTTQALMCDEQDATFGNEPASPTTQALMCEEQNTTFGNDYRSSFPLVSCDQDISEIN 445

Query: 459 AEQERLILTRFRDFLNRLITCGSI 482
           A QE L+LT  R++L  +IT G I
Sbjct: 446 AAQENLVLTGLREYLRVIITRGKI 469


>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
 gi|223944643|gb|ACN26405.1| unknown [Zea mays]
 gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
          Length = 499

 Score =  234 bits (598), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 154/389 (39%), Positives = 215/389 (55%), Gaps = 28/389 (7%)

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
           L  YCECFAAG YCDGCNC  C N VE+E  RQEA+  T  RNPNAF+PKI + P  +  
Sbjct: 83  LKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIPS-S 141

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
             +DA       KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DCKN+EGSEE R+ 
Sbjct: 142 VWKDAGALPSHPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEELRST 201

Query: 222 FHGDH----NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 277
             GD+    N I   Q+AAN A+ GA+GSSGY       KR  E+        + S+++ 
Sbjct: 202 TQGDNSCDRNNI---QQAANVALNGAIGSSGYRFSPVRRKRPPEDPHCQTLNVEGSMMQT 258

Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 337
            Q ++ NNV  S +  S        +  +  G++K   RSPLA+ +   D+ +L + LV+
Sbjct: 259 -QFRETNNVDVSQMTSSTG---RDGSTVNFQGTSKLVYRSPLANTIHITDVNDLANHLVI 314

Query: 338 VSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMD-- 395
              +A +  T  V   + E E  +    +  Q+ ++++KG  V             +D  
Sbjct: 315 ACRKAAERFTTIVDN-KVEMEIDKEICTNTDQKFDENKKG--VQKAALSQLGKVTDIDQQ 371

Query: 396 -VDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKS-NAYA 453
            VD +    SD  D  + RP SPGT+AL+CDE+D  F       G   +S++  + +   
Sbjct: 372 IVDDSRLHCSDTQD--DARPASPGTQALLCDEQDLTF-------GTVHRSSIPVAMHDQV 422

Query: 454 STRIYAEQERLILTRFRDFLNRLITCGSI 482
            + ++A QE  +L   R+ L  +I  G +
Sbjct: 423 ISELHAAQESAVLKELRNNLRLIIARGKL 451


>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
           distachyon]
          Length = 540

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 160/458 (34%), Positives = 224/458 (48%), Gaps = 87/458 (18%)

Query: 51  NTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFA 110
           +++  +GTP++ K C C+NS+CLKL                             YC+CFA
Sbjct: 138 SSESINGTPRR-KNCKCKNSKCLKL-----------------------------YCDCFA 167

Query: 111 AGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP-HGAQDAREDARE 168
           +G YC D CNC NC N+V HE ARQ+A+ A +ERNP AF PKI + P H AQ+    A E
Sbjct: 168 SGRYCNDDCNCKNCCNDVSHETARQDAINAVMERNPVAFMPKIGNIPRHAAQNREYRAAE 227

Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 228
                KH KGC CK+S CLKKYCECFQ+N+LCSENC+C DCKN+E SE+ + +       
Sbjct: 228 GPRVGKHMKGCQCKRSECLKKYCECFQSNVLCSENCKCTDCKNYESSEDMKEMRRMTQQH 287

Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 288
             ++    N A+ G +G S    ++     K   L      +DQ +  N  SQ       
Sbjct: 288 GVYVHHVQNLALKGMIGPSA---VLPRAAEKFSGLSVASLGRDQPINNNDSSQ------- 337

Query: 289 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTK 348
              P   SVP   T ++  LG      R+ LADI+Q +D+  LC +LV+VS         
Sbjct: 338 VLSPLLTSVPTEDTESSVRLGRHGVTYRTLLADIIQIEDVNVLCKVLVLVS--------- 388

Query: 349 KVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPD---DRQS------------GNQ 393
                   R+ + + + S  +E  + +K     + IP    DR++            GN 
Sbjct: 389 --------RQAAGAFLDSGFKENTNIKKLDRAESHIPSTNHDRKAVQKQHDEQLCSLGNS 440

Query: 394 MDV----DGTSGSG-SDGGDML--NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTV 446
           +      +G +G   SD  D    N R +SPGT+ALMC+E+D  F+      G    +T 
Sbjct: 441 LIAVPLSEGRAGMPRSDPSDTWKDNRRSVSPGTQALMCNEQD-MFVHTSSVAGAILSTT- 498

Query: 447 QKSNAYASTRIYAEQERLILTRFRDFLNRLITCGSIKE 484
            K N    + IY EQE+ +LT   DFL  L   G ++E
Sbjct: 499 -KENL---SEIYIEQEKRVLTNLLDFLRELAKSGRLQE 532


>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
           sativa Japonica Group]
          Length = 508

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 157/438 (35%), Positives = 222/438 (50%), Gaps = 65/438 (14%)

Query: 55  KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
           K+ +PKK+K CNC+NS                                  YCECFAA +Y
Sbjct: 115 KNASPKKRKHCNCKNS----------------------------------YCECFAARVY 140

Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
           CDGC+C  C N +E+E  R+EA+   L RNP AF+PKI +SP+     ++++       K
Sbjct: 141 CDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTVTVRKDNSEAIPPIPK 200

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH----GDHNGIA 230
           HNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL       D N I 
Sbjct: 201 HNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI- 259

Query: 231 FMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSA 290
             Q AAN A+ GA GS GY       K+  E  L       + ++   Q Q  ++   S 
Sbjct: 260 --QEAANFALNGATGSLGYKNSPVRRKKYQENSL------GEQILSEAQFQPTDHADVSQ 311

Query: 291 VPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKV 350
           +  S +      A      S++   RSPLA+ +   ++ +L    ++   +A +    K 
Sbjct: 312 LASSCTGFGGDIAINYQSKSSEMIYRSPLANTIPLIEVNDLVKNALVACRKAPEAFLTKA 371

Query: 351 G---AMQTEREGSRSSVASPIQEGEDSRKG---HEVHNGIPDDRQSGNQMDVDGTSGSGS 404
           G    MQ E++   +   +  +  E + K     ++ N    DRQ     +++GT    +
Sbjct: 372 GNKVEMQAEKQYQTNDEINNDKTKEQNLKEASLKDIQNKACIDRQ-----NINGTEPHLA 426

Query: 405 DGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERL 464
           +     + RP SPGT+ALMCDE D+ F     ++     S  Q       + + A+QER+
Sbjct: 427 NS--FKDSRPASPGTQALMCDEMDTTFGNEDYRSPFVVPSRDQD-----ISELNADQERI 479

Query: 465 ILTRFRDFLNRLITCGSI 482
           +LT  RD+L  LIT G+I
Sbjct: 480 VLTGLRDYLRVLITRGNI 497


>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
 gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
          Length = 497

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 162/487 (33%), Positives = 241/487 (49%), Gaps = 90/487 (18%)

Query: 34  PTLPPGKQESPRSRP-RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
           P +P  +Q      P + +T+ K  TPKK+K CNCRNS+CLK+                 
Sbjct: 38  PVMPVPRQWPMAFNPLKPSTEVKSVTPKKKKHCNCRNSKCLKM----------------- 80

Query: 93  HLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
                       YCECF    YCDGCNC NC N V +E AR EA+ A  +RNP+AF+PKI
Sbjct: 81  ------------YCECFQELQYCDGCNCSNCGNIVGNENARNEAIEAIRQRNPSAFQPKI 128

Query: 153 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
            + P+     ++++    + AKH+KGCHCKKSGCLKKYCEC+QAN+ CS+NCRC+ CKN 
Sbjct: 129 ENGPNTLNVRKDNSGAVPVVAKHHKGCHCKKSGCLKKYCECYQANVFCSKNCRCMHCKNS 188

Query: 213 EGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKD 271
           EG+E+       DH +  + +Q+AAN A  G VGSSGY  + + ++++S E   GV  K 
Sbjct: 189 EGNEDTETSSQRDHASDRSHIQQAANVAFNGTVGSSGY--ICSLSRKRSHEDALGVRIKS 246

Query: 272 QSVIRNPQSQ--QGNNVQNSAV-PCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDM 328
           +  +   Q+Q  QGN+   S + PCS        AN+    S     RSPLA+ +  +++
Sbjct: 247 EGSMPETQTQYQQGNHADVSPLAPCSTGFDRHNAANSK---SNNPIYRSPLANTIHLREV 303

Query: 329 KELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDR 388
            +L   LVMV   A   + ++ G        S + +++   + +D ++   +     D  
Sbjct: 304 NDLVKHLVMVCRMAEAKIAEEKGFQ------SNNGLSNGNCKQQDFKETSSI-----DIL 352

Query: 389 QSG--NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQ--AGPQTG----- 439
             G  +Q +++      S+  ++   RP SP T++LMCDE+++ F    A P T      
Sbjct: 353 SKGCSHQPNINEMGFHWSETSNV--SRPASPTTQSLMCDEQNTTFGNEPASPTTQALMCE 410

Query: 440 ------------------LASKSTVQKSNAYAS-----------TRIYAEQERLILTRFR 470
                             +  +      N Y S           + I A QE L+LT  R
Sbjct: 411 EQNTTFGNEPASPTTQALMCEEQNTTFGNDYKSSFPLVSRGQDISEINAVQENLVLTGLR 470

Query: 471 DFLNRLI 477
            +L  +I
Sbjct: 471 QYLCTII 477


>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
          Length = 612

 Score =  193 bits (490), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 159/476 (33%), Positives = 231/476 (48%), Gaps = 63/476 (13%)

Query: 52  TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
           ++ KD T  ++  C+C+ S+CLKL                             YCECF  
Sbjct: 161 SETKDVTLTRKTNCSCKYSKCLKL-----------------------------YCECFEK 191

Query: 112 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 171
           G YC GCNC NC NNV HE  RQ+A+   LERNP AF PK+ +S   A +    A E  +
Sbjct: 192 GRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNS--TAHNCESKAAEGDI 249

Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHNG 228
             KH KGC CK++  LKKYCECF+A++ CSENCRC  CKNF+ +E+R   +   H  H  
Sbjct: 250 VGKHTKGCKCKRTEYLKKYCECFKASVFCSENCRCTGCKNFKSNEDRISQKNTVHAHH-- 307

Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL----FGVAAKDQSVIRNPQSQQGN 284
                   N A  G VG S        N              +      VI    + Q +
Sbjct: 308 ------VQNPASSGMVGQSVIIFHAAENDSSLSLAASVSDHSINNNTSHVILLQHTFQIS 361

Query: 285 NVQNSAVPCSQSVPVSQTANASVLGST--KSALRSPLADILQPQDMKELCSLLVMVSSEA 342
             Q  A+ CS +  V++    S++     +   R  LADI+Q +D+ ELC+LL++VS +A
Sbjct: 362 PFQ--ALSCSLTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQA 419

Query: 343 TKTLTKKVGAMQTERE--GSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG-T 399
             TL +      T+R+      S  S      ++ +          ++ S   +  +G  
Sbjct: 420 GATLLESGMKENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRL 479

Query: 400 SGSGSDGGDML--NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRI 457
               +D  D+   N+RPLSPGT+ALMCDE+D+ F  +  +   A+ ST +++       I
Sbjct: 480 ELPRADPFDIQKGNKRPLSPGTQALMCDEQDAVFQDS--KAPDATPSTTEEN----LVDI 533

Query: 458 YAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEV 513
           + EQE+ +LT FRD+L +L TCG ++E   S  + S   S+Q   + G + S T V
Sbjct: 534 FKEQEKRVLTNFRDYLCKLATCGKLQELKLS--LASTNCSEQAFADHGNNSSITRV 587


>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
          Length = 586

 Score =  191 bits (485), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 155/470 (32%), Positives = 227/470 (48%), Gaps = 77/470 (16%)

Query: 52  TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
           ++ KD T  ++  C+C+ S+CLKL                             YCECF  
Sbjct: 161 SETKDVTLTRKTNCSCKYSKCLKL-----------------------------YCECFEK 191

Query: 112 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 171
           G YC GCNC NC NNV HE ARQ+A+   LERNP AF PK+++S   A +    A E  +
Sbjct: 192 GRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNS--TAHNCESKAAEGDI 249

Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHNG 228
             KH KGC CK++ CLKKYCECF+A++ CSENCRC  CKN++ + +R   +   H  H  
Sbjct: 250 VGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHH-- 307

Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 288
                   N A  G VG S               ++F  A  D S+     +   +    
Sbjct: 308 ------VQNPASSGMVGQSV--------------IIFHAAENDSSLSLAASALSCSL--- 344

Query: 289 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTK 348
                + SV      +       +   R  LADI+Q +D+ ELC+LL++VS +A  TL +
Sbjct: 345 -----TSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATLLE 399

Query: 349 KVGAMQTERE--GSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG-TSGSGSD 405
                 T+R+      S  S      ++ +          ++ S   +  +G      +D
Sbjct: 400 SGMKENTKRKKLNRVDSCLSSTNHDREAIQKEPAKQMCSLEKSSNEVLIAEGRLELPRAD 459

Query: 406 GGDML--NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQER 463
             D+   N+RPLSPGT+ALMCDE+D+ F  +  +   A+ ST +++       I+ EQE+
Sbjct: 460 PFDIQKGNKRPLSPGTQALMCDEQDAVFQDS--KAPDATPSTTEEN----LVDIFKEQEK 513

Query: 464 LILTRFRDFLNRLITCGSIKETMCSPLVKSETRSQQKAVEDGTSRSGTEV 513
            +LT FRD+L +L TCG ++E   S  + S   S+Q   + G + S T V
Sbjct: 514 RVLTNFRDYLCKLATCGKLQELKLS--LASTNCSEQAFADHGNNSSITRV 561


>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
          Length = 702

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 95/169 (56%), Positives = 107/169 (63%), Gaps = 40/169 (23%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC+NSRCLKL                             YCECFA+G YCDGCNC+
Sbjct: 153 KKHCNCKNSRCLKL-----------------------------YCECFASGRYCDGCNCL 183

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC+NN EHE  RQ AV A LERNPNAFRPKIA    GA + R         A+HNKGC+C
Sbjct: 184 NCYNNREHEATRQSAVEAILERNPNAFRPKIAP---GADNRRAGG------ARHNKGCNC 234

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF--HGDHNG 228
           KKSGCLKKYCECFQA I+CSE C+CLDCKN+EGSE R AL    G+  G
Sbjct: 235 KKSGCLKKYCECFQAGIVCSEICKCLDCKNYEGSEARDALVSPQGERQG 283


>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
 gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
          Length = 154

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 94/152 (61%), Positives = 114/152 (75%), Gaps = 4/152 (2%)

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           E+  E  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFH
Sbjct: 1   EETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFH 60

Query: 224 GDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQ 282
           GDH N +A++Q+AANAAI GA+GSSGY +L  S KRK +EL FG   KD S  R    QQ
Sbjct: 61  GDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQ 120

Query: 283 GNNVQNSAVPCS--QSVPVSQTANASVLGSTK 312
            ++++  A P S   S PV++  N + LGS+K
Sbjct: 121 ASHIR-PAAPSSPLPSNPVARAGNTTTLGSSK 151


>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
          Length = 426

 Score =  181 bits (460), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 131/383 (34%), Positives = 186/383 (48%), Gaps = 44/383 (11%)

Query: 118 CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           CNC N        +  QEA+           + KI +  +     ++DA       KHNK
Sbjct: 83  CNCKN-----SQCLKLQEAITVRSNGIQKPSKLKIENGSNALNLRKDDAGAPASLPKHNK 137

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIA-FMQRAA 236
           GCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +  GD++     +Q AA
Sbjct: 138 GCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAA 197

Query: 237 NAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS 296
           N A+ GA+GSSGY       KR  E+ L+     + S ++  Q Q+ N+V +S +  S  
Sbjct: 198 NVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTG 256

Query: 297 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV---SSEATKTLTKKVGAM 353
           +   +   ++    +    RS LA+ + P D   L   LV+V   +++A  T  +  G M
Sbjct: 257 L---EGCYSNYQCRSNVVYRSALANTISPTDATGLAKHLVIVCRKAADAFLTTAENKGEM 313

Query: 354 QTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNER 413
           + ERE                         I  D      MD       G     + + R
Sbjct: 314 EVERE-------------------------IHTDSHGATNMDQQNGGDFGPCCNSLEDSR 348

Query: 414 PLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFL 473
           P SPGT+ALMCDE+ S F   G    ++    +   +   ++ + A+QE+ +LT FRD+L
Sbjct: 349 PASPGTQALMCDEQGSTF---GTDYRISFPVALHDQD---TSELNAQQEKAVLTGFRDYL 402

Query: 474 NRLITCGSIKETMCSPLVKSETR 496
             +IT G I  T  + +  S  R
Sbjct: 403 RTVITRGKINGTEFAKIFLSLKR 425


>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
          Length = 274

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/169 (55%), Positives = 113/169 (66%), Gaps = 9/169 (5%)

Query: 102 LVK--YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
           LVK  YCECFAA +YCDGC+C  C N +E+E  R+EA+   L RNP AF+PKI +SP+  
Sbjct: 83  LVKSWYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENSPNTV 142

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
              ++++       KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+
Sbjct: 143 TVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERK 202

Query: 220 ALFH----GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 264
           AL       D N I   Q AAN A+ GA GS GY +     K+  E  L
Sbjct: 203 ALVQMKNASDRNHI---QEAANFALSGATGSLGYKSSPVRRKKYQENSL 248


>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
          Length = 321

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 94/185 (50%), Positives = 111/185 (60%), Gaps = 44/185 (23%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNC 120
           +K CNCRNSRCLKL                             YCECFA+G YC   CNC
Sbjct: 20  RKTCNCRNSRCLKL-----------------------------YCECFASGQYCLQNCNC 50

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
           V C NNVE+ V RQEAV ATLERNPNAFRPK+  + +G +             KHNKGCH
Sbjct: 51  VECKNNVENSVQRQEAVEATLERNPNAFRPKVLPTLNGGEGDE----------KHNKGCH 100

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAI 240
           CKKSGCLKKYCECFQA+I CS+ CRCLDCKNFEGS+  +A+         FM   ++A++
Sbjct: 101 CKKSGCLKKYCECFQASIFCSDMCRCLDCKNFEGSDAHKAIASQQE----FMPAGSDASL 156

Query: 241 IGAVG 245
           + A G
Sbjct: 157 LVAPG 161


>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 255

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 84/159 (52%), Positives = 99/159 (62%), Gaps = 35/159 (22%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           K++K CNCRNSRCLKL                             YCECFA+G++CD CN
Sbjct: 4   KRKKYCNCRNSRCLKL-----------------------------YCECFASGLHCDSCN 34

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           CVNC NN+     R  AV +TLERNPNAFRPKIA      +  RE A      ++HN+GC
Sbjct: 35  CVNCSNNLASAAIRCTAVESTLERNPNAFRPKIAPGVGPLEKRREGA------SRHNRGC 88

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
           HCKKS CLKKYCECFQANI CS+ CRC++CKNF GS +R
Sbjct: 89  HCKKSSCLKKYCECFQANIFCSDICRCMECKNFLGSSQR 127



 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           + +K  K C+C+ S CLK YCECF + + C ++C C++C N
Sbjct: 1   MPSKRKKYCNCRNSRCLKLYCECFASGLHC-DSCNCVNCSN 40


>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
 gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
          Length = 116

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 78/116 (67%), Positives = 86/116 (74%), Gaps = 9/116 (7%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--- 161
           YCECFA+G+YCDGCNC+NC NN E+E  RQEAV   LERNPNAFRPKI S      D   
Sbjct: 1   YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60

Query: 162 -----AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                 +E  RE Q   A+HNKGC CKKSGCLKKYCECFQANILCSENCRC+DC+N
Sbjct: 61  CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116


>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
 gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
          Length = 1133

 Score =  154 bits (390), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 68/109 (62%), Positives = 78/109 (71%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YCECFA+G YC+ CNCV C NN EHE  RQ AV A LERNPNAFRPKI S+      A  
Sbjct: 144 YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNPNAFRPKIQSNEQAPAAAAA 203

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
               A    +H KGC+CKKS CLKKYCECFQA I CS+NC+C++CKN+E
Sbjct: 204 VVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYE 252



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 27/67 (40%), Gaps = 30/67 (44%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K CNC+ S CLK                             KYCECF AGIYC D C CV
Sbjct: 216 KGCNCKKSSCLK-----------------------------KYCECFQAGIYCSDNCKCV 246

Query: 122 NCHNNVE 128
            C N  E
Sbjct: 247 ECKNYEE 253


>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
 gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
          Length = 766

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 90/156 (57%), Gaps = 40/156 (25%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG--CNC 120
           K CNC+NSRCLKL                             YCECFAAG  C    C C
Sbjct: 16  KGCNCKNSRCLKL-----------------------------YCECFAAGALCASARCRC 46

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
            NC N VEH   R  A+   L+RNPNAFRPKIA+    ++D         +A +HN+GCH
Sbjct: 47  ANCMNTVEHAGERTAAIETVLDRNPNAFRPKIAAVASPSRD---------VALRHNRGCH 97

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           CK+SGCLKKYCECFQA I C+E CRC+ C+N+EGS+
Sbjct: 98  CKRSGCLKKYCECFQAAIYCAETCRCVSCENYEGSD 133



 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/75 (46%), Positives = 40/75 (53%), Gaps = 4/75 (5%)

Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILC-SENCRCLDCKN-FEGSEERRALFHG-- 224
           AQ+     KGC+CK S CLK YCECF A  LC S  CRC +C N  E + ER A      
Sbjct: 8   AQIQRGDGKGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGERTAAIETVL 67

Query: 225 DHNGIAFMQRAANAA 239
           D N  AF  + A  A
Sbjct: 68  DRNPNAFRPKIAAVA 82


>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
          Length = 695

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 103/192 (53%), Gaps = 39/192 (20%)

Query: 42  ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           ES    P +N +     P+K   CNC  S+CLKL                          
Sbjct: 464 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKL-------------------------- 495

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YC+CFA G +C  CNC+NC NN+E+E  RQ A+   LERNPNAFRPKI  +   A D
Sbjct: 496 ---YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGD 552

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           +           KH KGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E S E+++L
Sbjct: 553 SS--------IRKHTKGCNCKRSGCLKNYCECYEAKIACSNNCKCMGCRNIEDSMEKKSL 604

Query: 222 FHGDHNGIAFMQ 233
                +G+  +Q
Sbjct: 605 RASIEDGLHNLQ 616


>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
          Length = 553

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 173/352 (49%), Gaps = 45/352 (12%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERRALFHG--DHNGIAFMQ 233
           K C+CK S CLK YCECF + + C + C C +C  N +    RR       + N  AF  
Sbjct: 160 KCCNCKHSKCLKLYCECFASGVYC-DGCNCSNCFNNVKNETARREAIDATLERNPDAFRP 218

Query: 234 R-------------AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 280
           +             AANAA+ GA+G++ + +   S KRK  +     +AK+    R    
Sbjct: 219 KIGSSPHANRNNMQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTNHV 278

Query: 281 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 340
            Q    Q + +P    +P++Q  +   LG  K   R  LADI+QP D+KELC LLV+VS 
Sbjct: 279 PQVF-YQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVVSG 337

Query: 341 EATKT---------LTKKVGAMQTEREGS-RSSVASPIQEGEDSRKGHEVHNGIPDDRQS 390
            A K          + +K   M  +REG    S+AS   + E++           DD  +
Sbjct: 338 NAAKAYAGKKSQDKVVEKEDEMGGKREGEIGGSLASTNHDREENNNDPNNQKASADDPST 397

Query: 391 G----NQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTV 446
           G     +  ++ +    +D  D  + RP+SPGT ALMCDE+D+ FM         S++ V
Sbjct: 398 GGTRAGKASLEESKHDCND--DQKSNRPMSPGTLALMCDEQDTMFM--------TSQNVV 447

Query: 447 QKSNAYAS---TRIYAEQERLILTRFRDFLNRLITCGSIKETMCSPLVKSET 495
            +  A  +   + +YAEQER +LT FRD L++L+T G +KE   S  +KSE 
Sbjct: 448 PQQPAPVNQNQSELYAEQERCVLTEFRDCLHKLVTFGRMKEEKFSMAIKSEV 499



 Score =  125 bits (314), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 66/131 (50%), Positives = 82/131 (62%), Gaps = 31/131 (23%)

Query: 40  KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKL                       
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKL----------------------- 172

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YCECFA+G+YCDGCNC NC NNV++E AR+EA+ ATLERNP+AFRPKI SSPH 
Sbjct: 173 ------YCECFASGVYCDGCNCSNCFNNVKNETARREAIDATLERNPDAFRPKIGSSPHA 226

Query: 159 AQDAREDAREA 169
            ++  + A  A
Sbjct: 227 NRNNMQAANAA 237


>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
          Length = 752

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 103/192 (53%), Gaps = 39/192 (20%)

Query: 42  ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           ES    P +N +     P+K   CNC  S+CLKL                          
Sbjct: 521 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKL-------------------------- 552

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YC+CFA G +C  CNC+NC NN+E+E  RQ A+   LERNPNAFRPKI  +   A D
Sbjct: 553 ---YCDCFANGEFCYMCNCMNCFNNLENEDHRQRAIKTCLERNPNAFRPKIGKAKDVAGD 609

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           +           KH KGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E S E+++L
Sbjct: 610 SS--------IRKHTKGCNCKRSGCLKNYCECYEAKIACSNNCKCMGCRNIEDSMEKKSL 661

Query: 222 FHGDHNGIAFMQ 233
                +G+  +Q
Sbjct: 662 RASIEDGLHNLQ 673


>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 151

 Score =  148 bits (373), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 82/170 (48%), Positives = 99/170 (58%), Gaps = 31/170 (18%)

Query: 43  SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
           SPRS     +  +D +  ++K+CNC+NS+CLKL                           
Sbjct: 12  SPRSIAARASTPRD-SASQRKRCNCKNSKCLKL--------------------------- 43

Query: 103 VKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS-SPHGAQD 161
             YCECFA+G YCD CNC  C NN EH   RQ A+  TLERNPNAFRPKI + + +G  D
Sbjct: 44  --YCECFASGKYCDACNCAGCKNNDEHAHERQSAIEQTLERNPNAFRPKIINQTTNGTLD 101

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                  A   A+HN+GCHCK+SGCLKKYCECFQA I C E CRC +CKN
Sbjct: 102 GAGAGDGATTEARHNRGCHCKRSGCLKKYCECFQAAIYCVERCRCAECKN 151



 Score = 42.4 bits (98), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 44/90 (48%), Gaps = 4/90 (4%)

Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF-EGSEERRALFHG--DHNG 228
           +A   K C+CK S CLK YCECF +   C + C C  CKN  E + ER++      + N 
Sbjct: 26  SASQRKRCNCKNSKCLKLYCECFASGKYC-DACNCAGCKNNDEHAHERQSAIEQTLERNP 84

Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKR 258
            AF  +  N    G +  +G G   T+  R
Sbjct: 85  NAFRPKIINQTTNGTLDGAGAGDGATTEAR 114


>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
           CCMP2712]
          Length = 138

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 80/161 (49%), Positives = 101/161 (62%), Gaps = 33/161 (20%)

Query: 55  KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
           + GTP K+K CNC+NS+CLKL                             YCECFA+G+Y
Sbjct: 7   RPGTPAKRKPCNCKNSKCLKL-----------------------------YCECFASGLY 37

Query: 115 CD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS-PHGAQDA-REDAREAQL 171
           C+  CNC +CHNNV  E  RQ AV +TLERNP+AFRPKI  +   G  D+  + AR    
Sbjct: 38  CEVQCNCHSCHNNVRFEWDRQAAVVSTLERNPHAFRPKITQTGQQGIVDSPAKGARWLMF 97

Query: 172 A-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
             A+H KGCHCKKSGCLKKYCECFQA ++CS+ C+C++C+N
Sbjct: 98  VKARHMKGCHCKKSGCLKKYCECFQAAVMCSDTCKCIECRN 138



 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 34/77 (44%), Gaps = 3/77 (3%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERRALFHGDHNGIAFMQ 233
           K C+CK S CLK YCECF + + C   C C  C N   FE   +   +   + N  AF  
Sbjct: 15  KPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERNPHAFRP 74

Query: 234 RAANAAIIGAVGSSGYG 250
           +       G V S   G
Sbjct: 75  KITQTGQQGIVDSPAKG 91


>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
          Length = 750

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/205 (38%), Positives = 107/205 (52%), Gaps = 52/205 (25%)

Query: 46  SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
           ++P++ +D      K +K CNC  S+CLKL                             Y
Sbjct: 525 TKPKSTSDVN----KTKKPCNCTRSQCLKL-----------------------------Y 551

Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
           C+CFA G +C  CNC +C+NN+EHE  R +A+ + L+RNPNAFRPKI S           
Sbjct: 552 CDCFANGEFCYQCNCNSCYNNMEHEEDRAQAIRSVLDRNPNAFRPKIKS----------- 600

Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
               Q   +H KGC+CK+SGCLK YCECF+A I CS+NC+C+ C+N E  ++    F GD
Sbjct: 601 -FVGQAERQHTKGCNCKRSGCLKNYCECFEARIACSQNCKCIGCRNME--DDSHTSF-GD 656

Query: 226 HNGIA----FMQRAANAAIIGAVGS 246
           H G         R  +A  + + GS
Sbjct: 657 HPGSLSVDNLKSRTKSANFVTSFGS 681


>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
 gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
          Length = 757

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/175 (42%), Positives = 97/175 (55%), Gaps = 40/175 (22%)

Query: 39  GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           G ++  R  P   T   D   KK + CNC  S+CLKL                       
Sbjct: 467 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKL----------------------- 502

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YC+CFA G +C  CNC +C+NN+++E  RQ+A+ ATLERNP+AF+PKI     G
Sbjct: 503 ------YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFKPKI-----G 551

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           A  A EDA        H KGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 552 AVSADEDALRL-----HTKGCNCKRSGCLKNYCECYEAKIACSANCKCIGCRNTE 601



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 37/77 (48%), Gaps = 10/77 (12%)

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERRALFHG--DHNGI 229
           A   + C+C KS CLK YC+CF     C  NC C DC  N +  EER+       + N  
Sbjct: 486 AYKKRPCNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEERQKAIRATLERNPS 544

Query: 230 AFMQRAANAAIIGAVGS 246
           AF  +      IGAV +
Sbjct: 545 AFKPK------IGAVSA 555


>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
          Length = 916

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 83/187 (44%), Positives = 102/187 (54%), Gaps = 46/187 (24%)

Query: 56  DGTP-----KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFA 110
           +GTP     + +K CNC  S+CLKL                             YC+CFA
Sbjct: 652 NGTPCEDNSRPRKPCNCTKSQCLKL-----------------------------YCDCFA 682

Query: 111 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
            G +C  CNC NC NN+EHE  RQ+A+GA LERNPNAFRPKI     G  + R       
Sbjct: 683 NGEFCHSCNCNNCFNNLEHEEERQKAIGACLERNPNAFRPKIGKGKEGDHERR------- 735

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGI 229
               H KGC+CK+SGCLK YCEC++A ILCS  C+C+ CKNFE S ER+ L    D   +
Sbjct: 736 ----HTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEV 791

Query: 230 AFMQRAA 236
              Q+AA
Sbjct: 792 RVQQQAA 798


>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
          Length = 596

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 87/193 (45%), Positives = 100/193 (51%), Gaps = 57/193 (29%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC+NS+CLKL                             YCECFA+G YC  CNC 
Sbjct: 33  KKNCNCKNSKCLKL-----------------------------YCECFASGEYCKNCNCN 63

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIAS----SPHGAQDA--------------- 162
            C NN+E+E  R+E +   LERNPNAFRPKIAS    SP   Q                 
Sbjct: 64  GCCNNIENESIRKETIAIILERNPNAFRPKIASLPPQSPQITQKVVGVTGGPINNPSQIN 123

Query: 163 -------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
                       +   A KH KGC CKKSGCLKKYCECFQA I CS+NC+C DCKNF+GS
Sbjct: 124 IGLSTPIMTGVNDGVAAGKHAKGCACKKSGCLKKYCECFQAGIFCSDNCKCCDCKNFDGS 183

Query: 216 EERRALFHGDHNG 228
            +RR L   D NG
Sbjct: 184 LDRRILL--DSNG 194


>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
 gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
 gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 368

 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 137/457 (29%), Positives = 190/457 (41%), Gaps = 134/457 (29%)

Query: 27  PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
           PV +LPL   PP  +E   S  +  TD +  T +K K C C+ S+CLKL           
Sbjct: 34  PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YC+CFA+G+ C  C+CV+CHNN E   AR+ A+   L RNPN
Sbjct: 79  ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120

Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
           AF  K   S    Q  A  D +   L+    +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176

Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
           C++CKN   SE  +      HN     +R  +  +                       LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220

Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
            +   D S+          N   S + CS  +P  +    S++G T + L          
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257

Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
            D+ +LCSLLV  S  AT T                          ED  +        P
Sbjct: 258 PDLGDLCSLLVAASESATTT-------------------------AEDQNRIF----TKP 288

Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCD-----EEDSAFMQAGPQTGL 440
           DD+++     ++ +S S S     + E   S G    + D     EEDS           
Sbjct: 289 DDKEA-----IELSSESESRN---VEEEIQSRGRLIELIDVQYNGEEDSQ---------C 331

Query: 441 ASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
            +K++V +      T IY EQER +L  FRD L + I
Sbjct: 332 KTKTSVNE------TDIYMEQERAVLETFRDCLQKFI 362


>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
          Length = 1772

 Score =  140 bits (352), Expect = 3e-30,   Method: Composition-based stats.
 Identities = 72/175 (41%), Positives = 90/175 (51%), Gaps = 49/175 (28%)

Query: 52  TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
           T  K    K+ K CNCRNSRCLKL                             YC+CFAA
Sbjct: 320 TAAKASATKRFKPCNCRNSRCLKL-----------------------------YCDCFAA 350

Query: 112 GIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
           G +C+  C CV+CHN+  H   R +A+ ATLE+NP AFR K+ +                
Sbjct: 351 GRFCEESCKCVDCHNDQAHARDRDDAIKATLEKNPKAFRAKVDAE--------------- 395

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE---RRALF 222
            A  H  GCHCKK+ CLKKYCECF+A I C   C+C DC+N+ GS +   RR L+
Sbjct: 396 -ATTHQNGCHCKKTKCLKKYCECFEAGITCGAKCKCADCENYPGSLQLAARRGLY 449


>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
          Length = 369

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 137/457 (29%), Positives = 190/457 (41%), Gaps = 134/457 (29%)

Query: 27  PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
           PV +LPL   PP  +E   S  +  TD +  T +K K C C+ S+CLKL           
Sbjct: 34  PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YC+CFA+G+ C  C+CV+CHNN E   AR+ A+   L RNPN
Sbjct: 79  ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120

Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
           AF  K   S    Q  A  D +   L+    +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176

Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
           C++CKN   SE  +      HN     +R  +  +                       LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220

Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
            +   D S+          N   S + CS  +P  +    S++G T + L          
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257

Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
            D+ +LCSLLV  S  AT T                          ED  +        P
Sbjct: 258 PDLGDLCSLLVAASESATTT-------------------------AEDQNRIF----TKP 288

Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCD-----EEDSAFMQAGPQTGL 440
           DD+++     ++ +S S S     + E   S G    + D     EEDS           
Sbjct: 289 DDKEA-----IELSSESESRN---VEEEIQSRGRLIELIDVQYNGEEDSQ---------C 331

Query: 441 ASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
            +K++V +      T IY EQER +L  FRD L + I
Sbjct: 332 KTKTSVNE------TDIYMEQERAVLETFRDCLQKFI 362


>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 134/469 (28%), Positives = 193/469 (41%), Gaps = 116/469 (24%)

Query: 33  LPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
           +P  PP  +E   ++   + +G       +K+C C+ S+CLKL                 
Sbjct: 38  IPPFPPISREHSETKDTTDQEGITW----RKRCRCKQSKCLKL----------------- 76

Query: 93  HLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
                       YC+CFA+G+ C  C+CV+CHNN ++  AR  AV   L RNPNAF  K+
Sbjct: 77  ------------YCDCFASGVLCTDCDCVDCHNNSDNYDARDAAVVNVLGRNPNAFNEKL 124

Query: 153 ASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
            SS +  Q  A  D R   L+    +GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN
Sbjct: 125 FSSINDKQCKAAPDTRPGLLS----RGCKCKRTKCLKKYCECFQANVLCSDNCKCINCKN 180

Query: 212 ---------FEGSEERRALFHGDHNGIAFMQRAANAAIIG----AVGSSGYGTLMTSNKR 258
                    F      R L+ G  N     +   +  II     +VG + + T    N  
Sbjct: 181 VSEAFQPSVFAWGLNSRKLYEGFDNPEK--KFVCDLGIISPSEDSVGFNSHNTAGCMNYA 238

Query: 259 KSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSA---- 314
                  G +A +   +   +  Q     NS      S+P +   NA  LGS  S+    
Sbjct: 239 P------GFSAHNSPQVYRRRRHQELPEWNSCPAPLFSIPDNSIQNA--LGSPMSSSPKL 290

Query: 315 ----LRSPL--ADILQPQDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPI 368
                + PL  A  + P D+ ++CSLL+  S  AT              + +R  +    
Sbjct: 291 PYRKKKPPLGYASTVVP-DLGDICSLLLAASETAT----------ANAEDQNRICIK--- 336

Query: 369 QEGEDSRKGHEVHNGIPDDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCDEED 428
                           PDD+      +V   S SG+   ++ + R L     A    EE 
Sbjct: 337 ----------------PDDKVDNVYTEVLSESESGNVEEEIQSFRRLIELIDAQYNGEE- 379

Query: 429 SAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
                          S  +   +   T IY EQER +L  FRD L   I
Sbjct: 380 --------------HSKCKTETSVHETDIYMEQERAVLETFRDCLQTFI 414


>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 491

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 92/155 (59%), Gaps = 42/155 (27%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNC 120
           +K CNC+NSRCLKL                             YCECFA+G YCD  C C
Sbjct: 137 KKTCNCKNSRCLKL-----------------------------YCECFASGQYCDPSCKC 167

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
           V+C NN  ++ ARQ+AV ATLERNP AFRPKI S+          A E+   A+H+KGCH
Sbjct: 168 VSCSNNESNQEARQQAVDATLERNPAAFRPKINSA----------ATESW--ARHSKGCH 215

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
           C+KS C K+YCEC+QA I C+ +C+C+DCKN   S
Sbjct: 216 CRKSNCQKRYCECYQAGIQCTAHCKCVDCKNHSHS 250



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 28/43 (65%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           K C+CK S CLK YCECF +   C  +C+C+ C N E ++E R
Sbjct: 138 KTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEAR 180


>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
 gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
          Length = 952

 Score =  138 bits (348), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)

Query: 33  LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
           LPTLPPG + +  ++P                 D   G   ++K CNC  S+CLKL    
Sbjct: 698 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 753

Query: 80  IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
                                    YC+CFA G +C  C C +C NN+++EV R+ A+ +
Sbjct: 754 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 788

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
            L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I 
Sbjct: 789 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 836

Query: 200 CSENCRCLDCKNFE 213
           CS  C+C+ C+N E
Sbjct: 837 CSSICKCVGCRNME 850


>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
           protein of 120 kDa
 gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
          Length = 950

 Score =  138 bits (347), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)

Query: 33  LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
           LPTLPPG + +  ++P                 D   G   ++K CNC  S+CLKL    
Sbjct: 696 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 751

Query: 80  IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
                                    YC+CFA G +C  C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 786

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
            L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I 
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834

Query: 200 CSENCRCLDCKNFE 213
           CS  C+C+ C+N E
Sbjct: 835 CSSICKCVGCRNME 848


>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
          Length = 950

 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)

Query: 33  LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
           LPTLPPG + +  ++P                 D   G   ++K CNC  S+CLKL    
Sbjct: 696 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSRSQCLKL---- 751

Query: 80  IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
                                    YC+CFA G +C  C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 786

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
            L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I 
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834

Query: 200 CSENCRCLDCKNFE 213
           CS  C+C+ C+N E
Sbjct: 835 CSSICKCVGCRNME 848


>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
           occidentalis]
          Length = 609

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 106/221 (47%), Gaps = 53/221 (23%)

Query: 6   QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP-RANTDGKD---GTPKK 61
           QA PQ Q  PQ++ +    P   H   +        +   S P +A+ D  D        
Sbjct: 341 QASPQSQ-HPQYLVVKQTTPQGQHSFAV--------QPQNSEPAQADYDAYDKQLSHTSS 391

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC  S+CLKL                             YC+CFA G +C  CNCV
Sbjct: 392 KKPCNCTKSQCLKL-----------------------------YCDCFANGEFCQNCNCV 422

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C+NN+EHE  R  AV   LERNPNAF PKI     G ++ R           H KGC+C
Sbjct: 423 QCYNNLEHEEERSLAVKLCLERNPNAFHPKIGKYKPGDKERR-----------HTKGCNC 471

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           K+SGCLK YCEC++A ILCS  CRC+ C N E + +  +L 
Sbjct: 472 KRSGCLKNYCECYEARILCSGVCRCVGCHNIEENMDAESLL 512


>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
 gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
          Length = 949

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 71/194 (36%), Positives = 99/194 (51%), Gaps = 54/194 (27%)

Query: 33  LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
           LPTLPPG + +  ++P                 D   G   ++K CNC  S+CLKL    
Sbjct: 695 LPTLPPGAKANVPAKPLFEVLKPAATPAAAGAVDQLGGMTSRRKHCNCSKSQCLKL---- 750

Query: 80  IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
                                    YC+CFA G +C  C C +C NN+++EV R+ A+ +
Sbjct: 751 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 785

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
            L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I 
Sbjct: 786 CLDRNPSAFKPKITAPNSG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIP 833

Query: 200 CSENCRCLDCKNFE 213
           C+  C+C+ C+N E
Sbjct: 834 CTSICKCVGCRNME 847


>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
           magnipapillata]
          Length = 530

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/200 (39%), Positives = 104/200 (52%), Gaps = 42/200 (21%)

Query: 42  ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           +S +S  +AN      +   +K CNC  S+CLKL                          
Sbjct: 298 DSNKSASKANVGPITESKILKKPCNCTKSQCLKL-------------------------- 331

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YCECFA G +C+ CNC  C NN+ HEV R +A+ + L+RNP AF PKI     G   
Sbjct: 332 ---YCECFANGEFCNNCNCRICFNNIAHEVERSKAIKSCLDRNPYAFHPKI-----GKGK 383

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
            + D        +H KGC+C++SGCLK YCEC++A ILCS  C+C  CKNFE S +R+ L
Sbjct: 384 IKGDTER-----RHTKGCNCRRSGCLKNYCECYEAKILCSTLCKCSGCKNFEESADRKTL 438

Query: 222 FH-GDHNGIAFMQRAANAAI 240
               D   +  +Q+  NAAI
Sbjct: 439 MQLADAAEVRVLQQ--NAAI 456


>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
 gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
          Length = 944

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 76/210 (36%), Positives = 107/210 (50%), Gaps = 45/210 (21%)

Query: 6   QAMPQWQARPQHVRMVNRVP-HPVHKLPLPTLPPGK-QESPRSRPRANTDGKDGTPKKQK 63
           Q +P  Q+ P  V    ++P  P    P+PT P     + P +   A  D   G   ++K
Sbjct: 676 QQLPTEQSAPTKVE--PKLPTQPPGFNPVPTKPLFDILKQPATGAAATVDALGGMTSRRK 733

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNC 123
            CNC  S+CLKL                             YC+CFA G +C  C C +C
Sbjct: 734 HCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTCKDC 764

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
            NN+++EV R+ A+ + L+RNP+AF+PKI +   G  D R           HNKGC+CK+
Sbjct: 765 FNNMDYEVERERAIRSCLDRNPSAFQPKITAPNSG--DMRL----------HNKGCNCKR 812

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFE 213
           SGCLK YCEC++A I C+  C+C+ C+N E
Sbjct: 813 SGCLKNYCECYEAKIPCTSICKCVGCRNME 842


>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
 gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
          Length = 433

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 90/302 (29%), Positives = 136/302 (45%), Gaps = 51/302 (16%)

Query: 49  RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
           R  T  K   P ++K CNC  S+CLKL                             YC+C
Sbjct: 159 RLKTKKKVYAPGQRKPCNCTKSQCLKL-----------------------------YCDC 189

Query: 109 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
           FA G +C  CNC +CHNN++++  R +A+  +LERNPNAF+PKI  +  G  D       
Sbjct: 190 FANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTAD------- 242

Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 228
             +   H KGCHCKKSGCLK YCEC++A + C++ C+C  C+N E     R    G    
Sbjct: 243 --IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKCCQNTEAYRMTRFKNSGSLPS 300

Query: 229 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 288
              +    NA+   A   SG G++MT             A  D   +  PQ  +      
Sbjct: 301 TNALMSLTNASST-ATPDSGPGSVMTDE-----------AGDDYDDMMLPQKPKAEMDPR 348

Query: 289 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTK 348
                  +  V + A   ++   + AL +  ++ +Q +D K + ++  +V  E  + L +
Sbjct: 349 RFPWYYMTDEVVEAATMCMVAQAEEALNASFSEKMQTEDDK-IINMEKLVLREFGRCLEQ 407

Query: 349 KV 350
            +
Sbjct: 408 MI 409


>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
          Length = 407

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 104/207 (50%), Gaps = 40/207 (19%)

Query: 49  RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
           R  T  K   P ++K CNC  S+CLKL                             YC+C
Sbjct: 134 RMKTKKKVYAPGQRKPCNCTKSQCLKL-----------------------------YCDC 164

Query: 109 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
           FA G +C  CNC +CHNN++++  R +A+  +LERNPNAF+PKI  +  G  D       
Sbjct: 165 FANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIA-RGTAD------- 216

Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG-DHN 227
             +   H KGCHCKKSGCLK YCEC++A + C++ C+C  C+N E     R    G    
Sbjct: 217 --IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRFKNSGVSVP 274

Query: 228 GIAFMQRAANAAIIGAVGSSGYGTLMT 254
           G   +    NAA   A   SG G++MT
Sbjct: 275 GTNALISLTNAASSSATPDSGPGSVMT 301


>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
          Length = 437

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 77/207 (37%), Positives = 103/207 (49%), Gaps = 39/207 (18%)

Query: 49  RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
           R  T  K   P ++K CNC  S+CLKL                             YC+C
Sbjct: 161 RMKTKKKVYAPGQRKPCNCTKSQCLKL-----------------------------YCDC 191

Query: 109 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
           FA G +C  CNC +CHNN++++  R +A+  +LERNPNAF+PKI  +  G  D       
Sbjct: 192 FANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGTAD------- 244

Query: 169 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHN- 227
             +   H KGCHCKKSGCLK YCEC++A + C++ C+C  C+N E     R    G    
Sbjct: 245 --IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRFKNSGGPGA 302

Query: 228 GIAFMQRAANAAIIGAVGSSGYGTLMT 254
           G   +    NA    A   SG G++MT
Sbjct: 303 GANALMSLTNANSSTATPDSGPGSVMT 329


>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
          Length = 605

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 67/154 (43%), Positives = 83/154 (53%), Gaps = 42/154 (27%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           K +K CNC  S CLKL                             YCECFA G +CD CN
Sbjct: 375 KIRKPCNCTKSMCLKL-----------------------------YCECFANGHFCDSCN 405

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C+NCHNN+E +  R +A+ + LERNP AFRPKI       +DA            H KGC
Sbjct: 406 CINCHNNLEFDTDRSKAIKSCLERNPMAFRPKIGR----GRDANRT---------HQKGC 452

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           +CK+SGCLK YCEC++A I C+  C+C+ CKN E
Sbjct: 453 NCKRSGCLKNYCECYEARIPCTSKCKCIGCKNLE 486


>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
          Length = 356

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 106/320 (33%), Positives = 144/320 (45%), Gaps = 77/320 (24%)

Query: 27  PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
           PV +LPL   PP  +E   S  +  TD +  T +K K C C+ S+CLKL           
Sbjct: 34  PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YC+CFA+G+ C  C+CV+CHNN E   AR+ A+   L RNPN
Sbjct: 79  ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120

Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
           AF  K   S    Q  A  D +   L+    +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176

Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
           C++CKN   SE  +      HN     +R  +  +                       LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220

Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
            +   D S+          N   S + CS  +P  +    S++G T + L          
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257

Query: 326 QDMKELCSLLVMVSSEATKT 345
            D+ +LCSLLV  S  AT T
Sbjct: 258 PDLGDLCSLLVAASESATTT 277


>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
 gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
          Length = 950

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 71/194 (36%), Positives = 98/194 (50%), Gaps = 54/194 (27%)

Query: 33  LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
           LPTLPPG +    ++P                 D   G   ++K CNC  S+CLKL    
Sbjct: 696 LPTLPPGVKADVPAKPLFEVLKPAATPAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 751

Query: 80  IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
                                    YC+CFA G +C  C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCCNNLDYEVERERAIRS 786

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
            L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I 
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834

Query: 200 CSENCRCLDCKNFE 213
           C+  C+C+ C+N E
Sbjct: 835 CTSICKCVGCRNME 848


>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
 gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
          Length = 978

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 92/172 (53%), Gaps = 41/172 (23%)

Query: 42  ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           +SP+      TD   G   ++K CNC  S+CLKL                          
Sbjct: 745 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKL-------------------------- 778

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YC+CFA G +C  C C +C NN+++EV R+ A+ + L+RNP+AF+PKI +   G  D
Sbjct: 779 ---YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--D 833

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
            R           HNKGC+CK+SGCLK YCEC++A I CS  C+C+ C+N E
Sbjct: 834 MR----------LHNKGCNCKRSGCLKNYCECYEAKIPCSSICKCVGCRNME 875



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 38/84 (45%), Gaps = 6/84 (7%)

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           +I  SP  A     DA      A   K C+C KS CLK YC+CF     C ++C C DC 
Sbjct: 742 EILKSPQLASAPTTDALAGM--ASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCF 798

Query: 211 N---FEGSEERRALFHGDHNGIAF 231
           N   +E   ER      D N  AF
Sbjct: 799 NNLDYEVERERAIRSCLDRNPSAF 822


>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 546

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 66/149 (44%), Positives = 86/149 (57%), Gaps = 33/149 (22%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K+CNC+ S+C+KL                             YCEC+AAG+YC+GC+CV+
Sbjct: 69  KKCNCKQSQCIKL-----------------------------YCECYAAGVYCNGCHCVS 99

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C NN + E  R +A+  TL +NP AF PKI+     A    +         KHNKGC CK
Sbjct: 100 CKNNDQSEGWRNQAIQNTLVKNPTAFLPKISQKQETASKVVDPLN----LGKHNKGCACK 155

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
           +SGC+KKYCECFQA + CSENC+C DC+N
Sbjct: 156 RSGCMKKYCECFQAGVPCSENCKCTDCQN 184



 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           K C+CK+S C+K YCEC+ A + C+  C C+ CKN + SE  R
Sbjct: 69  KKCNCKQSQCIKLYCECYAAGVYCN-GCHCVSCKNNDQSEGWR 110


>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
 gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
           protein 54
 gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
          Length = 435

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 46/213 (21%)

Query: 46  SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
           S  R  T  K   P ++K CNC  S+CLKL                             Y
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKL-----------------------------Y 188

Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
           C+CFA G +C  CNC +CHNN+E++  R +A+  +LERNPNAF+PKI  +  G  D    
Sbjct: 189 CDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITD---- 244

Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG- 224
                +   H KGCHCKKSGCLK YCEC++A + C++ C+C  C+N E     R    G 
Sbjct: 245 -----IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRYKNSGG 299

Query: 225 ---DHNGIAFMQRAANAAIIGAVGSSGYGTLMT 254
              + N +  +  A++     A   SG G+++T
Sbjct: 300 AVSNTNALMSLTNASST----ATPDSGPGSVVT 328


>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
 gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
          Length = 429

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 46/213 (21%)

Query: 46  SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
           S  R  T  K   P ++K CNC  S+CLKL                             Y
Sbjct: 152 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKL-----------------------------Y 182

Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
           C+CFA G +C  CNC +CHNN+E++  R +A+  +LERNPNAF+PKI  +  G  D    
Sbjct: 183 CDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITD---- 238

Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG- 224
                +   H KGCHCKKSGCLK YCEC++A + C++ C+C  C+N E     R    G 
Sbjct: 239 -----IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRYKNSGG 293

Query: 225 ---DHNGIAFMQRAANAAIIGAVGSSGYGTLMT 254
              + N +  +  A++     A   SG G+++T
Sbjct: 294 AVSNTNALMSLTNASST----ATPDSGPGSVVT 322


>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
 gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
          Length = 725

 Score =  134 bits (338), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 71/185 (38%), Positives = 96/185 (51%), Gaps = 44/185 (23%)

Query: 42  ESPRSRPRANTDGKDGTPK---KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           ++ +  PRA T    G P    K++ CNC  S+CLKL                       
Sbjct: 432 DTSKDSPRALTTTVYGFPDEAYKKRPCNCTKSQCLKL----------------------- 468

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                 YC+CFA G YC  CNC +C N  +H+  RQ+A+ +TLERNPNAF+PKI     G
Sbjct: 469 ------YCDCFANGEYCYNCNCKDCFNTFDHDNERQKAIRSTLERNPNAFKPKI-----G 517

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
           +  + +D         H KGC+CK+SGCLK YCEC++  I CS NC+C+   +F   +  
Sbjct: 518 SIGSTDDGTRL-----HTKGCNCKRSGCLKNYCECYEGKIACSSNCKCVGTPSFHLLQSH 572

Query: 219 RALFH 223
             LFH
Sbjct: 573 --LFH 575



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 42/82 (51%), Gaps = 10/82 (12%)

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERRALFHGDHNGIAF 231
           A   + C+C KS CLK YC+CF     C  NC C DC N F+   ER+          + 
Sbjct: 452 AYKKRPCNCTKSQCLKLYCDCFANGEYC-YNCNCKDCFNTFDHDNERQKAIR------ST 504

Query: 232 MQRAANA--AIIGAVGSSGYGT 251
           ++R  NA    IG++GS+  GT
Sbjct: 505 LERNPNAFKPKIGSIGSTDDGT 526


>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
 gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
          Length = 879

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 86/153 (56%), Gaps = 41/153 (26%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
           ++K CNC  S+CLKL                             YC+CFA G +C  C C
Sbjct: 663 RRKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTC 693

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
            +C NN+++EV R+EA+ + LERNP+AF+PKI +   G  D R           HNKGC+
Sbjct: 694 KDCFNNLDYEVKREEAIRSCLERNPSAFKPKITAPNSG--DIRL----------HNKGCN 741

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           CK+SGCLK YCEC++A I CS  C+C+ C+N E
Sbjct: 742 CKRSGCLKNYCECYEAKIPCSTMCKCVGCRNME 774



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           K C+C KS CLK YC+CF     C ++C C DC N    E +R
Sbjct: 665 KHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKR 706


>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
          Length = 999

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/215 (36%), Positives = 99/215 (46%), Gaps = 73/215 (33%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
           K+  CNC+ S+CLKL                             YCECFA+G YCD  CN
Sbjct: 563 KKAPCNCKKSKCLKL-----------------------------YCECFASGGYCDESCN 593

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH-----------GAQDAREDARE 168
           C++C N    E  RQ+A+ A LE+NPNAF+PKI ++P            GA      +  
Sbjct: 594 CLDCSNTPATEEVRQQAIAARLEKNPNAFKPKIGATPGMTPGSARRLSVGASGGSHASPG 653

Query: 169 AQLAAK---------------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCL 207
             L+                       H  GCHCKKS C KKYCECFQA + C ENCRC+
Sbjct: 654 TFLSPSGRLNLQQQHQLLSAGLLSTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCI 713

Query: 208 DCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIG 242
           DCKN       +A      NG+A     A +A++G
Sbjct: 714 DCKN-------QAPCVAHANGVA----TAGSAVVG 737


>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
          Length = 705

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/171 (45%), Positives = 96/171 (56%), Gaps = 42/171 (24%)

Query: 53  DGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAG 112
           DG    P+K   CNC  S+CLKL                             YC+CFA G
Sbjct: 475 DGSGSRPRKP--CNCTKSQCLKL-----------------------------YCDCFANG 503

Query: 113 IYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 172
            +C  CNC NC NN++HE  R +AV A L+RNP+AF PKI    HG+Q  R         
Sbjct: 504 EFCRNCNCNNCLNNLDHEDERTKAVKACLDRNPHAFHPKIGKG-HGSQTNR--------- 553

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
            +HNKGC+CK+SGCLK YCEC++A ILCS  C+C+ CKNFE S ER+ L H
Sbjct: 554 -RHNKGCNCKRSGCLKNYCECYEAKILCSNFCKCVGCKNFEESPERKTLMH 603


>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
          Length = 1001

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 100/199 (50%), Gaps = 39/199 (19%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ R +  +   DG     K+CNC+ S+CLKL                         
Sbjct: 547 QPSPKKRRKKASSTNDG--DGCKRCNCKKSKCLKL------------------------- 579

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
               YC+CFAAGIYC +GC CV C N  E++    E       RNP AF PKI    +G+
Sbjct: 580 ----YCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGS 635

Query: 160 Q-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
             ++ ED R    +A+H +GC+CKKS CLKKYCEC+QAN+ CS  CRC  CKN  G +E 
Sbjct: 636 PINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKNVYGRKEE 695

Query: 219 RALFHGDHNGIAFMQRAAN 237
              F         M R AN
Sbjct: 696 YGAFK------EMMSRRAN 708


>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
           CCMP2712]
          Length = 118

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 77/153 (50%), Positives = 92/153 (60%), Gaps = 38/153 (24%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
           K+KQCNC+NS+CLKL                             YCECFA+G YC+  CN
Sbjct: 2   KRKQCNCKNSKCLKL-----------------------------YCECFASGSYCNLSCN 32

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS-PHGAQDAREDAREAQLAAKHNKG 178
           C  C NN  +E  R+ A+ ATLERNP AF+PKIASS P G +D+    R       H KG
Sbjct: 33  CQLCQNNENYEKQRKSAIDATLERNPLAFQPKIASSAPPGIEDSPVQGR-------HTKG 85

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           CHCKKSGCLKKYCECFQA +LCS  C+C +C+N
Sbjct: 86  CHCKKSGCLKKYCECFQAGVLCSAQCKCHECRN 118



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 3/69 (4%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE-ERRALFHG--DHNGIAFMQ 233
           K C+CK S CLK YCECF +   C+ +C C  C+N E  E +R++      + N +AF  
Sbjct: 4   KQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQP 63

Query: 234 RAANAAIIG 242
           + A++A  G
Sbjct: 64  KIASSAPPG 72


>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 983

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 71/181 (39%), Positives = 89/181 (49%), Gaps = 53/181 (29%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
           K+  CNC+ S+CLKL                             YCECFA G YCD  CN
Sbjct: 583 KKAPCNCKKSKCLKL-----------------------------YCECFANGGYCDENCN 613

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK----- 174
           C++C N +E    RQ AV + LE+NPNAF+PKI  S   A  +  +A  +  +       
Sbjct: 614 CLDCSNTLERIDERQAAVASRLEKNPNAFKPKILPSQSSAVKSLYEALSSTKSTSSGVFP 673

Query: 175 ------------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
                             H  GCHCKKS C KKYCECFQA +LC +NCRC+DC+N E  +
Sbjct: 674 MRRSTADGHRTPTGKKRMHKDGCHCKKSACQKKYCECFQAGVLCGDNCRCIDCRNVEKKK 733

Query: 217 E 217
           E
Sbjct: 734 E 734


>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/182 (39%), Positives = 89/182 (48%), Gaps = 60/182 (32%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
           K+  CNC+ S+CLKL                             YCECFA+G YCD  CN
Sbjct: 560 KKAPCNCKKSKCLKL-----------------------------YCECFASGGYCDESCN 590

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH-------GAQDAREDAR----- 167
           C++C N    E  RQ+A+ + LE+NPNAF+PKI ++         GA+    D R     
Sbjct: 591 CLDCANTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTSGGARRLSADRRASTGT 650

Query: 168 ----------------EAQLAAK--HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
                            A +A    H  GCHCKKS C KKYCECFQA + C ENCRC+DC
Sbjct: 651 FLSPPGQLSLQQRQLLSAGMATTKMHKHGCHCKKSACQKKYCECFQAGVSCGENCRCIDC 710

Query: 210 KN 211
           KN
Sbjct: 711 KN 712



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 30/49 (61%)

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           + A     C+CKKS CLK YCECF +   C E+C CLDC N   +E+ R
Sbjct: 556 IPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVR 604


>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
          Length = 709

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
 gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
          Length = 980

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 63/153 (41%), Positives = 85/153 (55%), Gaps = 41/153 (26%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
           ++K CNC  S+CLKL                             YC+CFA G +C  C C
Sbjct: 765 RRKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTC 795

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
            +C NN+++EV R+ A+ + LERNP+AF+PKI +   G  D R           HNKGC+
Sbjct: 796 KDCFNNLDYEVKRERAIRSCLERNPSAFKPKITAPNSG--DMRL----------HNKGCN 843

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           CK+SGCLK YCEC++A I CS  C+C+ C+N E
Sbjct: 844 CKRSGCLKNYCECYEAKIPCSAMCKCVGCRNME 876



 Score = 45.4 bits (106), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           L     K C+C KS CLK YC+CF     C ++C C DC N    E +R
Sbjct: 761 LGGSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCFNNLDYEVKR 808


>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
 gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
          Length = 950

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 76/220 (34%), Positives = 108/220 (49%), Gaps = 54/220 (24%)

Query: 6   QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANT------------D 53
           QA  + + R Q +      P  V    LPTLP G + +  ++P  +             D
Sbjct: 671 QAQAKQRIRQQQLPTEQNAPTKVEP-KLPTLPTGLKANAPAKPLFDILKPPATGAAAAVD 729

Query: 54  GKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
              G   ++K CNC  S+CLKL                             YC+CFA G 
Sbjct: 730 ALGGMTSRRKHCNCSKSQCLKL-----------------------------YCDCFANGE 760

Query: 114 YCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
           +C  C C +C NN+++EV R+ A+ + L+RNP+AF+PKI +   G  D R          
Sbjct: 761 FCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMRL--------- 809

Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
            HNKGC+CK+SGCLK YCEC++A I C+  C+C+ C+N E
Sbjct: 810 -HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 848


>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
 gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
 gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
 gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
 gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
          Length = 749

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + T + +K CNC  S CLKL                             YC+CFA G +C
Sbjct: 516 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 546

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
           + CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H
Sbjct: 547 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 595

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
           +KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+
Sbjct: 596 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 655

Query: 235 AA 236
            A
Sbjct: 656 TA 657


>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 72/182 (39%), Positives = 89/182 (48%), Gaps = 60/182 (32%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
           K+  CNC+ S+CLKL                             YCECFA+G YCD  CN
Sbjct: 560 KKAPCNCKKSKCLKL-----------------------------YCECFASGGYCDESCN 590

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH-------GAQDAREDAR----- 167
           C++C N    E  RQ+A+ + LE+NPNAF+PKI ++         GA+    D R     
Sbjct: 591 CLDCTNTTAAEDVRQQAIASRLEKNPNAFKPKIGATSTMVTVTSGGARRLSADRRASTGT 650

Query: 168 ----------------EAQLA--AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
                            A +A    H  GCHCKKS C KKYCECFQA + C ENCRC+DC
Sbjct: 651 FLSPPGQLSLQQRQLLSAGMANTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCIDC 710

Query: 210 KN 211
           KN
Sbjct: 711 KN 712



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 30/49 (61%)

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           + A     C+CKKS CLK YCECF +   C E+C CLDC N   +E+ R
Sbjct: 556 IPAPKKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVR 604


>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
          Length = 460

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 70/194 (36%), Positives = 96/194 (49%), Gaps = 40/194 (20%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           K ++ CNC  S+CLKL                             YCECFA G +C  C 
Sbjct: 262 KSKRPCNCTKSQCLKL-----------------------------YCECFANGEFCSSCK 292

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC NN EH   R +AV A L+RNP AFRPKI S            ++  +   H KGC
Sbjct: 293 CTNCFNNTEHVFERSQAVKACLDRNPGAFRPKIGSR-----------KQGNVKGCHTKGC 341

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAA 239
           +CK+SGCLK YCEC++A I+C+  C+C+ C+N++   + +     D + I F  R   + 
Sbjct: 342 NCKRSGCLKNYCECYEAKIMCTSTCKCVGCRNYDEGCDMKKSSEQDRHDIYFPTRCPVSV 401

Query: 240 IIGAVGSSGYGTLM 253
           I   V  +  G L+
Sbjct: 402 ITRDVVEATCGCLL 415


>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
          Length = 793

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + T + +K CNC  S CLKL                             YC+CFA G +C
Sbjct: 560 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 590

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
           + CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H
Sbjct: 591 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 639

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
           +KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+
Sbjct: 640 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 699

Query: 235 AA 236
            A
Sbjct: 700 TA 701


>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
          Length = 749

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
          Length = 749

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
 gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
 gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
 gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
           domain-containing protein 1
 gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
 gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
 gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
 gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
 gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
 gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
 gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
          Length = 749

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
          Length = 749

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
           gorilla]
          Length = 749

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
          Length = 448

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 67/163 (41%), Positives = 87/163 (53%), Gaps = 40/163 (24%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC  S CLKL                             YC+CFA G +C  CNC 
Sbjct: 221 KKPCNCTKSMCLKL-----------------------------YCDCFANGEFCKDCNCK 251

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +CHNN+++E  R  A+ A+LERNPNAF+PKI  +  G  D+            H KGCHC
Sbjct: 252 DCHNNLDNEAERSRAIKASLERNPNAFKPKIGVASRGRVDSER---------LHQKGCHC 302

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG 224
           KKS CLK YCEC++A + C+E C+C  C+N E   +R + F G
Sbjct: 303 KKSNCLKNYCECYEAKVPCTERCKCSSCRNTES--DRASKFRG 343



 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 46/92 (50%), Gaps = 11/92 (11%)

Query: 137 VGATLERNPNAFRPKIAS-------SPH-GAQDAREDAREAQLAAKHNKGCHCKKSGCLK 188
           V AT  R P AFR   A+       +P   A+   +  R++ +  K  K C+C KS CLK
Sbjct: 176 VQATEPRVPIAFRTAEAAYQQNRLLTPFLAARMVHQKKRQSGIGIK--KPCNCTKSMCLK 233

Query: 189 KYCECFQANILCSENCRCLDCKNFEGSEERRA 220
            YC+CF     C ++C C DC N   +E  R+
Sbjct: 234 LYCDCFANGEFC-KDCNCKDCHNNLDNEAERS 264


>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
          Length = 749

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
          Length = 749

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
          Length = 807

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 77/190 (40%), Positives = 100/190 (52%), Gaps = 41/190 (21%)

Query: 48  PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
           P   T   +  P+ +K CNC  S+CLKL                             YC+
Sbjct: 566 PLNGTSTTETAPRPRKPCNCTKSQCLKL-----------------------------YCD 596

Query: 108 CFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR 167
           CFA G +C+ CNC NC NN+EHE  R +A+   L+RNP AF+PKI     G  D R    
Sbjct: 597 CFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR---- 652

Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDH 226
                  H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D 
Sbjct: 653 -------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADA 705

Query: 227 NGIAFMQRAA 236
             +   Q+ A
Sbjct: 706 AEVRVQQQTA 715


>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
          Length = 749

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
          Length = 748

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 519 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 549

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 550 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 598

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 599 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656


>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
 gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
 gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
          Length = 748

 Score =  132 bits (331), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 519 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 549

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 550 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 598

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 599 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 656


>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
          Length = 284

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 61/117 (52%), Positives = 83/117 (70%), Gaps = 11/117 (9%)

Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC+CF +G+YC   C CV C+NN +H+V R+ A+ + LE+NP+AF+PK+A+     +
Sbjct: 23  LKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQPKVAT----FR 78

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           D+        L  KHNKGC+C+K+ CLKKYCECFQ+ +LCSE C+C  CKNFEGS E
Sbjct: 79  DS------GGLIVKHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCKNFEGSLE 129



 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           KGC C KS CLK YC+CF + + C  +C C+ C N E  +  R
Sbjct: 13  KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVER 55


>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
          Length = 748

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 749

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
 gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
          Length = 749

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
 gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
          Length = 587

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 78/227 (34%), Positives = 111/227 (48%), Gaps = 63/227 (27%)

Query: 5   SQAMPQWQARPQHVRMVNR------------------VPHPVHKLPLPTLPPGKQESPRS 46
           +QA  + +AR QH++ + +                  VP P +K     L P    S  S
Sbjct: 294 AQAQIRQRARQQHLQQLQKAKPQLKGSQLNQSANVKPVPAPANKPNFEILKP--PSSAAS 351

Query: 47  RPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYC 106
            P    D +     ++K CNC  S+CLKL                             YC
Sbjct: 352 TPAV--DAQLNLASRRKHCNCSKSQCLKL-----------------------------YC 380

Query: 107 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 166
           +CFA G +C  C C +C NN+++EV R++A+ + L+RNP+AF+PKI +   G  D R   
Sbjct: 381 DCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNPSAFKPKITAPASG--DMRL-- 436

Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                   HNKGC+CK+SGCLK YCEC++A I CS  C+C+ C+N E
Sbjct: 437 --------HNKGCNCKRSGCLKNYCECYEAKIPCSSMCKCVGCRNME 475



 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 36/74 (48%), Gaps = 5/74 (6%)

Query: 162 AREDAREAQL-AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERR 219
           A   A +AQL  A   K C+C KS CLK YC+CF     C ++C C DC  N +   ER 
Sbjct: 350 ASTPAVDAQLNLASRRKHCNCSKSQCLKLYCDCFANGEFC-KDCTCKDCFNNLDNEVERE 408

Query: 220 ALFHG--DHNGIAF 231
                  D N  AF
Sbjct: 409 KAIRSCLDRNPSAF 422


>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
          Length = 749

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
 gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
 gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
          Length = 741

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 101/182 (55%), Gaps = 43/182 (23%)

Query: 45  RSRPRANTDG---KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           +++PR   +G    +   + +K CNC  S CLKL                          
Sbjct: 494 QNQPRGPLNGIISSESASRPRKPCNCTKSLCLKL-------------------------- 527

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D
Sbjct: 528 ---YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESD 584

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L
Sbjct: 585 RR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTL 633

Query: 222 FH 223
            H
Sbjct: 634 MH 635


>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
 gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
 gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
          Length = 749

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
 gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
          Length = 749

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
          Length = 749

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
           familiaris]
          Length = 749

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
          Length = 749

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQRAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
          Length = 813

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 584 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 614

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 615 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 663

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 664 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 721


>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
          Length = 827

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 598 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 628

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 629 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 677

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 678 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 735


>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
 gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
          Length = 921

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 61/153 (39%), Positives = 85/153 (55%), Gaps = 41/153 (26%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
           ++K CNC  S+CLKL                             YC+CFA G +C  C C
Sbjct: 706 RRKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQNCTC 736

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
            +C NN+++EV R+ A+ + L+RNP+AF+PKI +   G  D R           HNKGC+
Sbjct: 737 KDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMRL----------HNKGCN 784

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           CK+SGCLK YCEC++A I C+  C+C+ C+N E
Sbjct: 785 CKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 817



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 32/64 (50%), Gaps = 4/64 (6%)

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERRALFHGDHN 227
           L A   K C+C KS CLK YC+CF     C +NC C DC N   +E   ER      D N
Sbjct: 702 LGASRRKHCNCSKSQCLKLYCDCFANGEFC-QNCTCKDCFNNLDYEVERERAIRSCLDRN 760

Query: 228 GIAF 231
             AF
Sbjct: 761 PSAF 764


>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
 gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
           gorilla]
 gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
          Length = 660

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
          Length = 660

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
          Length = 659

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 430 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 460

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 461 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 509

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 510 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 567


>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
          Length = 1094

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/177 (41%), Positives = 91/177 (51%), Gaps = 37/177 (20%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YC+CFAAGIYC +GC CV
Sbjct: 574 KRCNCKKSKCLKL-----------------------------YCDCFAAGIYCAEGCACV 604

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCH 180
            C N  E++    E       RNP AF PKI    +G+  ++ ED R    +A+H +GC+
Sbjct: 605 GCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPINSGEDGRSTPSSARHKRGCN 664

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 237
           CKKS CLKKYCEC+QAN+ CS  CRC  CKN  G +E    F         M R AN
Sbjct: 665 CKKSMCLKKYCECYQANVGCSAGCRCEGCKNVYGRKEEYGAFK------EMMSRRAN 715



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 47/106 (44%), Gaps = 21/106 (19%)

Query: 134 QEAVGATLERNPN----AFRPKIASSPHGAQDAREDAREAQLA---AKHN---------- 176
           + + G    RN N    A     + S HG Q  +    +A L+     HN          
Sbjct: 510 ENSAGTEDRRNENKASIAMSSATSQSSHGIQPTKPKKEKASLSFHVISHNFDFSASSTND 569

Query: 177 ----KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
               K C+CKKS CLK YC+CF A I C+E C C+ C N    ++R
Sbjct: 570 GDGCKRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDR 615


>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
          Length = 660

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
          Length = 660

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
          Length = 763

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 534 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 564

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 565 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 613

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 614 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 671


>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 660

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
          Length = 660

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
          Length = 408

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/191 (39%), Positives = 99/191 (51%), Gaps = 48/191 (25%)

Query: 35  TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           T+PP   +  +S+P++  +     P+K   CNC+NS+CLKL                   
Sbjct: 155 TIPPPSHQ--QSKPKSGANLASNKPRKP--CNCKNSQCLKL------------------- 191

Query: 95  NTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
                     YC+CFA G +C D CNC NC N+ +HE  R  AV A LERNPNAF+PK+ 
Sbjct: 192 ----------YCDCFANGEFCRDSCNCQNCKNSFQHEGDRSRAVKACLERNPNAFKPKVG 241

Query: 154 SSPHG-AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
              HG   D R          +H KGC+CKKS CLK YCEC++A I CS  CRC+ C+N 
Sbjct: 242 ---HGRVGDER----------RHIKGCNCKKSSCLKNYCECYEAKIPCSHLCRCVGCQNL 288

Query: 213 EGSEERRALFH 223
               E + L  
Sbjct: 289 ADRPEGKGLMQ 299



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 4/94 (4%)

Query: 151 KIASSPHGAQDAREDAREAQLAA-KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
           K A+ P  +    +    A LA+ K  K C+CK S CLK YC+CF     C ++C C +C
Sbjct: 152 KTATIPPPSHQQSKPKSGANLASNKPRKPCNCKNSQCLKLYCDCFANGEFCRDSCNCQNC 211

Query: 210 KN---FEGSEERRALFHGDHNGIAFMQRAANAAI 240
           KN    EG   R      + N  AF  +  +  +
Sbjct: 212 KNSFQHEGDRSRAVKACLERNPNAFKPKVGHGRV 245


>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
           familiaris]
          Length = 660

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
          Length = 660

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 431 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 461

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 462 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 510

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 511 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
          Length = 621

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 392 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 422

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 423 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 471

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 472 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 529


>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 560

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 331 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 361

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 362 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 410

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 411 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 468


>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
          Length = 515

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + T + +K CNC  S CLKL                             YC+CFA G +C
Sbjct: 282 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 312

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
           + CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H
Sbjct: 313 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 361

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
           +KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+
Sbjct: 362 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 421

Query: 235 AA 236
            A
Sbjct: 422 TA 423


>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
          Length = 804

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 575 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 605

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN++HE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 606 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 654

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 655 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 712


>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
          Length = 756

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 527 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 557

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 558 CNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 606

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 607 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 664


>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
          Length = 752

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 523 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 553

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN++HE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 554 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 602

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 603 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 660


>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
          Length = 756

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 527 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 557

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN++HE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 558 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 606

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 607 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 664


>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
           carolinensis]
          Length = 749

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN++HE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CANCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
          Length = 751

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 72/164 (43%), Positives = 93/164 (56%), Gaps = 40/164 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 522 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 552

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN++HE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 553 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 601

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 602 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 645


>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
          Length = 757

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 72/164 (43%), Positives = 93/164 (56%), Gaps = 40/164 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 528 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 558

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN++HE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 559 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 607

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 608 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 651


>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
          Length = 821

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 75/182 (41%), Positives = 98/182 (53%), Gaps = 41/182 (22%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + T + +K CNC  S+CLKL                             YC+CFA G +C
Sbjct: 588 ETTSRPRKPCNCTKSQCLKL-----------------------------YCDCFANGEFC 618

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
           + CNC NC NN+EHE  R +A+   L+RNP AF+PKI     G  D R           H
Sbjct: 619 NMCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR-----------H 667

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
           +KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+
Sbjct: 668 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 727

Query: 235 AA 236
            A
Sbjct: 728 TA 729


>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
          Length = 528

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
           gorilla]
          Length = 528

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
 gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
 gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
 gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
 gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
 gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
 gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
 gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
 gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
 gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
 gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
 gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
          Length = 528

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
          Length = 528

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
           garnettii]
          Length = 544

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 64/161 (39%), Positives = 83/161 (51%), Gaps = 40/161 (24%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K C C  S CLKLD                             C+CFA G + + CNC +
Sbjct: 345 KPCECVISLCLKLD-----------------------------CDCFANGEFYNNCNCTS 375

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
            +N++EHE  RQ+A+ A L+RNP AF+PK               +E      H+KGC+CK
Sbjct: 376 YYNSLEHENERQKAIKACLDRNPEAFKPKTG-----------KGKEEDSDXHHSKGCNCK 424

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
            SGCLK YCEC++A I+CS  C+C  CKNFE   ER+ L H
Sbjct: 425 XSGCLKNYCECYEAKIMCSSICKCKGCKNFEEXPERKTLMH 465


>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
          Length = 528

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 521

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 292 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 322

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 323 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 371

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 372 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 429


>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 528

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
          Length = 775

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/190 (40%), Positives = 99/190 (52%), Gaps = 41/190 (21%)

Query: 48  PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
           P       + T + +K CNC  S+CLKL                             YC+
Sbjct: 534 PLNGLSAAETTSRPRKPCNCTKSQCLKL-----------------------------YCD 564

Query: 108 CFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR 167
           CFA G +C+ CNC NC NN+EHE  R +A+   L+RNP AF+PKI     G  D R    
Sbjct: 565 CFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESDRR---- 620

Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDH 226
                  H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D 
Sbjct: 621 -------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADA 673

Query: 227 NGIAFMQRAA 236
             +   Q+ A
Sbjct: 674 AEVRVQQQTA 683


>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
          Length = 528

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
          Length = 527

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 298 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 328

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 329 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 377

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 378 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 435


>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
           familiaris]
 gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
          Length = 528

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
 gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
          Length = 902

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 69/166 (41%), Positives = 87/166 (52%), Gaps = 40/166 (24%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K+C+C+NS+CLK+                             YCECFAA + C+GC C  
Sbjct: 360 KKCHCKNSKCLKM-----------------------------YCECFAAKVLCNGCLCFG 390

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C NN  +    + A   TLERNP+AF PKI  S               +  KH+KGCHC+
Sbjct: 391 CQNNEANIEKVERARFQTLERNPDAFNPKIKPS-------NSKINNQIIIDKHSKGCHCR 443

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 228
           KS CLKKYCECFQA+I C+ENC+C DCKN    +    L H D+ G
Sbjct: 444 KSSCLKKYCECFQASIPCNENCKCYDCKN----QAHHHLHHPDNGG 485


>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Saccoglossus kowalevskii]
          Length = 946

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 95/182 (52%), Gaps = 53/182 (29%)

Query: 42  ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           E+P +RPR             K CNC  S+CLKL                          
Sbjct: 495 EAPGTRPR-------------KPCNCTKSQCLKL-------------------------- 515

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YC+CFA G +C  CNC NC NN+EHE  R +A+ + LERNP AF PKI     G  D
Sbjct: 516 ---YCDCFANGEFCSNCNCTNCFNNLEHESERAKAIKSCLERNPLAFHPKIGKGKEGQAD 572

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
            R           HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S +R+ L
Sbjct: 573 RR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPDRKTL 621

Query: 222 FH 223
            H
Sbjct: 622 MH 623


>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
          Length = 667

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/164 (43%), Positives = 93/164 (56%), Gaps = 40/164 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 438 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 468

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN++HE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 469 CTNCYNNLDHENDRQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 517

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 518 NCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMH 561


>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
          Length = 503

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 57/124 (45%), Positives = 82/124 (66%), Gaps = 11/124 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA+G +C+ C+  NC NN+ HE+ R +A+ A L+RNP AF+PKI +   G
Sbjct: 296 KLTLAGYCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGTGNLG 355

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                       +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN++ S ER
Sbjct: 356 -----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDESPER 404

Query: 219 RALF 222
           + L 
Sbjct: 405 KTLM 408


>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
          Length = 494

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 265 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 295

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 296 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 344

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 345 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 402


>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
 gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
 gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
          Length = 771

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S+CLKL                             YC+CFA G +C+ CN
Sbjct: 543 RPRKPCNCTRSQCLKL-----------------------------YCDCFANGEFCNNCN 573

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           CVNC NN++HE  R +A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 574 CVNCFNNLDHESERLKAIKACLDRNPVAFKPKIGKGKEGESDRR-----------HSKGC 622

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CKKSGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 623 NCKKSGCLKNYCECYEAKIMCSSICKCMGCKNFEESPERKTLMHLADAAEVRVQQQTA 680


>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
           [Cucumis sativus]
          Length = 581

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/161 (40%), Positives = 86/161 (53%), Gaps = 30/161 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ SRCLKL                             YCECFAAG+YC + C+C 
Sbjct: 287 KRCNCKKSRCLKL-----------------------------YCECFAAGVYCIEPCSCQ 317

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE    E       RNP AF PK+  +     + R+D+ +   +A+H +GC+C
Sbjct: 318 DCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELRDDSNKTPASARHKRGCNC 377

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           KKSGCLKKYCEC+Q  + CS NCRC  CKN  G ++  AL 
Sbjct: 378 KKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALL 418


>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
          Length = 495

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 266 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 296

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 297 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 345

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 346 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 389


>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
 gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
          Length = 602

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/161 (40%), Positives = 86/161 (53%), Gaps = 30/161 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ SRCLKL                             YCECFAAG+YC + C+C 
Sbjct: 308 KRCNCKKSRCLKL-----------------------------YCECFAAGVYCIEPCSCQ 338

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE    E       RNP AF PK+  +     + R+D+ +   +A+H +GC+C
Sbjct: 339 DCFNKPIHEAMVLETRRQIESRNPLAFAPKVIMNCDLISELRDDSNKTPASARHKRGCNC 398

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           KKSGCLKKYCEC+Q  + CS NCRC  CKN  G ++  AL 
Sbjct: 399 KKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALL 439


>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
 gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
 gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
          Length = 419

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + T + +K CNC  S CLKL                             YC+CFA G +C
Sbjct: 186 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 216

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
           + CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H
Sbjct: 217 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 265

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
           +KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+
Sbjct: 266 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 325

Query: 235 AA 236
            A
Sbjct: 326 TA 327


>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 684

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 69/164 (42%), Positives = 89/164 (54%), Gaps = 38/164 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C+
Sbjct: 390 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 420

Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           +C N   HE    E V AT +    RNP AF PK+  S     +A +DA +   +A+H +
Sbjct: 421 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRSADSIMEAGDDASKTPASARHKR 476

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G +E   L
Sbjct: 477 GCNCKKSNCLKKYCECYQGGVGCSVNCRCEGCKNVFGRKEGSLL 520


>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
 gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
 gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
 gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
          Length = 475

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 368

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 369 NYEESPERKMLMSTPH 384


>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
          Length = 438

 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 209 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 239

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 240 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 288

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 289 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 332


>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
          Length = 475

 Score =  125 bits (314), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 368

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 369 NYEESPERKMLMSTPH 384


>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
          Length = 614

 Score =  125 bits (314), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 73/169 (43%), Positives = 93/169 (55%), Gaps = 39/169 (23%)

Query: 55  KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
           ++   + +K CNC  S+CLKL                             YC+CFA G +
Sbjct: 386 QNAAARPRKACNCTKSQCLKL-----------------------------YCDCFANGEF 416

Query: 115 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
           C  CNCVNC NN+EHE  R  ++   LERNP AF PKI  S  G  D R          +
Sbjct: 417 CSNCNCVNCFNNLEHEEERSRSIKTCLERNPQAFHPKIGKSKAGQGDIR----------R 466

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           HNKGC+CK+SGCLK YCEC++A ILC+  C+C+ CKNFE S +R+ L H
Sbjct: 467 HNKGCNCKRSGCLKNYCECYEAKILCTSLCKCVGCKNFEESPDRKTLMH 515



 Score = 38.9 bits (89), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 19/30 (63%)

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECF 194
            A E   AA+  K C+C KS CLK YC+CF
Sbjct: 382 SAAEQNAAARPRKACNCTKSQCLKLYCDCF 411


>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1852

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 77/220 (35%), Positives = 105/220 (47%), Gaps = 57/220 (25%)

Query: 24   VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWIN 83
             P  + K PL    PGK+         N +G       + +CNC+ S+CLKL        
Sbjct: 1481 FPSSIVKTPLAV--PGKE---------NPNGITPMSTPRNRCNCKKSQCLKL-------- 1521

Query: 84   IYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 143
                                 YCECF+A + C+GCNC +C N       R +A+   L +
Sbjct: 1522 ---------------------YCECFSAEVMCEGCNCTDCRNLPAFAAVRAKAMTDCLNK 1560

Query: 144  NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
            N +AF+P+IA++P GA D +           H+ GC CKKS CLKKYCECFQA +LC   
Sbjct: 1561 NAHAFKPRIAAAP-GAADLQ----------GHSMGCKCKKSECLKKYCECFQAGVLCGLK 1609

Query: 204  CRCLDCKNFEGS----EERRALFHGDHNGIAFMQRAANAA 239
            C+C  CKN  GS    ++RR +   D +G  F  R +  A
Sbjct: 1610 CKCESCKNIAGSQKLIDKRRQM--KDTSGAEFAMRVSQEA 1647


>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
           both male and female reproductiv [Ectocarpus
           siliculosus]
          Length = 735

 Score =  125 bits (313), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 65/155 (41%), Positives = 78/155 (50%), Gaps = 43/155 (27%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
           CNC+ S+CLKL                             YCECF    YC+GCNCV C 
Sbjct: 440 CNCKKSKCLKL-----------------------------YCECFQRQQYCNGCNCVECL 470

Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
           N    E  RQ A+  T+ERNP AF  K         + R   R       HN GC+CKKS
Sbjct: 471 NTERTEDLRQLAIQGTIERNPQAFVSKF--------ERRAGKRS------HNAGCNCKKS 516

Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
            CLKKYCECFQA + C  NC+C++CKN+EG+   R
Sbjct: 517 ACLKKYCECFQAGVACGTNCKCVNCKNYEGAAGGR 551



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/43 (53%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           C+CKKS CLK YCECFQ    C+  C C++C N E +E+ R L
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCN-GCNCVECLNTERTEDLRQL 481


>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
          Length = 419

 Score =  125 bits (313), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 190 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 220

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 221 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 269

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 270 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 313


>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
 gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
          Length = 818

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/168 (43%), Positives = 91/168 (54%), Gaps = 41/168 (24%)

Query: 46  SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
           SR   N+D     P+K   CNC  S+CLKL                             Y
Sbjct: 583 SRSFNNSDANGIRPRKP--CNCTKSQCLKL-----------------------------Y 611

Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
           C+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI       +D R  
Sbjct: 612 CDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIR-- 669

Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                   +HNKGC+CK+SGCLK YCEC++A I CSENC+C+ C+N E
Sbjct: 670 --------RHNKGCNCKRSGCLKNYCECYEAKIPCSENCKCIGCRNVE 709


>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 542

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/206 (38%), Positives = 100/206 (48%), Gaps = 43/206 (20%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           K++K C C+NS CLKL                             YCECFAAG  C  C+
Sbjct: 228 KERKSCKCKNSMCLKL-----------------------------YCECFAAGQLCSNCS 258

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC N+ +HE    EA  A L RNP AF PK+ + P      +ED     LA KH KGC
Sbjct: 259 CQNCLNDEDHEKEVSEARNAILLRNPAAFEPKMTAVP------KEDG---ALAIKHQKGC 309

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE--GSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGC K YCECF A + CS  C+C DC N E   SE+   L H    +GI    R A
Sbjct: 310 NCKRSGCQKNYCECFHAGVFCSNICKCNDCGNREDDNSEKSTNLSHILSDSGIGI--RGA 367

Query: 237 NAAIIGAVGSSGYGTLMTSNKRKSEE 262
            + + G      +   M S +++  E
Sbjct: 368 LSPLRGETNGYQHLLFMESPRKRWRE 393


>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 707

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 92/203 (45%), Gaps = 74/203 (36%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC+NS+CLKL                             YC+CFAAG YC+GC+
Sbjct: 99  RPRKPCNCKNSKCLKL-----------------------------YCDCFAAGTYCNGCH 129

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------------------------- 152
           C NC N   ++  RQ A+  +LERNP+AFR K+                           
Sbjct: 130 CTNCLNLPAYDTLRQNAIRTSLERNPHAFRSKVVTLEYDTSSKDTASRQNCAATAAAAKS 189

Query: 153 -------ASSPHGAQDAREDAREAQLA-----------AKHNKGCHCKKSGCLKKYCECF 194
                  +S P   Q   E+A                 A+H KGC C +S CLKKYCECF
Sbjct: 190 ADVAPEDSSEPVQEQVPNEEASSTSTQPRTTGTTTQRHAQHIKGCSCLRSLCLKKYCECF 249

Query: 195 QANILCSENCRCLDCKNFEGSEE 217
           Q  + CS +CRC +C+NFEGS E
Sbjct: 250 QNGVYCSASCRCSNCRNFEGSAE 272



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           +A E     +  K C+CK S CLK YC+CF A   C+  C C +C N 
Sbjct: 90  NALEGGSVTRPRKPCNCKNSKCLKLYCDCFAAGTYCN-GCHCTNCLNL 136



 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 24/44 (54%), Gaps = 4/44 (9%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHN---NVEHEVARQEAVGATL 141
           L KYCECF  G+YC   C C NC N   + E   AR+  V A L
Sbjct: 242 LKKYCECFQNGVYCSASCRCSNCRNFEGSAELSQARERLVLAEL 285


>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
 gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
          Length = 458

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/187 (41%), Positives = 100/187 (53%), Gaps = 44/187 (23%)

Query: 41  QESPRSRPRANTDGKDGT----PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNT 96
           Q +  ++PR   +G   T     + +K CNC  S+CLKL                     
Sbjct: 210 QATTATQPRQQVNGTLNTEQVGTRPRKPCNCTKSQCLKL--------------------- 248

Query: 97  QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
                   YC+CFA G +C+ CNC NC NN+EHE  R +++ A LERNP AF PKI    
Sbjct: 249 --------YCDCFANGEFCNNCNCNNCFNNLEHEAERAKSIKACLERNPYAFHPKIGKGK 300

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
            G  D R           HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S 
Sbjct: 301 EGEGDRR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESP 349

Query: 217 ERRALFH 223
           ER+ L H
Sbjct: 350 ERKTLMH 356


>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
 gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific
          Length = 475

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 82/136 (60%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I CS  C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACK 368

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 369 NYEESPERKMLMSTPH 384


>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
           norvegicus]
          Length = 467

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 82/136 (60%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 256 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 311

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I CS  C+C+ CK
Sbjct: 312 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACK 360

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 361 NYEESPERKMLMSTPH 376


>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
          Length = 528

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+C A G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCCANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
          Length = 764

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/144 (47%), Positives = 88/144 (61%), Gaps = 14/144 (9%)

Query: 94  LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
           L+   K N   YC+CFA G +C  CNC NC NN++HE  R  A+ + L+RNP AF PKI 
Sbjct: 520 LSANLKQN--GYCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIG 577

Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
               G +D R           HNKGC+CK+SGCLK YCEC++A I+CS +C+C+ CKNFE
Sbjct: 578 KGRDGERDRR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKNFE 626

Query: 214 GSEERRALFH-GDHNGIAFMQRAA 236
            S ER+ L H  D   +   Q+ A
Sbjct: 627 ESPERKTLMHLADAAEVRVQQQTA 650


>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
          Length = 597

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/150 (41%), Positives = 80/150 (53%), Gaps = 41/150 (27%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K C+C+NS+CLKL                             YCECFA  + C+GC+C 
Sbjct: 341 RKGCHCKNSKCLKL-----------------------------YCECFANKLLCNGCHCF 371

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            CHNN  +    Q +    LERNP AF PK           +E+ ++     KH KGCHC
Sbjct: 372 GCHNNDANIEVVQRSRSNILERNPEAFNPKFKQ--------KEETKQH----KHTKGCHC 419

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
           +KS CLKKYCECFQA I C E+C+C+DC+N
Sbjct: 420 RKSECLKKYCECFQAGIPCGEHCKCIDCRN 449



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           KGCHCK S CLK YCECF   +LC+  C C  C N + + E
Sbjct: 342 KGCHCKNSKCLKLYCECFANKLLCN-GCHCFGCHNNDANIE 381



 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 35/92 (38%), Gaps = 33/92 (35%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGC 118
           K  K C+CR S CLK                             KYCECF AGI C + C
Sbjct: 412 KHTKGCHCRKSECLK-----------------------------KYCECFQAGIPCGEHC 442

Query: 119 NCVNCHNNVEHEVARQEAVGATL---ERNPNA 147
            C++C N  E    R +     L   ER P A
Sbjct: 443 KCIDCRNTGEDIKERSKPKEQPLKKEERKPTA 474


>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
          Length = 305

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 64/172 (37%), Positives = 86/172 (50%), Gaps = 42/172 (24%)

Query: 50  ANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECF 109
           +++DG    P     CNCR SRCLKL                             YCECF
Sbjct: 174 SSSDGSPHDPNHSHTCNCRKSRCLKL-----------------------------YCECF 204

Query: 110 AAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 169
           A+ ++C+GCNCVNC NN EH   R+E   A ++R+P+ F P +               + 
Sbjct: 205 ASDVFCNGCNCVNCKNNPEHLKEREEVKQALIQRDPHVFEPHV-------------VWDT 251

Query: 170 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
                + +GC C+K+ C+KKYC+CFQ  I C  NCRC+DCKN E +   RAL
Sbjct: 252 PATRTNLRGCRCQKTRCVKKYCDCFQNGICCGPNCRCVDCKNTEEAALNRAL 303



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 6/56 (10%)

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-----FEGSEERRALFHGD 225
           H+  C+C+KS CLK YCECF +++ C+  C C++CKN      E  E ++AL   D
Sbjct: 185 HSHTCNCRKSRCLKLYCECFASDVFCN-GCNCVNCKNNPEHLKEREEVKQALIQRD 239


>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
 gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
          Length = 369

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/194 (40%), Positives = 107/194 (55%), Gaps = 39/194 (20%)

Query: 52  TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
           ++ KD T  ++  C+C+ S+CLKL                             YCECF  
Sbjct: 161 SETKDVTLTRKTNCSCKYSKCLKL-----------------------------YCECFEK 191

Query: 112 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 171
           G YC GCNC NC NNV HE ARQ+A+   LERNP AF PK+ +S   A +    A E  +
Sbjct: 192 GRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNS--TAHNCESKAAEGDI 249

Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHNG 228
             KH KGC CK++ CLKKYCECF+A++ CSENCRC  CKN++ + +R   +   H  H  
Sbjct: 250 VGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHH-- 307

Query: 229 IAFMQRAANAAIIG 242
              +Q  A++ ++G
Sbjct: 308 ---VQNPASSGMVG 318


>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1223

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 81/156 (51%), Gaps = 30/156 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
           K+CNC+ S+CLKL                             YCECFAA  +C G C C 
Sbjct: 737 KRCNCKKSKCLKL-----------------------------YCECFAAREFCVGSCACR 767

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N  E+E            RNP AF PKI  +P  +    ++A +   +A+H +GC+C
Sbjct: 768 NCFNKPEYEATVLNTRQQIESRNPLAFAPKIVQAPESSPIPGDEALDTPASARHKRGCNC 827

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           KKS CLKKYCEC+QA + CSE CRC  C+N  G +E
Sbjct: 828 KKSLCLKKYCECYQAGVGCSEGCRCEGCRNMYGRKE 863



 Score = 38.9 bits (89), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 29/117 (24%), Positives = 46/117 (39%), Gaps = 39/117 (33%)

Query: 38  PGKQESPRSRPRANTDGKD--GTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLN 95
           P   ++P S P    +  D   + + ++ CNC+ S CLK                     
Sbjct: 796 PKIVQAPESSPIPGDEALDTPASARHKRGCNCKKSLCLK--------------------- 834

Query: 96  TQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNV-------EHEVARQEAVGATLERN 144
                   KYCEC+ AG+ C +GC C  C N         E E   ++ V  T++++
Sbjct: 835 --------KYCECYQAGVGCSEGCRCEGCRNMYGRKEGPEEGEAKEKDQVNLTVDKS 883


>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
 gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
           AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
 gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
          Length = 674

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 70/184 (38%), Positives = 93/184 (50%), Gaps = 35/184 (19%)

Query: 38  PGKQESPRSRP---RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           P  QE   S P   R   D  +G  +  K+CNC+ S+CLKL                   
Sbjct: 349 PALQELNLSSPKKKRVKLDSGEG--ESCKRCNCKKSKCLKL------------------- 387

Query: 95  NTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
                     YCECFAAG+YC + C+C++C N   HE            RNP AF PK+ 
Sbjct: 388 ----------YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVI 437

Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
            +    Q+  +DA +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  
Sbjct: 438 RNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAF 497

Query: 214 GSEE 217
           G ++
Sbjct: 498 GRKD 501


>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
          Length = 360

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 149 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 204

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 205 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 253

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 254 NYEESPERKMLMSTPH 269


>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 64/151 (42%), Positives = 89/151 (58%), Gaps = 38/151 (25%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
           K+CNC+NS+C+KL                             YCEC+   ++CD  C+C 
Sbjct: 16  KKCNCKNSKCIKL-----------------------------YCECYQNKVFCDNNCHCH 46

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC NN  +   RQ+A+  TLE+NP+AF+PKI +S     D++ D        KHNKGC C
Sbjct: 47  NCFNNSSNAKQRQKAMQYTLEKNPSAFQPKITTS-----DSKPDPLNF---GKHNKGCQC 98

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           KKSGC+KKYCECFQA + CS+ C+C++C+N+
Sbjct: 99  KKSGCMKKYCECFQAKVPCSDQCKCIECRNY 129



 Score = 46.2 bits (108), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 55/132 (41%), Gaps = 15/132 (11%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 236
           K C+CK S C+K YCEC+Q  + C  NC C +C N   + ++R             Q+A 
Sbjct: 16  KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQR-------------QKAM 62

Query: 237 NAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS 296
              +      S +   +T++  K + L FG   K     ++   ++      + VPCS  
Sbjct: 63  QYTL--EKNPSAFQPKITTSDSKPDPLNFGKHNKGCQCKKSGCMKKYCECFQAKVPCSDQ 120

Query: 297 VPVSQTANASVL 308
               +  N   L
Sbjct: 121 CKCIECRNYGFL 132


>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
          Length = 495

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 102/217 (47%), Gaps = 61/217 (28%)

Query: 3   SPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQ 62
           SP  A    QA  Q     NRV  PV    L         +P+ RP  +          +
Sbjct: 228 SPPLAYRAAQASIQR----NRVFTPVLAAKL---------APKKRPLGS----------K 264

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K C+C  S CLKL                             YC+CFA G +C+ C+C +
Sbjct: 265 KPCHCTRSMCLKL-----------------------------YCDCFANGEFCNDCDCKD 295

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C N +E+E+ R  A+  +LERNPNAF+PKI  + +         R+ +    H KGCHCK
Sbjct: 296 CKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCK 346

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           KS CLK YCEC++A + C++ C+C+ C+N E     R
Sbjct: 347 KSNCLKNYCECYEAKVPCTDRCKCVCCRNTESDRAAR 383


>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
          Length = 348

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 73/164 (44%), Positives = 93/164 (56%), Gaps = 40/164 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 119 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 149

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 150 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 198

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 199 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 242


>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
          Length = 209

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 68/152 (44%), Positives = 83/152 (54%), Gaps = 41/152 (26%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           K +K CNC  S+CLKL                             YC+CFA G +C GCN
Sbjct: 20  KPRKPCNCTKSQCLKL-----------------------------YCDCFANGEFCHGCN 50

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           CV C NN+EHE  R  A+ + L+RNP+AF+PKI     G     E  R       HNKGC
Sbjct: 51  CVCCANNLEHEELRLRAIRSCLDRNPHAFKPKI-----GVGWGPEPRR-------HNKGC 98

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           HCK+SGCLK YCEC++A I CS  C+C+ CKN
Sbjct: 99  HCKRSGCLKNYCECYEAKIACSAICKCIGCKN 130


>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
 gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
          Length = 469

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 102/217 (47%), Gaps = 61/217 (28%)

Query: 3   SPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQ 62
           SP  A    QA  Q     NRV  PV    L         +P+ RP  +          +
Sbjct: 202 SPPLAYRAAQASVQR----NRVFTPVLAAKL---------APKKRPLGS----------K 238

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K C+C  S CLKL                             YC+CFA G +C+ C+C +
Sbjct: 239 KPCHCTRSMCLKL-----------------------------YCDCFANGEFCNDCDCKD 269

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C N +E+E+ R  A+  +LERNPNAF+PKI  + +         R+ +    H KGCHCK
Sbjct: 270 CKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCK 320

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           KS CLK YCEC++A + C++ C+C+ C+N E     R
Sbjct: 321 KSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 357


>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
          Length = 452

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 69/210 (32%), Positives = 107/210 (50%), Gaps = 48/210 (22%)

Query: 59  PKK-----QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
           PKK     +K C+C  S CLKL                             YC+CFA G 
Sbjct: 213 PKKRPLGSKKPCHCTRSMCLKL-----------------------------YCDCFANGE 243

Query: 114 YCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
           +C+ C+C +C N +E+E+ R  A+  +LERNPNAF+PKI  + +         R+ +   
Sbjct: 244 FCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPER 294

Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQ 233
            H KGCHCKKS CLK YCEC++A + C++ C+C+ C+N   +E  RA    +      + 
Sbjct: 295 LHQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCVCCRN---TESDRAARLANTKSATALT 351

Query: 234 RAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
              +AA +G+  +  + T + S++   +E+
Sbjct: 352 DIRSAATMGS--ADFHKTTLYSDEDSDDEV 379



 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 48/99 (48%), Gaps = 15/99 (15%)

Query: 136 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
           A  A+++RN   F P +A        A+   ++  L +K  K CHC +S CLK YC+CF 
Sbjct: 192 AAQASIQRN-RVFTPVLA--------AKLAPKKRPLGSK--KPCHCTRSMCLKLYCDCFA 240

Query: 196 ANILCSENCRCLDCKN---FEGSEERRALFHGDHNGIAF 231
               C+ +C C DCKN   +E    R      + N  AF
Sbjct: 241 NGEFCN-DCDCKDCKNTIEYEIERTRAIRLSLERNPNAF 278


>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
          Length = 380

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 57/128 (44%), Positives = 80/128 (62%), Gaps = 15/128 (11%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA+G +C  C+C    NN+ HE+ R +A+ A ++RNP AF+PK+     G
Sbjct: 177 KIALAGYCDCFASGDFCTSCSC----NNLRHELERFKAIKACIDRNPEAFQPKMGKGRLG 232

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
               R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER
Sbjct: 233 VSKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPER 281

Query: 219 RALFHGDH 226
           + L +  H
Sbjct: 282 KMLMNTPH 289


>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 623

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/208 (35%), Positives = 104/208 (50%), Gaps = 38/208 (18%)

Query: 19  RMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFV 78
           +++  VP     + L  + P K+     R ++   G+ G+    K+CNC+ S+CLKL   
Sbjct: 302 KVIEVVPKSSGLVELTPISPKKK-----RRKSEQSGEGGS--SCKRCNCKKSKCLKL--- 351

Query: 79  EIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 137
                                     YCECFAAG+YC + C+C++C N   HE       
Sbjct: 352 --------------------------YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATR 385

Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
                RNP AF PK+  +     +  EDA +   +A+H +GC+CKKS CLKKYCEC+Q  
Sbjct: 386 KQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECYQGG 445

Query: 198 ILCSENCRCLDCKNFEGSEERRALFHGD 225
           + CS NCRC  CKN  G ++  +LF  D
Sbjct: 446 VGCSINCRCEGCKNAFGRKD-GSLFEQD 472


>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 995

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 70/154 (45%), Positives = 90/154 (58%), Gaps = 40/154 (25%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S+CLKL                             YCECFA G +C  CN
Sbjct: 779 RMKKPCNCTRSQCLKL-----------------------------YCECFANGEFCFQCN 809

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE ARQ+++ + LERNP AFRPKI        +  ED R      +HNKGC
Sbjct: 810 CNNCYNNIEHEEARQKSIKSCLERNPCAFRPKIQ-----IGETEEDER------RHNKGC 858

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           HCK+SGCLK YCEC++A I CS++C+C+ C+N E
Sbjct: 859 HCKRSGCLKNYCECYEAKITCSKSCKCIGCRNTE 892


>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 798

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 64/130 (49%), Positives = 84/130 (64%), Gaps = 11/130 (8%)

Query: 94  LNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
           + +Q +  L  YC+CFA G +C+ CNC NC NN+EHE  R +A+   L+RNP AF+PKI 
Sbjct: 579 VQSQARMPLNGYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIG 638

Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
               G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE
Sbjct: 639 KGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFE 687

Query: 214 GSEERRALFH 223
            S ER+ L H
Sbjct: 688 ESPERKTLMH 697


>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
 gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
 gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
 gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
 gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
 gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
          Length = 295

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 84  GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 139

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 140 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 188

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 189 NYEESPERKMLMSTPH 204


>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
 gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
          Length = 695

 Score =  122 bits (306), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 88/164 (53%), Gaps = 38/164 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C+
Sbjct: 401 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 431

Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           +C N   HE    E V AT +    RNP AF PK+  +     +A +DA +   +A+H +
Sbjct: 432 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 487

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           GC+CKKS C+KKYCEC+Q  + CS NCRC  C N  G ++   L
Sbjct: 488 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLL 531


>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
 gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
 gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 695

 Score =  122 bits (306), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 88/164 (53%), Gaps = 38/164 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C+
Sbjct: 401 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 431

Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           +C N   HE    E V AT +    RNP AF PK+  +     +A +DA +   +A+H +
Sbjct: 432 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 487

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           GC+CKKS C+KKYCEC+Q  + CS NCRC  C N  G ++   L
Sbjct: 488 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLL 531


>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
          Length = 772

 Score =  122 bits (305), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 69/183 (37%), Positives = 92/183 (50%), Gaps = 33/183 (18%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R   D  DGT    K+C+C+ S+CLKL                         
Sbjct: 443 QGSPKKK-RHKFDSGDGT--SCKRCSCKKSKCLKL------------------------- 474

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
               YCECFAAG+YC + C+C+ C NN  H             RNP AF PK+  +    
Sbjct: 475 ----YCECFAAGVYCSEPCSCIGCMNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSEPG 530

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
            + R+ + +   +A+H +GC+CKKS CLKKYCECFQ  + CS +CRC  CKN  G  E  
Sbjct: 531 LELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGKREGA 590

Query: 220 ALF 222
           A+ 
Sbjct: 591 AVL 593


>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
          Length = 391

 Score =  122 bits (305), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 84/158 (53%), Gaps = 38/158 (24%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K C+C  S CLKL                             YC+CFA G +C+ C+C 
Sbjct: 234 KKPCHCTRSMCLKL-----------------------------YCDCFANGEFCNDCDCK 264

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N +E+E+ R  A+  +LERNPNAF+PKI  + +         R+ +    H KGCHC
Sbjct: 265 DCKNTIEYEIERTRAIRLSLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHC 315

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           KKS CLK YCEC++A + C++ C+C+ C+N E     R
Sbjct: 316 KKSNCLKNYCECYEAKVPCTDRCKCICCRNTESDRAAR 353



 Score = 45.8 bits (107), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 34/68 (50%), Gaps = 8/68 (11%)

Query: 171 LAAKH----NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERRALFH 223
           LAAK      K CHC +S CLK YC+CF     C+ +C C DCKN   +E    R     
Sbjct: 225 LAAKRPLGSKKPCHCTRSMCLKLYCDCFANGEFCN-DCDCKDCKNTIEYEIERTRAIRLS 283

Query: 224 GDHNGIAF 231
            + N  AF
Sbjct: 284 LERNPNAF 291


>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
           AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
 gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
 gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
 gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 609

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 85/164 (51%), Gaps = 31/164 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N   H+            RNP AF PK+  +     +  EDA +   +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
           KKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  +LF  D
Sbjct: 419 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)

Query: 89  SPGAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATL 141
           S G HLNT     KD  V     F+  I     + +    + +H++ R+    +  G  +
Sbjct: 234 SIGLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQII 293

Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
           E  P +    +  +P   +  R  + ++       K C+CKKS CLK YCECF A   C 
Sbjct: 294 EVVPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCI 352

Query: 202 ENCRCLDCKN 211
           E C C++C N
Sbjct: 353 EPCSCINCFN 362


>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
          Length = 609

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 85/164 (51%), Gaps = 31/164 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N   H+            RNP AF PK+  +     +  EDA +   +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
           KKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  +LF  D
Sbjct: 419 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)

Query: 89  SPGAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATL 141
           S G HLNT     KD  V     F+  I     + +    + +H++ R+    +  G  +
Sbjct: 234 SIGLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQII 293

Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
           E  P +    +  +P   +  R  + ++       K C+CKKS CLK YCECF A   C 
Sbjct: 294 EVVPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCI 352

Query: 202 ENCRCLDCKN 211
           E C C++C N
Sbjct: 353 EPCSCINCFN 362


>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 235

 Score =  121 bits (304), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 64/120 (53%), Positives = 74/120 (61%), Gaps = 30/120 (25%)

Query: 38  PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQ 97
           P K +SP+   R   +GKD TP K+K CNCRNSRCLKL                      
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKL---------------------- 167

Query: 98  FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                  YCECFA+G +CDGCNC NC NN E+EVAR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 168 -------YCECFASGTHCDGCNCTNCFNNPENEVARREAIEATLERNPDAFRPKIGSSPH 220



 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           K C+C+ S CLK YCECF +   C + C C +C N   +E  R
Sbjct: 155 KCCNCRNSRCLKLYCECFASGTHC-DGCNCTNCFNNPENEVAR 196


>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
 gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
          Length = 729

 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 63/155 (40%), Positives = 81/155 (52%), Gaps = 30/155 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YC+CFAAG+YC + C+C 
Sbjct: 450 KRCNCKRSKCLKL-----------------------------YCDCFAAGLYCIEPCSCQ 480

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE    E       RNP AF PK+  S     D  ++  +   +A+H +GC+C
Sbjct: 481 DCFNKPAHEDTVLETRKQIESRNPLAFAPKVIRSTDYVSDFGDETNKTPASARHKRGCNC 540

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           KKS CLKKYCECFQ  + CS NCRC  CKN  GS+
Sbjct: 541 KKSNCLKKYCECFQGGVGCSSNCRCEGCKNSFGSK 575


>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1174

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 94/203 (46%), Gaps = 54/203 (26%)

Query: 54   GKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
            GK+ + + +  C C+NS+CLKL                             YC+CF+A  
Sbjct: 852  GKENS-ENESTCKCKNSKCLKL-----------------------------YCDCFSAEK 881

Query: 114  YCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
            YC GC C++C N    E  R +A+  T  RNPNAF+ K+ +S                  
Sbjct: 882  YCVGCKCIDCQNTPTFESIRAKAIADTRARNPNAFKTKLNAS----------------VT 925

Query: 174  KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS----EERRALFHGDHNGI 229
            +H  GC CKKS CLKKYCECF   ++C E+C+C +CKN+ GS    E RR L   D  G 
Sbjct: 926  EHATGCKCKKSECLKKYCECFANLVVCGESCKCSNCKNYVGSQALIERRRKL--KDAKGT 983

Query: 230  AFMQRAANAAIIGAVGSS--GYG 250
                R       G++  S   YG
Sbjct: 984  ELAMRPVARGWKGSMSDSKVAYG 1006


>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
          Length = 804

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)

Query: 35  TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           TL  G     +S  R+ T  +    + +K CNC  S+CLKL                   
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 620

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
                     YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI  
Sbjct: 621 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 670

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                 D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 671 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 719


>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
          Length = 360

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 97/178 (54%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 131 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 161

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 162 CNNCFNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 210

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 211 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 268


>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
          Length = 835

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)

Query: 35  TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           TL  G     +S  R+ T  +    + +K CNC  S+CLKL                   
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 628

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
                     YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI  
Sbjct: 629 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 678

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                 D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 727


>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
          Length = 829

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)

Query: 35  TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           TL  G     +S  R+ T  +    + +K CNC  S+CLKL                   
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 622

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
                     YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI  
Sbjct: 623 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 672

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                 D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 673 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 721


>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
          Length = 827

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)

Query: 35  TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           TL  G     +S  R+ T  +    + +K CNC  S+CLKL                   
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 620

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
                     YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI  
Sbjct: 621 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 670

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                 D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 671 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 719


>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
          Length = 829

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)

Query: 35  TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           TL  G     +S  R+ T  +    + +K CNC  S+CLKL                   
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 622

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
                     YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI  
Sbjct: 623 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 672

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                 D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 673 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 721


>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
          Length = 835

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 93/179 (51%), Gaps = 39/179 (21%)

Query: 35  TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           TL  G     +S  R+ T  +    + +K CNC  S+CLKL                   
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL------------------- 628

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
                     YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI  
Sbjct: 629 ----------YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGK 678

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                 D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 GRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 727


>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
          Length = 348

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/119 (53%), Positives = 82/119 (68%), Gaps = 11/119 (9%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R 
Sbjct: 135 YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR- 193

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
                     H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 194 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 242


>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/160 (40%), Positives = 87/160 (54%), Gaps = 38/160 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C+
Sbjct: 232 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 262

Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           +C N   HE    E V AT +    RNP AF PK+  +     +A +DA +   +A+H +
Sbjct: 263 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 318

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           GC+CKKS C+KKYCEC+Q  + CS NCRC  C N  G ++
Sbjct: 319 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKD 358


>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/160 (40%), Positives = 87/160 (54%), Gaps = 38/160 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C+
Sbjct: 232 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 262

Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           +C N   HE    E V AT +    RNP AF PK+  +     +A +DA +   +A+H +
Sbjct: 263 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 318

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           GC+CKKS C+KKYCEC+Q  + CS NCRC  C N  G ++
Sbjct: 319 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKD 358


>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
          Length = 802

 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 90/169 (53%), Gaps = 39/169 (23%)

Query: 45  RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
           +S  R+ T  +    + +K CNC  S+CLKL                             
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL----------------------------- 619

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI        D R 
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR- 678

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                    +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 ---------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 718


>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 782

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/171 (37%), Positives = 86/171 (50%), Gaps = 30/171 (17%)

Query: 48  PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
           P+      +G  +  K+C+C+ S+CLKL                             YCE
Sbjct: 458 PKKRRKSDNGDDESCKRCSCKKSKCLKL-----------------------------YCE 488

Query: 108 CFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 166
           CFAAG+YC + C+C  C N   HE            RNP AF PK+       Q+A+ED 
Sbjct: 489 CFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKHIEFRNPLAFAPKVIRMSEAGQEAQEDP 548

Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           +    +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G+ +
Sbjct: 549 KNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSNNCRCETCKNTFGTRD 599


>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
          Length = 826

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 90/169 (53%), Gaps = 39/169 (23%)

Query: 45  RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
           +S  R+ T  +    + +K CNC  S+CLKL                             
Sbjct: 589 KSSQRSYTSVEPNGIRPRKPCNCTKSQCLKL----------------------------- 619

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRPKI        D R 
Sbjct: 620 YCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR- 678

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                    +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 679 ---------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRNVE 718


>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
          Length = 800

 Score =  119 bits (299), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 87/177 (49%), Gaps = 32/177 (18%)

Query: 39  GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           G+  SP+ + R   +G DG  +  K+C+C+ S+CLKL                       
Sbjct: 460 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKL----------------------- 494

Query: 99  KDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                 YCECFAAG+YC + C+C  C N   HE            RNP AF PK+     
Sbjct: 495 ------YCECFAAGVYCSEPCSCQGCMNKPIHEETVLSTRKQIEFRNPLAFAPKVIRMSD 548

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
              +  ED      +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 549 AGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNAFG 605


>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  119 bits (299), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 79/238 (33%), Positives = 112/238 (47%), Gaps = 53/238 (22%)

Query: 1   MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTP 59
            RSP ++  +   R   ++ +N V          TL  P K  S +   R  T  K   P
Sbjct: 135 FRSPPKSPHRKLERATKLKFINFV----------TLKKPLKFTSDQVNNRKRTSSKRRKP 184

Query: 60  KKQKQ-----CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
           + +++     CNC+ S+CLKL                             YC+CFAAG+ 
Sbjct: 185 QTEQKPKIIVCNCKKSKCLKL-----------------------------YCDCFAAGVP 215

Query: 115 C-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 173
           C   CNC +CHNN +H   R+  +   +ERNP AFRPK+ S  +   +  +  R      
Sbjct: 216 CGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKVESRSNSEDEIDQKPR------ 269

Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAF 231
            H KGC+CKKS CLKKYCEC+Q  + CSE C+C +CKN E   ++ +     H   +F
Sbjct: 270 -HFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKNCEMPVKKESRKRVKHTHSSF 326


>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
          Length = 822

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 67/151 (44%), Positives = 82/151 (54%), Gaps = 39/151 (25%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K CNC  S+CLKL                             YC+CFA G +C  CNC N
Sbjct: 603 KPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCNCNN 633

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK
Sbjct: 634 CSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCK 683

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           +SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 684 RSGCLKNYCECYEAKIPCSANCKCIGCRNIE 714


>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
          Length = 429

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/150 (41%), Positives = 79/150 (52%), Gaps = 39/150 (26%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           + CNC+NS+CLKL                             YCECFA G YC   CNCV
Sbjct: 216 RTCNCKNSKCLKL-----------------------------YCECFANGEYCGPNCNCV 246

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N    E  R++A+  TL +NP+AFRPK      GA  +R+ A  +     H KGC C
Sbjct: 247 SCGNREATETLRKQAIEVTLAKNPDAFRPKF-----GANKSRQSAAPS----AHVKGCRC 297

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
            +S C+K+YCECFQA I C+  C+C DC N
Sbjct: 298 ARSKCIKRYCECFQAGIPCTSACQCRDCHN 327



 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 26/47 (55%)

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           A   + C+CK S CLK YCECF     C  NC C+ C N E +E  R
Sbjct: 212 ASGGRTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLR 258


>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
 gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
          Length = 873

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 65/162 (40%), Positives = 83/162 (51%), Gaps = 31/162 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ ++CLKL                             YC+CFAAGIYC D C C 
Sbjct: 453 KRCNCKKTKCLKL-----------------------------YCDCFAAGIYCADPCACQ 483

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA-AKHNKGCH 180
           +C N  E+E    E       RNP AF PKI         +RED   +  + ++H +GC+
Sbjct: 484 DCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHAKEFAASREDRSSSMPSLSRHKRGCN 543

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           CKKS CLKKYCEC+QAN+ CS  CRC  CKN  G +E   + 
Sbjct: 544 CKKSMCLKKYCECYQANVGCSSECRCEGCKNGYGRKEEYGII 585



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 30/51 (58%)

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           D  E+ +   +     K C+CKK+ CLK YC+CF A I C++ C C DC N
Sbjct: 437 DNFEEKKSLSIDGDGCKRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFN 487


>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
          Length = 825

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 67/151 (44%), Positives = 82/151 (54%), Gaps = 39/151 (25%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K CNC  S+CLKL                             YC+CFA G +C  CNC N
Sbjct: 606 KPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCNCNN 636

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK
Sbjct: 637 CSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCK 686

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           +SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 687 RSGCLKNYCECYEAKIPCSANCKCIGCRNIE 717


>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 107/221 (48%), Gaps = 55/221 (24%)

Query: 1   MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTP 59
            RSP ++  +   R   ++ +N V          TL  P K  S +   R  T  K   P
Sbjct: 144 FRSPPKSPHRKLERATKLKFINFV----------TLKKPLKFTSDQVNNRKRTSSKRRKP 193

Query: 60  KKQKQ-----CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIY 114
             +++     CNC+ S+CLKL                             YC+CFAAG+ 
Sbjct: 194 LTEQKPKIIVCNCKKSKCLKL-----------------------------YCDCFAAGVP 224

Query: 115 C-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLA 172
           C   CNC +CHNN +H   R+  +   +ERNP AFRPK+        D+R ++  E    
Sbjct: 225 CGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKV--------DSRSNSEDEIDHK 276

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
            +H KGC+CKKS CLKKYCEC+Q  + CSE C+C DCKN E
Sbjct: 277 PRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDDCKNCE 317



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C+CKKS CLK YC+CF A + C ++C C  C N +   + R
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKER 244


>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
           distachyon]
          Length = 787

 Score =  119 bits (298), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 65/172 (37%), Positives = 87/172 (50%), Gaps = 33/172 (19%)

Query: 44  PRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLV 103
           P+ + R + +G D   +  K+C+C+ S+CLKL                            
Sbjct: 459 PQKKRRKSDNGDD---ESCKRCSCKKSKCLKL---------------------------- 487

Query: 104 KYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
            YCECFAAG+YC + C+C+ C N   HE            RNP AF PK+       Q+ 
Sbjct: 488 -YCECFAAGVYCSEPCSCLGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRLSDAGQET 546

Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +ED      +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 547 QEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCESCKNTFG 598



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/41 (60%), Positives = 26/41 (63%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF A + CSE C CL C N    EE
Sbjct: 475 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEE 515


>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
 gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
          Length = 587

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 68/178 (38%), Positives = 89/178 (50%), Gaps = 33/178 (18%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R   D  DGT    K+C+C+ S+CLKL                         
Sbjct: 441 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 472

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
               YCECFAAG+YC + C+C+ C NN  H             RNP AF PK+  +    
Sbjct: 473 ----YCECFAAGVYCSEPCSCIGCQNNQSHMETVLSTRQQIESRNPLAFAPKVIHTSEPG 528

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
            D  + + +   +A+H +GC+CKKS CLKKYCECFQ  + CS +CRC  CKN  G  E
Sbjct: 529 MDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKNAFGKRE 586


>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
 gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
          Length = 776

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 87/177 (49%), Gaps = 32/177 (18%)

Query: 39  GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           G+  SP+ + R   +G DG  +  K+C+C+ S+CLKL                       
Sbjct: 442 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKL----------------------- 476

Query: 99  KDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                 YCECFAAG+YC + C+C  C N   HE            RNP AF PK+     
Sbjct: 477 ------YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSD 530

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
              +  ED      +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 531 AGLETGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSTNCRCESCKNTFG 587


>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
 gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
          Length = 767

 Score =  119 bits (297), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 68/183 (37%), Positives = 91/183 (49%), Gaps = 33/183 (18%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R   D  DGT    K+C+C+ S+CLKL                         
Sbjct: 438 QGSPKKK-RQKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 469

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
               YCECFAAG+YC + C+C+ C NN  H             RNP AF PK+  +    
Sbjct: 470 ----YCECFAAGVYCSEPCSCIGCLNNQSHTETVLSTRQQIESRNPLAFAPKVIHTSESG 525

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
            +  + + +   +A+H +GC+CKKS CLKKYCECFQ  + CS +CRC  CKN  G  E  
Sbjct: 526 LELGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGKREGA 585

Query: 220 ALF 222
           A+ 
Sbjct: 586 AVL 588


>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
 gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
          Length = 226

 Score =  118 bits (296), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 75/162 (46%), Positives = 90/162 (55%), Gaps = 40/162 (24%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC  S+CLKL                             YC+CFA G +C  CNCV
Sbjct: 4   RKPCNCTKSQCLKL-----------------------------YCDCFAQGEFCSNCNCV 34

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC NN+EHE  R +A+ A LERNP+AF PKI     G  + R           HNKGCHC
Sbjct: 35  NCSNNIEHERERSKAIKACLERNPHAFHPKIGKGKVGESERR-----------HNKGCHC 83

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           K+SGCLK YCEC++A ILC+  C+C  CKNFE S ER+ L H
Sbjct: 84  KRSGCLKNYCECYEAKILCTSLCKCTGCKNFEESPERKTLMH 125


>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
          Length = 717

 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/183 (37%), Positives = 90/183 (49%), Gaps = 32/183 (17%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R    G  G  +  K+CNC+ S+CLKL                         
Sbjct: 403 QSSPKKKRRRLEHG--GETEGCKRCNCKKSKCLKL------------------------- 435

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
               YCECFAAG+YC + C+C  C N   HE            RNP AF PK+  S    
Sbjct: 436 ----YCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSL 491

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
            +  ++  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  
Sbjct: 492 PEVGDEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGS 551

Query: 220 ALF 222
           AL 
Sbjct: 552 ALI 554


>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
 gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
 gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
          Length = 782

 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/174 (39%), Positives = 87/174 (50%), Gaps = 33/174 (18%)

Query: 43  SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
           SP+ + R + +G DG  +  K+C+C+ S+CLKL                           
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKL--------------------------- 480

Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
             YCECFAAG+YC + C+C  C N   HE            RNP AF PK+       QD
Sbjct: 481 --YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQD 538

Query: 162 AR-EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
              ED      +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 539 ITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTFG 592



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 25/41 (60%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF A + CSE C C  C N    EE
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 508


>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
          Length = 834

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/150 (44%), Positives = 82/150 (54%), Gaps = 39/150 (26%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K CNC  S+CLKL                             YC+CFA G +C  CNC N
Sbjct: 615 KPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCNCNN 645

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C NN+ +E  RQ A+ + LERNPNAFRPKI       +D R          +HNKGC+CK
Sbjct: 646 CSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIR----------RHNKGCNCK 695

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNF 212
           +SGCLK YCEC++A I CS NC+C+ C+N 
Sbjct: 696 RSGCLKNYCECYEAKIPCSGNCKCIGCRNI 725


>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
          Length = 859

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/153 (43%), Positives = 84/153 (54%), Gaps = 39/153 (25%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S+CLKL                             YC+CFA G +C  CN
Sbjct: 637 RPRKPCNCTKSQCLKL-----------------------------YCDCFANGEFCHMCN 667

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC NN+ +E  RQ A+ + LERNPNAFRPKI       +D R          +HNKGC
Sbjct: 668 CNNCSNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGEDIR----------RHNKGC 717

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           +CK+SGCLK YCEC++A I CS NC+C+ C+N 
Sbjct: 718 NCKRSGCLKNYCECYEAKIPCSGNCKCIGCRNI 750


>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
          Length = 782

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/174 (39%), Positives = 87/174 (50%), Gaps = 33/174 (18%)

Query: 43  SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
           SP+ + R + +G DG  +  K+C+C+ S+CLKL                           
Sbjct: 438 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKL--------------------------- 468

Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
             YCECFAAG+YC + C+C  C N   HE            RNP AF PK+       QD
Sbjct: 469 --YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQD 526

Query: 162 AR-EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
              ED      +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 527 ITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTFG 580



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 25/41 (60%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF A + CSE C C  C N    EE
Sbjct: 456 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 496


>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
          Length = 794

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/174 (39%), Positives = 87/174 (50%), Gaps = 33/174 (18%)

Query: 43  SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
           SP+ + R + +G DG  +  K+C+C+ S+CLKL                           
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKL--------------------------- 480

Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
             YCECFAAG+YC + C+C  C N   HE            RNP AF PK+       QD
Sbjct: 481 --YCECFAAGVYCSEPCSCQGCLNKPIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGQD 538

Query: 162 AR-EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
              ED      +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 539 ITGEDPNNTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSSNCRCEGCKNTFG 592



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 25/41 (60%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF A + CSE C C  C N    EE
Sbjct: 468 KRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEE 508


>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
          Length = 658

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 53/114 (46%), Positives = 71/114 (62%), Gaps = 1/114 (0%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFAAG+YC + C+C++C N   HE            RNP AF PK+  +    Q+  
Sbjct: 372 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETG 431

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           +DA +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G ++
Sbjct: 432 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 485



 Score = 39.7 bits (91), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 14/24 (58%), Positives = 16/24 (66%)

Query: 188 KKYCECFQANILCSENCRCLDCKN 211
           K YCECF A + C E C C+DC N
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFN 393


>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
 gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
          Length = 897

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 63/157 (40%), Positives = 81/157 (51%), Gaps = 31/157 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YC+CFAAG++C D C+C 
Sbjct: 509 KRCNCKKSKCLKL-----------------------------YCDCFAAGVFCLDPCSCQ 539

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLAAKHNKGCH 180
           +C N  E+     E       RNP AF PKI  S   A    ED       +A+H +GC+
Sbjct: 540 DCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCN 599

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           CK+S CLKKYCEC+Q+N+ CS  CRC  CKN  G +E
Sbjct: 600 CKRSMCLKKYCECYQSNVGCSSGCRCEGCKNVYGKKE 636


>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
          Length = 821

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 63/157 (40%), Positives = 81/157 (51%), Gaps = 31/157 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YC+CFAAG++C D C+C 
Sbjct: 433 KRCNCKKSKCLKL-----------------------------YCDCFAAGVFCLDPCSCQ 463

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLAAKHNKGCH 180
           +C N  E+     E       RNP AF PKI  S   A    ED       +A+H +GC+
Sbjct: 464 DCFNKPEYGEKVLETRQQIESRNPLAFAPKIVKSATNAPSNMEDVNLTTPSSARHTRGCN 523

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           CK+S CLKKYCEC+Q+N+ CS  CRC  CKN  G +E
Sbjct: 524 CKRSMCLKKYCECYQSNVGCSSGCRCEGCKNVYGKKE 560


>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
          Length = 738

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 81/161 (50%), Gaps = 30/161 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 444 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCVEPCSCQ 474

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C N   HE            RNP AF PK+  S     +  ++  +   +A+H +GC+C
Sbjct: 475 ECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNC 534

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           KKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  AL 
Sbjct: 535 KKSSCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSALI 575


>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
 gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
          Length = 808

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 66/170 (38%), Positives = 88/170 (51%), Gaps = 22/170 (12%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL  +++ I +         L   F      YCECFAAG+YC + C+C 
Sbjct: 488 KRCNCKKSKCLKLIDLKMTILVL-------ELTILF---FFSYCECFAAGVYCIEPCSCQ 537

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C N   HE    +       RNP AF PK+  S     +   D  +   +A+H +GC+C
Sbjct: 538 ECFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNC 597

Query: 182 KKSGCLKKYCECFQANIL-----------CSENCRCLDCKNFEGSEERRA 220
           KKS CLKKYCEC+Q  IL           CS +CRC  CKN  G ++  A
Sbjct: 598 KKSNCLKKYCECYQVLILTISCLPQGGVGCSISCRCEGCKNAFGRKDGSA 647


>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
          Length = 806

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 82/156 (52%), Gaps = 30/156 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 503 KRCNCKKSKCLKL-----------------------------YCECFAAGLYCVEPCSCQ 533

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE    +       RNP AF PK+  S     +  +++ +   +A+H +GC+C
Sbjct: 534 DCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNC 593

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           KKS CLKKYCECFQ  + CS +CRC  CKN  G ++
Sbjct: 594 KKSSCLKKYCECFQGGVGCSISCRCEGCKNTFGRKD 629



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 4/57 (7%)

Query: 157 HGAQDAREDAREAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           HG+   +   R ++ AA++   K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGS--PKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537


>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
          Length = 774

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 91/181 (50%), Gaps = 32/181 (17%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SPR + R +     G  +  K+CNC+ S+CLKL                         
Sbjct: 466 QNSPRMKRRKSE--TPGDTEGCKRCNCKKSKCLKL------------------------- 498

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
               YCECFAAG+YC + C+C +C N   HE    +       RNP AF PK+  +    
Sbjct: 499 ----YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSV 554

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
            +  +D  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CKN  G ++  
Sbjct: 555 PEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGS 614

Query: 220 A 220
           A
Sbjct: 615 A 615


>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
          Length = 829

 Score =  116 bits (290), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 82/156 (52%), Gaps = 30/156 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 503 KRCNCKKSKCLKL-----------------------------YCECFAAGLYCVEPCSCQ 533

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE    +       RNP AF PK+  S     +  +++ +   +A+H +GC+C
Sbjct: 534 DCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNC 593

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           KKS CLKKYCECFQ  + CS +CRC  CKN  G ++
Sbjct: 594 KKSSCLKKYCECFQGGVGCSISCRCEGCKNTFGRKD 629



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 4/57 (7%)

Query: 157 HGAQDAREDAREAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           HG+   +   R ++ AA++   K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 483 HGS--PKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537


>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
          Length = 774

 Score =  116 bits (290), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 68/192 (35%), Positives = 94/192 (48%), Gaps = 46/192 (23%)

Query: 46  SRPRANTDGKD---GTPKKQ-------------KQCNCRNSRCLKLDFVEIWINIYSNTS 89
           S+P A   G+D    +PKK+             K+CNC+ S+CLKL              
Sbjct: 453 SQPSAYMAGEDFNQNSPKKKRRKSETPGETEGCKRCNCKKSKCLKL-------------- 498

Query: 90  PGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
                          YCECFAAG+YC + C+C +C N   HE    +       RNP AF
Sbjct: 499 ---------------YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAF 543

Query: 149 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
            PK+  +     +  +D  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS +CRC  
Sbjct: 544 APKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEG 603

Query: 209 CKNFEGSEERRA 220
           CKN  G ++  A
Sbjct: 604 CKNAFGRKDGSA 615


>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
          Length = 777

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 82/156 (52%), Gaps = 30/156 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 474 KRCNCKKSKCLKL-----------------------------YCECFAAGLYCVEPCSCQ 504

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE    +       RNP AF PK+  S     +  +++ +   +A+H +GC+C
Sbjct: 505 DCFNKPVHEDTVLQTRKQIESRNPLAFAPKVIRSSDYVPEFGDESNKTPASARHKRGCNC 564

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           KKS CLKKYCECFQ  + CS +CRC  CKN  G ++
Sbjct: 565 KKSSCLKKYCECFQGGVGCSISCRCEGCKNTFGRKD 600



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 4/57 (7%)

Query: 157 HGAQDAREDAREAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           HG+   +   R ++ AA++   K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 454 HGS--PKRKRRRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 508


>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 651

 Score =  115 bits (289), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 52/111 (46%), Positives = 68/111 (61%), Gaps = 1/111 (0%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFAAG+YC + C+C++C N   HE            RNP AF PK+  +     +  
Sbjct: 365 YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLETG 424

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +DA +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 425 DDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFG 475



 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 6/91 (6%)

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA-QDAREDAREAQLAAKHNKGCH 180
           +C  NV H+ +    +   L+ + +  +  +  + + A +DA +D        + N    
Sbjct: 301 DCKTNVSHDYSCSGKIQVGLQSSISTMQDTLDQTENEAREDADQDVPVEPALQELNLSSP 360

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
            KKS     YCECF A + C E C C+DC N
Sbjct: 361 KKKS-----YCECFAAGVYCIEPCSCIDCFN 386


>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
          Length = 764

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 68/196 (34%), Positives = 94/196 (47%), Gaps = 43/196 (21%)

Query: 39  GKQESPRSRPRANTDGKDGTPKKQ-------------KQCNCRNSRCLKLDFVEIWINIY 85
           G Q +  S   A+ D    +PKK+             K+CNC+ S+CLKL          
Sbjct: 439 GVQPAEDSSQAASEDFNQNSPKKKRRKVEPAGDTEGCKRCNCKKSKCLKL---------- 488

Query: 86  SNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERN 144
                              YCECFAAG+YC + C+C +C N   HE    +       RN
Sbjct: 489 -------------------YCECFAAGVYCIEPCSCQDCFNKPIHEDTVLQTRKQIESRN 529

Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 204
           P AF PK+  +     +  +D  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS +C
Sbjct: 530 PLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSISC 589

Query: 205 RCLDCKNFEGSEERRA 220
           RC  CKN  G ++  A
Sbjct: 590 RCEGCKNAFGRKDGSA 605


>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
 gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
          Length = 727

 Score =  115 bits (287), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/159 (38%), Positives = 80/159 (50%), Gaps = 30/159 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 435 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCQ 465

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C N   HE            RNP AF PK+  S     +  ++  +   +A+H +GC+C
Sbjct: 466 ECFNKPIHEDTVLATRKQIESRNPLAFAPKVIRSSEPVTEIGDEFSKTPASARHKRGCNC 525

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
           KKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  A
Sbjct: 526 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSA 564


>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
          Length = 728

 Score =  115 bits (287), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 86/132 (65%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G++L    K  L  YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 513 GSNLQGPAKITLAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQP 572

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 573 KIGKGKLG-----------DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 621

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 622 NYEESPERKTLL 633


>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
          Length = 243

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 87/132 (65%), Gaps = 11/132 (8%)

Query: 98  FKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
            +D   +YC+CFA+G +C+ CNC NC+NN  HE  R +A+ A L RNP AF+PKI     
Sbjct: 45  LEDGGFRYCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKL 104

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           G           ++  +HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S +
Sbjct: 105 G-----------EVKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEESPQ 153

Query: 218 RRALFHGDHNGI 229
           RR L++G H  +
Sbjct: 154 RRILWNGPHPAV 165


>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1084

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/147 (40%), Positives = 78/147 (53%), Gaps = 30/147 (20%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
           CNC+ ++CLKL                             YCECFAA   C+GC C  C 
Sbjct: 533 CNCKRTKCLKL-----------------------------YCECFAASRMCEGCTCQGCF 563

Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
           N  E E  R+EA  A LERN +AF PKI S     Q   E+  + +    H+KGC+CKKS
Sbjct: 564 NKPEFEAMRKEARQAILERNNSAFDPKIDSQSKLKQLCNEEGVDNEQKV-HSKGCNCKKS 622

Query: 185 GCLKKYCECFQANILCSENCRCLDCKN 211
            CLKKYCEC+Q  + C++ C+C +C+N
Sbjct: 623 NCLKKYCECYQLGVKCTKLCKCDNCRN 649



 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 34/58 (58%), Gaps = 8/58 (13%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEG--SEERRALFHGDHNGIAF 231
           C+CK++ CLK YCECF A+ +C E C C  C N   FE    E R+A+   + N  AF
Sbjct: 533 CNCKRTKCLKLYCECFAASRMC-EGCTCQGCFNKPEFEAMRKEARQAIL--ERNNSAF 587



 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 34/87 (39%), Gaps = 31/87 (35%)

Query: 51  NTDGKDGTPK-KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECF 109
           N +G D   K   K CNC+ S CLK                             KYCEC+
Sbjct: 602 NEEGVDNEQKVHSKGCNCKKSNCLK-----------------------------KYCECY 632

Query: 110 AAGIYCDG-CNCVNCHNNVEHEVARQE 135
             G+ C   C C NC NN+ H  + +E
Sbjct: 633 QLGVKCTKLCKCDNCRNNINHPHSHEE 659


>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 638

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 78/153 (50%), Gaps = 42/153 (27%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           ++C+C  S CLKL                             YCECFAAG++C  C+CV 
Sbjct: 163 RRCSCSRSFCLKL-----------------------------YCECFAAGVFCSDCSCVG 193

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C+N  +HE  RQ+A+   + R P+AF+ KIA S              + A  H +GC+CK
Sbjct: 194 CYNLAQHENYRQKAIMRIVNRKPDAFQSKIAHS-------------LENATVHARGCNCK 240

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
           +SGCLK YCEC++A + C+  CRC  C N   +
Sbjct: 241 RSGCLKNYCECYEARVRCTSRCRCQYCYNMPAA 273



 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C C +S CLK YCECF A + CS+ C C+ C N    E  R
Sbjct: 165 CSCSRSFCLKLYCECFAAGVFCSD-CSCVGCYNLAQHENYR 204


>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
          Length = 774

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 89/179 (49%), Gaps = 32/179 (17%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           +Q SP+ + R      +G  +  K+CNC+ S+CLKL                        
Sbjct: 467 QQNSPKKKRRKLEPAGEG--ESCKRCNCKKSKCLKL------------------------ 500

Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                YCECFAAG+YC + C C +C N   H     +       RNP AF PK+  S   
Sbjct: 501 -----YCECFAAGVYCIEPCACRDCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDS 555

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
             +  +D  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CKN  G ++
Sbjct: 556 VSEIGDDLNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCEGCKNTYGRKD 614


>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
          Length = 509

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 87/133 (65%), Gaps = 11/133 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 294 GSALPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 353

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 354 KIGKGKLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 402

Query: 211 NFEGSEERRALFH 223
           N+E S ER++L +
Sbjct: 403 NYEESPERKSLIN 415


>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
 gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
          Length = 749

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/151 (41%), Positives = 85/151 (56%), Gaps = 5/151 (3%)

Query: 76  DFVEIWINIYSNTSPGA--HLNTQFK--DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHE 130
           D V  ++  +S TSP    H+  + +    L  YCECFAAG+YC + C C +C N   HE
Sbjct: 437 DNVVPFVEDFSQTSPKKKRHVGRRSRGLQALQLYCECFAAGVYCIEPCACQDCFNKPIHE 496

Query: 131 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 190
                       RNP AF PK+  S   A +  +++ +   +A+H +GC+CKKS CLKKY
Sbjct: 497 DTVLATRKQIESRNPLAFAPKVIRSSEPAPEIGDESSKTPASARHKRGCNCKKSSCLKKY 556

Query: 191 CECFQANILCSENCRCLDCKNFEGSEERRAL 221
           CEC+Q  + CS NCRC  CKN  G ++  AL
Sbjct: 557 CECYQGGVGCSLNCRCEGCKNAFGRKDGSAL 587


>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
 gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
          Length = 921

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/178 (37%), Positives = 84/178 (47%), Gaps = 40/178 (22%)

Query: 45  RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
           R + + +TD   G     K CNC+ S+CLKL                             
Sbjct: 506 RKKKKTSTDDASGP----KTCNCKKSKCLKL----------------------------- 532

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC+CF AGI+C DGC C  C N VE +    E       RNP AF PKI   P  A    
Sbjct: 533 YCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAPKIV--PCAADVPP 590

Query: 164 EDAREAQL----AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
            +  +  +    +A+H +GC+CK+S C KKYCECFQAN+ CS  CRC  C N  G  E
Sbjct: 591 NNMEDVNMTTPASARHKRGCNCKRSKCTKKYCECFQANVGCSTGCRCDGCMNAFGKRE 648


>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 611

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/153 (39%), Positives = 80/153 (52%), Gaps = 30/153 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+C C+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 431 KRCKCKKSQCLKL-----------------------------YCECFAAGLYCVEPCSCQ 461

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N   HE    ++      RNP AF PK+ S+   A D   +  +   +A+H +GC+C
Sbjct: 462 NCFNKPIHEDLVMKSREVIEARNPLAFAPKVVSTSDTAIDLWVENSKTPASARHTRGCNC 521

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +KSGCLKKYCEC    + CS +CRC+ CKN  G
Sbjct: 522 RKSGCLKKYCECNLMGVRCSSSCRCIGCKNVFG 554


>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
          Length = 511

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 86/133 (64%), Gaps = 11/133 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 296 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 355

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 356 KIGKGKLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 404

Query: 211 NFEGSEERRALFH 223
           N+E S ER+ L +
Sbjct: 405 NYEESPERKTLMN 417


>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
           [Cucumis sativus]
          Length = 788

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 83/161 (51%), Gaps = 30/161 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 493 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCQ 523

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE            RNP AF PK+  +     +  +++ +   +A+H +GC+C
Sbjct: 524 DCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNC 583

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           KKS CLKKYCEC+Q  + CS +CRC  CKN  G ++  ++ 
Sbjct: 584 KKSSCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624


>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
          Length = 512

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/124 (48%), Positives = 82/124 (66%), Gaps = 11/124 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA+G +C+ CNC NC N++ HE+ R +A+ A L+RNP AF+PKI     G
Sbjct: 305 KITLAGYCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG 364

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                       +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER
Sbjct: 365 -----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 413

Query: 219 RALF 222
           + L 
Sbjct: 414 KTLM 417


>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
           distachyon]
          Length = 771

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/186 (37%), Positives = 96/186 (51%), Gaps = 39/186 (20%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R   D  DGT    K+C+C+ S+CLKL                         
Sbjct: 446 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 477

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
               YCECF AG++C + C+C  C N   N+E  ++ +E + +   RNP AF PK+  + 
Sbjct: 478 ----YCECFHAGVFCSEPCSCQGCLNKPSNMETVLSTREQIES---RNPLAFAPKVIRTS 530

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
              Q+  E + +   +A+H +GC+CKKS CLKKYCECFQ  + CS +CRC  CKN  G  
Sbjct: 531 EPGQELGEQSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGRR 590

Query: 217 ERRALF 222
           E   L 
Sbjct: 591 EGVVLL 596


>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
          Length = 788

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 83/161 (51%), Gaps = 30/161 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C 
Sbjct: 493 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCQ 523

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   HE            RNP AF PK+  +     +  +++ +   +A+H +GC+C
Sbjct: 524 DCFNKPIHEDTVLATRKQIESRNPLAFAPKVIRNSDSLPEPGDESNKTPASARHKRGCNC 583

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           KKS CLKKYCEC+Q  + CS +CRC  CKN  G ++  ++ 
Sbjct: 584 KKSSCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSSIL 624


>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
          Length = 780

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/186 (37%), Positives = 99/186 (53%), Gaps = 40/186 (21%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R   D  DGT    K+C+C+ S+CLKL                         
Sbjct: 458 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 489

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
               YCECF AG++C + C+C  C N   N+E  ++ +E + +   RNP AF PK+  + 
Sbjct: 490 ----YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLAFAPKVIRTE 542

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
            G Q+  +D+ +   +++H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CKN  G  
Sbjct: 543 PG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCEGCKNAFGRR 601

Query: 217 ERRALF 222
           E  AL 
Sbjct: 602 EGVALL 607


>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
          Length = 548

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 86/132 (65%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G++L    K  L  YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 333 GSNLQGPAKITLAGYCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQP 392

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 393 KIGKGKLG-----------DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 441

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 442 NYEESPERKTLL 453


>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
          Length = 502

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/148 (43%), Positives = 89/148 (60%), Gaps = 15/148 (10%)

Query: 86  SNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 145
           S    G+ L    K +L  YC+CFA+G +C  CNC NC NN+ HEV R +A+ A L+RNP
Sbjct: 283 STVPSGSTLPGPPKISLAGYCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNP 342

Query: 146 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
            AF+PKI              R   +  +HN+GC+C++SGCLK YCEC++A I+CS  C+
Sbjct: 343 EAFQPKIGK-----------GRPGDVTPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICK 391

Query: 206 CLDCKNFEGSEERRAL----FHGDHNGI 229
           C+ CKN++   ER+ L    +H D  G 
Sbjct: 392 CIGCKNYDEGPERKTLMSMPYHPDAGGF 419


>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
 gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
          Length = 896

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/162 (38%), Positives = 77/162 (47%), Gaps = 31/162 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YC+CFAAG YC D C C 
Sbjct: 474 KRCNCKKSKCLKL-----------------------------YCDCFAAGTYCTDPCACQ 504

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR-EAQLAAKHNKGCH 180
            C N  E+     E       RNP AF PKI           +D       +A+H +GC+
Sbjct: 505 GCLNRPEYVETVVETKQQIESRNPIAFAPKIVQPTTDISSHMDDENLTTPSSARHKRGCN 564

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           CK+S CLKKYCEC+QAN+ CS  CRC  CKN  G +E    F
Sbjct: 565 CKRSMCLKKYCECYQANVGCSSGCRCEGCKNVHGKKEDYVAF 606


>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
 gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
          Length = 620

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 53/109 (48%), Positives = 63/109 (57%), Gaps = 13/109 (11%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YCECFAAG++C  C+C NC N  ++    Q        RNPNAF  KI +S  G      
Sbjct: 278 YCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGG----- 332

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                    +H KGCHCKKS CLKKYCECFQA +LC + C+C  CKN E
Sbjct: 333 --------GQHKKGCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKNKE 373


>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
 gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
          Length = 140

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/95 (63%), Positives = 65/95 (68%), Gaps = 7/95 (7%)

Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH----GDHNGI 229
           KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL       D N I
Sbjct: 23  KHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI 82

Query: 230 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 264
              Q AAN A+ GA GS GY       K+  E  L
Sbjct: 83  ---QEAANFALNGATGSLGYKNSPVRRKKYQENSL 114


>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
 gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
 gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
 gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 639

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 59/153 (38%), Positives = 79/153 (51%), Gaps = 30/153 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+C CR S+CLKL                             YCECF+AG++C + C+C 
Sbjct: 453 KRCKCRKSQCLKL-----------------------------YCECFSAGLFCGEPCSCQ 483

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N   HE    ++      RNP AF PK+ S+     D   +  +   +A+H +GC+C
Sbjct: 484 NCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLWVENSKTPASARHKRGCNC 543

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +KSGC KKYCECF   + CS NCRC+ CKN  G
Sbjct: 544 RKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 576


>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
          Length = 665

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/174 (38%), Positives = 88/174 (50%), Gaps = 39/174 (22%)

Query: 49  RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
           +A+T   D T K+   CNC+ ++CLKL                             YC+C
Sbjct: 476 KASTSTDDDTTKR---CNCKKTKCLKL-----------------------------YCDC 503

Query: 109 FAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI----ASSPHGAQDAR 163
           FAAG YC + C C  C N  E+E    E       RNP AF PK+     +SP   +   
Sbjct: 504 FAAGFYCGESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAPKVMQTVTASPTTGKVVE 563

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           + +  +  +A+H +GC+CKKS CLKKYCEC+QAN+ CSE CRC  CKN  G  E
Sbjct: 564 KLSTPS--SARHKRGCNCKKSMCLKKYCECYQANVGCSEGCRCEGCKNIYGVRE 615


>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 760

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 69/186 (37%), Positives = 99/186 (53%), Gaps = 40/186 (21%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R   D  DGT    K+C+C+ S+CLKL                         
Sbjct: 438 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKL------------------------- 469

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
               YCECF AG++C + C+C  C N   N+E  ++ +E + +   RNP AF PK+  + 
Sbjct: 470 ----YCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNPLAFAPKVIRTE 522

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
            G Q+  +D+ +   +++H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CKN  G  
Sbjct: 523 PG-QELADDSNKTPASSRHKRGCNCKKSSCLKKYCECYQGGVGCSVSCRCEGCKNAFGRR 581

Query: 217 ERRALF 222
           E  AL 
Sbjct: 582 EGVALL 587


>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
          Length = 517

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 62/124 (50%), Positives = 81/124 (65%), Gaps = 11/124 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L +YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI     G
Sbjct: 314 KIALARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG 373

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
               R           HNKGC C++SGCLK YCEC++A I+CS  C+C+DCKN+E S ER
Sbjct: 374 DVKPR-----------HNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEESPER 422

Query: 219 RALF 222
           + L 
Sbjct: 423 KTLM 426


>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
 gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
          Length = 414

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/111 (47%), Positives = 67/111 (60%), Gaps = 1/111 (0%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFAAG+YC + C+C+ C NN  HE    E       RNP AF PK+  +     +  
Sbjct: 141 YCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVSEFG 200

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           E+  +   +A+H +GC+CKKS CLKKYCECFQ  + CS  CRC  CKN  G
Sbjct: 201 EETNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFG 251



 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           +K YCECF A + C E C CL+C N    E+
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHED 168


>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 510

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 53/107 (49%), Positives = 63/107 (58%), Gaps = 13/107 (12%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YCECFAAG +C  C+C  C N  E+E    +      +RNP AF  KI +      DA +
Sbjct: 208 YCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKIMA------DAGD 261

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           DAR       H KGCHCKKS CLKKYCECFQA + C + C+C  CKN
Sbjct: 262 DAR-------HTKGCHCKKSACLKKYCECFQAGVKCQDYCKCEGCKN 301



 Score = 38.5 bits (88), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 16/26 (61%), Positives = 17/26 (65%), Gaps = 1/26 (3%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNN 126
           L KYCECF AG+ C D C C  C NN
Sbjct: 277 LKKYCECFQAGVKCQDYCKCEGCKNN 302


>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
 gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
          Length = 510

 Score =  112 bits (280), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 63/140 (45%), Positives = 87/140 (62%), Gaps = 22/140 (15%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G
Sbjct: 303 KITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLG 362

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                       +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER
Sbjct: 363 -----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 411

Query: 219 RAL-----------FHGDHN 227
           + L           F G+H+
Sbjct: 412 KTLMNMPNYMEIGGFEGNHH 431


>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 1042

 Score =  112 bits (280), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 63/161 (39%), Positives = 79/161 (49%), Gaps = 49/161 (30%)

Query: 52  TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
           +D K   P K   CNC+ S+CLKL                             YC+CFA 
Sbjct: 642 SDNKSTQPLKTT-CNCKKSKCLKL-----------------------------YCDCFAY 671

Query: 112 GIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
           G+ C   CNC +C  N E    R++A+    ERNPNAF+PKI                 Q
Sbjct: 672 GLGCSPDCNCADC-ANTEGNEERKQAMDQITERNPNAFKPKI-----------------Q 713

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
               H+KGCHCKKSGCLKKYCEC+Q+ ++C+  C C  CKN
Sbjct: 714 EKGFHSKGCHCKKSGCLKKYCECYQSGVVCTNLCACEGCKN 754



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 30/41 (73%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C+CKKS CLK YC+CF   + CS +C C DC N EG+EER+
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERK 694


>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
          Length = 760

 Score =  112 bits (279), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/153 (38%), Positives = 77/153 (50%), Gaps = 30/153 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C C 
Sbjct: 475 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCACH 505

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N   H     +       RNP AF PK+  S     +  +D  +   +A+H +GC+C
Sbjct: 506 DCFNKPIHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIGDDPNKTPASARHKRGCNC 565

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           KKS CLKKYCEC+Q  + CS +CRC  CKN  G
Sbjct: 566 KKSSCLKKYCECYQGGVGCSISCRCEGCKNTYG 598


>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
          Length = 383

 Score =  112 bits (279), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 86/133 (64%), Gaps = 11/133 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 168 GSALPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 227

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 228 KIGKGKLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 276

Query: 211 NFEGSEERRALFH 223
           N+E S ER+ L +
Sbjct: 277 NYEESPERKTLMN 289


>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
 gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
          Length = 348

 Score =  112 bits (279), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 61/153 (39%), Positives = 76/153 (49%), Gaps = 30/153 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC D C C 
Sbjct: 20  KRCNCKRSKCLKL-----------------------------YCECFAAGVYCLDSCACE 50

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N  E+E    +       RNP AF PK+      +    E       +A+H +GC+C
Sbjct: 51  DCFNRPEYEDTVLDTRQQIEARNPLAFAPKVVKQATNSPANEEGNWTTPSSARHKRGCNC 110

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           KKS CLKKYCEC+QA + CS  CRC  CKN  G
Sbjct: 111 KKSKCLKKYCECYQAGVGCSSGCRCEGCKNSYG 143


>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 444

 Score =  112 bits (279), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 61/148 (41%), Positives = 78/148 (52%), Gaps = 34/148 (22%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
           CNC+ S+CLKL                             YCECFA+G YC   C C  C
Sbjct: 318 CNCKRSQCLKL-----------------------------YCECFASGSYCTSKCKCNGC 348

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
            NN ++    + A   TLERNP AF PKI+ S     +  +  R     A H++GC+CK+
Sbjct: 349 KNNGDNANHVKTARERTLERNPRAFSPKISESTAAVTEEGQVLR----VAAHHRGCNCKR 404

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
           S C KKYCECFQA + C +NC+C+DCKN
Sbjct: 405 SNCRKKYCECFQAGVPCGDNCKCVDCKN 432


>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
 gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
          Length = 1987

 Score =  111 bits (278), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 61/168 (36%), Positives = 82/168 (48%), Gaps = 54/168 (32%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K C C+ S+CLKL                             YC+CFAAG YC  C+C++
Sbjct: 788 KSCRCKKSQCLKL-----------------------------YCDCFAAGQYCGSCSCIS 818

Query: 123 CHNNVEHE---VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           CHN  EH    + R+E + A   R+P AF  KI  +P+G               KH +GC
Sbjct: 819 CHNRPEHADRVLQRREDIAA---RDPQAFTRKIQLAPNGN-------------GKHKRGC 862

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNF------EGSEERRAL 221
           +C+KS CLKKYCEC+Q  + C   C C++C+N       EG+  R AL
Sbjct: 863 NCRKSHCLKKYCECYQGGVKCGIQCTCMECENMDVGSSQEGAGARGAL 910



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 1/74 (1%)

Query: 138 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
           G     NPNA R   A             R  + +   +K C CKKS CLK YC+CF A 
Sbjct: 749 GINANANPNAHRSSGAGGGLTQIQGGGPNRGRRTSENSSKSCRCKKSQCLKLYCDCFAAG 808

Query: 198 ILCSENCRCLDCKN 211
             C  +C C+ C N
Sbjct: 809 QYCG-SCSCISCHN 821


>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
          Length = 515

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+P
Sbjct: 300 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQP 359

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 360 KIGKGQLG-----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 408

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 409 NYEESPERKTLM 420


>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
          Length = 511

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 81/124 (65%), Gaps = 11/124 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G
Sbjct: 304 KITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG 363

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                       +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER
Sbjct: 364 -----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 412

Query: 219 RALF 222
           + L 
Sbjct: 413 KTLM 416


>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
 gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
          Length = 511

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 81/124 (65%), Gaps = 11/124 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G
Sbjct: 304 KITLAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG 363

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                       +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER
Sbjct: 364 -----------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 412

Query: 219 RALF 222
           + L 
Sbjct: 413 KTLM 416


>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
          Length = 844

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/159 (43%), Positives = 91/159 (57%), Gaps = 38/159 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S+CLKL                             YC+CFA G +C+ CN
Sbjct: 626 RPRKACNCTKSQCLKL-----------------------------YCDCFANGEFCNRCN 656

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NCHNN+E+E  RQ+A+   L+RNPNAFRPKI  S  G           ++  +HNKGC
Sbjct: 657 CNNCHNNLENEELRQKAIRGCLDRNPNAFRPKIGKSKAGG---------PEIIRRHNKGC 707

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
           +CK+SGCLK YCEC++A I CS  C+C+ C+N E + ER
Sbjct: 708 NCKRSGCLKNYCECYEAKIACSSICKCVGCRNVEETLER 746


>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Oryctolagus cuniculus]
          Length = 450

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/137 (48%), Positives = 86/137 (62%), Gaps = 16/137 (11%)

Query: 86  SNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 145
           S+T PG       K  L  YC+CFA+G +C+ CNC NC NN  HE+ R +A+ A L+RNP
Sbjct: 236 SSTLPGPP-----KIALSGYCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNP 290

Query: 146 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
            AF+PKI     G+   R           HNKGC+CK+SGCLK YCEC++A I+CS  C+
Sbjct: 291 EAFQPKIGKGRPGSAKPR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICK 339

Query: 206 CLDCKNFEGSEERRALF 222
           C+ CKN+E S ER+ L 
Sbjct: 340 CVGCKNYEESPERKTLM 356


>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 71/111 (63%), Gaps = 11/111 (9%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YCECF  G YC   C+C  CHN  ++   RQ+A+ + L+RN +AF+PK+A S     
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                 ++     KH +GC+CK+SGCLKKYCEC+QA + C++ C C++CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKN 305



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 32/87 (36%)

Query: 47  RPRANTDGK--DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
           +P+    GK  +   K  + CNC+ S CLK                             K
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLK-----------------------------K 283

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHE 130
           YCEC+ AG+ C D CNCV C N  +H+
Sbjct: 284 YCECYQAGVPCNDICNCVECKNRDDHD 310


>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
 gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
          Length = 561

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/190 (37%), Positives = 95/190 (50%), Gaps = 45/190 (23%)

Query: 41  QESPRSRPR----ANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNT 96
           Q SP+ + +    ++TDG DG     K+CNC+ +RCLK                      
Sbjct: 405 QSSPKKKRQDHKVSSTDG-DGC----KRCNCKKTRCLK---------------------- 437

Query: 97  QFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 155
                  +YC+CFAAGIYC + C C  C N +E+E    EA       NP AF PKI   
Sbjct: 438 -------RYCDCFAAGIYCAETCACQGCFNRLEYEDTVLEARQQKESHNPLAFAPKIVQ- 489

Query: 156 PHGAQDAREDAREAQL----AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
            H  +    +  +A L    + +H  GC+CK+S C+KKYCEC+QAN+ CS  CRC  CKN
Sbjct: 490 -HVTEFHAINVEDASLFTPSSGRHKTGCNCKRSMCVKKYCECYQANVGCSNACRCEGCKN 548

Query: 212 FEGSEERRAL 221
             G +E   L
Sbjct: 549 IHGRKEGEKL 558



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN---FEGSEERR 219
           R+D + +       K C+CKK+ CLK+YC+CF A I C+E C C  C N   +E +    
Sbjct: 412 RQDHKVSSTDGDGCKRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCFNRLEYEDTVLEA 471

Query: 220 ALFHGDHNGIAF 231
                 HN +AF
Sbjct: 472 RQQKESHNPLAF 483


>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
          Length = 508

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQP 352

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 353 KIGKGQLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 401

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 402 NYEESPERKTLM 413


>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
          Length = 662

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/219 (32%), Positives = 100/219 (45%), Gaps = 63/219 (28%)

Query: 43  SPRSRPRANTDGKDGTPKKQKQ-----------CNCRNSRCLKLDFVEIWINIYSNTSPG 91
           S ++ P A+  G+    KK+++           CNCR S+CLKL                
Sbjct: 334 SQKTMPNADKSGQQIHKKKRRKLQKDDGESCRHCNCRRSKCLKL---------------- 377

Query: 92  AHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLER----NPN 146
                        YC CFAA +YC   C+C  C NN  HE    E V  T +R    NP 
Sbjct: 378 -------------YCPCFAAKVYCSSLCSCQGCSNNHAHE----ELVSCTRKRTESHNPL 420

Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
           AF P +  +     +  +D+ +   +A+H +GC+C+KS CLKKYCECFQ+ + C  +CRC
Sbjct: 421 AFAPTVTHTYGTVLEFGDDSNKTPASARHKRGCNCRKSSCLKKYCECFQSGVGCCISCRC 480

Query: 207 LDCKNFEGSEERRALF--------------HGDHNGIAF 231
           + CKN  G  E   L               HG+ + +AF
Sbjct: 481 MSCKNSFGKREGVLLLTTEKLEKGGKAKGTHGEEDKLAF 519


>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
          Length = 516

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/148 (43%), Positives = 90/148 (60%), Gaps = 22/148 (14%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 301 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQP 360

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 409

Query: 211 NFEGSEERRAL-----------FHGDHN 227
           N+E S E++ L           F G+H+
Sbjct: 410 NYEESPEQKTLMNMPNYMEIGGFEGNHH 437


>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
          Length = 601

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 50/112 (44%), Positives = 69/112 (61%), Gaps = 1/112 (0%)

Query: 104 KYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
           +YCECF+AG++C + C+C NC N   HE    ++      RNP AF PK+ S+     D 
Sbjct: 427 RYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDL 486

Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
             +  +   +A+H +GC+C+KSGC KKYCECF   + CS NCRC+ CKN  G
Sbjct: 487 WVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 538


>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 70/111 (63%), Gaps = 11/111 (9%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YCECF  G YC   C+C  CHN  ++   RQ+A+ + L+RN +AF+PK+A S     
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                 ++     KH +GC+CK+SGCLKKYCEC+QA + C++ C C+ CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKN 305



 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 32/87 (36%)

Query: 47  RPRANTDGK--DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
           +P+    GK  +   K  + CNC+ S CLK                             K
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLK-----------------------------K 283

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHE 130
           YCEC+ AG+ C D CNCV C N  +H+
Sbjct: 284 YCECYQAGVPCNDICNCVGCKNRDDHD 310


>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
           fascicularis]
          Length = 490

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+P
Sbjct: 275 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQP 334

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 335 KIGKGQLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCK 383

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 384 NYEESPERKTLM 395


>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
          Length = 263

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 85/132 (64%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN++HE+ R +A+ A L RNP AF+P
Sbjct: 39  GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQP 98

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI +S  G    R           H+KGC C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 99  KIGTSQLGDVKPR-----------HSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCK 147

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 148 NYEESPERKTLM 159


>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
          Length = 584

 Score =  109 bits (272), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 58/140 (41%), Positives = 74/140 (52%), Gaps = 41/140 (29%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
           + CNC  SRCLKL                             YCEC+AAG YC   C C 
Sbjct: 407 RSCNCGKSRCLKL-----------------------------YCECYAAGTYCGPQCKCT 437

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC NN+E+E  RQ AV +TL++NP+AFRPKI       Q+   + +     A H +GC C
Sbjct: 438 NCQNNLENEGRRQLAVLSTLDKNPDAFRPKIK------QETTTNRK-----ANHLRGCKC 486

Query: 182 KKSGCLKKYCECFQANILCS 201
            KS CL++YCECFQA + C+
Sbjct: 487 TKSKCLQRYCECFQAGVPCT 506



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 28/45 (62%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           + C+C KS CLK YCEC+ A   C   C+C +C+N   +E RR L
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQL 451


>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 805

 Score =  109 bits (272), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 62/177 (35%), Positives = 94/177 (53%), Gaps = 36/177 (20%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           +G+ KK + CNC+ ++CLKL                             YC+CFAAG +C
Sbjct: 471 NGSNKKTRICNCKKTKCLKL-----------------------------YCDCFAAGEFC 501

Query: 116 DG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
              CNC  C N V ++ AR + V   LE+NP AF  K       +Q + +  ++  LA+K
Sbjct: 502 GAECNCCECSNTVANKEARNKIVEGLLEKNPFAFNVKDIEIEEPSQLSLKAQKK--LASK 559

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAF 231
             KGC+C++SGCLKKYC+C+Q  + C E+C+C  C+N E  +  + LF  + N ++ 
Sbjct: 560 --KGCNCRRSGCLKKYCQCYQDGLQCGEHCKCNGCENCE--KPAKKLFLAEQNSLSL 612


>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
          Length = 391

 Score =  109 bits (272), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 60/125 (48%), Positives = 81/125 (64%), Gaps = 11/125 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA+G +C+ CNC NC NN+ HE+ R  A+   L+RNP AF+PKI      
Sbjct: 180 KITLAGYCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGK---- 235

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                   R   +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER
Sbjct: 236 -------GRSGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPER 288

Query: 219 RALFH 223
           + L +
Sbjct: 289 KTLMN 293


>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
           [Homo sapiens]
          Length = 529

 Score =  108 bits (271), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 314 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 373

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 374 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 422

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 423 NYEESPERKTLM 434


>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
          Length = 671

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 53/108 (49%), Positives = 64/108 (59%), Gaps = 15/108 (13%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNV-EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC+CFAAG++C  C+C +C N   + EV RQ        RNPNAF  KI           
Sbjct: 359 YCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRY-QIESRNPNAFANKIVD--------- 408

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
               +  + AKH KGCHCKKS CLKKYCECFQAN+ C + C+C  CKN
Sbjct: 409 ----DDSVDAKHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKN 452


>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
          Length = 613

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 65/168 (38%), Positives = 84/168 (50%), Gaps = 34/168 (20%)

Query: 58  TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-D 116
           T   +K CNC+ ++CLKL                             YC+C AAG+YC D
Sbjct: 282 TESDRKHCNCKRTQCLKL-----------------------------YCDCLAAGVYCTD 312

Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--AREDAREAQ-LAA 173
            C C NC N  E+E   Q        R+P AF P+I +      D   +ED       +A
Sbjct: 313 SCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRIVNPDTDTTDNVLQEDGNWTTPSSA 372

Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERRA 220
           +H +GC+CKKS C KKYCEC+QA I CS+ CRC DC+N F    +RRA
Sbjct: 373 RHKRGCNCKKSMCQKKYCECYQAGIGCSDGCRCEDCRNSFGIKADRRA 420


>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
 gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
 gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
           [Homo sapiens]
          Length = 508

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 353 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 401

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 402 NYEESPERKTLM 413


>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
          Length = 508

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 353 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 401

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 402 NYEESPERKTLM 413


>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
          Length = 283

 Score =  108 bits (269), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 61/125 (48%), Positives = 83/125 (66%), Gaps = 11/125 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA+G +C+ C+C NC NN+ HE+ R  A+ A L+RNP AF+PKI  S   
Sbjct: 71  KITLAGYCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKS--- 127

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                   R   +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+DC N+E S ER
Sbjct: 128 --------RLGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEESPER 179

Query: 219 RALFH 223
           + L +
Sbjct: 180 KTLMN 184


>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
           [Homo sapiens]
          Length = 463

 Score =  108 bits (269), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 248 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 307

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 308 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 356

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 357 NYEESPERKTLM 368


>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 510

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/151 (40%), Positives = 72/151 (47%), Gaps = 44/151 (29%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC+ SRCLKL                             YC+CFAA  +C   CNC NC
Sbjct: 225 CNCKRSRCLKL-----------------------------YCQCFAAQAFCIPSCNCQNC 255

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
            N       R EA+   +ERNPNAF+ K    P+G     EDA        H  GC C+K
Sbjct: 256 LNTAGQNHLRAEAIRQIMERNPNAFQTKF--RPNG-----EDA-------VHKMGCKCRK 301

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           S CLKKYCECF A   CS+ C C+ C+N  G
Sbjct: 302 SACLKKYCECFNAGARCSDKCSCVGCQNVAG 332



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
           GC+CK+S CLK YC+CF A   C  +C C +C N  G    RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266


>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 766

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 88/176 (50%), Gaps = 35/176 (19%)

Query: 41  QESP-RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           QE+P R R +   D  D      K+C+C+ S+CLKL                        
Sbjct: 452 QENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKL------------------------ 483

Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                YCECFA+ +YC + C+C  C N+  HE            RNP AF PK+  +   
Sbjct: 484 -----YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGP 538

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
             +  ED+     +++H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CKN  G
Sbjct: 539 GLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 594



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 511


>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
          Length = 377

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 63/139 (45%), Positives = 85/139 (61%), Gaps = 21/139 (15%)

Query: 103 VKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
           V YC+CFA G +C  CNC NC+NN +HE+ R  A+ A L+RNP AF PKI     G    
Sbjct: 174 VGYCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLG---- 229

Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
                   +  +HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 230 -------DIKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEESPERKTLI 282

Query: 223 ----------HGDHNGIAF 231
                     H +++ ++F
Sbjct: 283 NMPNDMEIKSHEENDAVSF 301


>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
          Length = 757

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 88/176 (50%), Gaps = 35/176 (19%)

Query: 41  QESP-RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           QE+P R R +   D  D      K+C+C+ S+CLKL                        
Sbjct: 443 QENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKL------------------------ 474

Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                YCECFA+ +YC + C+C  C N+  HE            RNP AF PK+  +   
Sbjct: 475 -----YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGP 529

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
             +  ED+     +++H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CKN  G
Sbjct: 530 GLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 585



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 502


>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
          Length = 792

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/172 (36%), Positives = 77/172 (44%), Gaps = 48/172 (27%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
           CNC+ S+CLKL                             YC+CFA  + C   CNC +C
Sbjct: 26  CNCKRSKCLKL-----------------------------YCQCFAMQVVCTASCNCHHC 56

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
           +NN   E  R +A+   LERNP AF            DA+    E   A  H  GC C+K
Sbjct: 57  YNNANQESMRLQAINGVLERNPAAF------------DAKFRETEKHDAVVHKTGCKCRK 104

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRA 235
           S CLKKYCECF   + CSE C C+ C+N        AL+     G A  +RA
Sbjct: 105 SACLKKYCECFNKGVACSEKCNCVGCRNTV------ALYSEYRGGGAAAERA 150



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           A    GC+CK+S CLK YC+CF   ++C+ +C C  C N    E  R
Sbjct: 20  APTTDGCNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMR 66


>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 511

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/151 (40%), Positives = 72/151 (47%), Gaps = 44/151 (29%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC+ SRCLKL                             YC+CFAA  +C   CNC NC
Sbjct: 225 CNCKRSRCLKL-----------------------------YCQCFAAQAFCIPSCNCQNC 255

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
            N       R EA+   +ERNPNAF+ K    P+G     EDA        H  GC C+K
Sbjct: 256 LNTAGQNHLRAEAIRQIMERNPNAFQTKF--RPNG-----EDA-------VHKMGCKCRK 301

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           S CLKKYCECF A   CS+ C C+ C+N  G
Sbjct: 302 SACLKKYCECFNAGARCSDKCSCVGCQNVAG 332



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
           GC+CK+S CLK YC+CF A   C  +C C +C N  G    RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266


>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
          Length = 644

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 88/176 (50%), Gaps = 35/176 (19%)

Query: 41  QESP-RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           QE+P R R +   D  D      K+C+C+ S+CLKL                        
Sbjct: 330 QENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKL------------------------ 361

Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                YCECFA+ +YC + C+C  C N+  HE            RNP AF PK+  +   
Sbjct: 362 -----YCECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGP 416

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
             +  ED+     +++H +GC+CK+S C+KKYCECFQ+ + CS +CRC +CKN  G
Sbjct: 417 GLEFGEDSNATPASSRHKRGCNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 472



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 389


>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
           gaditana CCMP526]
          Length = 365

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 75/213 (35%), Positives = 96/213 (45%), Gaps = 73/213 (34%)

Query: 41  QESPRSRPRANTDGKDGTP-----------KKQKQCNCRNSRCLKLDFVEIWINIYSNTS 89
           QE+ RS P   + G  GTP           +  K+CNCR S+CLKL              
Sbjct: 129 QEARRSPPLNGSGG--GTPQGRGTSGLAATRSSKKCNCRKSKCLKL-------------- 172

Query: 90  PGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
                          YCECFA G YC D C C++C+N +  E  R +AV + L RNP+AF
Sbjct: 173 ---------------YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAF 217

Query: 149 RPKI------ASSP-----------HGAQDAREDAREAQL-------------AAKHNKG 178
             KI      A+ P            G Q +  D  E  +              A H +G
Sbjct: 218 TEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRG 277

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C+CKKS C+KKYCECF A + C ENC+C  C+N
Sbjct: 278 CNCKKSLCIKKYCECFFAGVYCGENCQCEGCQN 310



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 30/48 (62%)

Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           A + +K C+C+KS CLK YCECF     C ++C CLDC N    E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202



 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVE 128
           + KYCECF AG+YC + C C  C NN E
Sbjct: 286 IKKYCECFFAGVYCGENCQCEGCQNNPE 313


>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
          Length = 508

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K NL  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKINLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ C+
Sbjct: 353 KIGKGQLG-----------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCE 401

Query: 211 NFEGSEERRALF 222
           N E S ER+ L 
Sbjct: 402 NCEESPERKTLM 413


>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
          Length = 508

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K NL  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKINLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ C+
Sbjct: 353 KIGKGQLG-----------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCE 401

Query: 211 NFEGSEERRALF 222
           N E S ER+ L 
Sbjct: 402 NCEESPERKTLM 413


>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 450

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 82/161 (50%), Gaps = 43/161 (26%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC  S CLKL                             YCECFA G  C  CNC 
Sbjct: 144 RKPCNCTKSHCLKL-----------------------------YCECFAQGQLCQNCNCH 174

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC NN+ +E  R  A+  TLERNP AF PKI           E  R      KH KGC+C
Sbjct: 175 NCMNNLAYEEERGRAIKMTLERNPTAFHPKIGRG--------EGER------KHTKGCNC 220

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           K+SGCLK YCEC++A I CS+ CRC  C+N E S ERR+L 
Sbjct: 221 KRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLM 261


>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
 gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
          Length = 316

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 65/133 (48%), Positives = 82/133 (61%), Gaps = 13/133 (9%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YCECFA G +C  CNC NCHNN+EHE  R  A+ A LERNP+AFRPKI  +  G +    
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEGNER--- 194

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH- 223
                    +H+KGC+CK+SGCLK YCEC++A I C+  C+C +CKN E   ER+ L   
Sbjct: 195 ---------RHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKNREDCLERKNLMQL 245

Query: 224 GDHNGIAFMQRAA 236
            D   I   Q+ A
Sbjct: 246 ADAADIRVKQQTA 258


>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
          Length = 622

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 61/126 (48%), Positives = 79/126 (62%), Gaps = 11/126 (8%)

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
           N Q K     YCECFA+G +C+ CNC NC+NN  HE  R +A+   L+RNP AF PKI  
Sbjct: 417 NLQVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQ 476

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           S  G            +   HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E 
Sbjct: 477 SKLG-----------DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 525

Query: 215 SEERRA 220
           S +++ 
Sbjct: 526 SPDKKT 531


>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
          Length = 299

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/124 (47%), Positives = 81/124 (65%), Gaps = 11/124 (8%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G
Sbjct: 92  KITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG 151

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
                       +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER
Sbjct: 152 -----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPER 200

Query: 219 RALF 222
           + L 
Sbjct: 201 KTLM 204


>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 471

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 80/172 (46%), Gaps = 50/172 (29%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-C 118
           KKQK CNC+ ++CLKL                             YC+CFAAG  C G C
Sbjct: 127 KKQKTCNCKKTKCLKL-----------------------------YCDCFAAGELCGGEC 157

Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
           NC  C N    E  R  A+ + ++R P+AF PK+  + H                   KG
Sbjct: 158 NCFGCFNTSNSE-KRAGAIVSIMDRQPDAFGPKVQQNTHK------------------KG 198

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE-GSEERRALFHGDHNGI 229
           C+C +SGCLKKYCEC+   + C E C+C  CKN +    ER AL     NGI
Sbjct: 199 CNCTRSGCLKKYCECYTLGVNCGEYCKCTQCKNVDLNHSERLALSMNGSNGI 250


>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 365

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 74/213 (34%), Positives = 96/213 (45%), Gaps = 73/213 (34%)

Query: 41  QESPRSRPRANTDGKDGTP-----------KKQKQCNCRNSRCLKLDFVEIWINIYSNTS 89
           QE+ RS P   + G  GTP           +  K+CNCR S+CLKL              
Sbjct: 129 QEARRSPPLNGSGG--GTPQGRGTSGLAATRSSKKCNCRKSKCLKL-------------- 172

Query: 90  PGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
                          YCECFA G YC D C C++C+N +  E  R +AV + L RNP+AF
Sbjct: 173 ---------------YCECFAMGAYCRDDCGCLDCNNTLAFEQVRTQAVESILSRNPHAF 217

Query: 149 RPKI------ASSP-----------HGAQDAREDAREAQL-------------AAKHNKG 178
             KI      A+ P            G Q +  D  E  +              A H +G
Sbjct: 218 TEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQGTAGGALPTAAHVRG 277

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C+CKKS C++KYCECF A + C ENC+C  C+N
Sbjct: 278 CNCKKSLCIQKYCECFFAGVYCGENCQCEGCQN 310



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 30/48 (62%)

Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           A + +K C+C+KS CLK YCECF     C ++C CLDC N    E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202



 Score = 40.0 bits (92), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVE 128
           + KYCECF AG+YC + C C  C NN E
Sbjct: 286 IQKYCECFFAGVYCGENCQCEGCQNNPE 313


>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
          Length = 929

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 65/116 (56%), Gaps = 14/116 (12%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC+CFA G YC   C+C +C N VE+         A  +RNPNAF  KI +         
Sbjct: 541 YCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQRNPNAFVQKIEA--------- 591

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
               +A L  +H +GC+CKKS C+KKYCECFQA + C E+C+C  C N  G   RR
Sbjct: 592 ----DAALGGQHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643


>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1040

 Score =  105 bits (261), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 82/184 (44%), Gaps = 54/184 (29%)

Query: 58  TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG 117
           T K++ +CNC+ S+CLKL                             YC+CF  G +C+ 
Sbjct: 707 TSKQKVKCNCKKSKCLKL-----------------------------YCDCFNQGQFCNS 737

Query: 118 -CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK-- 174
            CNC  C N   ++  R+ A+    ERNP+AF+PKI +     Q      +   L     
Sbjct: 738 ECNCTECSNTENNKAERESAIKQLQERNPDAFKPKIQTKEQIIQQYLSKLQIFMLIISSL 797

Query: 175 ----------------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
                                 HNKGC+CKKSGC KKYCEC+   + CS+ C+C  C+N 
Sbjct: 798 LSKNQIQYLIPKIIKDDEEILIHNKGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNR 857

Query: 213 EGSE 216
           + SE
Sbjct: 858 DPSE 861



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 28/50 (56%)

Query: 170 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           Q  +K    C+CKKS CLK YC+CF     C+  C C +C N E ++  R
Sbjct: 705 QTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAER 754


>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 742

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 57/157 (36%), Positives = 76/157 (48%), Gaps = 48/157 (30%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + +P + K CNCRNS+CLK                             +YC+CFAAG+YC
Sbjct: 493 EKSPTEAKPCNCRNSKCLK-----------------------------RYCDCFAAGLYC 523

Query: 116 DG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
              C C  CHN  E+E  RQ+A+    +R  +AF P I ++ H                 
Sbjct: 524 QAECKCEECHNKPEYEDERQKAIQKKKKRIKDAFLPVIQNNSHI---------------- 567

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
             KGCHCK S C KKYC C Q  +LCS  C+C++C+N
Sbjct: 568 --KGCHCKNSHCQKKYCVCHQNGVLCSSLCQCVECEN 602



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 27/43 (62%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           K C+C+ S CLK+YC+CF A + C   C+C +C N    E+ R
Sbjct: 500 KPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDER 542


>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 766

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 94/181 (51%), Gaps = 39/181 (21%)

Query: 41  QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           Q SP+ + R   D  DGT    K+C+C+ S+CLKL                         
Sbjct: 440 QGSPKKK-RHKFDDGDGT--SCKRCSCKKSKCLKL------------------------- 471

Query: 101 NLVKYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSP 156
               YCECF AG++C + C+C +C N   N+E  ++ ++ + +   RNP AF PK+  + 
Sbjct: 472 ----YCECFHAGVFCSEPCSCQDCLNKPSNMEIVLSTRKQIES---RNPLAFAPKVIRTS 524

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
              Q+  E +     +A+H +GC+CKKS CLKKYCEC+Q  + CS +CR   CKN  G  
Sbjct: 525 EPGQELGEYSNRTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRYEGCKNAFGRR 584

Query: 217 E 217
           E
Sbjct: 585 E 585


>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
          Length = 783

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 12/99 (12%)

Query: 139 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 198
           A L+RNP AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I
Sbjct: 604 ACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKI 652

Query: 199 LCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 653 MCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 691


>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
          Length = 503

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 60/126 (47%), Positives = 78/126 (61%), Gaps = 11/126 (8%)

Query: 95  NTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 154
           N   K     YCECFA+G +C+ CNC NC+NN  HE  R +A+   L+RNP AF PKI  
Sbjct: 297 NLHVKMTSAGYCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQ 356

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           S  G            +   HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E 
Sbjct: 357 SELG-----------DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 405

Query: 215 SEERRA 220
           S +++ 
Sbjct: 406 SPDKKT 411


>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
 gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
          Length = 710

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 69/205 (33%), Positives = 90/205 (43%), Gaps = 64/205 (31%)

Query: 45  RSRPRANTDGKDG----TPKKQ------------KQCNCRNSRCLKLDFVEIWINIYSNT 88
           R R    +D  DG    TPKK+            K+C+C+ S+CLKL             
Sbjct: 356 RKRLSMQSDSMDGEMGRTPKKRRKSLGEKAGDGCKRCHCKKSKCLKL------------- 402

Query: 89  SPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNA 147
                           YCECFAAG+YC D C C +C N  E              RNP A
Sbjct: 403 ----------------YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLA 446

Query: 148 FRPKIAS--------------SPHGAQDA----REDAREAQLAAKHNKGCHCKKSGCLKK 189
           F PKI                S H   ++    + +  +   +A+H +GC+CKKS CLKK
Sbjct: 447 FAPKIVQTSDNTPTEGVGIWFSGHAMTNSLFALQPNCPDTTGSARHKRGCNCKKSLCLKK 506

Query: 190 YCECFQANILCSENCRCLDCKNFEG 214
           YCEC+QA + CS+ C+C  CKN  G
Sbjct: 507 YCECYQAGVGCSDGCKCNGCKNIYG 531


>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas reinhardtii]
 gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 91

 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 52/92 (56%), Positives = 61/92 (66%), Gaps = 1/92 (1%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YCECFA+  YC+ CNC+ C NN E+E  RQ AV A +ERNPNAF+PKI    H       
Sbjct: 1   YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 196
            A  A  + +H KGC+CKKS CLKKYCECFQ 
Sbjct: 61  VA-AAGASGRHLKGCNCKKSFCLKKYCECFQV 91


>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
          Length = 364

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 82/161 (50%), Gaps = 43/161 (26%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC  S CLKL                             YCECFA G  C  CNC 
Sbjct: 145 RKPCNCTKSHCLKL-----------------------------YCECFAQGQLCQNCNCN 175

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC NN+ +E  R  A+  TLERNP AF PKI           E  R      KH KGC+C
Sbjct: 176 NCMNNLAYEEERGRAIKMTLERNPTAFHPKIGRG--------EGER------KHTKGCNC 221

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           K+SGCLK YCEC++A I CS+ CRC  C+N E S ERR+L 
Sbjct: 222 KRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLM 262



 Score = 42.0 bits (97), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 42/106 (39%), Gaps = 34/106 (32%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           +G  K  K CNC+ S CLK                              YCEC+ A I C
Sbjct: 210 EGERKHTKGCNCKRSGCLK-----------------------------NYCECYEAKISC 240

Query: 116 -DGCNCVNCHN---NVEHE-VARQEAVGATLERNPNAFRPKIASSP 156
            D C C  C N   +VE   + R  A+GA   R P++F+  +   P
Sbjct: 241 SDLCRCQGCRNTEDSVERRSLMRLAAMGALRSRQPSSFKKHLVKPP 286


>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
          Length = 363

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 82/161 (50%), Gaps = 43/161 (26%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC  S CLKL                             YCECFA G  C  CNC 
Sbjct: 144 RKPCNCTKSHCLKL-----------------------------YCECFAQGQLCQNCNCN 174

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC NN+ +E  R  A+  TLERNP AF PKI           E  R      KH KGC+C
Sbjct: 175 NCMNNLAYEEERGRAIKMTLERNPTAFHPKIGRG--------EGER------KHTKGCNC 220

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           K+SGCLK YCEC++A I CS+ CRC  C+N E S ERR+L 
Sbjct: 221 KRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLM 261



 Score = 42.0 bits (97), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 42/106 (39%), Gaps = 34/106 (32%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           +G  K  K CNC+ S CLK                              YCEC+ A I C
Sbjct: 209 EGERKHTKGCNCKRSGCLK-----------------------------NYCECYEAKISC 239

Query: 116 -DGCNCVNCHN---NVEHE-VARQEAVGATLERNPNAFRPKIASSP 156
            D C C  C N   +VE   + R  A+GA   R P++F+  +   P
Sbjct: 240 SDLCRCQGCRNTEDSVERRSLMRLAAMGALRSRQPSSFKKHLVKPP 285


>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
          Length = 593

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 52/135 (38%), Positives = 69/135 (51%), Gaps = 30/135 (22%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N   H+            RNP AF PK+  +     +  EDA +   +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418

Query: 182 KKSGCLKKYCECFQA 196
           KKS CLKKYCEC+Q+
Sbjct: 419 KKSNCLKKYCECYQS 433



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 8/128 (6%)

Query: 91  GAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATLER 143
           G HLNT     KD  V     F+  I     + +    + +H++ R+    +  G  +E 
Sbjct: 236 GLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQIIEV 295

Query: 144 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
            P +    +  +P   +  R  + ++       K C+CKKS CLK YCECF A   C E 
Sbjct: 296 VPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCIEP 354

Query: 204 CRCLDCKN 211
           C C++C N
Sbjct: 355 CSCINCFN 362


>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 490

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 40/151 (26%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNCVNC 123
           C C+ +RCLKL                             YC+CF   IYC   C C++C
Sbjct: 217 CTCKKTRCLKL-----------------------------YCQCFGVKIYCGPNCRCLDC 247

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK---HNKGCH 180
           HN    E ARQ A+   L RNP+AF  K   +P        D    +  +K   H  GC 
Sbjct: 248 HNVPAQEDARQNAMKVILSRNPHAFDTKFQKTP-------VDGATVETPSKLLTHKLGCK 300

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C+KS C+KKYCEC+  ++ C+ +CRC  CKN
Sbjct: 301 CRKSACMKKYCECYAGHVYCNTHCRCTGCKN 331



 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/67 (46%), Positives = 37/67 (55%), Gaps = 6/67 (8%)

Query: 159 AQDAREDARE-AQLAAKHNK-----GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           A D RE A   +Q+   H +     GC CKK+ CLK YC+CF   I C  NCRCLDC N 
Sbjct: 191 ANDGREGATSPSQIETDHLEFVQAVGCTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNV 250

Query: 213 EGSEERR 219
              E+ R
Sbjct: 251 PAQEDAR 257


>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 113

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 67/111 (60%), Gaps = 1/111 (0%)

Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFAA ++C G C C NC N  E+E            R+P AF PKI  +       R
Sbjct: 1   YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +DA +   +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC  CKN  G
Sbjct: 61  DDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNMYG 111


>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 394

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 64/189 (33%), Positives = 89/189 (47%), Gaps = 51/189 (26%)

Query: 59  PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG- 117
           P++ K C+C+ S CLKL                             YCEC AA   CD  
Sbjct: 45  PERTKGCSCKKSNCLKL-----------------------------YCECLAAQRMCDHR 75

Query: 118 CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           CNC  C N  E    R+ AV A L+RNP AF+PK+AS                 +++H +
Sbjct: 76  CNCEGCKNCSETLAERERAVAAILDRNPLAFQPKVASG----------------SSQHLR 119

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG--SEERRALFHG---DHNGIAFM 232
           GC+C+KSGC+K YCEC QA + C+  C C  C+N E   S ++  +F G   D N +   
Sbjct: 120 GCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSADDNDLRVT 179

Query: 233 QRAANAAII 241
            R +   ++
Sbjct: 180 IRPSQRKLV 188



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 3/75 (4%)

Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 224
           + Q   +  KGC CKKS CLK YCEC  A  +C   C C  CKN        ER      
Sbjct: 40  QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAIL 99

Query: 225 DHNGIAFMQRAANAA 239
           D N +AF  + A+ +
Sbjct: 100 DRNPLAFQPKVASGS 114


>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
           CCMP2712]
          Length = 113

 Score =  102 bits (253), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 60/151 (39%), Positives = 77/151 (50%), Gaps = 41/151 (27%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
           ++K C+C+ SRCLKL                             YCECFAAG  C GC C
Sbjct: 4   RKKVCSCKKSRCLKL-----------------------------YCECFAAGEICSGCKC 34

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
           V+C N+ +HE  R +AV    +RN NAF PKI                 Q    H +GC 
Sbjct: 35  VDCANDGDHEDMRLQAVDTIKQRNNNAFAPKIVDEIQ------------QDKGMHARGCR 82

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           CKKS CLKKYCEC+QA + C++ C+C +C+N
Sbjct: 83  CKKSHCLKKYCECYQAGVQCTDKCKCEECQN 113



 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           K C CKKS CLK YCECF A  +CS  C+C+DC N    E+ R
Sbjct: 6   KVCSCKKSRCLKLYCECFAAGEICS-GCKCVDCANDGDHEDMR 47


>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
          Length = 407

 Score =  102 bits (253), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 81/169 (47%), Gaps = 48/169 (28%)

Query: 59  PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG- 117
           P++ K C+C+ S CLKL                             YCEC AA   CD  
Sbjct: 45  PERTKGCSCKKSNCLKL-----------------------------YCECLAAQRMCDHL 75

Query: 118 CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           CNC  C N  E    R+ AV A LERNP AF PK+AS                 +++H +
Sbjct: 76  CNCEGCKNCQETLAERERAVAAILERNPLAFLPKVASG----------------SSQHLR 119

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG--SEERRALFHG 224
           GC+C+KSGC+K YCEC QA + C+  C C  C+N E   S ++  +F G
Sbjct: 120 GCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAG 168



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 3/73 (4%)

Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 224
           + Q   +  KGC CKKS CLK YCEC  A  +C   C C  CKN +      ER      
Sbjct: 40  QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAIL 99

Query: 225 DHNGIAFMQRAAN 237
           + N +AF+ + A+
Sbjct: 100 ERNPLAFLPKVAS 112


>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
          Length = 1430

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/167 (35%), Positives = 80/167 (47%), Gaps = 43/167 (25%)

Query: 54   GKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGI 113
            G+DG+    K CNC+ S+CLKL                             YCEC+AAG+
Sbjct: 1003 GRDGSGGGFKPCNCKKSKCLKL-----------------------------YCECYAAGV 1033

Query: 114  YCDGCNCVNCHNNVEHEVAR-----QEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 168
            +C  CNC NC N  E   A        A+G  L     A    + +S   AQ      R+
Sbjct: 1034 FCRDCNCSNCLNKPEDGEAHPPYPATHALG-RLRHGDVAVPTAVLTS---AQK-----RK 1084

Query: 169  AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
             Q   +   GC CKKS CLKKYCECF+A + C  +C+C +C+N++GS
Sbjct: 1085 PQSKQRQESGCFCKKSKCLKKYCECFEAGVHCEASCKCENCENYDGS 1131



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 44/88 (50%), Gaps = 21/88 (23%)

Query: 144  NPNAFRPKIASSP-------HGAQDAREDAREAQL---AAKHNKG----------CHCKK 183
             P+A  P  AS+P        G+ D+R  A  A L   +A+  KG          C+CKK
Sbjct: 959  TPSAMAPPPASTPLCGPPPPPGSADSRGRAAGATLVSASARKRKGRDGSGGGFKPCNCKK 1018

Query: 184  SGCLKKYCECFQANILCSENCRCLDCKN 211
            S CLK YCEC+ A + C  +C C +C N
Sbjct: 1019 SKCLKLYCECYAAGVFC-RDCNCSNCLN 1045


>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1114

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 77/155 (49%), Gaps = 38/155 (24%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC  ++CLKL                             YC CF  G+ C D C C +C
Sbjct: 714 CNCGKTKCLKL-----------------------------YCVCFQQGLMCSDQCRCTDC 744

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA---QDA-REDAR---EAQLAAKHN 176
           +N + +   R+ A+ A  +R P+AF  K+   P  A   +DA R DA    + QL A H 
Sbjct: 745 YNKIAYLEERERAIKAIKQRYPDAFEKKVNFLPELARKIEDAERLDADLELQNQLLA-HK 803

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           KGC+CKKS C KKYCECF A + C+  CRC  C+N
Sbjct: 804 KGCNCKKSACKKKYCECFLAGVKCTYLCRCEQCQN 838



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 24/34 (70%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
            C+C K+ CLK YC CFQ  ++CS+ CRC DC N
Sbjct: 713 ACNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYN 746


>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 492

 Score = 99.4 bits (246), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 63/113 (55%), Gaps = 12/113 (10%)

Query: 111 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
           A +    C+C NC NN+ +E+ R  A+ + LERNP AF+PKI          R D     
Sbjct: 275 APVLKSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG-------KGRADTERL- 326

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
               HNKGC+CKKS CLK YCEC++A + C+  C+C+ C+N E     R   H
Sbjct: 327 ----HNKGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRNTEADRNHRNRGH 375


>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 495

 Score = 99.4 bits (246), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 61/159 (38%), Positives = 75/159 (47%), Gaps = 32/159 (20%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           D   K ++ C C+NS+CLKL                             YCECFA G YC
Sbjct: 247 DSEQKAREGCRCKNSKCLKL-----------------------------YCECFAKGAYC 277

Query: 116 -DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 174
            D C C+ C N  E E   +EA   TL RNP+AF  K+       QD   +A    L  K
Sbjct: 278 RDICKCLQCSNTEECENEIKEARKVTLTRNPDAFTSKLEVVGTIVQDEDIEANRGMLGYK 337

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
             KGC CK++ C KKYCEC+ A + CS  C C +C N E
Sbjct: 338 --KGCKCKRTYCKKKYCECYNAGVKCSYLCICENCHNQE 374



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 29/56 (51%), Gaps = 9/56 (16%)

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           D+ + ARE         GC CK S CLK YCECF     C + C+CL C N E  E
Sbjct: 247 DSEQKARE---------GCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECE 293


>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
          Length = 223

 Score = 98.6 bits (244), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 12/99 (12%)

Query: 126 NVEHEV-ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
           NV + V   Q    A L+RNP AF+PKI     G  D R           H+KGC+CK+S
Sbjct: 30  NVGYAVLPAQYVTQACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRS 78

Query: 185 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           GCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 79  GCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 117


>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
          Length = 466

 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 55/132 (41%), Positives = 82/132 (62%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  +C+CFA+G + + CNC NC NN+ HE+ R +A+ A L R+P AF+P
Sbjct: 294 GSALPGPPKTTLAGHCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQP 353

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           +I     G           ++  +H+KGC+C++ GCL+ YCEC +A I+CS  C+C+ CK
Sbjct: 354 QIGKGQLG-----------EVRPRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCK 402

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 403 NYEESPERKTLM 414


>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 974

 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 76/153 (49%), Gaps = 36/153 (23%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC+ ++CLKL                             YC+CFA G  C   C CV C
Sbjct: 636 CNCKKTKCLKL-----------------------------YCDCFAVGELCGQDCKCVEC 666

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK--GCHC 181
            NN      R +A+   L+RNP AF  K    P  A+ ++++   + L   ++K  GC+C
Sbjct: 667 CNNEATLNLRNQAIEGILQRNPFAFNVK----PEEAKKSKKELNNSILQNSNSKVIGCNC 722

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +KS CLKKYC C+ + + CSE C+C +CKN E 
Sbjct: 723 RKSNCLKKYCLCYHSGMKCSEACKCTECKNLEN 755



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 28/41 (68%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C+CKK+ CLK YC+CF    LC ++C+C++C N E +   R
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLR 676


>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
          Length = 289

 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 71/169 (42%), Positives = 92/169 (54%), Gaps = 18/169 (10%)

Query: 92  AHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
           +H++ Q K     YC+CFA G +C  CNC NC NN  HE  R  A+ A LERNP AF PK
Sbjct: 138 SHMHGQVKMTSEGYCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPK 197

Query: 152 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           I     G           ++   HNKGC+CK+SGCL+ YCECF+A I+CS  C+C  CKN
Sbjct: 198 IGKRKLG-----------EIKHHHNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKN 246

Query: 212 FEGS-EERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 259
           +E S +E + L     N I +M    N      V +S +  L  S K +
Sbjct: 247 YEESPDENKQL-----NVINYMD-IGNKEGNSPVLTSAFNMLPKSRKDR 289


>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 712

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 77/176 (43%), Gaps = 58/176 (32%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
           C C+ S+CLKL                             YC CF+A   C   C C  C
Sbjct: 338 CTCKKSKCLKL-----------------------------YCHCFSANSMCSSFCRCTEC 368

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE-----DAREAQL------A 172
            N  E    R++AV ATL+RNPNAFR K      G + A+E      A++ QL       
Sbjct: 369 MNTTEAGTVREDAVKATLQRNPNAFRSKFVKVTGGEEFAKEWEKNERAKQMQLLNVGDSP 428

Query: 173 AK-----------------HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           AK                 H  GC+C+KS CLK+YCECF A   C  NC+CL C+N
Sbjct: 429 AKTKTTGVEMAAVAIANPTHKLGCNCRKSFCLKRYCECFGAQTPCGLNCKCLGCQN 484



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 61/153 (39%), Gaps = 23/153 (15%)

Query: 131 VARQEAVGATLERNP--NAFRPKIASSPHGAQDARED----AREAQLAAKHNKG-----C 179
            A   A  ++ E NP  N   P I S P      R      A  A+L A H        C
Sbjct: 279 TAAATAFKSSTETNPPANYPAPTIPSRPSVPVAQRSSIVLQAATARLKAHHYTDNSSVTC 338

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE--GSEERRALFHGDHNGIAFMQRAAN 237
            CKKS CLK YC CF AN +CS  CRC +C N    G+    A+        A +QR  N
Sbjct: 339 TCKKSKCLKLYCHCFSANSMCSSFCRCTECMNTTEAGTVREDAV-------KATLQRNPN 391

Query: 238 A---AIIGAVGSSGYGTLMTSNKRKSEELLFGV 267
           A     +   G   +      N+R  +  L  V
Sbjct: 392 AFRSKFVKVTGGEEFAKEWEKNERAKQMQLLNV 424


>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 978

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 55/148 (37%), Positives = 73/148 (49%), Gaps = 38/148 (25%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC+ ++CLKL                             YCECFA   YC +GC+C  C
Sbjct: 405 CNCKKTKCLKL-----------------------------YCECFAKLGYCGEGCHCHEC 435

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
            N  E+E  RQ A+     RN +AF+PK  S     Q+ +  A   Q+     KGC+C+K
Sbjct: 436 KNRPENEDERQNAIQEAKSRNNDAFQPKTESE----QERKNKANNEQMK----KGCNCRK 487

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
           + CLKKYCECF A   C+  C+C  C+N
Sbjct: 488 TKCLKKYCECFNAGTYCNNMCKCDSCEN 515



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 26/41 (63%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C+CKK+ CLK YCECF     C E C C +CKN   +E+ R
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDER 445


>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
          Length = 664

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/157 (35%), Positives = 72/157 (45%), Gaps = 48/157 (30%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGC 118
           ++Q+ CNC+ S CLK+                             YCECFAAG +C   C
Sbjct: 260 REQRPCNCKRSMCLKM-----------------------------YCECFAAGGFCAPSC 290

Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
           +C++C N          A    L +NPNAF  K+ +                 A  H +G
Sbjct: 291 SCLSCSNTPAEMGVVMAAREVVLAKNPNAFEVKVTA-----------------ATGHRRG 333

Query: 179 CHCKKSGCLKKYCECFQANILCS-ENCRCLDCKNFEG 214
           C CK+S CLKKYCECF A   C+ E C+C DC+N EG
Sbjct: 334 CRCKRSKCLKKYCECFHAGARCNPEVCQCEDCRNTEG 370



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 26/39 (66%)

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           A+  + C+CK+S CLK YCECF A   C+ +C CL C N
Sbjct: 259 AREQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSN 297


>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1022

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 71/147 (48%), Gaps = 35/147 (23%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCH 124
           C C+ ++CLKL                             YCECF A  YC G NC  C 
Sbjct: 766 CGCKKTKCLKL-----------------------------YCECFQAMKYCVGTNCQGCL 796

Query: 125 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 184
           N  E+E  R+ A+    +RN +AF PK   S    Q  + D+      A H+KGC+CKKS
Sbjct: 797 NKPEYEDQRKHAMELIQQRNSSAFDPKADKS---DQYYKSDSG---TKAIHSKGCNCKKS 850

Query: 185 GCLKKYCECFQANILCSENCRCLDCKN 211
            C KKYCEC+Q    C+  C+C +CKN
Sbjct: 851 DCRKKYCECYQMGAKCTHLCKCYNCKN 877



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 179 CHCKKSGCLKKYCECFQANILC-SENCR-CLDCKNFEGSEERRALFHGDHNGIAFMQRA 235
           C CKK+ CLK YCECFQA   C   NC+ CL+   +E   +         N  AF  +A
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKA 824


>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
          Length = 712

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 78/179 (43%), Gaps = 57/179 (31%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
           K CNC+ S+CLKL                             YCECFA   +C   C C 
Sbjct: 449 KSCNCKKSKCLKL-----------------------------YCECFANNKFCGANCACN 479

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR----------EAQL 171
            C N+ EH+  R  A    L RNP AFRPK+ +         +D +          +A L
Sbjct: 480 GCSNHAEHDDERLHAKEQILMRNPLAFRPKVETQQDETSLFSKDIQITKINIQEQSDALL 539

Query: 172 AA-----------------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           +                  +H +GC+CKKS C KKYCECFQ  + CS+ C+C +CKN E
Sbjct: 540 SGLSKPQTFGALAVSDKDKRHFRGCNCKKSNCQKKYCECFQQGVNCSDLCKCEECKNNE 598



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 44/97 (45%), Gaps = 7/97 (7%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-----NGIAF 231
           K C+CKKS CLK YCECF  N  C  NC C  C N    ++ R   H        N +AF
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDER--LHAKEQILMRNPLAF 506

Query: 232 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVA 268
             +        ++ S        + + +S+ LL G++
Sbjct: 507 RPKVETQQDETSLFSKDIQITKINIQEQSDALLSGLS 543


>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 90

 Score = 95.5 bits (236), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 68/95 (71%), Gaps = 9/95 (9%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI-ASSPH---GAQ 160
           YCECFA+G YCDGCNCVNC NN E E  RQ AV A LERNPNAFRPKI    P    G  
Sbjct: 1   YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
            A + A     AA+HNKGC+CKKSGCLKKYCECFQ
Sbjct: 61  VAIQGA-----AARHNKGCNCKKSGCLKKYCECFQ 90


>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
 gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
          Length = 641

 Score = 95.1 bits (235), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 83/190 (43%), Gaps = 38/190 (20%)

Query: 45  RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVK 104
           R R   N DGK       + C+C+ S+CLKL                             
Sbjct: 344 RRRKLQNDDGKSC-----RHCSCKKSKCLKL----------------------------- 369

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC CFAA +YC + C+C  C NN  HE A       T  RNP AF P +        +  
Sbjct: 370 YCACFAAKVYCSEFCSCQGCSNNHMHEEAVSHIRKQTESRNPLAFAPTVTRKCGSVSELG 429

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS--ENCRCLDCKNFEGSEERRAL 221
           +D+     +A+H +GC+C+KS CLKKYCECFQ  +L    E    L  +  +   E +  
Sbjct: 430 DDSNNTPASARHKRGCNCRKSSCLKKYCECFQVLMLLFSLEGVLLLTTEKLKKGGEAKGT 489

Query: 222 FHGDHNGIAF 231
            HG    +AF
Sbjct: 490 -HGKEEKLAF 498


>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
 gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
          Length = 478

 Score = 95.1 bits (235), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 55/149 (36%), Positives = 69/149 (46%), Gaps = 60/149 (40%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K+C+C+NS+CLK+                             YCECFA  I C+ C C  
Sbjct: 103 KKCHCKNSKCLKM-----------------------------YCECFANKILCNNCQCFG 133

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C NN     A  E +G                             ++ L  KH++GCHC+
Sbjct: 134 CQNNE----ANIEKIGND---------------------------QSVLIDKHSRGCHCR 162

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
           KSGCLKKYCECFQA I C+ENC+C DCKN
Sbjct: 163 KSGCLKKYCECFQAGIPCNENCKCYDCKN 191



 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/42 (57%), Positives = 29/42 (69%), Gaps = 1/42 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
           K CHCK S CLK YCECF   ILC+ NC+C  C+N E + E+
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCN-NCQCFGCQNNEANIEK 143


>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
          Length = 496

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 61/108 (56%), Gaps = 5/108 (4%)

Query: 104 KYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           +YC CF A + C+GC+CV+C+N+ +HE  R  A+      +P AF  ++ +    A  + 
Sbjct: 299 QYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVRAEGDAASVSV 358

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           E         +H +GC CK S CLKKYCECF+  + CS  C C DC N
Sbjct: 359 ESK-----PKQHVRGCKCKNSKCLKKYCECFEFGVSCSSKCDCKDCMN 401


>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
          Length = 260

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 60/182 (32%), Positives = 88/182 (48%), Gaps = 44/182 (24%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCN 119
           +Q+QCNC+ S+CLKL                             YC+CFA G YC+  C+
Sbjct: 60  QQRQCNCKKSKCLKL-----------------------------YCDCFAIGEYCNPICH 90

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C  C NN  ++  R+ A+    ERNPNAF  +I              +++QL   H +GC
Sbjct: 91  CFECKNNESNKEKREMALKQNKERNPNAFTSRII-------------QQSQLNN-HQRGC 136

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAA 239
           +CKKSGC KKYC+C+   + CS  CRC+ C+N    E ++   +  +      Q+  N+ 
Sbjct: 137 NCKKSGCQKKYCQCYLDGMECSRYCRCIGCENCNIFENKKICLNNKNIEKQIQQQKGNSD 196

Query: 240 II 241
            I
Sbjct: 197 YI 198



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 32/51 (62%)

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           L A   + C+CKKS CLK YC+CF     C+  C C +CKN E ++E+R +
Sbjct: 56  LKALQQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREM 106


>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
          Length = 371

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/107 (42%), Positives = 65/107 (60%), Gaps = 15/107 (14%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A+
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAS 355


>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
          Length = 276

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 72/153 (47%), Gaps = 44/153 (28%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-C 118
           ++ K CNCR S+CLKL                             YC+CFA G  C   C
Sbjct: 144 QEHKPCNCRKSKCLKL-----------------------------YCDCFAVGKLCSSKC 174

Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
           NC  C NN  + + R   +   +ERNP AF  K+              +E +    H KG
Sbjct: 175 NCCGCFNNSSNLLERNSFIQKMIERNPQAFNQKV--------------QEVESKMTHAKG 220

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C+C+KSGC KKYCEC+Q  I CS+NC+C  C N
Sbjct: 221 CNCRKSGCQKKYCECYQMGIECSDNCKCDGCLN 253


>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
          Length = 218

 Score = 94.4 bits (233), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/150 (37%), Positives = 73/150 (48%), Gaps = 44/150 (29%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
           K CNC+ S+CLKL                             YC+CFAAG  C   CNC 
Sbjct: 83  KPCNCKKSKCLKL-----------------------------YCDCFAAGKLCSSKCNCC 113

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C NN  + + R + +   +ERNP AF  K+              +E      H+KGC+C
Sbjct: 114 GCFNNSSNLLERNQFIEKMVERNPEAFNQKV--------------KEVDYKLAHSKGCNC 159

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
           +KSGC KKYCEC+Q  I CS+NC+C  C+N
Sbjct: 160 RKSGCKKKYCECYQLGIECSDNCKCDGCQN 189


>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
          Length = 348

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 71/149 (47%), Gaps = 30/149 (20%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
           QCNC+ S+CLKL                             YCECF     C   CNC  
Sbjct: 167 QCNCKKSKCLKL-----------------------------YCECFTNNWVCSQSCNCTE 197

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C N +++   R +A+   L RNP AF P + ++    Q  +E   +  +  +  KGC+CK
Sbjct: 198 CKNRIDNPNERSKAIEEALLRNPEAFAPILTNNGQQPQVIQEQKSQKDIQKETKKGCNCK 257

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
           KS C KKYCEC+  N  C+E C+C +C N
Sbjct: 258 KSECKKKYCECYSINQKCTELCKCENCLN 286


>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
          Length = 373

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/175 (36%), Positives = 89/175 (50%), Gaps = 39/175 (22%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA-GIYCDGCNCVN 122
           QCNC+ S+CLKL                             YCECFA  G+    CNC +
Sbjct: 187 QCNCKKSKCLKL-----------------------------YCECFANNGVCSQSCNCQD 217

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN-----K 177
           C N +++   R +A+   L RN +AF  +  S+  GAQ  ++D +  +  +K N     K
Sbjct: 218 CKNRIDNPQERSKAIEEALLRNTDAFV-QCFSTKGGAQFVQQD-KPLKEPSKDNSSVIRK 275

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE--ERRALFHGDHNGIA 230
           GC+CKKSGC KKYCEC+  NI C+E C+C  C N   +E   ++ L   DHN ++
Sbjct: 276 GCNCKKSGCKKKYCECYSQNIRCNELCKCEHCLNKTDAEIQAQQDLGQEDHNNLS 330


>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 617

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 59/108 (54%), Gaps = 14/108 (12%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECF  G YC D C C  C NN ++   R++A+     RN +AF  K   + +      
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQITSNST---- 412

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                      H KGCHCK++ CLKKYCECF A + C+ NC+C +C+N
Sbjct: 413 ---------VTHKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRN 451


>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
          Length = 396

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/150 (35%), Positives = 71/150 (47%), Gaps = 46/150 (30%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVNC 123
           CNC+ SRCLKL                             YCECF +  YC   CNC+NC
Sbjct: 151 CNCKKSRCLKL-----------------------------YCECFKSQEYCSSECNCLNC 181

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
           +NN +HE  R+EAV     RNP+ F   +        D ++   ++        GC C++
Sbjct: 182 YNNRKHESEREEAVQRMKARNPHCFENHV--------DEKKGVNKS--------GCRCRR 225

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKNFE 213
           + C KKYCECF   + C + C+C DCKN E
Sbjct: 226 THCDKKYCECFSHGVPCGDQCQCKDCKNEE 255



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 12/85 (14%)

Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG----------CHCKKSGCLKKYCECF 194
           P +F   ++++   ++DA E A   + +AK  +           C+CKKS CLK YCECF
Sbjct: 109 PTSFSFLLSAA--SSRDALESAFPREKSAKSTRATQPSIPGLVTCNCKKSRCLKLYCECF 166

Query: 195 QANILCSENCRCLDCKNFEGSEERR 219
           ++   CS  C CL+C N    E  R
Sbjct: 167 KSQEYCSSECNCLNCYNNRKHESER 191


>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
 gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
          Length = 155

 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/173 (35%), Positives = 80/173 (46%), Gaps = 43/173 (24%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
           +C+C+ S+CLKL                             YCECFA+G YC D C C N
Sbjct: 1   RCSCKKSKCLKL-----------------------------YCECFASGSYCLDSCACGN 31

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----------- 171
           C N  EHE    E       RNP AF P++ S     +  +       L           
Sbjct: 32  CSNRQEHEDVVSETRQLIQSRNPLAFAPRVISPAEVVRVCQAFVSSNSLIFVSPAAAAIP 91

Query: 172 --AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
              +KH +GC+CKKS CLKKYCECFQ+ + CS+ C+C  CKN  G+++    F
Sbjct: 92  AATSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNTFGAKQGDDFF 144



 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 34/65 (52%), Gaps = 5/65 (7%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQR 234
           C CKKS CLK YCECF +   C ++C C +C N +  E    E R L     N +AF  R
Sbjct: 2   CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQ-SRNPLAFAPR 60

Query: 235 AANAA 239
             + A
Sbjct: 61  VISPA 65


>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 75/156 (48%), Gaps = 30/156 (19%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
           QCNC+ S+CLKL                             YCECFA    C   CNC  
Sbjct: 163 QCNCKKSKCLKL-----------------------------YCECFANNWVCSQSCNCTE 193

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C N +++   R +A+   L RNP AF   + ++    Q   E   + + + +  KGC+CK
Sbjct: 194 CKNRIDNPNERSKAIEEALLRNPEAFSTILTNNGQQPQIIPEPKSQKEQSKETKKGCNCK 253

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
           KS C KKYCEC+  N  C++ C+C +C N E  +E+
Sbjct: 254 KSECKKKYCECYSINQKCTDLCKCENCLNKEEPQEQ 289



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 50/95 (52%), Gaps = 15/95 (15%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANA 238
           C+CKKS CLK YCECF  N +CS++C C +CKN   +   R+             +A   
Sbjct: 164 CNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERS-------------KAIEE 210

Query: 239 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQS 273
           A++       + T++T+N ++ + +    + K+QS
Sbjct: 211 ALLR--NPEAFSTILTNNGQQPQIIPEPKSQKEQS 243


>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
 gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
          Length = 781

 Score = 92.4 bits (228), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 83/157 (52%), Gaps = 32/157 (20%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           ++CNC+ SRCLKL                             YCECFAAGIYC D C+CV
Sbjct: 441 RRCNCKKSRCLKL-----------------------------YCECFAAGIYCLDTCSCV 471

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIAS-SPHGAQDAREDAREAQLAA-KHNKGC 179
           NC N  E+E    +    T  RNP AF PK+ + + +   +  E+ +  + ++ +H KGC
Sbjct: 472 NCINKPEYEDTVLDMRQQTEARNPLAFAPKVVNNATNSPANMMEEGKWMKTSSSRHKKGC 531

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           +CKKS C KKYCECFQ  + C   CRC  C N  G++
Sbjct: 532 NCKKSKCSKKYCECFQGGVGCCNGCRCEGCYNPYGNK 568


>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
          Length = 386

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 73/159 (45%), Gaps = 35/159 (22%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
           QCNC+ S+CLKL                             YCECFA    C   CNC +
Sbjct: 191 QCNCKKSKCLKL-----------------------------YCECFANNWVCSQSCNCQD 221

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN-----K 177
           C N +++   R +A+   L RNP+AF              ++  +  +   K N     K
Sbjct: 222 CKNRIDNPQERSKAIEEALLRNPDAFAQCFQQKSQTQFSVQQQDKPLKEPTKDNSNITRK 281

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           GC+CKKSGC KKYCEC+  N+ CS+ C+C  C N   +E
Sbjct: 282 GCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAE 320



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
           C+CKKS CLK YCECF  N +CS++C C DCKN  +  +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232


>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
           tropicalis]
          Length = 183

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 84/154 (54%), Gaps = 40/154 (25%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           K +K CNC  S+CLKL                             YC+CFA G  C+ CN
Sbjct: 9   KTKKPCNCTRSQCLKL-----------------------------YCDCFANGDLCNNCN 39

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN EHE+ R +AV A L RNP+AF+PKI           E  +   +  +HNKGC
Sbjct: 40  CSNCYNNAEHELERFKAVKACLYRNPDAFQPKI-----------EKGKIGDVKPRHNKGC 88

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           +CK+SGC K YCEC++A I CS  C+C+ CKN +
Sbjct: 89  NCKRSGCFKNYCECYEAKIACSSVCKCVGCKNVK 122


>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
          Length = 384

 Score = 92.0 bits (227), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 73/159 (45%), Gaps = 35/159 (22%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
           QCNC+ S+CLKL                             YCECFA    C   CNC +
Sbjct: 191 QCNCKKSKCLKL-----------------------------YCECFANNWVCSQSCNCQD 221

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN-----K 177
           C N +++   R +A+   L RNP+AF              ++  +  +   K N     K
Sbjct: 222 CKNRIDNPQERSKAIEEALLRNPDAFAQCFQQKGQTQFSVQQQDKPLKEPTKDNSNITRK 281

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           GC+CKKSGC KKYCEC+  N+ CS+ C+C  C N   +E
Sbjct: 282 GCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAE 320



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
           C+CKKS CLK YCECF  N +CS++C C DCKN  +  +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232


>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
 gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
          Length = 141

 Score = 92.0 bits (227), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 60/168 (35%), Positives = 79/168 (47%), Gaps = 43/168 (25%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVN 122
           +C+C+ S+CLKL                             YCECFA+G YC D C C N
Sbjct: 1   RCSCKKSKCLKL-----------------------------YCECFASGSYCLDSCACGN 31

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----------- 171
           C N  EHE    E       RNP AF P++ S     +  +       L           
Sbjct: 32  CSNRQEHEDVVSETRQLIQSRNPLAFAPRVISPAEVVRVCQAFVSSNSLIFVSPAAAAIP 91

Query: 172 --AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
              +KH +GC+CKKS CLKKYCECFQ+ + CS+ C+C  CKN  G+++
Sbjct: 92  AATSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNTFGAKQ 139



 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 34/65 (52%), Gaps = 5/65 (7%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQR 234
           C CKKS CLK YCECF +   C ++C C +C N +  E    E R L     N +AF  R
Sbjct: 2   CSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQ-SRNPLAFAPR 60

Query: 235 AANAA 239
             + A
Sbjct: 61  VISPA 65


>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
           trifallax]
          Length = 600

 Score = 92.0 bits (227), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/113 (45%), Positives = 67/113 (59%), Gaps = 10/113 (8%)

Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEH-EVARQEAVGATLERNPNAFRPKIAS-SPHG 158
           L  YC+CFA G  C   C C+NC N  ++ E    EA    + RNP AF+PK+    P  
Sbjct: 456 LKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGP-- 513

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           AQ ++ DA +     +H KGC+CK SGC K YCECFQ  + C++NCRC+ C N
Sbjct: 514 AQMSQNDADK-----QHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561


>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
          Length = 713

 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 53/134 (39%), Positives = 69/134 (51%), Gaps = 13/134 (9%)

Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC CF++  +C  GC C +C N  +H+  R EA+   L  +P AF    AS P  A 
Sbjct: 543 LKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAF--SFASLPQSAS 600

Query: 161 DA-REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS---- 215
                     + +A   +GC CKKS CLKKYCECFQ  I C+ +CRC+DC N   S    
Sbjct: 601 TTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSNHSESTAAH 660

Query: 216 -----EERRALFHG 224
                ++R A  HG
Sbjct: 661 QHPHLQKRSATAHG 674



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 52/109 (47%), Gaps = 16/109 (14%)

Query: 157 HGAQDAREDAREAQLAAKHNK-----------GCHCKKSGCLKKYCECFQANILCSENCR 205
           H  + A+ED R+ +L  +H +            C CK +GCLK YC CF +   C   C 
Sbjct: 504 HEPELAKEDKRQ-RLLLRHAQQIQASSVSSSTSCGCK-TGCLKMYCMCFSSRGFCHAGCA 561

Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAAN-AAIIGAVGSSGYGTLM 253
           C DCKN  G + +        N +A   RA + A++  +  ++G+  L+
Sbjct: 562 CDDCKN--GRKHQTERVEAIQNYLANDPRAFSFASLPQSASTTGFLHLL 608


>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
 gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
          Length = 191

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/99 (44%), Positives = 63/99 (63%), Gaps = 15/99 (15%)

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
           K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     G
Sbjct: 92  KIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLG 147

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 197
           A   R           H+KGC+CK+SGCLK YCEC++A+
Sbjct: 148 AAKLR-----------HSKGCNCKRSGCLKNYCECYEAS 175


>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 120

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/118 (45%), Positives = 67/118 (56%), Gaps = 8/118 (6%)

Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA----SSPH-- 157
           YCECFAAG+YC G C C +C N  E+             RNP AF PKI     SSP   
Sbjct: 1   YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60

Query: 158 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
             +    ++  +   +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC  C N  G
Sbjct: 61  VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMNKYG 118


>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/150 (37%), Positives = 70/150 (46%), Gaps = 44/150 (29%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCV 121
           K CNC+ S+CLKL                             YC+CFAA   C   CNC 
Sbjct: 107 KPCNCKKSKCLKL-----------------------------YCDCFAANKLCSSKCNCC 137

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C NN  + + R       +ERNP AF  K+              +E      H+KGC+C
Sbjct: 138 GCFNNSSNLIERNLYREKMVERNPEAFNQKV--------------KEVDQKMAHSKGCNC 183

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
           +KSGC KKYCEC+Q  I CS+NC+C  CKN
Sbjct: 184 RKSGCKKKYCECYQMGIECSDNCKCDGCKN 213



 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 28/46 (60%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
           K C+CKKS CLK YC+CF AN LCS  C C  C N   +   R L+
Sbjct: 107 KPCNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLY 152


>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 159

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 70/132 (53%), Gaps = 22/132 (16%)

Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA- 162
           YCECFA G+YC G C C +C N  E+             RNP AF PKI     GA+ + 
Sbjct: 1   YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIV---QGAEPSP 57

Query: 163 -----------------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
                            +++A +   +A+H +GC+CKKS CLKKYCEC+QA + CSE CR
Sbjct: 58  VPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCR 117

Query: 206 CLDCKNFEGSEE 217
           C  C N  G +E
Sbjct: 118 CEGCMNKYGKKE 129


>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 710

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 42/160 (26%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNCV 121
           K+C C  S+CLKL                             YCECFAAG+ C   C C 
Sbjct: 525 KKCTCSKSKCLKL-----------------------------YCECFAAGLVCGLDCGCK 555

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           +C N  +   A Q+A    ++R+P AF  K++      Q++  D  +      H KGC C
Sbjct: 556 DCCNTEDSSDAIQKAKDEIMKRDPLAFEIKVS------QNSGTDKLQ------HRKGCTC 603

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           KKSGC K YCECFQ  + C++ C+C  C N E   ++  L
Sbjct: 604 KKSGCQKGYCECFQLGVPCTDFCKCSGCANCEKKSDQPCL 643



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 34/67 (50%)

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           +I  + H   + ++     Q      K C C KS CLK YCECF A ++C  +C C DC 
Sbjct: 499 QITHNHHSHHNEQKLGNSPQQPGVSTKKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCC 558

Query: 211 NFEGSEE 217
           N E S +
Sbjct: 559 NTEDSSD 565


>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
 gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
          Length = 144

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 50/118 (42%), Positives = 65/118 (55%), Gaps = 8/118 (6%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH------ 157
           YCECFAAG+YC D C C +C N  E              RNP AF PKI  +        
Sbjct: 1   YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60

Query: 158 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
             +   + +  +   +A+H +GC+CKKS CLKKYCEC+QA + CS+ C+C  CKN  G
Sbjct: 61  VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 118


>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
 gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
          Length = 777

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/115 (42%), Positives = 62/115 (53%), Gaps = 10/115 (8%)

Query: 102 LVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC CF++  +C  GC C +C N    +  R EA+ + L  +P AF    AS P   Q
Sbjct: 547 LKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAF--SFASLP---Q 601

Query: 161 DAREDAREAQLAAKHN----KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           D         L  K +    +GC CKKS CLKKYCECFQ  I C+ +CRC+DC N
Sbjct: 602 DTNTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSN 656



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C CK +GCLK YC CF +   C   C C DCKN   S+  R
Sbjct: 540 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTER 579


>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
          Length = 735

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 47/123 (38%), Positives = 61/123 (49%), Gaps = 38/123 (30%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVN 122
           K C+C+NS+CLK+                             YCECFA+   C+GC C  
Sbjct: 326 KMCHCKNSKCLKM-----------------------------YCECFASKQLCNGCQCFG 356

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           CHNN +H  + + A  +TLERNP+AF PK   S +G         E   A KH KGCHC+
Sbjct: 357 CHNNEDHIESVENARASTLERNPDAFGPKFKQSFNG---------EKPKAEKHLKGCHCR 407

Query: 183 KSG 185
           +SG
Sbjct: 408 RSG 410



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K CHCK S CLK YCECF +  LC+  C+C  C N E   E
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCN-GCQCFGCHNNEDHIE 365


>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
          Length = 374

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 73/150 (48%), Gaps = 32/150 (21%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC+ S+CLKL                             YCECFA    C   CNC  C
Sbjct: 169 CNCKKSKCLKL-----------------------------YCECFANNWVCSQNCNCCEC 199

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA--REAQLAAKHNKGCHC 181
            N +++   R +A+   L RNP+AF   + ++    Q  +E+   +E+Q      KGC+C
Sbjct: 200 KNRIDNPNERSKAIEEALLRNPDAFAAILTNNGQQPQIIQEEKSQKESQKDITTRKGCNC 259

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
           KKS C KKYCEC+  N  C++ C+C +C N
Sbjct: 260 KKSECKKKYCECYSINQRCTDLCKCENCLN 289


>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
          Length = 265

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 92/215 (42%), Gaps = 50/215 (23%)

Query: 24  VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPK------KQK----QCNCRNSRCL 73
           +PH  H +P    PP        +   NT  K+   K       QK     CNC+ S+CL
Sbjct: 1   MPHYPHPIPYWYRPPVYFPQYDGQQYQNTLSKELQSKVSQLVQYQKVSHFACNCKKSKCL 60

Query: 74  KLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVA 132
           KL                             YCECFA    C   CNC  C N +++   
Sbjct: 61  KL-----------------------------YCECFANNWVCSQNCNCCECKNRIDNPNE 91

Query: 133 RQEAVGATLERNPNAFRPKIASS---PHGAQDA-------REDAREAQLAAKHNKGCHCK 182
           R +A    L RNP+AF   + ++   P   QD         +  +E+Q+     KGC+CK
Sbjct: 92  RSKATEEALLRNPDAFAAILTNNGQQPQIIQDKTLILPLEEKSQKESQIDITTRKGCNCK 151

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           KS C KKYCEC+  N  C++ C+C +C N   S+E
Sbjct: 152 KSECKKKYCECYSINQKCTDLCKCENCLNKAESQE 186


>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
           distachyon]
          Length = 377

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 78/194 (40%), Gaps = 42/194 (21%)

Query: 25  PHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINI 84
           P   ++L L T P       R R     DG        K C C+ S+CLKL         
Sbjct: 62  PSAANQLALATTPK------RQRVEEAADGNGC-----KHCACKKSKCLKL--------- 101

Query: 85  YSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLER 143
                               YC CFA G YC + C CV C N  ++    Q      L R
Sbjct: 102 --------------------YCPCFAGGGYCSEKCGCVPCFNKSDYAETVQTTRKVLLSR 141

Query: 144 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 203
                  KI        +A EDA  +  +    +GC+CKKS CLKKYC+C+Q    CS  
Sbjct: 142 QKR-MSLKINRRSEANAEAMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLF 200

Query: 204 CRCLDCKNFEGSEE 217
           CRC DCKN  G  E
Sbjct: 201 CRCEDCKNPYGKNE 214


>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 625

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 47/107 (43%), Positives = 55/107 (51%), Gaps = 14/107 (13%)

Query: 105 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 164
           YCECFAAG +C  C+C NC N  ++    Q        RNP AF  KI ++         
Sbjct: 281 YCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKIVAT--------- 331

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           D  E      H K     KS CLKKYCECFQA +LC E C+C  CKN
Sbjct: 332 DGEEKHKKGCHCK-----KSACLKKYCECFQAGVLCQEYCKCEGCKN 373


>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
          Length = 223

 Score = 86.3 bits (212), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 65/125 (52%), Gaps = 15/125 (12%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------ASS 155
           YC CF +  +C D CNC  C+N   +E A +E       +NPNAF P+I        A+ 
Sbjct: 21  YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80

Query: 156 PHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
           P  +  A  D +    A       KH KGC C+KS C K YCECF+ ++ C+  C+CL+C
Sbjct: 81  PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNSVGCTAKCKCLEC 140

Query: 210 KNFEG 214
            N  G
Sbjct: 141 SNSFG 145



 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           C C +S C + YC CF++   CS+NC C  C N +  E+
Sbjct: 10  CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48


>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 375

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/190 (33%), Positives = 78/190 (41%), Gaps = 50/190 (26%)

Query: 58  TPKKQK---------QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCEC 108
           TPK+QK          C C+ SRCLKL                             YC C
Sbjct: 66  TPKRQKVEDAADGCKHCACKKSRCLKL-----------------------------YCPC 96

Query: 109 FAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAR 167
           FA G YC + C CV C N  +     Q      L R       KI        +  EDA 
Sbjct: 97  FAGGGYCSEKCGCVPCFNKADFAETVQTTRKVLLSRQ-KRMSLKINRRLEANAETMEDAH 155

Query: 168 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEERRA 220
            +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DCKN  G        E +R 
Sbjct: 156 NSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCEDCKNPFGKNEGIMADESKRF 215

Query: 221 LFHG---DHN 227
           L+ G   DH+
Sbjct: 216 LYTGADLDHS 225


>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
          Length = 142

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 51/158 (32%), Positives = 71/158 (44%), Gaps = 39/158 (24%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GC 118
           +K + C C+ SRCL+L                             YCECF  GI+C+  C
Sbjct: 10  RKVEFCKCKKSRCLQL-----------------------------YCECFVNGIFCNKSC 40

Query: 119 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
            C NC N   ++   + A      RNP+AF  K         +             H KG
Sbjct: 41  ICTNCGNTENNKKQIESAKQEAKMRNPDAFSQKFLVVKQNQYEG---------IVSHKKG 91

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           C+C K+ C KKYCECF A I C+ENC+C +C+N++  +
Sbjct: 92  CNCTKTQCTKKYCECFNAGIKCTENCKCENCENYKDEQ 129



 Score = 47.0 bits (110), Expect = 0.028,   Method: Composition-based stats.
 Identities = 19/40 (47%), Positives = 28/40 (70%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 218
           C CKKS CL+ YCECF   I C+++C C +C N E ++++
Sbjct: 15  CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQ 54


>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
           [Oryza sativa Japonica Group]
          Length = 194

 Score = 85.9 bits (211), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 65/125 (52%), Gaps = 15/125 (12%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------ASS 155
           YC CF +  +C D CNC  C+N   +E A +E       +NPNAF P+I        A+ 
Sbjct: 21  YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80

Query: 156 PHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
           P  +  A  D +    A       KH KGC C+KS C K YCECF+ ++ C+  C+CL+C
Sbjct: 81  PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNSVGCTAKCKCLEC 140

Query: 210 KNFEG 214
            N  G
Sbjct: 141 SNSFG 145



 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           C C +S C + YC CF++   CS+NC C  C N +  E+
Sbjct: 10  CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48


>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
           niloticus]
          Length = 310

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 52/77 (67%), Gaps = 11/77 (14%)

Query: 141 LERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILC 200
           L  NP+AFRPKIA    GA           +   HNKGC+CK+SGCLK+YCEC++ANI+C
Sbjct: 172 LGHNPDAFRPKIAGVKSGA-----------VRGWHNKGCNCKRSGCLKRYCECYEANIMC 220

Query: 201 SENCRCLDCKNFEGSEE 217
           + +C+C+ C+N++   +
Sbjct: 221 TSSCKCVGCRNYDAGSQ 237


>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 51/150 (34%), Positives = 65/150 (43%), Gaps = 54/150 (36%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K CNC+ S+CLK                             +YC+CFA G  C + CNCV
Sbjct: 89  KSCNCKKSQCLK-----------------------------QYCDCFANGQMCSENCNCV 119

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C NN  +   R+EA    + R+P AF+                           KGC+C
Sbjct: 120 GCFNNTLNMEQRKEAKVQIINRDPGAFKQSF------------------------KGCNC 155

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
           KKSGC KKYCECF + + C+  CRC  C N
Sbjct: 156 KKSGCQKKYCECFLSGLACTHLCRCDGCLN 185



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 23/45 (51%), Positives = 31/45 (68%)

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           + K C+CKKS CLK+YC+CF    +CSENC C+ C N   + E+R
Sbjct: 87  NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 131


>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
          Length = 326

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 68/145 (46%), Gaps = 33/145 (22%)

Query: 58  TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-D 116
           T   +K CNC+ ++CLKL                             YC+C AAG+YC D
Sbjct: 10  TESDRKHCNCKRTQCLKL-----------------------------YCDCLAAGVYCTD 40

Query: 117 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--AREDAREAQ-LAA 173
            C C NC N  E+E   Q        R+P AF P+I +      D   +ED       +A
Sbjct: 41  SCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRIVNPDTDTTDNVLQEDGNWTTPSSA 100

Query: 174 KHNKGCHCKKSGCLKKYCECFQANI 198
           +H +GC+CKKS C KKYCEC+Q  I
Sbjct: 101 RHKRGCNCKKSMCQKKYCECYQVYI 125



 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 25/35 (71%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           K C+CK++ CLK YC+C  A + C+++C C +C N
Sbjct: 15  KHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLN 49


>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
          Length = 286

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 69/131 (52%), Gaps = 12/131 (9%)

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
           + +YC+CF    YC  CN   C NN E+E  RQ+A+   L +NP AF  K  +  +  + 
Sbjct: 151 IKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKTRKNFIKK 210

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC--LDCKNFEGS---E 216
           +++D+            C+C KSGC  KYC C +  ILCS+NC+C    CKN +G    +
Sbjct: 211 SKKDSNLIHT-------CNCSKSGCNNKYCVCLKEGILCSKNCKCNHRKCKNKQGDRNQK 263

Query: 217 ERRALFHGDHN 227
           +R  +   D N
Sbjct: 264 QRSVIISNDEN 274



 Score = 39.3 bits (90), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 6/67 (8%)

Query: 153 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           ++  H   D  +D    +L  K+     C  + C+K+YC+CF+ N  C+ NC   +CKN 
Sbjct: 122 STQNHHDHDLDKDMAIIELCCKN-----CSNTYCIKRYCQCFKLNKYCT-NCNHPECKNN 175

Query: 213 EGSEERR 219
           + +EE R
Sbjct: 176 QENEEER 182


>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
          Length = 214

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 64/125 (51%), Gaps = 15/125 (12%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI--------ASS 155
           YC CF +  +C D CNC  C+N   +E A +E       +NPNAF P+I        A+ 
Sbjct: 21  YCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPRIIVSVQDATAAD 80

Query: 156 PHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
           P  +  A  D +    A       KH KGC C+KS C K YCECF+ ++ C+  C+C +C
Sbjct: 81  PQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNSVGCTAKCKCQEC 140

Query: 210 KNFEG 214
            N  G
Sbjct: 141 SNSFG 145



 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           C C +S C + YC CF++   CS+NC C  C N +  E+
Sbjct: 10  CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48


>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 356

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 72/176 (40%), Gaps = 41/176 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K C C+ SRCLKL                             YC CFA G YC D C C 
Sbjct: 62  KHCACKKSRCLKL-----------------------------YCPCFAGGGYCSDKCGCQ 92

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C N        Q      L R       KI   P    +  EDA  +  +    +GC+C
Sbjct: 93  PCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 151

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 227
           KKS CLKKYC+C+Q    CS  CRC DC+N  G        + +R L+ G   DH+
Sbjct: 152 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGIIADDSKRYLYTGADLDHS 207


>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
 gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
          Length = 466

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 50/91 (54%), Gaps = 14/91 (15%)

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C N VE+    ++       RNPNAF  KI       +D  +DAR       H KGCHCK
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKI-------EDDGDDAR-------HTKGCHCK 216

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           KS CLKKYCECFQA + C E C+C  C N E
Sbjct: 217 KSACLKKYCECFQAGVKCQEYCKCEGCMNKE 247


>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
 gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
          Length = 359

 Score = 83.2 bits (204), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 69/170 (40%), Gaps = 38/170 (22%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K C C+ SRCLKL                             YC CF+ G YC D C C 
Sbjct: 67  KHCACKKSRCLKL-----------------------------YCPCFSGGGYCSDKCGCQ 97

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C N        Q      L R       KI   P    +  EDA  +  +    +GC+C
Sbjct: 98  PCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 156

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE-------RRALFHG 224
           KKS CLKKYC+C+Q    CS  CRC DC+N  G  E       +R L+ G
Sbjct: 157 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNEGIIADDSKRYLYTG 206


>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 886

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 47/118 (39%), Positives = 60/118 (50%), Gaps = 17/118 (14%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVA---RQEAVGATLERNPNAFRPKIASSPH 157
           L+ YC+CF   I+C + C C++C N VE+  A   R  AV   LER P AF         
Sbjct: 19  LLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERRPEAF--------- 69

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
                RE    A+   K + GC C K+ CLKKYC CF A + C   C C +CKN  G+
Sbjct: 70  ----GRESGAFAERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPFGT 123



 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 61/114 (53%), Gaps = 13/114 (11%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNP-NAFRPKIASSPHGA 159
           L KYC CF A + C   CNC NC N     +  ++ V  T+E    NA +        GA
Sbjct: 95  LKKYCVCFGAKMQCRARCNCSNCKNPF-GTIRIEDGVTCTIEGGKVNALK--------GA 145

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
             A    R+    AK N  C C ++ CLK YC+CF+ +++C++NC CL+CKN E
Sbjct: 146 --AFVPIRKLANLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSE 197



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 38/77 (49%), Gaps = 15/77 (19%)

Query: 172 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--------------FEGSEE 217
           AAK    C C K+ CL  YC+CF  NI C E+C C+DCKN               EG  E
Sbjct: 4   AAKKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLE 63

Query: 218 RRALFHGDHNGIAFMQR 234
           RR    G  +G AF +R
Sbjct: 64  RRPEAFGRESG-AFAER 79



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 48/113 (42%), Gaps = 24/113 (21%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEV---ARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           YC+CF   + C D C+C+ C N+ E+      R   +   LE+ P+ F+ K A    G  
Sbjct: 174 YCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDILEKRPHIFQ-KTAKRKAGG- 231

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRCLDCKNF 212
                            GC C+K+ C   YC+C  +   C    C C++C N 
Sbjct: 232 -----------------GCSCQKNKCRAGYCDCKGSGGKCDPAVCTCINCDNM 267


>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
 gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
          Length = 847

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 55/103 (53%), Gaps = 6/103 (5%)

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKG 178
           C N  E+E    E       RNP AF PKI    H  +    D  +  L    + +H  G
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQ--HVTEFQAIDVEDVDLFTPYSGRHKTG 501

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           C+CK+S C+KKYCEC+QAN+ CS  CRC  C+N  G +E  A+
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRNIHGRKEEYAM 544


>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
          Length = 374

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 78/192 (40%), Gaps = 52/192 (27%)

Query: 58  TPKKQK-----------QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYC 106
           TPK+QK            C C+ SRCLKL                             YC
Sbjct: 70  TPKRQKVEESADGNGCKHCACKKSRCLKL-----------------------------YC 100

Query: 107 ECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
            CFA G YC + C C  C N   +    Q      L R       KI        +A ED
Sbjct: 101 PCFAGGGYCSEKCGCQPCFNKALYAETVQTTRKVLLSRQ-KRMSLKINRRSEANAEAVED 159

Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEER 218
           A  +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DC+N  G        E +
Sbjct: 160 AHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCEDCRNPFGKNEGIMAEESK 219

Query: 219 RALFHG---DHN 227
           R L+ G   DH+
Sbjct: 220 RFLYTGADLDHS 231


>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
          Length = 573

 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 35/66 (53%), Positives = 48/66 (72%), Gaps = 8/66 (12%)

Query: 147 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
           AF+PKI++S     D++ D        KHNKGC CKKSGC+KKYCECFQA + CS+ C+C
Sbjct: 433 AFQPKISTS-----DSKPDPLNF---GKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKC 484

Query: 207 LDCKNF 212
           ++C+N+
Sbjct: 485 IECRNY 490


>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
          Length = 356

 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 72/176 (40%), Gaps = 41/176 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K C C+ SRCLKL                             YC CFA G YC D C C 
Sbjct: 62  KHCACKKSRCLKL-----------------------------YCPCFAGGGYCSDKCGCQ 92

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C +        Q      L R       KI   P    +  EDA  +  +    +GC+C
Sbjct: 93  PCFSKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 151

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 227
           KKS CLKKYC+C+Q    CS  CRC DC+N  G        + +R L+ G   DH+
Sbjct: 152 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGIIADDSKRYLYTGADLDHS 207


>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
 gi|194696492|gb|ACF82330.1| unknown [Zea mays]
          Length = 278

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 50/80 (62%)

Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
           RNP AF PK+  +     + R+ + +   +A+H +GC+CKKS CLKKYCECFQ  + CS 
Sbjct: 20  RNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSI 79

Query: 203 NCRCLDCKNFEGSEERRALF 222
           +CRC  CKN  G  E  A+ 
Sbjct: 80  SCRCEGCKNAFGKREGAAVL 99


>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|194699308|gb|ACF83738.1| unknown [Zea mays]
 gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|224028923|gb|ACN33537.1| unknown [Zea mays]
 gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 359

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 68/170 (40%), Gaps = 38/170 (22%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K C C+ SRCLKL                             YC CF+ G YC D C C 
Sbjct: 67  KHCACKKSRCLKL-----------------------------YCPCFSGGGYCSDKCGCQ 97

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C N               L R       KI   P    +  EDA  +  +    +GC+C
Sbjct: 98  PCFNKEAFAETVHTTRKVLLSRQ-KRMSMKINRRPEANTEPMEDAHHSSSSTPPKRGCNC 156

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE-------RRALFHG 224
           KKS CLKKYC+C+Q    CS  CRC DC+N  G  E       +R L+ G
Sbjct: 157 KKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNEGIMADDSKRYLYTG 206


>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|219884557|gb|ACL52653.1| unknown [Zea mays]
 gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
          Length = 394

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 74/170 (43%), Gaps = 45/170 (26%)

Query: 43  SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
           S R R R   DG DG     K C+C+ SRCLKL                           
Sbjct: 72  SDRKRRRKAEDG-DGC----KTCSCKKSRCLKL--------------------------- 99

Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
             YC CFA+G++C + C C  CHN  + +   + A    L         K   +    QD
Sbjct: 100 --YCVCFASGLHCTESCGCEPCHNKPQLQGTLRMATVLPL---------KPVQTFEAGQD 148

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
             E    + +     K C CKKSGCLKKYC+C+Q    CS NC+C DC+N
Sbjct: 149 IVEQVIRSPMDLIRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRN 197


>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
          Length = 434

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 132/337 (39%), Gaps = 77/337 (22%)

Query: 52  TDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA 111
           TD      +K+K C C  ++CLK                             +YC CF  
Sbjct: 4   TDNAQKKSEKEKGCRCGRTKCLK-----------------------------QYCACFRN 34

Query: 112 GIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 170
            + C + C CV+CHN+ +HE AR  AV      +P AF+          +   ++     
Sbjct: 35  DVRCTNDCVCVDCHNDGKHEQARMMAVRLVRLNDPMAFKGT------SLELENQEVHTPN 88

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF--HGDHNG 228
              K  +GC C++S C KKYCECF A + CS NC C  C N  G++  +  +   G  + 
Sbjct: 89  GTLKTVRGCRCRRSKCQKKYCECFGAGLKCSTNCVCEGCLN--GNDPSKPFYPSGGSASS 146

Query: 229 IAFMQRA--ANAAIIGAVGSSGY-----GTLMTSNKRKSEELLFG---------VAAKDQ 272
           +A M  A  A  ++    G + +      +L TS   +  +L F           A K Q
Sbjct: 147 VARMAPAIPAGFSVSNMAGKTHFDRTSKASLDTSRASRPPQLSFSRTSAPSAEQKAEKSQ 206

Query: 273 SV----------IRNPQSQQGN--------NVQNSAVPCSQSVPVSQTANASVLGSTKSA 314
           S+          +RNP S +          +   S  P +Q V +      +      S+
Sbjct: 207 SLASRRPPISVEVRNPTSMRVACAVTEIPLHDSRSRAPVAQEVKLEDKVEVNKENKPLSS 266

Query: 315 LRSPLADILQPQDMKELCSLLVMVSSEATKTLTKKVG 351
           L +P AD++     +E  SL   +SS     LT   G
Sbjct: 267 LDNPAADLVL---ERESSSLNYYMSSTPQLNLTPTWG 300


>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 455

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/155 (32%), Positives = 69/155 (44%), Gaps = 51/155 (32%)

Query: 59  PKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG- 117
           P  Q  CNC  SRCLKL                             YC CF AG  C   
Sbjct: 332 PAGQVACNCIRSRCLKL-----------------------------YCTCFQAGKMCQPG 362

Query: 118 -CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
            C C+ C N+VE    R++A+  TL++ P+AF+ K    P G                  
Sbjct: 363 ICTCIACANSVEDHPERKQAIKHTLQKRPDAFQTK--DRPVGL----------------- 403

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
            GC CK + C++KYCECF+  + C++ C CL+C+N
Sbjct: 404 -GCACKNNKCIRKYCECFRNGLSCADKCCCLNCEN 437



 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 20/127 (15%)

Query: 105 YCECFAAGIYCD--GCNCVNCHNNVEH---EVARQEAVGATLERNPNAFRPKIASSPHGA 159
           YC+CF AG  C+   CNC  C N +     + AR  A+ A L RNP AF         G 
Sbjct: 271 YCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAFVTA------GV 324

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNF--EGSE 216
             A +     Q+A      C+C +S CLK YC CFQA  +C    C C+ C N   +  E
Sbjct: 325 ASAVQKLPAGQVA------CNCIRSRCLKLYCTCFQAGKMCQPGICTCIACANSVEDHPE 378

Query: 217 ERRALFH 223
            ++A+ H
Sbjct: 379 RKQAIKH 385



 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSE-NCRCLDCKN 211
           GC C KS C+  YC+CF+A   C+  NC C  CKN
Sbjct: 259 GCSCPKSKCVALYCDCFKAGRRCNPINCNCTACKN 293


>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
          Length = 184

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 1/64 (1%)

Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFM 232
           +H KGC+CK+SGCLK YCEC++A ILCS  C+C+ CKNFE S ER+ L    D   +   
Sbjct: 3   RHTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQ 62

Query: 233 QRAA 236
           Q+AA
Sbjct: 63  QQAA 66


>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
          Length = 484

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 13/116 (11%)

Query: 82  INIYSNTSPGAHLNTQFKDNL--VKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
           +N+ +  SP     +    ++   KYC+CFA+G  C+ CNC NC NN+ HE+ R +A+ A
Sbjct: 273 LNLITQQSPAEVTESSMDSDVGVRKYCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKA 332

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
            L+RNP AF+PKI   P    DA+          +HN+GC+CK+SGCLK YCEC++
Sbjct: 333 CLDRNPEAFQPKIG--PGSLGDAK---------PRHNRGCNCKRSGCLKNYCECYE 377


>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
 gi|238005872|gb|ACR33971.1| unknown [Zea mays]
          Length = 272

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/75 (48%), Positives = 47/75 (62%)

Query: 143 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 202
           RNP AF PK+  +     D  + + +   +A+H +GC+CKKS CLKKYCECFQ  + CS 
Sbjct: 14  RNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSV 73

Query: 203 NCRCLDCKNFEGSEE 217
           +CRC  CKN  G  E
Sbjct: 74  SCRCEGCKNAFGKRE 88


>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
          Length = 176

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 61/126 (48%), Gaps = 37/126 (29%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC+ S+CLKL                             YC+CF AG+ C   CNC +C
Sbjct: 85  CNCKKSKCLKL-----------------------------YCDCFTAGVTCGKDCNCCSC 115

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
           HNN EH   R+  +   +ERNP+AFRPK+ S  +      ED  E     +H KGC+CKK
Sbjct: 116 HNNDEHPKEREIVIKQIMERNPSAFRPKVESKSNS-----ED--EQDHKPRHFKGCNCKK 168

Query: 184 SGCLKK 189
           S   KK
Sbjct: 169 SNWSKK 174



 Score = 46.2 bits (108), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 27/43 (62%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           C+CKKS CLK YC+CF A + C ++C C  C N +   + R +
Sbjct: 85  CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREI 127


>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 405

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 77/176 (43%), Gaps = 49/176 (27%)

Query: 48  PRANTDGKDGTPKKQK--------QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           P A  +G     KK+K         C+C+ SRCLKL                        
Sbjct: 72  PPAEANGSSDRRKKRKAEDGEGCRTCSCKKSRCLKL------------------------ 107

Query: 100 DNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 158
                YC CFA+G +C + C C  C+N   H V +    G  L+         +  +   
Sbjct: 108 -----YCVCFASGSHCSELCGCDPCYNKPIHGVQQNTPPGLPLQ---------VVRTAEA 153

Query: 159 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
            Q++ E AR  Q+     K C CKKSGCLKKYC+C+Q    CS NC+C DCKN  G
Sbjct: 154 GQNSAEFAR-YQMDFFRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 207



 Score = 42.4 bits (98), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 2/61 (3%)

Query: 153 ASSPHGAQDAREDAREAQLA--AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           A SP    +   D R+ + A   +  + C CKKS CLK YC CF +   CSE C C  C 
Sbjct: 69  AVSPPAEANGSSDRRKKRKAEDGEGCRTCSCKKSRCLKLYCVCFASGSHCSELCGCDPCY 128

Query: 211 N 211
           N
Sbjct: 129 N 129


>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
          Length = 387

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/159 (33%), Positives = 74/159 (46%), Gaps = 40/159 (25%)

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAA-GIYCDGCNCVN 122
           QCNC+ S+CLKL                             YCECFA  G+    CNC  
Sbjct: 194 QCNCKKSKCLKL-----------------------------YCECFANNGVCSQSCNCQE 224

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIAS--SPHGAQDAREDA---REAQLAAKHN- 176
           C N +++   R +A+   L RN +AF     +  +P   Q    +    +  +  +K N 
Sbjct: 225 CKNRIDNPQERSKAIEEALLRNSDAFAQCFTTKGAPQFVQQGIFNFIQDKPIKEPSKDNS 284

Query: 177 ----KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
               KGC+CKKSGC KKYCEC+  N+ C++ C+C  C N
Sbjct: 285 SVVHKGCNCKKSGCKKKYCECYSQNLKCNDLCKCEHCLN 323



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 30/41 (73%), Gaps = 1/41 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
           C+CKKS CLK YCECF  N +CS++C C +CKN  +  +ER
Sbjct: 195 CNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQER 235


>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
 gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
          Length = 409

 Score = 79.0 bits (193), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 59/173 (34%), Positives = 74/173 (42%), Gaps = 45/173 (26%)

Query: 43  SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNL 102
           S R R R   DG DG     K C+C+ SRC+KL                           
Sbjct: 87  SDRKRRRKAEDG-DGC----KTCSCKKSRCMKL--------------------------- 114

Query: 103 VKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
             YC CFA+G  C + C C  C N    +VA + A+   L         K   +    QD
Sbjct: 115 --YCVCFASGSRCTESCGCEPCENKQPLQVAPRTALILPL---------KPVQTSEAGQD 163

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
             E    + +     K C CKKSGCLKKYC+C+Q    CS NC+C DC+N  G
Sbjct: 164 IVEQVIRSPMDLIRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRNPFG 215


>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
          Length = 298

 Score = 78.6 bits (192), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 60/131 (45%), Gaps = 9/131 (6%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           +  YC CFA G YC D C C  C N        Q      L R       KI   P    
Sbjct: 14  ICSYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANA 72

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG------ 214
           +  EDA  +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DC+N  G      
Sbjct: 73  EPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGII 132

Query: 215 -SEERRALFHG 224
             + +R L+ G
Sbjct: 133 ADDSKRYLYTG 143


>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
          Length = 175

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/111 (41%), Positives = 56/111 (50%), Gaps = 25/111 (22%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L +YC+CFA G  C + CNCV C NN  +   R+EA      R+P AF+           
Sbjct: 86  LKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFKQAF-------- 137

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                           KGC+CKKSGC KKYCECFQ N+ C+  CRC  C N
Sbjct: 138 ----------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCVN 172



 Score = 46.6 bits (109), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 26/35 (74%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           K C+CKKS CLK+YC+CF    +CSENC C+ C N
Sbjct: 76  KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFN 110


>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
          Length = 190

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 65/150 (43%), Gaps = 54/150 (36%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K CNC+ S+CLK                             +YC+CFA G  C + CNCV
Sbjct: 91  KTCNCKKSQCLK-----------------------------QYCDCFANGQVCSENCNCV 121

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C NN  +   R++A      R+P AF+                           KGC+C
Sbjct: 122 GCFNNSLNNSQRKDAKLQIQARDPGAFKQAF------------------------KGCNC 157

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
           KKSGC KKYCECFQ N+ C+  CRC  C N
Sbjct: 158 KKSGCQKKYCECFQNNLQCTHQCRCEGCVN 187


>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 586

 Score = 76.3 bits (186), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 56/111 (50%), Gaps = 30/111 (27%)

Query: 39  GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           G ++  R  P   T   D   KK + CNC  S+CLKL                       
Sbjct: 506 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKL----------------------- 541

Query: 99  KDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 149
                 YC+CFA G +C  CNC +C+NN+++E  RQ+A+ ATLERNP+AF+
Sbjct: 542 ------YCDCFANGEFCYNCNCRDCYNNLDNEEERQKAIRATLERNPSAFK 586



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           A   + C+C KS CLK YC+CF     C  NC C DC N   +EE R
Sbjct: 525 AYKKRPCNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEER 570


>gi|115489472|ref|NP_001067223.1| Os12g0605300 [Oryza sativa Japonica Group]
 gi|77557030|gb|ABA99826.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649730|dbj|BAF30242.1| Os12g0605300 [Oryza sativa Japonica Group]
          Length = 407

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC CFA+G +C + C C  C+N   H V R             A   K   +    QD+ 
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 163

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           E    + +     K C CKKSGCLKKYC+C+Q    CS NC+C DCKN  G
Sbjct: 164 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 213


>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
 gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
          Length = 421

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC CFA+G +C + C C  C+N   H V R             A   K   +    QD+ 
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 177

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           E    + +     K C CKKSGCLKKYC+C+Q    CS NC+C DCKN  G
Sbjct: 178 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 227


>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
          Length = 196

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 66/151 (43%), Gaps = 54/151 (35%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNC 120
           ++ CNC+ S+CLK                             +YC+CFA G  C + CNC
Sbjct: 96  ERSCNCKKSQCLK-----------------------------QYCDCFANGQMCSENCNC 126

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
           V C NN  +   R+EA    + R+P AF+                           KGC+
Sbjct: 127 VGCFNNTLNMEQRKEAKVQIINRDPGAFKQSF------------------------KGCN 162

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           CKKSGC KKYCECF + + C+  CRC  C N
Sbjct: 163 CKKSGCQKKYCECFLSGLACTHLCRCDGCLN 193



 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 9/99 (9%)

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK---HNKGCH 180
            N +E E   +E V  +++     F+  ++++       +   +  Q+ +K   + + C+
Sbjct: 47  QNRIEME---EETVETSIQL---VFKEYLSTNQDYYFLLQISQKSLQIQSKLFINERSCN 100

Query: 181 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           CKKS CLK+YC+CF    +CSENC C+ C N   + E+R
Sbjct: 101 CKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 139


>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
           CCMP2712]
          Length = 125

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 61/118 (51%), Gaps = 21/118 (17%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFR-------PKIA 153
           L +YC+CF   I C   C C +CHN+ +HE  R EA+      NP+AF+        +  
Sbjct: 21  LKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAFKGTALELEDQEV 80

Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           ++P G +             K  +GC CK+S C KKYCECF A I C+ NC C DC N
Sbjct: 81  TTPKGGK-------------KTVRGCRCKRSKCQKKYCECFSAGIPCTSNCVCTDCAN 125



 Score = 45.8 bits (107), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 30/43 (69%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +GC C ++ CLK+YC+CF+ +I C+ +C C DC N    EE+R
Sbjct: 11  RGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKR 53


>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 932

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 15/117 (12%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L KYC+CFAAG +C   C C  C N+ +++   + A  +  +R+         S+   A 
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRS---------STSGEAL 869

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           +    A E     + ++GC CK++GCLK+YCECFQA   C+  C C  C N  G  E
Sbjct: 870 NGEPSASE-----RSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLNCRGLSE 921


>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
 gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
          Length = 554

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 29/122 (23%)

Query: 42  ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           +SP+      TD   G   ++K CNC  S+CLKL                          
Sbjct: 456 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKL-------------------------- 489

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YC+CFA G +C  C C +C NN+++EV R+ A+ + L+RNP+AF  K     + A+D
Sbjct: 490 ---YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFNLKHGEDSNNAKD 546

Query: 162 AR 163
            +
Sbjct: 547 VK 548



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 38/84 (45%), Gaps = 6/84 (7%)

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           +I  SP  A     DA      A   K C+C KS CLK YC+CF     C ++C C DC 
Sbjct: 453 EILKSPQLASAPTTDALAGM--ASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCF 509

Query: 211 N---FEGSEERRALFHGDHNGIAF 231
           N   +E   ER      D N  AF
Sbjct: 510 NNLDYEVERERAIRSCLDRNPSAF 533


>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 76/174 (43%), Gaps = 60/174 (34%)

Query: 41  QESPRSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           Q   R + R  T+     P  +KQ+QC C+ S+CLK+                       
Sbjct: 16  QTPTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKM----------------------- 52

Query: 99  KDNLVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                 YC+C A G YC   C+C NCHN+ +H   R + +   +E+N +A          
Sbjct: 53  ------YCDCLAFGEYCGSECSCQNCHNDDKHPEQRSKVLEQMVEKNSSA---------- 96

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                       QL       C+C+KS C KKYCEC+ A + CSE C+C DCKN
Sbjct: 97  ------------QLK------CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKN 132


>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
          Length = 391

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 78/186 (41%), Gaps = 56/186 (30%)

Query: 37  PPGKQE----SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
           PP   E    S R R R   DG DG     K C+C+ SRCLKL                 
Sbjct: 51  PPAPAEANGNSDRKRRRKADDG-DGC----KACSCKKSRCLKL----------------- 88

Query: 93  HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
                       YC C+A+G +C + C C  CHN            GA          P 
Sbjct: 89  ------------YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PV 122

Query: 152 IASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
           +   P    +A ++  E  + +  +   + C CKKSGCLKKYC+C+Q    CS NC+C D
Sbjct: 123 LPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDD 182

Query: 209 CKNFEG 214
           C+N  G
Sbjct: 183 CRNPFG 188


>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
 gi|194698372|gb|ACF83270.1| unknown [Zea mays]
 gi|194708474|gb|ACF88321.1| unknown [Zea mays]
 gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
          Length = 394

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 78/186 (41%), Gaps = 56/186 (30%)

Query: 37  PPGKQE----SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
           PP   E    S R R R   DG DG     K C+C+ SRCLKL                 
Sbjct: 53  PPAPAEANGNSDRKRRRKADDG-DGC----KACSCKKSRCLKL----------------- 90

Query: 93  HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
                       YC C+A+G +C + C C  CHN            GA          P 
Sbjct: 91  ------------YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PV 124

Query: 152 IASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
           +   P    +A ++  E  + +  +   + C CKKSGCLKKYC+C+Q    CS NC+C D
Sbjct: 125 LPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDD 184

Query: 209 CKNFEG 214
           C+N  G
Sbjct: 185 CRNPFG 190


>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
 gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 84

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 50/94 (53%), Gaps = 12/94 (12%)

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
           L  YC+CFAAG +C  C+C +C N  E+    Q+       R+P AF  KI  +P     
Sbjct: 2   LKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMDAP----- 56

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
                       KH +GC+CK+S CLKKYCECFQ
Sbjct: 57  -------GGGGGKHKRGCNCKRSHCLKKYCECFQ 83


>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 77/174 (44%), Gaps = 60/174 (34%)

Query: 41  QESPRSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQF 98
           Q   R + R  T+     P  +KQ+QC C+ S+CLK+                       
Sbjct: 16  QTPTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKM----------------------- 52

Query: 99  KDNLVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
                 YC+C A G YC   C+C NCHN+ +H   R + +   +E+N          +P 
Sbjct: 53  ------YCDCLAFGEYCGSECSCQNCHNDDKHPEQRNKVLEQMVEKN----------TP- 95

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                      AQ+       C+C+KS C KKYCEC+ A + CSE C+C DCKN
Sbjct: 96  -----------AQIK------CNCRKSNCQKKYCECYNAGVKCSELCKCDDCKN 132


>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 81

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 36/91 (39%), Positives = 49/91 (53%), Gaps = 17/91 (18%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC+CFA G++C D C C +C N  +   A  +A    + ++P AF+PK+  +  G     
Sbjct: 5   YCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPKVLDASGG----- 59

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECF 194
                      H KGC C+KS CLKKYCECF
Sbjct: 60  -----------HVKGCACRKSRCLKKYCECF 79



 Score = 39.7 bits (91), Expect = 4.8,   Method: Composition-based stats.
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 186 CLKKYCECFQANILCSENCRCLDCKN 211
           CLK YC+CF   + C++ C C DC+N
Sbjct: 1   CLKLYCDCFATGLFCNDACMCKDCEN 26


>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 238

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 72/165 (43%), Gaps = 49/165 (29%)

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           ++ CNC  S+CLKL                             YC+CFA G +C  C+C 
Sbjct: 55  RRPCNCTKSQCLKL-----------------------------YCDCFANGEFCSNCHCT 85

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN---KG 178
           NC NN+ +E+ R  A+ + LERNP AF+PKI +            + +++ AK      G
Sbjct: 86  NCLNNLTNELDRSRAIKSCLERNPMAFQPKIGN------------KTSEIMAKDELTLNG 133

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           C   +   +K       A + C+  C+C+ C+N E     R   H
Sbjct: 134 CT-TRDAIVKSQ----AARVSCTVRCKCVGCRNTEADRNHRNRGH 173



 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 32/62 (51%), Gaps = 4/62 (6%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG---DHNGIAFMQRA 235
           C+C KS CLK YC+CF     CS NC C +C N   +E  R+       + N +AF  + 
Sbjct: 58  CNCTKSQCLKLYCDCFANGEFCS-NCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKI 116

Query: 236 AN 237
            N
Sbjct: 117 GN 118


>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
          Length = 96

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 41/81 (50%), Gaps = 9/81 (11%)

Query: 136 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 195
           AV     RNPNAF  K         D             H KGC CKKS CLKKYCECF 
Sbjct: 2   AVNDAKNRNPNAFNIKFQIIEEKQDDK---------IVIHKKGCTCKKSNCLKKYCECFN 52

Query: 196 ANILCSENCRCLDCKNFEGSE 216
           A +LC++NC C  CKN +  E
Sbjct: 53  AGVLCNQNCVCDSCKNMQKEE 73


>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
          Length = 341

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 53/134 (39%), Gaps = 31/134 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K C C+ SRCLKL                             YC CFA G YC D C C 
Sbjct: 62  KHCACKKSRCLKL-----------------------------YCPCFAGGGYCSDKCGCQ 92

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C N        Q      L R       KI   P    +  EDA  +  +    +GC+C
Sbjct: 93  PCFNKEAFSETVQTTRKVLLSRQ-KRMSMKINRRPEANAEPMEDAHHSSSSTPPRRGCNC 151

Query: 182 KKSGCLKKYCECFQ 195
           KKS CLKKYC+C+Q
Sbjct: 152 KKSSCLKKYCDCYQ 165



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 22/41 (53%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YC CF     CS+ C C  C N E   E
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSE 102


>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
          Length = 159

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 63/150 (42%), Gaps = 54/150 (36%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K CNC+ S+CLK                             +YC+CFA G  C + CNCV
Sbjct: 60  KSCNCKKSQCLK-----------------------------QYCDCFANGQMCSENCNCV 90

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
            C NN  +   R+ A    + R+P AF+                           KGC+C
Sbjct: 91  GCFNNTLNIEQRKVAKVQIVNRDPGAFKQSF------------------------KGCNC 126

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKN 211
           KKSGC KKYCECF   + C+  CRC  C N
Sbjct: 127 KKSGCQKKYCECFLNGLTCTHLCRCDGCLN 156



 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 32/47 (68%)

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           + K C+CKKS CLK+YC+CF    +CSENC C+ C N   + E+R +
Sbjct: 58  NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKV 104


>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
          Length = 326

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/66 (50%), Positives = 44/66 (66%), Gaps = 7/66 (10%)

Query: 148 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 207
           FRPK+ S  +      ED ++ +   +H KGC+CKKS CLKKYCEC+Q  + CSE C+C 
Sbjct: 244 FRPKVESKSNS-----EDEQDHK--PRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCD 296

Query: 208 DCKNFE 213
           +CKN E
Sbjct: 297 ECKNCE 302


>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
          Length = 372

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 56/114 (49%), Gaps = 18/114 (15%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC C+A+G +C + C C  CHN            GA          P +   P    +A 
Sbjct: 69  YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PVLPLKPVQTLEAG 114

Query: 164 EDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           ++  E  + +  +   + C CKKSGCLKKYC+C+Q    CS NC+C DC+N  G
Sbjct: 115 QEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRNPFG 168


>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 319

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 11/81 (13%)

Query: 133 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 192
           R +A+ + LERNP AF+PK+ ++     +             H +GC+C+KS CLK YCE
Sbjct: 20  RSQAIASILERNPLAFQPKVVANGSETNNLH-----------HLRGCNCRKSNCLKNYCE 68

Query: 193 CFQANILCSENCRCLDCKNFE 213
           C QA + C+  C C  C N E
Sbjct: 69  CHQARVSCTNRCACHKCCNTE 89


>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
          Length = 482

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 52/154 (33%), Positives = 65/154 (42%), Gaps = 36/154 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCD-GCNCV 121
           K C C+ S+C+KL                             YCECF +  +C   C C 
Sbjct: 357 KGCTCKRSKCIKL-----------------------------YCECFLSKGFCGPSCTCN 387

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKI----ASSPHGAQDAREDAREAQLAAKHNK 177
           +C N  E+E    +     L RNP AF  K+    AS P G       A  + L  +H K
Sbjct: 388 DCENMEENEEEIVKIREKILSRNPKAFEKKLQIEEASQPLGMGPKAASAPSSNL--RHIK 445

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           GC CK+S C   YCEC Q    C+E C C DC N
Sbjct: 446 GCTCKRSKCSNNYCECHQHGAKCTELCSCTDCNN 479



 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 209
           +EA L+    KGC CK+S C+K YCECF +   C  +C C DC
Sbjct: 349 KEANLS--ECKGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDC 389


>gi|38637223|dbj|BAD03489.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|38637276|dbj|BAD03540.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 141

 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 50/73 (68%), Gaps = 6/73 (8%)

Query: 411 NERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYAEQERLILTRFR 470
           N+RPLSPGT+ALMCDE+D+ F  +  +   A+ ST +++       I+ EQE+ +LT FR
Sbjct: 66  NKRPLSPGTQALMCDEQDAVFQDS--KAPDATPSTTEEN----LVDIFKEQEKRVLTNFR 119

Query: 471 DFLNRLITCGSIK 483
           D+L +L TCG ++
Sbjct: 120 DYLCKLATCGKLQ 132


>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 313

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 50/148 (33%), Positives = 62/148 (41%), Gaps = 43/148 (29%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           C C+  RCLKL                             YC+CFA   +C   C C  C
Sbjct: 201 CTCKKCRCLKL-----------------------------YCDCFAQSRFCGSACLCKQC 231

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
            N +   V  Q A  A LERNP+AF        H A     + R   + +    GC C +
Sbjct: 232 CNRL-GAVVVQAARRAVLERNPDAF--------HRAHSL--ETRPGSVTS--GAGCGCTR 278

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
           S CLKKYC CFQ ++LC   C+C  C N
Sbjct: 279 SHCLKKYCVCFQKSLLCGAGCKCEGCLN 306



 Score = 38.9 bits (89), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 42/97 (43%), Gaps = 11/97 (11%)

Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
           +R     R K A++     D R     A  AA+ +  C CKK  CLK YC+CF  +  C 
Sbjct: 164 QRQAAGKRLKKAAAGVKVDDTRSPPPSAAAAAEQHNPCTCKKCRCLKLYCDCFAQSRFCG 223

Query: 202 ENCRCLDCKNFEGS----EERRAL-------FHGDHN 227
             C C  C N  G+      RRA+       FH  H+
Sbjct: 224 SACLCKQCCNRLGAVVVQAARRAVLERNPDAFHRAHS 260


>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 65.5 bits (158), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 54/115 (46%), Gaps = 16/115 (13%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L +YC CF +G  C D C C +C N  E    R EA+    ++                +
Sbjct: 159 LKRYCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKK--------------CHR 204

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
           D +    E  L+ + + GC CK +GC KKYCECF+   +C E C C DC N   S
Sbjct: 205 DKKVPVNEL-LSLQMSYGCKCKSTGCQKKYCECFKRGQICGEQCACEDCLNIPFS 258


>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
          Length = 204

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 78/186 (41%), Gaps = 56/186 (30%)

Query: 37  PPGKQE----SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
           PP   E    S R R R   DG DG     K C+C+ SRCLKL                 
Sbjct: 53  PPAPAEANGNSDRKRRRKADDG-DGC----KACSCKKSRCLKL----------------- 90

Query: 93  HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
                       YC C+A+G +C + C C  CHN            GA          P 
Sbjct: 91  ------------YCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PV 124

Query: 152 IASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
           +   P    +A ++  E  + +  +   + C CKKSGCLKKYC+C+Q    CS NC+C D
Sbjct: 125 LPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDD 184

Query: 209 CKNFEG 214
           C+N  G
Sbjct: 185 CRNPFG 190


>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 57/121 (47%), Gaps = 16/121 (13%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L +YC CF +G  C D C C +C N  +    R EA+    ++                +
Sbjct: 159 LKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKK--------------CHR 204

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
           D +    E  L+ + + GC CK +GC KKYCECF+   +C E C C  C N   ++ ++ 
Sbjct: 205 DKKVPVNEL-LSLQMSYGCKCKTTGCQKKYCECFKRGQICGEQCSCEGCLNIPSAQNQKE 263

Query: 221 L 221
           L
Sbjct: 264 L 264



 Score = 45.8 bits (107), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C+C +S CLK+YC CF +  +C + C+C DCKN     E R
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEER 191


>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
 gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
          Length = 459

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 33/46 (71%)

Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           ++ +    + KGC+CKK+GCLK+YCECFQ N  C+  CRC  CKN+
Sbjct: 186 QQPKFVDNNAKGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKNY 231


>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 103

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/71 (47%), Positives = 45/71 (63%), Gaps = 12/71 (16%)

Query: 137 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 196
           + A L ++P   +PKI     GA  A EDA       +H KGC+CK+SGCLK YCEC++A
Sbjct: 35  ITAGLGQSP--VQPKI-----GATSAAEDATR-----RHTKGCNCKRSGCLKNYCECYEA 82

Query: 197 NILCSENCRCL 207
            I CS NC+C+
Sbjct: 83  KIACSGNCKCI 93


>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
 gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
          Length = 141

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 12/107 (11%)

Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC+CF  G  C+  CNC +C N  E    R+  +   L+ NP AF          ++D+ 
Sbjct: 46  YCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAF----------SEDSD 95

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           +   + Q  ++ N GC C  S CLKKYC CF A+  C++ C C DC+
Sbjct: 96  KFNTKRQRMSRGN-GCACPSSHCLKKYCSCFGADAGCTDKCSCNDCE 141



 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 218
           C C KS C+K YC+CF    LC+  C C DCKN  E  EER
Sbjct: 35  CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREER 75


>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
 gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
          Length = 916

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 58/136 (42%), Gaps = 35/136 (25%)

Query: 2   RSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKK 61
           +SP Q +P  Q+        N++  P   L    L P    +P     A+ D   G+  +
Sbjct: 700 KSPQQKLPILQSSSTQD---NKLCVPAKPL-FDILKPAPMPTPSPAASASLDVLGGS--R 753

Query: 62  QKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCV 121
           +K CNC  S+CLKL                             YC+CFA G +C  C C 
Sbjct: 754 RKHCNCSKSQCLKL-----------------------------YCDCFANGEFCQDCTCK 784

Query: 122 NCHNNVEHEVARQEAV 137
           +C NN+++EV R+ A+
Sbjct: 785 DCFNNLDYEVKRERAI 800



 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 8/76 (10%)

Query: 146 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
           +  +P    +P  A  A  D     L     K C+C KS CLK YC+CF     C ++C 
Sbjct: 728 DILKPAPMPTPSPAASASLDV----LGGSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCT 782

Query: 206 CLDCKN---FEGSEER 218
           C DC N   +E   ER
Sbjct: 783 CKDCFNNLDYEVKRER 798


>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
 gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
          Length = 296

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 203
           P+A   ++ ++P      R   R  +   +   KGC CK++ C+K YC+C+Q+  +C E 
Sbjct: 5   PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64

Query: 204 CRCLDCKN 211
           C+C+DCKN
Sbjct: 65  CKCVDCKN 72


>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
          Length = 377

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 29/78 (37%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K C+C  SRCLKL                             YC+CF++G+ C  CN
Sbjct: 325 RPRKTCHCTKSRCLKL-----------------------------YCDCFSSGMMCSSCN 355

Query: 120 CVNCHNNVEHEVARQEAV 137
           C+NCHNN E+E  R EA+
Sbjct: 356 CINCHNNAENESQRHEAI 373



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           K CHC KS CLK YC+CF + ++CS +C C++C N   +E +R
Sbjct: 328 KTCHCTKSRCLKLYCDCFSSGMMCS-SCNCINCHNNAENESQR 369


>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
 gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
          Length = 296

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 145 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 203
           P+A   ++ ++P      R   R  +   +   KGC CK++ C+K YC+C+Q+  +C E 
Sbjct: 5   PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64

Query: 204 CRCLDCKN 211
           C+C+DCKN
Sbjct: 65  CKCVDCKN 72


>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
 gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
          Length = 213

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 48/114 (42%), Gaps = 33/114 (28%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YCECFAAG+YC + C+C++C NN  HE    E  G    RNP AF PK+  +   A 
Sbjct: 8   LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRNLDSAS 67

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +                                F+  + CS  CRC  CKN  G
Sbjct: 68  E--------------------------------FRGGVGCSSYCRCEGCKNTFG 89



 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 32/72 (44%), Gaps = 10/72 (13%)

Query: 187 LKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQRA-----AN 237
           L  YCECF A + C E C CLDC N  G E    E R       N +AF  +      + 
Sbjct: 8   LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIES-RNPLAFAPKVIRNLDSA 66

Query: 238 AAIIGAVGSSGY 249
           +   G VG S Y
Sbjct: 67  SEFRGGVGCSSY 78


>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
          Length = 123

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 56/108 (51%), Gaps = 18/108 (16%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCEC  AG  C D C CV+C N  +    R++A+    + NP AF     S+  G +D+R
Sbjct: 26  YCECLKAGRTCTDLCACVDCLNRAKFP-ERKKALKHIKKTNPKAFL--RVSNLKGHEDSR 82

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                          C C+KS C KKYC C+Q+   C+  C+C+DCKN
Sbjct: 83  --------------CCACQKSLCNKKYCSCYQSGRYCTPKCKCVDCKN 116



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER-RALFH-GDHNGIAFMQ 233
           C C K+ CLK YCEC +A   C++ C C+DC N     ER +AL H    N  AF++
Sbjct: 15  CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFPERKKALKHIKKTNPKAFLR 71


>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 54/137 (39%), Gaps = 40/137 (29%)

Query: 105 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFAAG  C+  CNC  C+N +  +  ++   G                         
Sbjct: 89  YCECFAAGKACNMDCNCQGCNNTICSQSIKKVQTG------------------------- 123

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
                         GC+CKK+GCLKKYCEC+     C+  C C+DC N E SE  +    
Sbjct: 124 --------------GCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCNQEDSESEKENEQ 169

Query: 224 GDHNGIAFMQRAANAAI 240
                I   Q   N  I
Sbjct: 170 LQQQSIDCGQTITNQII 186


>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 46/151 (30%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCN 119
           +Q+ CNC+NS CLK                             +YC CF +G  C   C 
Sbjct: 144 QQQPCNCKNSGCLK-----------------------------RYCRCFHSGRMCLKECQ 174

Query: 120 CV-NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 178
           C  +C NN +H+  R  A+   +  +   +R             R   R+A        G
Sbjct: 175 CSEDCQNNEQHQEQRNNAI---IHVDQKCYR------------NRRMPRDALFKLDVIYG 219

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDC 209
           C C KS C K+YCEC+  N  C++ C+C DC
Sbjct: 220 CSCTKSKCRKRYCECYLRNQKCTDKCKCFDC 250



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 34/51 (66%), Gaps = 3/51 (5%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRC-LDCKNFEGSEERR--ALFHGDH 226
           C+CK SGCLK+YC CF +  +C + C+C  DC+N E  +E+R  A+ H D 
Sbjct: 148 CNCKNSGCLKRYCRCFHSGRMCLKECQCSEDCQNNEQHQEQRNNAIIHVDQ 198


>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
 gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
          Length = 250

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 45/69 (65%), Gaps = 8/69 (11%)

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K A S  G + AR  A++  +     KGC CK+S C+K YC+C+Q+  +CS++CRC+ C+
Sbjct: 3   KPAPSGTGKRSARGQAQQPTI-----KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCR 57

Query: 211 NFEGSEERR 219
           N   ++ER+
Sbjct: 58  N---TQERK 63


>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
 gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
          Length = 590

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 49/110 (44%), Gaps = 32/110 (29%)

Query: 105 YCECFAAGIYCD--GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
           YC+C   G  CD   C+C +C N +E++  R E                           
Sbjct: 481 YCDCLKNGQACDPSSCSCASCLNTIENQNLRLEI-------------------------- 514

Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRCLDCKN 211
            ++ ++ Q  AK  +GC CK S C K+YCECFQ    C    C+C DCKN
Sbjct: 515 -QEKKQKQGQAK--EGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKN 561


>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 255

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 51/115 (44%), Gaps = 19/115 (16%)

Query: 99  KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
           K N +K YC CF     C D C C  C N  EH   R  A+             K+    
Sbjct: 140 KSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNAL------------EKVKQKL 187

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           H  ++  +    +++      GC C+KS C K YCECF  N  CS +CRC DC N
Sbjct: 188 HRQKNDDDLFDRSKIW-----GCKCQKSQCQKNYCECFVRNQKCSSSCRCKDCAN 237



 Score = 46.2 bits (108), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 30/66 (45%), Gaps = 8/66 (12%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--------FEGSEERRALFHGDHNGI 229
            C C KS CLK YC CF  N  CS+ C+C +CKN        F   E+ +   H   N  
Sbjct: 135 ACRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNALEKVKQKLHRQKNDD 194

Query: 230 AFMQRA 235
               R+
Sbjct: 195 DLFDRS 200


>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
          Length = 677

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 172 AAKHNKGCHCKKSGCLKKY--CECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNG 228
           A++  K C+C KS CLK +       A I+CS  C+C+ CKNFE S ER+ L H  D   
Sbjct: 518 ASRPRKPCNCTKSLCLKLWEVNNISMAKIMCSSICKCIGCKNFEESPERKTLMHLADAAE 577

Query: 229 IAFMQRAA 236
           +   Q+ A
Sbjct: 578 VRVQQQTA 585


>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
          Length = 303

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 56/148 (37%), Gaps = 47/148 (31%)

Query: 65  CNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCVNC 123
           CNC  S CLKL                             YC+CF     C + C C++C
Sbjct: 183 CNCSKSNCLKL-----------------------------YCQCFHQNKQCTELCKCLDC 213

Query: 124 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
            N   H   RQ A+             K+ S      +   D  +         GC C+K
Sbjct: 214 KNCDYHTQVRQTALEKI----------KMKSQRQKHDNDLFDLSKVW-------GCKCQK 256

Query: 184 SGCLKKYCECFQANILCSENCRCLDCKN 211
           S C K YCECF+ N  C+ +CRC DC N
Sbjct: 257 SQCQKNYCECFRRNQKCNSSCRCKDCAN 284



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 22/83 (26%)

Query: 133 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG----CHCKKSGCLK 188
           R++ + A L+RN                   E  +E+ L ++ N+     C+C KS CLK
Sbjct: 151 RKQNINANLQRN------------------YEYNQESDLNSQSNEHTMNPCNCSKSNCLK 192

Query: 189 KYCECFQANILCSENCRCLDCKN 211
            YC+CF  N  C+E C+CLDCKN
Sbjct: 193 LYCQCFHQNKQCTELCKCLDCKN 215


>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
          Length = 710

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/226 (26%), Positives = 87/226 (38%), Gaps = 75/226 (33%)

Query: 1   MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPK 60
           +R+P  A+P       H+  +  VP       +  +    Q SP+ + R   D  DGT  
Sbjct: 383 LRTPPCALPGIGL---HLNALATVPKDKMTAVVAYVGESSQGSPKKK-RHKFDNGDGT-- 436

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCN 119
             K+C+C+ S+CLKL                             YCECF AG++C + C+
Sbjct: 437 SCKRCSCKKSKCLKL-----------------------------YCECFHAGVFCSEPCS 467

Query: 120 CVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 176
           C  C N   N+E  ++ +E + +   RNP AF PK+  +  G + A              
Sbjct: 468 CQGCLNMPSNMETVLSTREQIES---RNPLAFAPKVIRTEPGQELA-------------- 510

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 222
                                + CS +CRC  CKN  G  E  AL 
Sbjct: 511 -------------------GGVGCSVSCRCEGCKNAFGRREGVALL 537


>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
          Length = 321

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 50/115 (43%), Gaps = 19/115 (16%)

Query: 99  KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
           K N +K YC+CF     C + C C  C N  +H   RQ A+           + KI S  
Sbjct: 205 KSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQTALE----------KIKIKSQR 254

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
               D   D            GC C+KS C K YCEC+  N  CS +CRC DC N
Sbjct: 255 QKHDDDLFDRSRVW-------GCKCQKSQCQKNYCECYIRNQKCSSSCRCKDCAN 302



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 2/86 (2%)

Query: 128 EHEVARQEAVGATLERNPNAFRP-KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSG 185
           +H+    +    TLE+     R  K+  S +  QD  +++   +Q    +   C C KS 
Sbjct: 148 DHQKKADDPCSHTLEQRSRRIRKNKLNPSFNTTQDPNQESDLHSQQQEPNTTPCKCTKSN 207

Query: 186 CLKKYCECFQANILCSENCRCLDCKN 211
           CLK YC+CF  N  C+E C+C  CKN
Sbjct: 208 CLKLYCQCFHQNRQCTELCKCFACKN 233


>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
          Length = 330

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 48/113 (42%), Gaps = 30/113 (26%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC+CF  G +C + C C++C NN E      E   + + RN N              
Sbjct: 224 LKFYCQCFQQGKFCGETCECLDCCNN-EQNKELIEQTRSQINRNLN-------------- 268

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                             C CKKS CLKKYCEC+ A   C  +C+C  C N+E
Sbjct: 269 --------------DEVKCFCKKSKCLKKYCECYNAGKKCGVDCKCEGCMNYE 307



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 28/41 (68%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           +GC CKK+ CLK YC+CFQ    C E C CLDC N E ++E
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE 254


>gi|194759119|ref|XP_001961797.1| GF14749 [Drosophila ananassae]
 gi|190615494|gb|EDV31018.1| GF14749 [Drosophila ananassae]
          Length = 324

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 35/45 (77%), Gaps = 3/45 (6%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           KGC CK+S C+K YC+C+Q+ ++CS+ CRC+ C+N   +E R+A+
Sbjct: 103 KGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRN---TEVRKAV 144


>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 78

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 157
           L  YC+CFA  + C D C C NC+N+ EH V R+ AV   L R+P+AF  K  + PH
Sbjct: 3   LKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKFKTEPH 59



 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 186 CLKKYCECFQANILCSENCRCLDCKN-FEGSEERRA 220
           CLK YC+CF   ++CS+ CRC +C N  E   ERRA
Sbjct: 2   CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRA 37


>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 46/108 (42%), Gaps = 23/108 (21%)

Query: 105 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCEC A G  C   C C NC N      + ++ +    E + N    KI S         
Sbjct: 83  YCECLAKGRLCSSACRCENCQN----RTSNKKVLHVIKELDLNKNLKKIKS--------- 129

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                     +   GC CKKS CLKKYCECF +   C   C C +C+N
Sbjct: 130 ---------KRFKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQN 168



 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%)

Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           R+  L    ++ C+C KS C K YCEC     LCS  CRC +C+N
Sbjct: 60  RDDSLLLSSDEPCNCSKSECRKMYCECLAKGRLCSSACRCENCQN 104


>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 54/118 (45%), Gaps = 23/118 (19%)

Query: 105 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCEC A G  C   C C NCHN   +++     V   L++N                   
Sbjct: 83  YCECLAKGRLCSSFCRCQNCHNKTSNKLVLN--VIEELDQNR------------------ 122

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
              R+   + +   GC C+KS CLKKYCECF +   C   C+C++C+N+   E  + +
Sbjct: 123 --NRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKSCGSGCKCVNCENYVLDEVEKII 178



 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 21/33 (63%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C+C +SGC K YCEC     LCS  CRC +C N
Sbjct: 72  CNCSRSGCRKMYCECLAKGRLCSSFCRCQNCHN 104



 Score = 39.7 bits (91), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 155
           L KYCECF +G  C  GC CVNC N V  EV  ++ + +  ++  + F  K  S+
Sbjct: 144 LKKYCECFNSGKSCGSGCKCVNCENYVLDEV--EKIICSPNQKKTSKFSSKFESA 196


>gi|195473860|ref|XP_002089210.1| GE18995 [Drosophila yakuba]
 gi|194175311|gb|EDW88922.1| GE18995 [Drosophila yakuba]
          Length = 237

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 154 SSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           S+P      R + R+A+       KGC CK+S C+K YC+C+Q+  +C+  CRC+DC+N 
Sbjct: 2   STPKKKSVDRSEGRKAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRNT 61

Query: 213 E 213
           E
Sbjct: 62  E 62


>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
          Length = 262

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 48/110 (43%), Gaps = 37/110 (33%)

Query: 104 KYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 162
           KYC C   G  C   C CV+C NN                    +F              
Sbjct: 141 KYCFCAKVGQSCSNLCKCVDCSNN-------------------KSFM------------- 168

Query: 163 REDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
              +++ Q+  A+  KGC CKK+ CLK YC+CF   + CS NC+C+ CKN
Sbjct: 169 ---SQQIQVTNAQEQKGCKCKKNFCLKGYCDCFAKGMQCSSNCKCISCKN 215


>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
          Length = 275

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 50/112 (44%), Gaps = 16/112 (14%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L +YC CF +G  C D C C +C N       R+ A+    ++                +
Sbjct: 161 LKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREAAINYVFKK--------------CNR 206

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           D    A E  L+ + + GC CK +GC KKYCEC +    C + C C DC N 
Sbjct: 207 DKNVPANEL-LSLQISYGCKCKSTGCQKKYCECLKRGQACGDLCSCEDCLNI 257



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE-GSEERRA 220
           C+C +S CLK+YC CF +  +C + C+C DCKN +  SEER A
Sbjct: 153 CNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREA 195


>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 52/115 (45%), Gaps = 17/115 (14%)

Query: 99  KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
           K N +K YC CF  G  C + C C +C N  E+   R EAV              I    
Sbjct: 86  KTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVEY------------IKKKA 133

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           H  +      +E     K   GC+CKK+ CLK+YCECF     C+ +C C  C+N
Sbjct: 134 HRNKKV---TQEKLFETKEVWGCNCKKTRCLKRYCECFIRQKTCTVDCNCNYCEN 185



 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 6/59 (10%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 237
           C C K+ CLK YC CF     C E C+C DCKN +    +R      H  + ++++ A+
Sbjct: 82  CSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQR------HEAVEYIKKKAH 134


>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
 gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
          Length = 163

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 40/91 (43%), Gaps = 30/91 (32%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           ++CNC+ SRCLKL                             YCECFAAG YC D C+C 
Sbjct: 81  RRCNCKRSRCLKL-----------------------------YCECFAAGNYCLDTCSCE 111

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
           NC N  E+  A          RNP AF PK+
Sbjct: 112 NCFNKSEYADAVLHMRQQIEARNPVAFAPKV 142



 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 34/71 (47%), Gaps = 7/71 (9%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-----NGIAFMQ 233
           C+CK+S CLK YCECF A   C + C C +C  F  SE   A+ H        N +AF  
Sbjct: 83  CNCKRSRCLKLYCECFAAGNYCLDTCSCENC--FNKSEYADAVLHMRQQIEARNPVAFAP 140

Query: 234 RAANAAIIGAV 244
           +      I  V
Sbjct: 141 KVVKHTTISPV 151


>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
          Length = 246

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 50/112 (44%), Gaps = 17/112 (15%)

Query: 102 LVKYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
           + +YC CF +G  C   C C + C N   +   R EA+      N   +R          
Sbjct: 144 IKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERSEAIKHV---NEKCYR---------- 190

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
              R+  +EA        GC C KS C K+YCECF  N  C+E C+C DC N
Sbjct: 191 --NRKIPKEALFKLDVIYGCSCTKSKCRKRYCECFLRNQNCTEKCKCFDCCN 240


>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
          Length = 524

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 53/120 (44%), Gaps = 34/120 (28%)

Query: 61  KQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNC 120
           K++ CNC  S+CLKL                             YC+CFA G +C  CNC
Sbjct: 325 KKRPCNCTKSQCLKL-----------------------------YCDCFANGEFCYNCNC 355

Query: 121 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 180
            +CHN  + + ARQ+A+     RN + F  +      GA ++   + E  ++ K + G +
Sbjct: 356 KDCHNTQKDDYARQKAIRC---RNTDQFLQEFHY--FGASESSRKSLEDGISPKLSPGLY 410



 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C+C KS CLK YC+CF     C  NC C DC N +  +  R
Sbjct: 329 CNCTKSQCLKLYCDCFANGEFCY-NCNCKDCHNTQKDDYAR 368


>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 957

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query: 163 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           R++A  ++   K  + C CKKS CLKKYCECF  ++ C+  C+C +C N
Sbjct: 643 RQEAINSKALTKQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 32/59 (54%), Gaps = 6/59 (10%)

Query: 174 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEGSEERRALFHGDHNGIAF 231
           K +KGC CKK+ CLK YC CF  +I C  + C C  C N +    R     GD NG+A 
Sbjct: 313 KTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCANTQDELARD----GD-NGMAM 366


>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
 gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
          Length = 225

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 33/45 (73%), Gaps = 3/45 (6%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           KGC CK+S C+K YC+C+Q+  +CS  C+C+ C+N   +EER A+
Sbjct: 25  KGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRN---TEERLAV 66


>gi|194856723|ref|XP_001968813.1| GG24300 [Drosophila erecta]
 gi|190660680|gb|EDV57872.1| GG24300 [Drosophila erecta]
          Length = 239

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 29/37 (78%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
           KGC CK+S C+K YC+C+Q+  +C++ CRC+ C+N E
Sbjct: 26  KGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRNTE 62


>gi|219114349|ref|XP_002176345.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217402591|gb|EEC42581.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 636

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 25/35 (71%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 212
           GC C++S CLKKYCECFQ    C  NC+C +C+N 
Sbjct: 245 GCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQNL 279


>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
          Length = 285

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 52/115 (45%), Gaps = 23/115 (20%)

Query: 102 LVKYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RNPNAFRPKIASSP 156
           L +YC CF +G  C   C CV  C NN +H   R +A+    E   RN N          
Sbjct: 168 LKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRNKNV--------- 218

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                     ++A    K   GC+CKK+ C   YCECF     C+ +C+C +C+N
Sbjct: 219 ---------PKDALFKLKDCFGCNCKKTRCQTGYCECFLRKSKCTMDCQCDNCEN 264



 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLD 208
           C C+ SGCLK+YC CF +  +C ++C+C++
Sbjct: 160 CFCRNSGCLKRYCRCFHSGRMCLKDCQCVE 189


>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 47/108 (43%), Gaps = 18/108 (16%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC CF   I C + C C +CHN  ++   R +A+           + K+        D  
Sbjct: 236 YCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE----------KVKVKQQRRKHDDDL 285

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
            D            GC C+KS C K YCECF  N  CS  C+C +C+N
Sbjct: 286 FDKDTVW-------GCQCRKSQCKKNYCECFIRNKKCSSLCKCNNCEN 326



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 155 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           + H  +   +D  E  L   +N  C C KS CLK YC CF  NI CSE C+C DC N
Sbjct: 204 TTHYREFFDKDLDEHNL---NNSPCKCSKSHCLKLYCACFHRNIECSELCQCHDCHN 257


>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 49/111 (44%), Gaps = 16/111 (14%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC CF  G  C + C C +C N   H   R EAV    ++               A 
Sbjct: 90  LKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKK---------------AH 134

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
             ++  +E     K   GC+CKK+ CLK+YCEC+     C+  C C  C+N
Sbjct: 135 RNKKVPQEKLFETKDIWGCNCKKTRCLKRYCECYIRQKTCTVECNCNHCEN 185



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 6/59 (10%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 237
           C C K+ CLK YC CF    +C E+C+C DCKN          FH     + ++++ A+
Sbjct: 82  CSCTKTYCLKMYCSCFHNGQICGESCKCEDCKN------TNMHFHQRDEAVEYIKKKAH 134


>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
          Length = 514

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 46/121 (38%), Gaps = 45/121 (37%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC+C AAG+YC D C C NC N  E+E   Q        R+P AF P+I        
Sbjct: 244 LSSYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRI-------- 295

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERR 219
                                              A I CS+ CRC DC+N F    +RR
Sbjct: 296 -----------------------------------AGIGCSDGCRCEDCRNSFGIKADRR 320

Query: 220 A 220
           A
Sbjct: 321 A 321


>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 381

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 50/134 (37%), Gaps = 53/134 (39%)

Query: 58  TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG 117
            PK  K C C+N+ CLKL                             YC CF  G +CD 
Sbjct: 272 VPKTVKPCKCKNTHCLKL-----------------------------YCTCFQKGSFCDP 302

Query: 118 --CNCVNCHNNVEHEVA---RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 172
             C C++C+N  E       RQEAV   L R                   R DA E++  
Sbjct: 303 DICKCIDCYNLREFNETGGKRQEAVSEILLR-------------------RIDAFESRPK 343

Query: 173 AKHNKGCHCKKSGC 186
            K  +GC CKK+ C
Sbjct: 344 KKTGEGCACKKNRC 357



 Score = 45.4 bits (106), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNF 212
           K C CK + CLK YC CFQ    C  + C+C+DC N 
Sbjct: 277 KPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNL 313


>gi|195118094|ref|XP_002003575.1| GI21907 [Drosophila mojavensis]
 gi|193914150|gb|EDW13017.1| GI21907 [Drosophila mojavensis]
          Length = 234

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 33/45 (73%), Gaps = 3/45 (6%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           KGC CK+S C+K YC+C+Q+  +CS  C+C+ C+N   ++ER A+
Sbjct: 25  KGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRN---TQERLAV 66


>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
          Length = 754

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 25/42 (59%)

Query: 175 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           H  GC C+KS CLKKYCECF   + C  +C C +C N    E
Sbjct: 421 HRTGCKCRKSRCLKKYCECFSNGVKCGASCSCENCGNQPAQE 462



 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 1/40 (2%)

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEG 214
           +KGC C K+ CLK YC CF  N+LC+++ C C DCKN E 
Sbjct: 231 DKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEA 270


>gi|195034702|ref|XP_001988959.1| GH11450 [Drosophila grimshawi]
 gi|193904959|gb|EDW03826.1| GH11450 [Drosophila grimshawi]
          Length = 225

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 34/45 (75%), Gaps = 3/45 (6%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           +GC CK+S C+K YC+C+Q+  +CS +C+C+ C+N   +E+R A+
Sbjct: 22  RGCCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRN---TEQRMAV 63


>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 53/115 (46%), Gaps = 17/115 (14%)

Query: 99  KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
           K N +K YC CF  G  C + C C +C N   +   R +AV    ++   A R K  S  
Sbjct: 86  KTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKK---AHRNKKVS-- 140

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                     +E     K   GC+CKK+ CLK+YCEC+     C+  C C  C+N
Sbjct: 141 ----------QETLFETKDIWGCNCKKTRCLKRYCECYIRQKACTVECNCTHCEN 185



 Score = 42.4 bits (98), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 21/33 (63%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C C K+ CLK YC CF     C E+C+C DCKN
Sbjct: 82  CSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKN 114


>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
 gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
          Length = 240

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 27/35 (77%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           KGC CK+S C+K YC+C+Q+  +C++ CRC  C+N
Sbjct: 23  KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRN 57


>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 47/111 (42%), Gaps = 16/111 (14%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC CF  G  C + C C +C N  E+   R  AV              I    H  +
Sbjct: 90  LKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEY------------IRKKAHRNK 137

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                 +E     K   GC+CKK+ C K+YCECF     C+ +C C  C+N
Sbjct: 138 KV---TQEKLFETKDVWGCNCKKTRCQKRYCECFIRQKTCTVDCNCNHCEN 185



 Score = 45.4 bits (106), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 7/77 (9%)

Query: 161 DAREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           D  E+  E+Q L     + C C K+ CLK YC CF     C E C+C DC+N +    +R
Sbjct: 63  DLDENKEESQILENTKWQPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQR 122

Query: 220 ALFHGDHNGIAFMQRAA 236
                 HN + ++++ A
Sbjct: 123 ------HNAVEYIRKKA 133


>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
          Length = 278

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 49/115 (42%), Gaps = 23/115 (20%)

Query: 102 LVKYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RNPNAFRPKIASSP 156
           L +YC CF +G  C   C CV  C NN +H   R  A+    E   RN N          
Sbjct: 161 LKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRNKNI--------- 211

Query: 157 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                     ++A    K   GC CKKS C   YCECF     C+ +C C +C+N
Sbjct: 212 ---------PKDALFKLKDCFGCSCKKSRCQTGYCECFLRKSKCTIDCECQNCEN 257



 Score = 42.0 bits (97), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 28/44 (63%), Gaps = 1/44 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLD-CKNFEGSEERRAL 221
           C C+ SGCLK+YC CF +  +C + C+C++ C N E   E R +
Sbjct: 153 CFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNM 196


>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
          Length = 223

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 47/111 (42%), Gaps = 16/111 (14%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC CF  G +C   C C  C N  E ++ R +AV    ++               A 
Sbjct: 97  LKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKK---------------AH 141

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
             ++  +E         GC+C K+ C+KKYCECF     C+  C C  C N
Sbjct: 142 RNKKVPKEKIFETVEIWGCNCSKTRCVKKYCECFIRGKKCTVECNCDHCDN 192



 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 26/41 (63%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C C+K+ CLK YC CF     C ++CRC +C+N E  + +R
Sbjct: 89  CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKR 129


>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
          Length = 97

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 30/47 (63%)

Query: 171 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           ++   ++ C C KS C+K YC CF A   CS+NCRC +CKN E  E+
Sbjct: 1   MSNDQHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYED 47



 Score = 46.2 bits (108), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 34/68 (50%), Gaps = 6/68 (8%)

Query: 99  KDNLVK-YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 156
           K N VK YC CF+A  YC   C C NC N   +E   +E V     +NP AF PKI    
Sbjct: 13  KSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPKIVR-- 70

Query: 157 HGAQDARE 164
              QDA E
Sbjct: 71  --VQDASE 76


>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 46/108 (42%), Gaps = 24/108 (22%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFA G  C   C C NC N         + +   LE+                    
Sbjct: 108 YCECFARGKTCGKHCGCQNCQN---------KQMNKKLEKIQKK--------------LI 144

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           +  R  Q  +++ KGC C KS C  K+C C Q    C+  CRC+DCKN
Sbjct: 145 KQKRIKQRTSRYLKGCTCGKSKCQNKFCSCHQDGKYCNSGCRCVDCKN 192


>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
          Length = 523

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 48/93 (51%), Gaps = 29/93 (31%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + T + +K CNC  S CLKL                             YC+CFA G +C
Sbjct: 451 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 481

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
           + CNC NC+NN+EHE  RQ+A+ A L+RNP AF
Sbjct: 482 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 514


>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
          Length = 259

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 52/111 (46%), Gaps = 17/111 (15%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L  YC CF     C   C C  C N+ +H     + V   +E+     + K   + H   
Sbjct: 151 LQLYCSCFHNRRQCTQECKCSECFNDGQHS----DEVLKAIEQ----IKIKEQRASHHDL 202

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           D+  D R+         GC CKK+ C+K YCECF  +  CS +C+C +C+N
Sbjct: 203 DSF-DTRQVW-------GCKCKKTQCIKGYCECFIRHKKCSSHCQCSNCQN 245



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 22/35 (62%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           K CHC K+ CL+ YC CF     C++ C+C +C N
Sbjct: 141 KPCHCSKTNCLQLYCSCFHNRRQCTQECKCSECFN 175


>gi|24581949|ref|NP_608936.1| tombola [Drosophila melanogaster]
 gi|7296988|gb|AAF52259.1| tombola [Drosophila melanogaster]
 gi|294862322|gb|ADE06681.2| RT06796p1 [Drosophila melanogaster]
          Length = 243

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 28/38 (73%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
           C CK+S C+K YC+C+Q+  +C++ CRC+ C+N E  E
Sbjct: 28  CCCKRSQCIKNYCDCYQSMAICTKFCRCVGCRNTEVRE 65


>gi|145481073|ref|XP_001426559.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393634|emb|CAK59161.1| unnamed protein product [Paramecium tetraurelia]
          Length = 254

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 53/145 (36%), Gaps = 49/145 (33%)

Query: 80  IWINIYSNTSPGAHLNTQF-------------KDNLVKY-CECFAAGIYCD-GCNCVNCH 124
           + +N      P  HL  Q+             K   +KY CECF  G  C   C C  C 
Sbjct: 116 VIVNKLEKVQPSDHLQQQYQNIRNESITCRCKKTKCIKYYCECFQYGQECSFKCECTGCC 175

Query: 125 NN-VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 183
           N  V+    + + V  T+E                                  +GC C+K
Sbjct: 176 NKQVKQSSGKMQLVSPTIEF---------------------------------QGCKCQK 202

Query: 184 SGCLKKYCECFQANILCSENCRCLD 208
             C KKYCEC + N  C++ C+C+D
Sbjct: 203 QNCSKKYCECLKRNQRCTDLCKCID 227



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 20/33 (60%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C CKK+ C+K YCECFQ    CS  C C  C N
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCN 176


>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
          Length = 192

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 46/110 (41%), Gaps = 40/110 (36%)

Query: 105 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFA G  C   CNC+ C+N+   +  ++   G                         
Sbjct: 85  YCECFAVGRSCSIDCNCLGCNNSNCSQKIKKSQCG------------------------- 119

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
                         GC+CKK+GC+KKYCEC+     C+  C C+DC N E
Sbjct: 120 --------------GCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCNQE 155


>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 379

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 57/121 (47%), Gaps = 19/121 (15%)

Query: 104 KYCECFAAGIYC--DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
           KYC C+A   +C    C C +C N        Q++  +T ++     +PK+ SS      
Sbjct: 257 KYCSCYANNNFCVPSICKCQSCMNTF------QQSYNSTSQK-----KPKLISSQKSLSQ 305

Query: 162 A--REDAREAQLAAKHNKG---CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 216
               +     QL  +       C C+KS C K+YC+C+++   CS  C+C +C N +G +
Sbjct: 306 VILYQIQNINQLKIQKENATISCTCQKSFCKKQYCDCYKSGKKCSTLCKCQNCFN-QGQK 364

Query: 217 E 217
           E
Sbjct: 365 E 365


>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
          Length = 244

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 58/151 (38%), Gaps = 26/151 (17%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YCECFA G  C   C C NC N     +   + +G   +R                    
Sbjct: 108 YCECFARGKTCGKHCGCQNCQN-----MQMNKKLGKIQKR------------------LV 144

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR--AL 221
           +  R  Q   ++ KGC C KS C   +C C Q    C+  C+C+DCKN      +    +
Sbjct: 145 KQKRIKQRTPRYLKGCTCGKSKCQNNFCSCHQDGKYCNSECKCVDCKNVYNFCLKIFPEI 204

Query: 222 FHGDHNGIAFMQRAANAAIIGAVGSSGYGTL 252
            H + N +          IIG   SS +  L
Sbjct: 205 EHVESNEMESTNANVVLFIIGCENSSNFQNL 235


>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 622

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 34/64 (53%)

Query: 156 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 215
           P+    + + ++E ++       C C+KS CLK YCECF    +C   C CL+CKN +  
Sbjct: 306 PNNKLQSIDSSKEGEILRVSKTPCKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNH 365

Query: 216 EERR 219
            E R
Sbjct: 366 LELR 369



 Score = 45.4 bits (106), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 46/117 (39%), Gaps = 42/117 (35%)

Query: 40  KQESPRSRPRANTDGKDG-------TPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGA 92
           K++ P ++ ++    K+G       TP     C CR S CLKL                 
Sbjct: 302 KKQQPNNKLQSIDSSKEGEILRVSKTP-----CKCRKSLCLKL----------------- 339

Query: 93  HLNTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 148
                       YCECFA G  C   C C+ C N+  H   R EA+     +NP+AF
Sbjct: 340 ------------YCECFARGEICGHACVCLECKNSKNHLELRNEAIKVIEAKNPSAF 384


>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
 gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
          Length = 113

 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 19  KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 59



 Score = 45.4 bits (106), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 43/107 (40%), Gaps = 34/107 (31%)

Query: 47  RPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYC 106
           R +   D  D      K+C+C+ S+CLKL                             YC
Sbjct: 7   RQKCQNDNVDSC----KRCSCKKSKCLKL-----------------------------YC 33

Query: 107 ECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 152
           ECFA+ +YC + C+C  C N+  HE            RNP AF PK+
Sbjct: 34  ECFASKVYCSESCSCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKV 80


>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
          Length = 139

 Score = 46.6 bits (109), Expect = 0.040,   Method: Composition-based stats.
 Identities = 43/165 (26%), Positives = 62/165 (37%), Gaps = 50/165 (30%)

Query: 48  PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCE 107
           P+    G D    K+  C C+ ++CLKL                             YC+
Sbjct: 6   PKLQIQGNDSEETKEG-CQCQKTKCLKL-----------------------------YCQ 35

Query: 108 CFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 166
           CF  G  C   C C  C N++   + R +A+G  + +  + ++ K               
Sbjct: 36  CFHDGKCCGQSCGCNGCKNSLNDYLERNKAIGKIIHKYKDDYKQK--------------- 80

Query: 167 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
               L+    +GC CKKS C   YCECF     C E C C  C+N
Sbjct: 81  ----LSGDKGQGCCCKKSKCKLNYCECFIKGRSCGEQCHCKKCEN 121


>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score = 45.8 bits (107), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 45/114 (39%), Gaps = 22/114 (19%)

Query: 102 LVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGAT---LERNPNAFRPKIASSPH 157
           L  YC CF  G  C   C C  C N  ++ + R++AV       +RN      KI  +  
Sbjct: 97  LQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYAKKKAQRNKKVPEGKIFETTE 156

Query: 158 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
                               GC+C K+ C+K YCECF     CS  C C  C N
Sbjct: 157 IW------------------GCNCSKTRCVKNYCECFIRGKKCSVECNCQHCDN 192



 Score = 38.5 bits (88), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
            C C KS CL+ YC CF    +C ++C+C +C+N E +  +R
Sbjct: 88  SCSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKR 129


>gi|428180254|gb|EKX49122.1| hypothetical protein GUITHDRAFT_151625 [Guillardia theta CCMP2712]
          Length = 289

 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 23/33 (69%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C CK+S C+KKYCECFQ +  C+  C C DC N
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGN 190


>gi|218194439|gb|EEC76866.1| hypothetical protein OsI_15056 [Oryza sativa Indica Group]
          Length = 177

 Score = 45.8 bits (107), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 33/73 (45%), Gaps = 5/73 (6%)

Query: 93  HLNTQFKDNLVKYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 151
           H   + +     YC CF+A  YC   C C NC N   +E   +E V     +NP AF PK
Sbjct: 88  HFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPK 147

Query: 152 IASSPHGAQDARE 164
           I       QDA E
Sbjct: 148 IVR----VQDASE 156



 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 179 CHCKKS--GCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           CH KK   G  + YC CF A   CS+NCRC +CKN E  E+
Sbjct: 87  CHFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYED 127


>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 45.4 bits (106), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 23/111 (20%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L +YC+CFA G  C   C C  CHN  ++   R+EA+     +N +  +      P G  
Sbjct: 66  LKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKMQNQSIEK----EVPVGIS 121

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
              + ++                   LK+YC+CFQ N  CSE+C+C +C N
Sbjct: 122 CKCKKSKC------------------LKRYCDCFQNNQKCSESCQCNNCSN 154



 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           C C KS CLK+YC+CF    +C+  C+C  C N E   E R
Sbjct: 58  CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEER 98


>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
          Length = 264

 Score = 45.1 bits (105), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           K CHC K+ CL+ YC CF     C++ C+C DC N
Sbjct: 146 KPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYN 180


>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 1/56 (1%)

Query: 165 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
           D +++Q   K  K C C KS CLK+YC+CF    +C+  C+C  C N E  +E R 
Sbjct: 45  DKQQSQNIPKSLK-CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERG 99



 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 49/111 (44%), Gaps = 23/111 (20%)

Query: 102 LVKYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
           L +YC+CFA G  C   C C  CHN  +++  R EA+     +N +  +      P G  
Sbjct: 66  LKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQNQSIEK----EVPVGIS 121

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
              + ++                    K+YC+CFQ N  C E+C+C +C N
Sbjct: 122 CKCKKSKCL------------------KRYCDCFQNNQKCHESCQCSNCSN 154


>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
          Length = 280

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 22/41 (53%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           K CHC K+ CL+ YC CF     C   CRC DC N    EE
Sbjct: 149 KPCHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEE 189


>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
          Length = 289

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           K CHC K+ CL+ YC CF     C   C+C DC N +G  E   L
Sbjct: 156 KPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYN-DGQHEEDVL 199


>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           CHC K+ CL+ YC CF     C+  C+C DC N +G  E   L
Sbjct: 149 CHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYN-DGKHEDEVL 190


>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
          Length = 151

 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 51/127 (40%), Gaps = 20/127 (15%)

Query: 105 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 163
           YC+CF  G  C + C+C  C N++     R +A+   + +                    
Sbjct: 38  YCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHK-------------------Y 78

Query: 164 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 223
           +D  + + +     GC CKKS C   YCECF     C + C C  C+N +   ++    H
Sbjct: 79  KDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNGKKQNQKICANH 138

Query: 224 GDHNGIA 230
            D   I 
Sbjct: 139 EDQIQIC 145



 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 49/113 (43%), Gaps = 12/113 (10%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--FEGSEERRALFHGDHNGIAFMQRA 235
           GC C+K+ CLK YC+CF     C E C C  CKN   +  E  +A+       +  + + 
Sbjct: 26  GCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAI-------VKIIHKY 78

Query: 236 ANAAIIGAVGSSGYGTLMTSNKRK---SEELLFGVAAKDQSVIRNPQSQQGNN 285
            + + +   G  G G     +K +    E  + G +  DQ   +  Q+ +  N
Sbjct: 79  KDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNGKKQN 131


>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 42.7 bits (99), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 21/35 (60%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           K CHC K+ CL+ YC CF  N  CS  C+C  C N
Sbjct: 145 KPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFN 179



 Score = 38.9 bits (89), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 46/209 (22%), Positives = 75/209 (35%), Gaps = 51/209 (24%)

Query: 9   PQWQARPQHVRMVNRVPHPVHKLPLPTLPPG---KQESPRSRPRANTDGKDGTPKKQ--K 63
           P++Q+  + +R  +R  + + +   P    G   K++  R++ R +    D        K
Sbjct: 86  PEYQSSQKQLRFSDRSQNHIEEENKPKQARGIKDKEQRQRTKRRVDYGVGDSFSNDSDLK 145

Query: 64  QCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDG-CNCVN 122
            C+C  ++CL+L                             YC CF     C   C C  
Sbjct: 146 PCHCSKTKCLQL-----------------------------YCSCFHKNKACSSKCQCKG 176

Query: 123 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 182
           C N+  H+V   E V A  +     +R  ++ S   + D R+            KG    
Sbjct: 177 CFNDGNHKV---EMVKALQKVKLKEYR--VSQSDLDSFDTRQVLGCKCKKTHCKKG---- 227

Query: 183 KSGCLKKYCECFQANILCSENCRCLDCKN 211
                  YCECF     C+  C+C +C N
Sbjct: 228 -------YCECFIRGRKCTSQCKCCECNN 249


>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           K CHC K+ CL+ YC CF     C+  C+C DC N +G  E   +
Sbjct: 124 KACHCSKTHCLQLYCSCFHNRRPCTNECKCNDCFN-DGKHEDEVI 167


>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
          Length = 261

 Score = 42.0 bits (97), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 211
           C C KS CLK+YC+CF    +C+  C+C  C N
Sbjct: 90  CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122


>gi|167382874|ref|XP_001736307.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165901477|gb|EDR27560.1| hypothetical protein EDI_093510 [Entamoeba dispar SAW760]
          Length = 176

 Score = 42.0 bits (97), Expect = 0.86,   Method: Composition-based stats.
 Identities = 17/39 (43%), Positives = 22/39 (56%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           C C+ S CL+  C CFQ    CS  C C  CKN + +E+
Sbjct: 15  CRCQTSACLESNCSCFQGRNFCSPLCTCNCCKNRKENEQ 53


>gi|345309828|ref|XP_001507154.2| PREDICTED: protein lin-54 homolog [Ornithorhynchus anatinus]
          Length = 182

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 196 ANILCSENCRCLDCKNFEGSEERRALFH 223
           A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 49  AKIMCSSICKCIGCKNFEESPERKTLMH 76


>gi|414884761|tpg|DAA60775.1| TPA: hypothetical protein ZEAMMB73_412328 [Zea mays]
          Length = 581

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 33/71 (46%), Gaps = 16/71 (22%)

Query: 278 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPL-ADILQPQDMKELCSLLV 336
           PQS+ G  V      C    P                +R PL ADI+Q +D+KELC LLV
Sbjct: 481 PQSRAGQLVMKPLTECITKGP---------------RMREPLLADIVQAEDIKELCKLLV 525

Query: 337 MVSSEATKTLT 347
           +VS EA K   
Sbjct: 526 VVSREAAKAYV 536


>gi|145508477|ref|XP_001440188.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407394|emb|CAK72791.1| unnamed protein product [Paramecium tetraurelia]
          Length = 235

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/58 (31%), Positives = 32/58 (55%)

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 208
           K+    +   + +  +  +QL++    GC+C+K  C K+YCEC + NI C+  C C +
Sbjct: 160 KLNCECYNCSNKQPKSTPSQLSSIEFNGCNCRKQNCSKRYCECQKRNIKCTSKCNCCE 217


>gi|407037160|gb|EKE38521.1| hypothetical protein ENU1_163380 [Entamoeba nuttalli P19]
          Length = 176

 Score = 41.2 bits (95), Expect = 1.6,   Method: Composition-based stats.
 Identities = 16/39 (41%), Positives = 22/39 (56%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           C C+ + CL+  C CFQ    CS  C C  CKN + +E+
Sbjct: 15  CRCQTTACLESNCSCFQGGNFCSPLCTCNCCKNRKENEQ 53


>gi|67482063|ref|XP_656381.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56473579|gb|EAL50998.1| hypothetical protein EHI_153340 [Entamoeba histolytica HM-1:IMSS]
 gi|449705185|gb|EMD45288.1| Hypothetical protein EHI5A_018850 [Entamoeba histolytica KU27]
          Length = 176

 Score = 40.8 bits (94), Expect = 1.7,   Method: Composition-based stats.
 Identities = 16/39 (41%), Positives = 22/39 (56%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 217
           C C+ + CL+  C CFQ    CS  C C  CKN + +E+
Sbjct: 15  CRCQTTACLESNCSCFQGGNFCSPLCTCNCCKNRKENEQ 53


>gi|344241563|gb|EGV97666.1| Solute carrier family 2, facilitated glucose transporter member 10
           [Cricetulus griseus]
          Length = 423

 Score = 40.0 bits (92), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 24/39 (61%)

Query: 521 GYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRVGLSNG 559
           G G    AVS+   PSC TTS +SRQ  DLP  +GL NG
Sbjct: 197 GIGLVSFAVSLDSGPSCLTTSNTSRQMGDLPQSLGLPNG 235


>gi|354476758|ref|XP_003500590.1| PREDICTED: solute carrier family 2, facilitated glucose transporter
           member 10 [Cricetulus griseus]
          Length = 544

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 24/39 (61%)

Query: 521 GYGSAKSAVSVTLEPSCTTTSVSSRQNNDLPMRVGLSNG 559
           G G    AVS+   PSC TTS +SRQ  DLP  +GL NG
Sbjct: 318 GIGLVSFAVSLDSGPSCLTTSNTSRQMGDLPQSLGLPNG 356


>gi|115615245|ref|XP_001199031.1| PREDICTED: uncharacterized protein LOC763146 [Strongylocentrotus
            purpuratus]
          Length = 1617

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)

Query: 179  CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 211
            C CKK+ C       K  C C Q NI C++ CRC++CKN
Sbjct: 1560 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1598


>gi|384246535|gb|EIE20025.1| hypothetical protein COCSUDRAFT_58250 [Coccomyxa subellipsoidea
           C-169]
          Length = 556

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 44/70 (62%), Gaps = 8/70 (11%)

Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
           +RNP AF+ KI        + + + + + ++  H +GCHCKKS C KKYCECFQA + C 
Sbjct: 162 QRNPQAFQEKI--------EKQANGQGSGVSGLHKRGCHCKKSHCKKKYCECFQAGVNCG 213

Query: 202 ENCRCLDCKN 211
            +C+C +C N
Sbjct: 214 VHCKCDECHN 223


>gi|390344245|ref|XP_003726079.1| PREDICTED: uncharacterized protein LOC764337 [Strongylocentrotus
            purpuratus]
          Length = 1742

 Score = 39.3 bits (90), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)

Query: 179  CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 211
            C CKK+ C       K  C C Q NI C++ CRC++CKN
Sbjct: 1685 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1723


>gi|145520699|ref|XP_001446205.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413682|emb|CAK78808.1| unnamed protein product [Paramecium tetraurelia]
          Length = 226

 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 23/35 (65%), Gaps = 1/35 (2%)

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLD-CKN 211
           GC+C+K  C K YCEC +  I C+  CRC D CKN
Sbjct: 178 GCNCRKQDCSKGYCECQKRKIKCTSKCRCCDECKN 212


>gi|291227245|ref|XP_002733597.1| PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
          Length = 1578

 Score = 38.9 bits (89), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 19/34 (55%)

Query: 173  AKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 206
            A H   C CKKS CL ++C C +A + C   C C
Sbjct: 1527 ATHKNYCRCKKSKCLTRHCICVKAGVKCVVACMC 1560


>gi|226069328|dbj|BAH36888.1| Polycomb group protein [Physcomitrella patens subsp. patens]
 gi|237637250|gb|ACR07925.1| CLF protein [Physcomitrella patens]
          Length = 999

 Score = 38.9 bits (89), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 7/90 (7%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR---ALFHGDHNGIAFMQ 233
           +GCHC KS C  + C CF A   C  +  C +C  + G  E+    A+ +  HN +  M 
Sbjct: 791 RGCHCAKSNCCSRQCPCFAAGRECDPDV-CRNC--WVGCGEKHVKPAVEYTCHN-MKLML 846

Query: 234 RAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
           +     ++G    +G+G  +     K E L
Sbjct: 847 KQQQRVLLGRSDVAGWGAFLKKTVAKHEYL 876


>gi|224010277|ref|XP_002294096.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970113|gb|EED88451.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 401

 Score = 38.9 bits (89), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 18/29 (62%), Gaps = 5/29 (17%)

Query: 179 CHCKKSGCLKKYCECFQANILCSENCRCL 207
           CHC KS CLK YC+CFQ      +  RCL
Sbjct: 372 CHCSKSKCLKLYCDCFQ-----QKEVRCL 395


>gi|345481297|ref|XP_003424335.1| PREDICTED: hypothetical protein LOC100679675 [Nasonia vitripennis]
          Length = 539

 Score = 38.5 bits (88), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 25/47 (53%), Gaps = 3/47 (6%)

Query: 173 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD--CKNFEGSEE 217
            K+NKGC CK   C  K C C +  I+C   CRC D  C N E  +E
Sbjct: 104 TKNNKGCSCK-GNCSNKICGCVKKGIMCQVFCRCNDAQCMNQEPEKE 149


>gi|325191197|emb|CCA25983.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 511

 Score = 38.5 bits (88), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 8/88 (9%)

Query: 156 PHGAQDAREDAREAQLAAKHNKGCHCKKSG-CLKKYCECFQANILCSENCRCLDCKN-FE 213
           P G+ D REDA E       NK   C+ SG C    C C + N +C+  CRC  C+N F 
Sbjct: 299 PLGSSDRREDALEK------NKVVSCRCSGGCRNGRCACVRENGMCTPACRCTSCQNPFS 352

Query: 214 GSEERRALFHGDHNGIAFMQRAANAAII 241
             +   A      N   FMQ A    ++
Sbjct: 353 IVKAAGANMDILKNDSCFMQNACKGRVM 380


>gi|168031358|ref|XP_001768188.1| polycomb group protein [Physcomitrella patens subsp. patens]
 gi|162680626|gb|EDQ67061.1| polycomb group protein [Physcomitrella patens subsp. patens]
          Length = 892

 Score = 38.5 bits (88), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 7/90 (7%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR---ALFHGDHNGIAFMQ 233
           +GCHC KS C  + C CF A   C  +  C +C  + G  E+    A+ +  HN +  M 
Sbjct: 681 RGCHCAKSNCCSRQCPCFAAGRECDPDV-CRNC--WVGCGEKHVKPAVEYTCHN-MKLML 736

Query: 234 RAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
           +     ++G    +G+G  +     K E L
Sbjct: 737 KQQQRVLLGRSDVAGWGAFLKKTVAKHEYL 766


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.128    0.377 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,707,507,268
Number of Sequences: 23463169
Number of extensions: 363981260
Number of successful extensions: 819997
Number of sequences better than 100.0: 669
Number of HSP's better than 100.0 without gapping: 565
Number of HSP's successfully gapped in prelim test: 104
Number of HSP's that attempted gapping in prelim test: 816086
Number of HSP's gapped (non-prelim): 2268
length of query: 571
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 423
effective length of database: 8,886,646,355
effective search space: 3759051408165
effective search space used: 3759051408165
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 80 (35.4 bits)