BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 014381
         (425 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255551004|ref|XP_002516550.1| transcription factor, putative [Ricinus communis]
 gi|223544370|gb|EEF45891.1| transcription factor, putative [Ricinus communis]
          Length = 629

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 256/411 (62%), Positives = 297/411 (72%), Gaps = 25/411 (6%)

Query: 3   QGETISDFAPKKLARQLDFT-ICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQV 61
           Q E ISDFAPKKLARQLDFT +CRAS N  L E S + Q  QP    P  P P     Q+
Sbjct: 5   QSEAISDFAPKKLARQLDFTAVCRASVNVALSEHSSRLQQQQPPPLPPPPPPPMSQP-QL 63

Query: 62  QSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM---VNRVPHP 118
             K+    Q    ++L+  PP  QP                QARPQHV +   + R+PHP
Sbjct: 64  HLKLQQQPQPVLQLKLET-PPKQQPQA-------------TQARPQHVMVPPAMRRIPHP 109

Query: 119 VHKLPLPTLPPGKQESPRSRPRANTDGKD--GTPKKQKQCNCRNSRCLKLYCECFAAGIY 176
           V KL L TL   KQ+SP SRPR N + KD  GTPKK K CNC+NSRCLKLYCECFAAG++
Sbjct: 110 VQKLSLHTLQLIKQQSPSSRPRNNIEAKDNSGTPKKAKHCNCKNSRCLKLYCECFAAGVH 169

Query: 177 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 236
           C GCNC NCHNNVE+E +RQEAVGA LERNP AF+PKIASSPHG++DA+EDA E QL  K
Sbjct: 170 CTGCNCTNCHNNVENEASRQEAVGAVLERNPEAFKPKIASSPHGSRDAKEDAMEVQLVGK 229

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
           HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH +HNGI +MQ+
Sbjct: 230 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNHNGIVYMQQ 289

Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS 356
           AANAAI GA+GSSGYGT +TSNKRKSEE LFG   +DQ   R  ++QQ N+ +N A   S
Sbjct: 290 AANAAICGAIGSSGYGTPLTSNKRKSEE-LFGSTNRDQLGRRVAKTQQENHQRNPAASAS 348

Query: 357 -QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
             SVPV    NA+ LGS+K   +SPLA ILQPQD+K++CSLLV++S EA K
Sbjct: 349 PLSVPVPH--NATALGSSKFTYKSPLAGILQPQDVKKMCSLLVVLSQEARK 397


>gi|224141773|ref|XP_002324239.1| predicted protein [Populus trichocarpa]
 gi|222865673|gb|EEF02804.1| predicted protein [Populus trichocarpa]
          Length = 682

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 233/417 (55%), Positives = 280/417 (67%), Gaps = 63/417 (15%)

Query: 1   MEQGETISDFAPKKLARQ-LDFT--ICR-ASANGLLPEQSVKSQSSQPQSHSPLQPQPPK 56
           MEQ ET++++ PKKLA Q LDFT  ICR A AN       V SQ+ Q Q +  LQ Q P 
Sbjct: 1   MEQSETVTEYTPKKLAPQKLDFTAPICRTAPAN-------VPSQTQQVQVN--LQTQTPP 51

Query: 57  SQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARP--QH--VRMV 112
                                                      P +Q RP  QH  V + 
Sbjct: 52  RP-------------------------------------EMLPPPYQERPVSQHEAVAVA 74

Query: 113 NRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
            R+PHPVHK+    +P GKQ+SP SR  AN    DGTPKK KQCNC+NS CLKLYCECFA
Sbjct: 75  QRIPHPVHKIS--AMPRGKQDSPGSR-LAN----DGTPKKPKQCNCKNSMCLKLYCECFA 127

Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
           AGI+C+GCNC+NC NNVE+E  R+EAV  TL+RNPNAFRPKIASSPHG++D  EDA+E Q
Sbjct: 128 AGIHCNGCNCLNCRNNVENEKERKEAVETTLQRNPNAFRPKIASSPHGSRDTTEDAQEVQ 187

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIA 292
           +  KHNKGC+CK+SGCLKKYCECFQ NI CSENC+CL+CKNFEGS+ERRALFHG  +  A
Sbjct: 188 MLGKHNKGCNCKRSGCLKKYCECFQGNIFCSENCKCLECKNFEGSDERRALFHGSPHATA 247

Query: 293 FMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSA 352
           + Q+A NAAI GA+GSSGYGT + S KRKSEE LFG+A KDQ      +  QGN ++NSA
Sbjct: 248 YAQQATNAAISGAIGSSGYGTTLASKKRKSEE-LFGIATKDQPAHSTAKFSQGNTLRNSA 306

Query: 353 VPCS-QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
              S  SVPVS+TAN +VLGSTK   RSPLA +L+PQ +KE+C+LLV +S +AT+ L
Sbjct: 307 AFSSPLSVPVSRTANGAVLGSTKPTYRSPLAGVLKPQFVKEICTLLVTISGKATEAL 363


>gi|356510007|ref|XP_003523732.1| PREDICTED: uncharacterized protein LOC100811459 [Glycine max]
          Length = 515

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 221/409 (54%), Positives = 270/409 (66%), Gaps = 30/409 (7%)

Query: 1   MEQGETISDFAPKKLARQLDFTICRA--SANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQ 58
           MEQ ET SD AP+KLARQLDF       S   L P         +P SHSPL+ Q     
Sbjct: 1   MEQSETASDLAPRKLARQLDFAAVYGDPSHQKLPPPLPPPQSPLRPPSHSPLRLQ----- 55

Query: 59  PQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHP 118
                             L L PP  +P + +  +     +  +  RP+ V  V R+PHP
Sbjct: 56  ------------------LDLQPPGQRPWLYSEAQEQQPGL--YSPRPKLVSPVRRIPHP 95

Query: 119 VHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD 178
           V K P+  L   K ESPRS+PR N + KD TPKKQKQCNC+NSRCLKLYCECFAAGIYCD
Sbjct: 96  VVKFPVKVLQVVKPESPRSQPRCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCD 155

Query: 179 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 238
           GCNCVNCHNNV++E ARQEAVG TLERNPNAFRPKIASSP   +D++E   E ++  KHN
Sbjct: 156 GCNCVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPQEQRDSKE--CEIKVIGKHN 213

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 298
           KGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGS+ER A+FH D+N +  M++AA
Sbjct: 214 KGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSDERIAIFHKDYN-LVHMKQAA 272

Query: 299 NAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS 358
           NA I GAVGSSGYGT +T  KRK++E+  G +A DQ+V    Q  Q  +   S+      
Sbjct: 273 NATISGAVGSSGYGTHITPKKRKNQEMFPGKSAMDQTVNMTAQYLQEIDPMASSPSSLSD 332

Query: 359 VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
             +S  +N  + GS++S  RS LAD+L+P ++K LCSL V++S  A KT
Sbjct: 333 SFISDPSNTRISGSSRSTYRSVLADVLKPNNVKNLCSLFVVLSRVAAKT 381


>gi|255546437|ref|XP_002514278.1| transcription factor, putative [Ricinus communis]
 gi|223546734|gb|EEF48232.1| transcription factor, putative [Ricinus communis]
          Length = 601

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 224/414 (54%), Positives = 274/414 (66%), Gaps = 62/414 (14%)

Query: 3   QGETISDFAPKKLARQLDFTI----CRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQ 58
           Q E++ DF  KKLARQLDFT        + N +LPE         PQS    Q QPP + 
Sbjct: 18  QSESL-DFPTKKLARQLDFTTQGNNNNNNNNVVLPEH--------PQS----QQQPPATV 64

Query: 59  PQVQSKVASPSQSQPLV--RLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVP 116
              Q  V +P Q Q +      LLPP  QP V+A                          
Sbjct: 65  ALPQEPVVAPQQPQAVAIPEAPLLPP--QPPVRAI------------------------- 97

Query: 117 HPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIY 176
                         K ESP+S+PR N + KDGTPK+QKQCNC++SRCLKLYCECFA+G Y
Sbjct: 98  --------------KPESPKSKPRPNAELKDGTPKRQKQCNCKHSRCLKLYCECFASGTY 143

Query: 177 CDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK 236
           CDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+  E  +  K
Sbjct: 144 CDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREENGEGLILGK 203

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQ 295
           HNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q
Sbjct: 204 HNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHTNNMAYIQ 263

Query: 296 RAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVP 354
           +AANAAI GA+GSSGY +   S KRK +ELLFG   KD S  R    QQ  +++ +++  
Sbjct: 264 QAANAAITGAIGSSGYVSPPISKKRKGQELLFGPTTKDPSFHRLGHFQQAVHIRPSTSSS 323

Query: 355 CSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
              S P+++  +++ LG +K   RS LADI+QPQD+KELCS+LV++S EA KTL
Sbjct: 324 SLSSNPIARAGSSATLGPSKFTYRSLLADIIQPQDLKELCSVLVVLSGEAAKTL 377


>gi|225445198|ref|XP_002280793.1| PREDICTED: uncharacterized protein LOC100249023 [Vitis vinifera]
          Length = 592

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 200/308 (64%), Positives = 242/308 (78%), Gaps = 9/308 (2%)

Query: 117 HPVHKLPLP--TLPPG----KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
           HPV  +P+P  T  P     K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKLYCEC
Sbjct: 65  HPV-TMPVPPQTTHPSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKLYCEC 123

Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
           FA+GIYCDGCNCVNCHNNVE+E AR+EAV  TLERNPNAFRPKIASSPHGA+D+RE++ E
Sbjct: 124 FASGIYCDGCNCVNCHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSREESGE 183

Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-N 289
           A +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N
Sbjct: 184 ALVLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHAN 243

Query: 290 GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ 349
            +A++Q+AANAAI GA+GSSG+G+   S KRK +EL +G  +KD S+ R  Q QQ +  +
Sbjct: 244 SMAYIQQAANAAITGAIGSSGFGSPPVSKKRKGQELFYGPTSKDPSLHRLAQFQQASLTK 303

Query: 350 -NSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
            +       S PVS+  N++  G++K   RS LADI+QPQD+KELCS+LV+VS EA +T 
Sbjct: 304 ASVTTSSLSSTPVSRVPNSAASGTSKFTYRSLLADIIQPQDLKELCSVLVVVSEEAARTF 363

Query: 409 TSIYHYIQ 416
               + ++
Sbjct: 364 ADKRNLVE 371


>gi|224119800|ref|XP_002331164.1| predicted protein [Populus trichocarpa]
 gi|222873247|gb|EEF10378.1| predicted protein [Populus trichocarpa]
          Length = 519

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 193/282 (68%), Positives = 229/282 (81%), Gaps = 4/282 (1%)

Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVE 190
           K ESP+SRPR N + KDGTPKKQKQCNC++SRCLKLYCECFA+G YCDGCNCVNC+NNVE
Sbjct: 90  KPESPKSRPRLNAELKDGTPKKQKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNVE 149

Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
           +E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+  E  +  KHNKGCHCKKSGCLK
Sbjct: 150 NEAARREAVEATLERNPNAFRPKIASSPHGTRDSREETGEGLVFVKHNKGCHCKKSGCLK 209

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSS 309
           KYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GSS
Sbjct: 210 KYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHGNNMAYIQQAANAAITGAIGSS 269

Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS--QSVPVSQTANA 367
           GY +L  S KRK +EL FG   KD S  R    QQ ++++  A P S   S PV++  N 
Sbjct: 270 GYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQASHIR-PAAPSSPLPSNPVARAGNT 328

Query: 368 SVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
           + LGS+K   RS LADI+Q QD+KELCS+LV++S EA KTL+
Sbjct: 329 TTLGSSKITYRSLLADIIQSQDLKELCSVLVVLSGEAAKTLS 370


>gi|356522716|ref|XP_003529992.1| PREDICTED: uncharacterized protein LOC100818116 [Glycine max]
          Length = 559

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 198/297 (66%), Positives = 230/297 (77%), Gaps = 8/297 (2%)

Query: 118 PVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAA 173
           PV  LPLP   P    GK ESP+SR R N + KD TPKKQKQCNC++S+CLKLYCECFA+
Sbjct: 52  PVAVLPLPLQAPPARVGKPESPKSRSRPNFEMKDATPKKQKQCNCKHSKCLKLYCECFAS 111

Query: 174 GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 233
           GIYCDGCNCVNC NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+REDA E  +
Sbjct: 112 GIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREDAGEVLI 171

Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAF 293
             KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD N +A+
Sbjct: 172 LGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNMAY 231

Query: 294 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV 353
           +Q+AANAAI GA+GSSGY +   S KRK +EL F    KD S+ +    QQ N+V+  A 
Sbjct: 232 IQQAANAAITGAIGSSGYSSPPVSKKRKGQELFFWPTTKDPSISK--LGQQVNHVRGPA- 288

Query: 354 PCSQSVPVSQT-ANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
           P S   PVS      + LG +K   RS LADI+QPQ +KELCS+LV+VS +A KTLT
Sbjct: 289 PSSSLSPVSGARVGTATLGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLT 345


>gi|297738803|emb|CBI28048.3| unnamed protein product [Vitis vinifera]
          Length = 524

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 194/294 (65%), Positives = 236/294 (80%), Gaps = 2/294 (0%)

Query: 125 PTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVN 184
           P++   K ESPRSRPR N D KDGTPKKQKQCNC++SRCLKLYCECFA+GIYCDGCNCVN
Sbjct: 10  PSVRVVKPESPRSRPRPNIDVKDGTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVN 69

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
           CHNNVE+E AR+EAV  TLERNPNAFRPKIASSPHGA+D+RE++ EA +  KHNKGCHCK
Sbjct: 70  CHNNVENEAARREAVEVTLERNPNAFRPKIASSPHGARDSREESGEALVLGKHNKGCHCK 129

Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAII 303
           KSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI 
Sbjct: 130 KSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANSMAYIQQAANAAIT 189

Query: 304 GAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVPCSQSVPVS 362
           GA+GSSG+G+   S KRK +EL +G  +KD S+ R  Q QQ +  + +       S PVS
Sbjct: 190 GAIGSSGFGSPPVSKKRKGQELFYGPTSKDPSLHRLAQFQQASLTKASVTTSSLSSTPVS 249

Query: 363 QTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYIQ 416
           +  N++  G++K   RS LADI+QPQD+KELCS+LV+VS EA +T     + ++
Sbjct: 250 RVPNSAASGTSKFTYRSLLADIIQPQDLKELCSVLVVVSEEAARTFADKRNLVE 303


>gi|224143568|ref|XP_002325000.1| predicted protein [Populus trichocarpa]
 gi|222866434|gb|EEF03565.1| predicted protein [Populus trichocarpa]
          Length = 545

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 223/405 (55%), Positives = 270/405 (66%), Gaps = 37/405 (9%)

Query: 10  FAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVASPS 69
           F  KKLARQLDFT       G+LP+         PQS    QP P +  P + S  A+ S
Sbjct: 23  FQTKKLARQLDFT------QGVLPDH--------PQS----QPSPLQKPPLLVSPAAAQS 64

Query: 70  QSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPP 129
           Q QP  + QL        +Q   +  +Q   Q        R V R+  P+ + P      
Sbjct: 65  QPQPHPQSQLQLQPQVAEIQVVPQQQTQQQQQP------ARAVLRLLPPMFRKP------ 112

Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
              ESP+S P  NT+ KDGTPKKQ+QCNC++SRCLKLYCECFA+G YCDGCNCVNC+NNV
Sbjct: 113 ---ESPKSIPIPNTELKDGTPKKQRQCNCKHSRCLKLYCECFASGTYCDGCNCVNCYNNV 169

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           E+E AR+EAV ATLERNPNAFRPKIASSPHG +D+RE+  +  +  KHNKGCHCKKSGCL
Sbjct: 170 ENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREETGDGLVFVKHNKGCHCKKSGCL 229

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGS 308
           KKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N +A++Q+AANAAI GA+GS
Sbjct: 230 KKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHGNNMAYIQQAANAAITGAIGS 289

Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN---SAVPCSQSVPVSQTA 365
           SGY +   S KRK +EL FG   KDQS  R    QQ N       +      S P+++  
Sbjct: 290 SGYASPPVSRKRKGQELFFGHTVKDQSFDRLGHFQQVNGSHTRPPAPSSSLPSNPIARAG 349

Query: 366 NASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
           NA  LG +K   RS LADI+QPQD+KELCS+LV++S EA KT + 
Sbjct: 350 NAITLGPSKITYRSLLADIIQPQDLKELCSVLVVLSGEAAKTFSD 394


>gi|356511363|ref|XP_003524396.1| PREDICTED: uncharacterized protein LOC100786353 [Glycine max]
          Length = 561

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 198/305 (64%), Positives = 238/305 (78%), Gaps = 8/305 (2%)

Query: 116 PHPVHKLPLP---TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
           P P   LPLP   ++  GK ESP+SRPR   + K+ TPKKQKQCNC++S+CLKLYCECFA
Sbjct: 52  PQPPVILPLPPSASVRVGKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKLYCECFA 111

Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
           +GIYCDGCNCVNC+NNVE+E AR+EAV ATLERNPNAFRPKIASSPHG +D+REDA E  
Sbjct: 112 SGIYCDGCNCVNCYNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDSREDAGEVL 171

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD-HNGI 291
           +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD +N +
Sbjct: 172 ILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNM 231

Query: 292 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNS 351
            ++Q+AANAAI GA+GSSGY +   S KRK +ELLFG   KD SV R  Q QQ NNV+  
Sbjct: 232 VYIQQAANAAITGAIGSSGYSSPPISKKRKGQELLFGPTVKDPSVGR--QGQQANNVRVP 289

Query: 352 AVPCSQS-VPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
           A   S S +P ++   A+ LG +K   RS LADI+QPQ +KELCS+LV+VS +A K  T 
Sbjct: 290 APSSSLSPIPGARVGPAT-LGPSKLMYRSLLADIIQPQHLKELCSVLVLVSGQAAKMFTD 348

Query: 411 IYHYI 415
             +++
Sbjct: 349 HKNFM 353


>gi|356529628|ref|XP_003533391.1| PREDICTED: uncharacterized protein LOC100777698 [Glycine max]
          Length = 559

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 198/314 (63%), Positives = 236/314 (75%), Gaps = 9/314 (2%)

Query: 102 WQARPQHVRMVNRVPHPVHKLPLPTLPP----GKQESPRSRPRANTDGKDGTPKKQKQCN 157
           +   P+H ++      PV  LPL    P    GK ESP+SR R N + KD TPKKQKQCN
Sbjct: 34  FTGAPEHPQLSQPPQLPVAVLPLQPQAPHARVGKPESPKSRSRPNFEIKDATPKKQKQCN 93

Query: 158 CRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 217
           C++S+CLKLYCECFA+GIYCDGCNCVNC NNVE+E AR+EAV ATLERNPNAFRPKIASS
Sbjct: 94  CKHSKCLKLYCECFASGIYCDGCNCVNCFNNVENEAARREAVEATLERNPNAFRPKIASS 153

Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 277
           PHG +D+RE+A E  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGS
Sbjct: 154 PHGTRDSREEAGEVLILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGS 213

Query: 278 EERRALFHGD-HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSV 336
           EER+ALFHGD +N +A++Q+AANAAI GA+GSSGY +   S KRK +EL F    KD S+
Sbjct: 214 EERQALFHGDQNNNMAYIQQAANAAITGAIGSSGYSSPPVSKKRKGQELFFWPTTKDPSI 273

Query: 337 IRNPQSQQGNNVQNSAVPCSQSVPVSQT-ANASVLGSTKSALRSPLADILQPQDMKELCS 395
            +    QQ N+V+  A P S   PVS      + LG +K   RS LADI+QPQ +KELCS
Sbjct: 274 SK--LGQQVNHVRGPA-PSSSLSPVSGARVGTATLGPSKFMYRSLLADIIQPQHLKELCS 330

Query: 396 LLVMVSSEATKTLT 409
           +LV+VS +A KTLT
Sbjct: 331 VLVLVSGQAAKTLT 344


>gi|225429991|ref|XP_002281512.1| PREDICTED: uncharacterized protein LOC100243881 [Vitis vinifera]
          Length = 579

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 200/305 (65%), Positives = 243/305 (79%), Gaps = 5/305 (1%)

Query: 109 VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYC 168
           V +V R+ +P      PTL   KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKLYC
Sbjct: 70  VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKLYC 125

Query: 169 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
           ECFA+G+YC+ C+C NC NNV +E AR+EAVG+TL+RNPNAFRPKIA+SPHG+ D  E+A
Sbjct: 126 ECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGEEA 185

Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH 288
            +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH + 
Sbjct: 186 GKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNG 245

Query: 289 NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNV 348
           N +A++Q+AANAAI GA+GSSGYGT + S KRK +++ F   A +QS  RN Q QQ N++
Sbjct: 246 NYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTANNQSNHRNTQFQQDNHL 304

Query: 349 QNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
           + SAV  S S+ V+ T NA+VLGS+K   RSPLADILQ QD+K+LCSLLV+VS EA KTL
Sbjct: 305 RASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKDLCSLLVVVSQEAGKTL 364

Query: 409 TSIYH 413
           T   H
Sbjct: 365 TEREH 369


>gi|296081871|emb|CBI20876.3| unnamed protein product [Vitis vinifera]
          Length = 676

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 199/301 (66%), Positives = 242/301 (80%), Gaps = 5/301 (1%)

Query: 109 VRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYC 168
           V +V R+ +P      PTL   KQESPRS PR+N++ KDGTPKK+K CNC+NSRCLKLYC
Sbjct: 151 VPVVPRITNPSR----PTLQAVKQESPRSLPRSNSEVKDGTPKKKKHCNCKNSRCLKLYC 206

Query: 169 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
           ECFA+G+YC+ C+C NC NNV +E AR+EAVG+TL+RNPNAFRPKIA+SPHG+ D  E+A
Sbjct: 207 ECFASGVYCNNCHCSNCQNNVVNEAARKEAVGSTLDRNPNAFRPKIANSPHGSHDGGEEA 266

Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH 288
            +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH + 
Sbjct: 267 GKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFHDNG 326

Query: 289 NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNV 348
           N +A++Q+AANAAI GA+GSSGYGT + S KRK +++ F   A +QS  RN Q QQ N++
Sbjct: 327 NYMAYIQQAANAAISGAIGSSGYGTAVASKKRK-QQVSFCTTANNQSNHRNTQFQQDNHL 385

Query: 349 QNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
           + SAV  S S+ V+ T NA+VLGS+K   RSPLADILQ QD+K+LCSLLV+VS EA KTL
Sbjct: 386 RASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQLQDVKDLCSLLVVVSQEAGKTL 445

Query: 409 T 409
           T
Sbjct: 446 T 446


>gi|449449629|ref|XP_004142567.1| PREDICTED: protein tesmin/TSO1-like CXC 5-like [Cucumis sativus]
          Length = 602

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 183/290 (63%), Positives = 222/290 (76%), Gaps = 6/290 (2%)

Query: 121 KLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLYCECFAAGIYC 177
           K PLP  P  K +SP+SR R+N + KD TPKKQKQC   NC++SRCLKLYCECFA+G+YC
Sbjct: 94  KAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLYCECFASGVYC 151

Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
           DGCNC NCHNNVEHE +R+EAV  TLERNPNAFRPKIA+SPHG +++R++  E  +  KH
Sbjct: 152 DGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKIANSPHGTRESRDEIGELVMLGKH 211

Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQR 296
           NKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N IA++Q+
Sbjct: 212 NKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANNIAYIQQ 271

Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCS 356
           AANAAI GA+GSSGY  L  S KRK  EL FG   KD  +   PQ  Q NN+ +S +  S
Sbjct: 272 AANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQANNIMSSGMSTS 331

Query: 357 QSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
              P +   +    G +K + RS LAD++QP D+KELCS+LV+++SE  K
Sbjct: 332 SPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381


>gi|449519509|ref|XP_004166777.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 5-like
           [Cucumis sativus]
          Length = 556

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 210/415 (50%), Positives = 258/415 (62%), Gaps = 64/415 (15%)

Query: 1   MEQGETISDFAP-KKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQP 59
           +E G  +S   P KKLARQLDFT    +   +LPE                         
Sbjct: 22  VELGNVVSSDVPAKKLARQLDFT---GTGGAVLPEH------------------------ 54

Query: 60  QVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR---SPSQAMPQWQARPQHVRMVNRVP 116
                                     PH Q+  R   S S A+   Q++ Q       V 
Sbjct: 55  --------------------------PHFQSRSRCQESESPAVMVVQSQSQPQSPQQLVV 88

Query: 117 HPV-HKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQC---NCRNSRCLKLYCECFA 172
            P+  K PLP  P  K +SP+SR R+N + KD TPKKQKQC   NC++SRCLKLYCECFA
Sbjct: 89  LPIGTKAPLP--PARKPDSPKSRSRSNVETKDATPKKQKQCKLCNCKHSRCLKLYCECFA 146

Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
           +G+YCDGCNC NCHNNVEHE +R+EAV  TLERNPNAFRPK A+SPHG +++R++  E  
Sbjct: 147 SGVYCDGCNCTNCHNNVEHEASRREAVETTLERNPNAFRPKXANSPHGTRESRDEIGELV 206

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGI 291
           +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGDH N I
Sbjct: 207 MLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDHANNI 266

Query: 292 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNS 351
           A++Q+AANAAI GA+GSSGY  L  S KRK  EL FG   KD  +   PQ  Q NN+ +S
Sbjct: 267 AYIQQAANAAITGAIGSSGYACLPVSKKRKGPELGFGPVGKDSPLNSIPQLHQANNIMSS 326

Query: 352 AVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
            +  S   P +   +    G +K + RS LAD++QP D+KELCS+LV+++SE  K
Sbjct: 327 GMSTSSPFPAAHVGSPVASGPSKFSFRSLLADLIQPNDLKELCSVLVVLTSEVAK 381


>gi|15235524|ref|NP_194629.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75213581|sp|Q9SZD1.1|TCX5_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 5; Short=AtTCX5
 gi|4972046|emb|CAB43914.1| putative transcription factor [Arabidopsis thaliana]
 gi|7269798|emb|CAB79658.1| putative transcription factor [Arabidopsis thaliana]
 gi|26452336|dbj|BAC43254.1| putative transcription factor [Arabidopsis thaliana]
 gi|29028974|gb|AAO64866.1| At4g29000 [Arabidopsis thaliana]
 gi|332660171|gb|AEE85571.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 603

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 180/289 (62%), Positives = 218/289 (75%), Gaps = 9/289 (3%)

Query: 133 ESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEH 191
           ESP++R PR N +G+DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNV++
Sbjct: 110 ESPKARGPRPNVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDN 169

Query: 192 EVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKK 251
           E AR+EAV ATLERNP AFRPKIASSPHG +D RED  E  L  KHNKGCHCKKSGCLKK
Sbjct: 170 EPARREAVEATLERNPFAFRPKIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCLKK 229

Query: 252 YCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSG 310
           YCECFQANILCSENC+CLDCKNFEGSEER+ALFHG+H N +A++Q+AANAAI GAVGSSG
Sbjct: 230 YCECFQANILCSENCKCLDCKNFEGSEERQALFHGEHSNHMAYLQQAANAAITGAVGSSG 289

Query: 311 YGTLMTSNKRKSEELLFGVAAKDQSVIRN-PQSQQGNNVQNSAVPCSQS--VPVSQTANA 367
           +       +RK +E+LF  A KD S + + PQ   G     +  P S +   PVS+    
Sbjct: 290 FAPSPAPKRRKGQEILFNQAIKDSSRLSHFPQVNNG----RTGGPTSGTSPSPVSRAGGN 345

Query: 368 SVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYIQ 416
           +    +K   RS LADI+QP D++ LCS+LV V+ EA KT T   + I+
Sbjct: 346 ASSVPSKFVYRSLLADIIQPHDVRALCSVLVTVAGEAAKTSTDKRNEIE 394


>gi|356523755|ref|XP_003530500.1| PREDICTED: uncharacterized protein LOC100777327 [Glycine max]
          Length = 551

 Score =  352 bits (903), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 193/289 (66%), Positives = 228/289 (78%), Gaps = 5/289 (1%)

Query: 123 PLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNC 182
           P  ++  GK ESP+SRPR   + K+ TPKKQKQCNC++S+CLKLYCECFA+GIYCDGCNC
Sbjct: 64  PSTSVRVGKPESPKSRPRPGFEIKEATPKKQKQCNCKHSKCLKLYCECFASGIYCDGCNC 123

Query: 183 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCH 242
           VNC NNVE+E AR+EAV ATLERNPNAFRPKIASSPH  +D REDA E  +  KHNKGCH
Sbjct: 124 VNCCNNVENEAARREAVEATLERNPNAFRPKIASSPHRTRDIREDAGEILILGKHNKGCH 183

Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD-HNGIAFMQRAANAA 301
           CKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD +N + ++Q+AANAA
Sbjct: 184 CKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNLVYIQQAANAA 243

Query: 302 IIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQS-VP 360
           I GA+GSSGY +   S KRK +ELLFG   KD SV R  Q QQ NNV+  A   S S +P
Sbjct: 244 ITGAIGSSGYSSPPISKKRKGQELLFGPTVKDPSVGR--QGQQANNVRVPAPSSSLSPIP 301

Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
            ++   A+ LG +K   RS LADI+QPQ +KELCS+LV+VS +A KT T
Sbjct: 302 GARVGPAT-LGPSKLMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTFT 349


>gi|297803150|ref|XP_002869459.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315295|gb|EFH45718.1| hypothetical protein ARALYDRAFT_491858 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 601

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 209/411 (50%), Positives = 257/411 (62%), Gaps = 45/411 (10%)

Query: 10  FAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVASPS 69
           F  KK ARQLDFT       G   EQS+   ++     +         +P V S +  PS
Sbjct: 21  FPTKKPARQLDFT-------GGSDEQSLSKAAAPTVVAT-------AVKPVVTSSI--PS 64

Query: 70  QSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNR-VPHPVHKLPLPTLP 128
             +P V + +            +R P+  M      P   +++N  + HP          
Sbjct: 65  TIRPGVTIAI----------GQVR-PTLPMATTSNPPSQSQILNAPIRHP---------- 103

Query: 129 PGKQESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
             K ESP++R PR   +G+DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC N
Sbjct: 104 --KPESPKARGPRPIVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFN 161

Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
           NV++E AR+EAV ATLERNP AFRPKIASSPHG +D RED  E  L  KHNKGCHCKKSG
Sbjct: 162 NVDNEPARREAVEATLERNPFAFRPKIASSPHGVRDKREDIGEVVLLGKHNKGCHCKKSG 221

Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAV 306
           CLKKYCECFQANILCSENC+CLDCKNFEGSEER+ALFHG+H N +A++Q+AANAAI GAV
Sbjct: 222 CLKKYCECFQANILCSENCKCLDCKNFEGSEERQALFHGEHANHMAYLQQAANAAITGAV 281

Query: 307 GSSGYGTLMTSNKRKSEELLFGVAAKDQSVI-RNPQSQQGNNVQNSAVPCSQSVPVSQTA 365
           GSSG+       +RK +E+LF  A KD S + + PQ   G          S S       
Sbjct: 282 GSSGFAPSPAPKRRKGQEILFNQATKDSSRLGQFPQVNSG-RASGPTSGSSPSPVSRAGG 340

Query: 366 NASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYIQ 416
           NAS   S K   RS LADI+QP D++ LCS+LV V+ EA KT T   + I+
Sbjct: 341 NASSAPS-KFVYRSLLADIIQPHDVRALCSVLVAVAGEAAKTSTDKRNEIE 390


>gi|357500623|ref|XP_003620600.1| Lin-54-like protein [Medicago truncatula]
 gi|357500779|ref|XP_003620678.1| Lin-54-like protein [Medicago truncatula]
 gi|355495615|gb|AES76818.1| Lin-54-like protein [Medicago truncatula]
 gi|355495693|gb|AES76896.1| Lin-54-like protein [Medicago truncatula]
          Length = 629

 Score =  350 bits (899), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 191/345 (55%), Positives = 234/345 (67%), Gaps = 34/345 (9%)

Query: 97  QAMPQWQARPQHVRMVNRVPHPVHKLPLP-----------TLPPGKQESPRSRPRANTDG 145
           Q+ PQ Q   Q   MV + P  V  LP+P           ++  GKQESP+ R R N + 
Sbjct: 61  QSKPQQQVVTQGSVMVQK-PVGVGGLPMPVPAQVQTLQHSSVRVGKQESPKPRSRPNFEV 119

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 205
           K+GTPKKQ+QCNC++S+CLKLYCECFA+GIYCDGCNCVNC NNV++E AR+EAV ATLER
Sbjct: 120 KEGTPKKQRQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLER 179

Query: 206 NPNAFRPKIASSPHGAQDAR---------EDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
           NPNAFRPKIASSP GA+D+R         E+A E  +  KH+KGCHCKKSGCLKKYCECF
Sbjct: 180 NPNAFRPKIASSPQGARDSRVSRSSILWQEEAGEGLILIKHHKGCHCKKSGCLKKYCECF 239

Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMT 316
           QAN+LCSENCRC+DCKNFEGSEER+ALF GD N   ++Q+AANAAI GA+GS G+ +   
Sbjct: 240 QANVLCSENCRCMDCKNFEGSEERQALFRGDQNNNVYLQQAANAAITGAIGSYGFSSPPA 299

Query: 317 SNKRKSEELLFGVAAKDQSVIRNPQ--------SQQG---NNVQNSAVPCSQSVPVSQTA 365
           S KRK +EL     AKD S+ +  Q        S+ G   N V+  A   S S   S   
Sbjct: 300 SKKRKGQELFLWPTAKDPSISKPGQQVKFAYCTSRTGTGVNLVKGPAPSSSASPVSSARG 359

Query: 366 NASVLGSTKSAL--RSPLADILQPQDMKELCSLLVMVSSEATKTL 408
               LG + S L  RS L+D++QP  +KELCS+LV+VS +A KTL
Sbjct: 360 TNPTLGQSPSKLKYRSLLSDVVQPHHLKELCSVLVLVSGQAAKTL 404


>gi|356518324|ref|XP_003527829.1| PREDICTED: uncharacterized protein LOC100812944 [Glycine max]
          Length = 656

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 175/286 (61%), Positives = 210/286 (73%), Gaps = 3/286 (1%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCN 181
           L    +   K +SPRS+P+ N + KD TPKKQKQCNC+NSRCLKLYCECFAAGIYCDGCN
Sbjct: 170 LNFSAIKIVKPKSPRSQPQCNVELKDTTPKKQKQCNCKNSRCLKLYCECFAAGIYCDGCN 229

Query: 182 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 241
           CVNCHNNV++E ARQEAVG TLERNPNAFRPKIASSP   +D++E   E Q+  KHNKGC
Sbjct: 230 CVNCHNNVDNEAARQEAVGITLERNPNAFRPKIASSPLERRDSKE--CEIQVIGKHNKGC 287

Query: 242 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAA 301
           HCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGS+ERRA+FH D+N +  +++AANA 
Sbjct: 288 HCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSDERRAIFHKDYN-LVHIKQAANAT 346

Query: 302 IIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPV 361
           I    GSSGYGT +T  KRK  E+  G +A DQ+     Q QQ  +   S+        V
Sbjct: 347 ISRVAGSSGYGTHLTPKKRKIHEMFPGKSAMDQTDNITAQYQQEIDPMASSPSFLSDSFV 406

Query: 362 SQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
           S  +N  +  S++S  RS LAD  QPQ++K LCSL V++S  A KT
Sbjct: 407 SDPSNTRISASSRSTYRSVLADFFQPQNVKNLCSLFVVLSQVAAKT 452


>gi|30680884|ref|NP_849995.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|330251872|gb|AEC06966.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 578

 Score =  345 bits (886), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 199/419 (47%), Positives = 253/419 (60%), Gaps = 63/419 (15%)

Query: 1   MEQGETISDFAPK---------KLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQ 51
           M +GE    F PK         K ARQLDFT   +       E S  +Q+S   + S   
Sbjct: 1   MGEGEEGDKFPPKTDEVTQESMKSARQLDFTGGSSDV-----EHSHSNQASSMAAASIPS 55

Query: 52  PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
           P    ++P + S      Q+ P V   + PP                 PQ Q    HV +
Sbjct: 56  PIVTVTRPIITS------QAPPTVATPIPPP-----------------PQSQGIILHVPI 92

Query: 112 VNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECF 171
             R P P              ESP S PR   + +DGTP+K+KQCNC++SRCLKLYCECF
Sbjct: 93  --RHPRP--------------ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECF 136

Query: 172 AAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 231
           A+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG +D RE+  + 
Sbjct: 137 ASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDV 196

Query: 232 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NG 290
            + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R++LFHG+H + 
Sbjct: 197 VMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGEHSHN 256

Query: 291 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 350
           +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  Q+  G     
Sbjct: 257 LAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--- 312

Query: 351 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
                + S   S+    + LG +K   +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 313 -----TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 366


>gi|15225302|ref|NP_179601.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75266022|sp|Q9SL70.1|TCX6_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 6; Short=AtTCX6
 gi|4580462|gb|AAD24386.1| hypothetical protein [Arabidopsis thaliana]
 gi|225898124|dbj|BAH30394.1| hypothetical protein [Arabidopsis thaliana]
 gi|330251873|gb|AEC06967.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 571

 Score =  345 bits (886), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 199/419 (47%), Positives = 253/419 (60%), Gaps = 63/419 (15%)

Query: 1   MEQGETISDFAPK---------KLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQ 51
           M +GE    F PK         K ARQLDFT   +       E S  +Q+S   + S   
Sbjct: 1   MGEGEEGDKFPPKTDEVTQESMKSARQLDFTGGSSDV-----EHSHSNQASSMAAASIPS 55

Query: 52  PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
           P    ++P + S      Q+ P V   + PP                 PQ Q    HV +
Sbjct: 56  PIVTVTRPIITS------QAPPTVATPIPPP-----------------PQSQGIILHVPI 92

Query: 112 VNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECF 171
             R P P              ESP S PR   + +DGTP+K+KQCNC++SRCLKLYCECF
Sbjct: 93  --RHPRP--------------ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECF 136

Query: 172 AAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 231
           A+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG +D RE+  + 
Sbjct: 137 ASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDV 196

Query: 232 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NG 290
            + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R++LFHG+H + 
Sbjct: 197 VMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGEHSHN 256

Query: 291 IAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQN 350
           +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  Q+  G     
Sbjct: 257 LAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--- 312

Query: 351 SAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
                + S   S+    + LG +K   +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 313 -----TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 366


>gi|26984067|gb|AAN85197.1| hypothetical protein [Arabidopsis thaliana]
          Length = 536

 Score =  339 bits (869), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 164/278 (58%), Positives = 206/278 (74%), Gaps = 10/278 (3%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           ESP S PR   + +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNVE+E
Sbjct: 63  ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENE 122

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
            AR++AV +TLERNPNAFRPKIA+SPHG +D RE+  +  + A+HNKGCHCKKSGCLKKY
Sbjct: 123 PARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKY 182

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGY 311
           CECFQANILCSENC CLDCKNFEGSE R++LFHG+H + +A++Q  ANAAI GA+GSSG+
Sbjct: 183 CECFQANILCSENCXCLDCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGF 241

Query: 312 GTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLG 371
            +     +RK +E+ F    KD S  R  Q+  G          + S   S+    + LG
Sbjct: 242 ASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--------TSSQTGSRAGGNASLG 293

Query: 372 STKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
            +K   +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 294 PSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 331


>gi|26984069|gb|AAN85198.1| hypothetical protein [Arabidopsis thaliana]
          Length = 543

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 164/278 (58%), Positives = 206/278 (74%), Gaps = 10/278 (3%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           ESP S PR   + +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNVE+E
Sbjct: 63  ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENE 122

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
            AR++AV +TLERNPNAFRPKIA+SPHG +D RE+  +  + A+HNKGCHCKKSGCLKKY
Sbjct: 123 PARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKY 182

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGY 311
           CECFQANILCSENC CLDCKNFEGSE R++LFHG+H + +A++Q  ANAAI GA+GSSG+
Sbjct: 183 CECFQANILCSENCXCLDCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGF 241

Query: 312 GTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLG 371
            +     +RK +E+ F    KD S  R  Q+  G          + S   S+    + LG
Sbjct: 242 ASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--------TSSQTGSRAGGNASLG 293

Query: 372 STKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
            +K   +S LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 294 PSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT 331


>gi|326532502|dbj|BAK05180.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 615

 Score =  337 bits (864), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 194/397 (48%), Positives = 248/397 (62%), Gaps = 16/397 (4%)

Query: 13  KKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVASPSQSQ 72
           KKL RQLDF     +        +  S+S  P++     PQP   Q +    +  P Q Q
Sbjct: 34  KKLVRQLDFNYGNPAMAA---AAAAVSRSVPPRALQVGYPQP--HQLRAALPMGVPHQLQ 88

Query: 73  PLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQ 132
           P        P  +PH   H + P  A+P     P   +  + +  PV + P+    P K 
Sbjct: 89  PRGL-----PVMRPHQMVHAQLPRPALPMAMPVPV-PQARSALSQPVQRPPVAV--PLKP 140

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           ESP+ R R   DGKDGTP K+K CNCR+S+CLKLYCECFA+GIYCDGCNC NC NNVE+E
Sbjct: 141 ESPKPRGRL-YDGKDGTPTKKKCCNCRHSKCLKLYCECFASGIYCDGCNCTNCFNNVENE 199

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
           VAR+EA+ ATLERNP+AFRPKI SSPH  ++  E A +  L  KHNKGCHCKKSGCLKKY
Sbjct: 200 VARREAIEATLERNPDAFRPKIGSSPHTNRNNMEVAGDLPLIGKHNKGCHCKKSGCLKKY 259

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYG 312
           CECFQANILCSENC+C DCKNF+GSE+R+ L   DH  +  MQ+AANAA+ GA+G++ + 
Sbjct: 260 CECFQANILCSENCKCRDCKNFDGSEDRKHLSL-DHKNVVHMQQAANAAVNGAIGATAFS 318

Query: 313 TLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGS 372
           +  TS KRK  +    ++ K+   +RN Q  Q    Q +A+    S+PV Q+ +  ++G 
Sbjct: 319 SPSTSKKRKHIDSSLDLSTKEHLAMRNGQLSQ-VIYQKNAMASDGSLPVGQSVHPHMMGP 377

Query: 373 TKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
            K   R  LADI+QP D+KELC LLV VS +A K  T
Sbjct: 378 FKVTYRPLLADIIQPGDVKELCKLLVTVSGQAAKAYT 414


>gi|297836236|ref|XP_002886000.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331840|gb|EFH62259.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 568

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 163/267 (61%), Positives = 200/267 (74%), Gaps = 10/267 (3%)

Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATL 203
           + +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNCHNNVE+E  R+EA+G+TL
Sbjct: 107 EARDGTPQKKKQCNCKHSRCLKLYCECFASGSYCDGCNCVNCHNNVENEPERREAIGSTL 166

Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
           ERNPNAFRPKIA+SPHG +D RE+  E  L  +HNKGCHCKKSGCLKKYCECFQANILCS
Sbjct: 167 ERNPNAFRPKIAASPHGGRDNREEVGEVVLLGRHNKGCHCKKSGCLKKYCECFQANILCS 226

Query: 264 ENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKS 322
           ENC+CLDCKNFEGSE R++LFHG+H + +A++Q  ANAAI GA+GSSG+ +     +RK 
Sbjct: 227 ENCKCLDCKNFEGSEVRQSLFHGEHSHNVAYLQH-ANAAITGAIGSSGFASAPAPKRRKG 285

Query: 323 EELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLA 382
           +E+ F    KD S  R  Q+  G          S         NAS L  +K   RS LA
Sbjct: 286 QEIFFNQGPKDSSTHRLGQANSGRTT-------SPKTGSHAGGNAS-LRPSKVVYRSLLA 337

Query: 383 DILQPQDMKELCSLLVMVSSEATKTLT 409
           DI++P+D+K LCS+LV V+ EA KTLT
Sbjct: 338 DIIKPKDVKALCSVLVSVAGEAAKTLT 364


>gi|26984071|gb|AAN85199.1| hypothetical protein [Arabidopsis thaliana]
          Length = 532

 Score =  333 bits (855), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 164/278 (58%), Positives = 205/278 (73%), Gaps = 14/278 (5%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           ESP S PR   + +DGTP+K+KQCNC++SRCLKLYCECFA+G YCDGCNCVNC NNVE+E
Sbjct: 63  ESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENE 122

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
            AR++AV +TLERNPNAFRPKIA+SPHG +D RE+  +  + A+HNKGCHCKKSGCLKKY
Sbjct: 123 PARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKY 182

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGY 311
           CECFQANILCSENC CLDCKNFEGSE R++LFHG+H + +A++Q  ANAAI GA+GSSG+
Sbjct: 183 CECFQANILCSENCXCLDCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGF 241

Query: 312 GTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLG 371
            +     +RK +E+ F    KD S  R  Q+  G          + S   S+    + LG
Sbjct: 242 ASAPPPKRRKGQEIFFNQGTKDSSTHRLGQANNGRT--------TSSQTGSRAGGNASLG 293

Query: 372 STKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
            +KS     LA+I++P D+K LCS+LV V+ EA KTLT
Sbjct: 294 PSKSL----LANIIKPMDVKALCSVLVAVAGEAAKTLT 327


>gi|242058765|ref|XP_002458528.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
 gi|241930503|gb|EES03648.1| hypothetical protein SORBIDRAFT_03g035240 [Sorghum bicolor]
          Length = 550

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 180/332 (54%), Positives = 223/332 (67%), Gaps = 10/332 (3%)

Query: 78  QLLPPSSQPH-VQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPR 136
           Q + P  +PH V  H+  P  A+P     PQ +R +   P     + +P  P    ESP+
Sbjct: 41  QRILPVMRPHQVVGHVPLPRHAVPMAVPVPQ-LRPMPPQPVQRPPVAVPLKP----ESPK 95

Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
            R R   +GKD TP K+K CNCRNSRCLKLYCECFA+G +CDGCNC NC NN E+EVAR+
Sbjct: 96  PRARL-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNNPENEVARR 154

Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
           EA+ ATLERNP+AFRPKI SSPH  ++  E + +  L  KHNKGCHCKKSGCLKKYCECF
Sbjct: 155 EAIEATLERNPDAFRPKIGSSPHTNRN-NEVSSDLPLVGKHNKGCHCKKSGCLKKYCECF 213

Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMT 316
           QANILCSENC+C+DCKNFEGSEERR LF GDH     MQ+A NAA+ GA+G++G+ +  T
Sbjct: 214 QANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQATNAAVNGAIGATGFPSPST 273

Query: 317 SNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSA 376
           S KRK  +     + K+    RN    Q +  Q +AVP    +P+SQ+A+   LG  K  
Sbjct: 274 SRKRKHIDPSLDNSNKEHVAHRNYHISQ-SVYQKNAVP-DGPIPISQSAHPPNLGPFKVT 331

Query: 377 LRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
            R  LADI+Q +D+KELC LLV+VS EA KT 
Sbjct: 332 YRPLLADIVQEEDIKELCKLLVVVSGEAAKTY 363


>gi|218189097|gb|EEC71524.1| hypothetical protein OsI_03825 [Oryza sativa Indica Group]
          Length = 576

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 161/277 (58%), Positives = 199/277 (71%), Gaps = 3/277 (1%)

Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 89  KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 147

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           ++E AR+EA+ ATLERNP+AFRPKI SSPH  ++  E A +  L  KHNKGCHCKKSGCL
Sbjct: 148 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCL 207

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSS 309
           KKYCECFQANILCSENC+C+DCKNFEGSEER+ALF GDH     MQ+AANAA+ GA+G++
Sbjct: 208 KKYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSIHMQQAANAAVNGAIGAA 267

Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASV 369
            + +  TS KRK  +     +AK+    R     Q    Q + +P    +P++Q  +   
Sbjct: 268 AFSSPSTSRKRKHIDPSLDHSAKENGAHRTNHVPQV-FYQKNGMPSDGPLPINQPVHPPT 326

Query: 370 LGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
           LG  K   R  LADI+QP D+KELC LLV+VS  A K
Sbjct: 327 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAK 363


>gi|413952361|gb|AFW85010.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 412

 Score =  320 bits (821), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 163/279 (58%), Positives = 200/279 (71%), Gaps = 8/279 (2%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
           P K +SP+   R   +GKD TP K+K CNCRNSRCLKLYCECFA+G +CDGCNC NC NN
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNN 189

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
            E+EVAR+EA+ ATLERNP+AFRPKI SSPH  ++  E + +  L  KHNKGCHCKKSGC
Sbjct: 190 PENEVARREAIEATLERNPDAFRPKIGSSPHTNRN-NEVSSDLPLVGKHNKGCHCKKSGC 248

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 308
           LKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH     +Q+A NAA+ GA+G+
Sbjct: 249 LKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINIQQATNAAVNGAIGT 308

Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANAS 368
           +G+ +  TS KRK  +     + K+ +  RN    Q N V +       S+P+SQ+A+  
Sbjct: 309 TGFPSPSTSRKRKHIDPSLDHSNKEHAAHRNCHIPQKNAVPDG------SIPISQSAHPP 362

Query: 369 VLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
            LG  K   R  LADI+Q +D+KELC LLV+VS EA KT
Sbjct: 363 TLGPFKVTYRPLLADIVQAEDVKELCKLLVVVSGEAAKT 401


>gi|115440083|ref|NP_001044321.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|57900356|dbj|BAD87346.1| tesmin-like [Oryza sativa Japonica Group]
 gi|113533852|dbj|BAF06235.1| Os01g0761100 [Oryza sativa Japonica Group]
 gi|215768567|dbj|BAH00796.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 619

 Score =  316 bits (809), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 161/278 (57%), Positives = 200/278 (71%), Gaps = 9/278 (3%)

Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 195

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           ++E AR+EA+ ATLERNP+AFRPKI SSPH  ++  E A +  L  KHNKGCHCKKSGCL
Sbjct: 196 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCL 255

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSS 309
           KKYCECFQANILCSENC+C+DCKNFEGSEER+ALF GDH     MQ+AANAA+ GA+G++
Sbjct: 256 KKYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSIHMQQAANAAVNGAIGAA 315

Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNV-QNSAVPCSQSVPVSQTANAS 368
            + +   S KRK  +     +AK+    R       N+V Q + +P    +P++Q  +  
Sbjct: 316 AFSSPSASRKRKHIDPSLDHSAKENGAHRT------NHVPQKNGMPSDGPLPINQPVHPP 369

Query: 369 VLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATK 406
            LG  K   R  LADI+QP D+KELC LLV+VS  A K
Sbjct: 370 TLGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAK 407


>gi|222619282|gb|EEE55414.1| hypothetical protein OsJ_03534 [Oryza sativa Japonica Group]
          Length = 529

 Score =  314 bits (804), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 160/279 (57%), Positives = 198/279 (70%), Gaps = 3/279 (1%)

Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 43  KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 101

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           ++E AR+EA+ ATLERNP+AFRPKI SSPH  ++  E A +  L  KHNKGCHCKKSGCL
Sbjct: 102 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMEAAGDLPLVGKHNKGCHCKKSGCL 161

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSS 309
           KKYCECFQANILCSENC+C+DCKNFEGSEER+ALF GDH     MQ+AANAA+ GA+G++
Sbjct: 162 KKYCECFQANILCSENCKCMDCKNFEGSEERKALFQGDHKNSIHMQQAANAAVNGAIGAA 221

Query: 310 GYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASV 369
            + +   S KRK  +     +AK+    R     Q    Q + +P    +P++Q  +   
Sbjct: 222 AFSSPSASRKRKHIDPSLDHSAKENGAHRTNHVPQV-FYQKNGMPSDGPLPINQPVHPPT 280

Query: 370 LGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
           LG  K   R  LADI+QP D+KELC LLV+VS  A K  
Sbjct: 281 LGPFKVTYRPLLADIVQPGDVKELCKLLVVVSGNAAKAY 319


>gi|293335083|ref|NP_001169566.1| uncharacterized protein LOC100383445 [Zea mays]
 gi|224030123|gb|ACN34137.1| unknown [Zea mays]
 gi|323433699|gb|ADX60083.1| CPP transcription factor [Zea mays]
          Length = 607

 Score =  305 bits (781), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 160/280 (57%), Positives = 194/280 (69%), Gaps = 10/280 (3%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
           P K ESP+ R R   +GKD TP K+K  NCRNS+CLKLYCECFA+G YCDGCNC NC NN
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCYNCRNSKCLKLYCECFASGTYCDGCNCTNCFNN 184

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
            ++EVAR+EAV A L+RNP+AFRPKI SSPH  ++  E   +  L  KHNKGCHCKKSGC
Sbjct: 185 PDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRN-NEVPSDLPLVGKHNKGCHCKKSGC 243

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 308
           LKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH     MQ+  NAA+ GA+G+
Sbjct: 244 LKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQVTNAAVNGAIGA 303

Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANAS 368
           +G+ +  TS KRK  +       K+    +N    Q N     AVP   S+P+SQ+A+  
Sbjct: 304 TGFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQKN-----AVP-DGSIPISQSAHPP 357

Query: 369 VLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
            LG      +  LAD +Q +D+KELC LLV+VS EA KT 
Sbjct: 358 TLGPF--TYKPLLADTVQAEDIKELCKLLVVVSGEAAKTY 395


>gi|358344677|ref|XP_003636414.1| Lin-54-like protein [Medicago truncatula]
 gi|355502349|gb|AES83552.1| Lin-54-like protein [Medicago truncatula]
          Length = 646

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 198/470 (42%), Positives = 253/470 (53%), Gaps = 84/470 (17%)

Query: 1   MEQGETISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQP------ 54
           ME+ ET  D A +K ARQLDF    A+  G  P     S  + PQS S   P+P      
Sbjct: 1   MEKSETTLDLANRKFARQLDFA---AACGG--PSYLTLSPPT-PQSLSKTIPRPPTPSYL 54

Query: 55  ---PKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
               +     QS V  P +  P  + Q    S  P V  H +SP         RPQ V  
Sbjct: 55  PPPAQMHLDFQSPVRRPWKQSP-PQWQQQRTSQSPPVMPHFQSP---------RPQLVSP 104

Query: 112 VNRVPHPVHKLPLPTLPPGKQE-SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
           V+R+P+   KLP+       +  SPRS+ +     KD T KKQK+CNC+NS+CLKLYCEC
Sbjct: 105 VHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKCLKLYCEC 164

Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP------------ 218
           +AAGIYCDGCNC NCHNN+ +E AR+EA+G TLE+NPNAFRPKIASSP            
Sbjct: 165 YAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVSSSI 224

Query: 219 -------------------------HGA--------------QDAREDAREAQLAAKHNK 239
                                    +GA              +  +E+  E QL  +HNK
Sbjct: 225 QLPHVFIVGVLHQSFELKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGRHNK 284

Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 299
           GCHCKK GCLKKYCECF AN+LCSENC+C+DCKNFEGS+  R +   +   +  +++A N
Sbjct: 285 GCHCKK-GCLKKYCECFHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQATN 342

Query: 300 AAIIGAVGS--SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ 357
           AAI GAVG   S  GT +T  KRK +E   G +  DQ V    Q Q+  +   S+     
Sbjct: 343 AAINGAVGFGPSISGTHITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPLSLS 402

Query: 358 SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
           +  VS TA   +   ++   RS LAD+LQ Q++K LCSLLV++S EA KT
Sbjct: 403 ASFVSDTAYKRI---SRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKT 449


>gi|357520979|ref|XP_003630778.1| Lin-54-like protein, partial [Medicago truncatula]
 gi|355524800|gb|AET05254.1| Lin-54-like protein, partial [Medicago truncatula]
          Length = 220

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 135/181 (74%), Positives = 152/181 (83%), Gaps = 6/181 (3%)

Query: 115 VPHPVHKLPLPTLPP------GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYC 168
            P P H  P PTLPP      GK ESP+S+ R N + KD TPKKQKQCNC++SRCLKLYC
Sbjct: 38  APLPEHPQPSPTLPPLPATKLGKPESPKSKSRPNFETKDATPKKQKQCNCKHSRCLKLYC 97

Query: 169 ECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
           ECFA+GIYCDGCNCVNC NNV++E AR+EAV ATLERNPNAFRPKIASSPHG +D +E+ 
Sbjct: 98  ECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEET 157

Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH 288
            E ++  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFHGD 
Sbjct: 158 GEVKVLVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQ 217

Query: 289 N 289
           N
Sbjct: 218 N 218


>gi|79521003|ref|NP_197951.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|395455100|sp|F4JY84.1|TCX7_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 7; Short=AtTCX7
 gi|332006105|gb|AED93488.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 459

 Score =  284 bits (727), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 157/352 (44%), Positives = 215/352 (61%), Gaps = 27/352 (7%)

Query: 62  QSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHK 121
           QS ++ P++     +  L PP   P V++  +S ++  P     PQ          P+H+
Sbjct: 13  QSDISIPAEKMDSFQKSLEPPPCSP-VKSSQQSETEVTP-----PQKP--------PLHQ 58

Query: 122 LPLPTLPPGK---QESPRSRPRANTDGKDGTP-KKQKQCNCRNSRCLKLYCECFAAGIYC 177
                +   +   Q+   SR     + K+ TP K+QK CNC+NS+CLKLYCECFA+G YC
Sbjct: 59  FGHNQVQKNRVSNQDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKLYCECFASGSYC 118

Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
           +GCNCVNCHN +E+E +RQ A+   LERNP+AF+PKIA SPHG +D +E+ ++  L  KH
Sbjct: 119 NGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQENVQQVLLIGKH 178

Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRA 297
           +KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+AL HG      ++Q+ 
Sbjct: 179 SKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERKALLHGSQVSDTYIQQM 238

Query: 298 ANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQ 357
            NAA+  A+  S Y     S KRKS+++   V +         Q Q+ N+V+ +      
Sbjct: 239 TNAAVNRAIDMSAYLYPPESRKRKSKDISDSVVSS-----YGVQYQRANHVRRNGENSLF 293

Query: 358 SVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLT 409
           S+P     N +V GST SA RS  ++  QP  ++ELCSLLV  S +    L+
Sbjct: 294 SLP----NNKAVSGSTTSAYRSSWSNTFQPHHVRELCSLLVSNSVDVANKLS 341


>gi|358344762|ref|XP_003636456.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
 gi|355502391|gb|AES83594.1| hypothetical protein MTR_041s0027 [Medicago truncatula]
          Length = 649

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 194/473 (41%), Positives = 249/473 (52%), Gaps = 87/473 (18%)

Query: 1   MEQGETISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQP------ 54
           ME+ ET  D A +K ARQLDF    A+  G  P     S  + PQS S   P+P      
Sbjct: 1   MEKSETTLDLANRKFARQLDFA---AACGG--PSYLTLSPPT-PQSLSKTIPRPPTPSYL 54

Query: 55  ---PKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRM 111
               +     QS V  P +  P  + Q    S  P V  H +SP         RPQ V  
Sbjct: 55  PPPAQMHLDFQSPVRRPWKQSP-PQWQQQRTSQSPPVMPHFQSP---------RPQLVSP 104

Query: 112 VNRVPHPVHKLPLPTLPPGKQE-SPRSRPRANTDGKDGTPKKQKQCNCRNSR---CLKLY 167
           V+R+P+   KLP+       +  SPRS+ +     KD T KKQK+CNC+NS+       Y
Sbjct: 105 VHRLPYSTPKLPVKRFQALNESPSPRSQSQNKAGLKDNTLKKQKRCNCKNSKSDFATARY 164

Query: 168 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP--------- 218
           CEC+AAGIYCDGCNC NCHNN+ +E AR+EA+G TLE+NPNAFRPKIASSP         
Sbjct: 165 CECYAAGIYCDGCNCQNCHNNLNNEAARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMVS 224

Query: 219 ----------------------------HGA--------------QDAREDAREAQLAAK 236
                                       +GA              +  +E+  E QL  +
Sbjct: 225 SSIQLPHVFIVGVLHQSFELKFGAVNNYYGASSYFSLLVVPFKCLEQTQEEVSEIQLIGR 284

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
           HNKGCHCKK GCLKKYCECF AN+LCSENC+C+DCKNFEGS+  R +   +   +  +++
Sbjct: 285 HNKGCHCKK-GCLKKYCECFHANVLCSENCKCIDCKNFEGSDVWRIVLQ-EECSLVQIRQ 342

Query: 297 AANAAIIGAVGS--SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVP 354
           A NAAI GAVG   S  GT +T  KRK +E   G +  DQ V    Q Q+  +   S+  
Sbjct: 343 ATNAAINGAVGFGPSISGTHITPKKRKIQESFSGKSLTDQPVSMTAQHQRELDPIASSPL 402

Query: 355 CSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
              +  VS TA   +   ++   RS LAD+LQ Q++K LCSLLV++S EA KT
Sbjct: 403 SLSASFVSDTAYKRI---SRPKFRSVLADVLQTQNVKNLCSLLVVLSKEAAKT 452


>gi|414880393|tpg|DAA57524.1| TPA: hypothetical protein ZEAMMB73_933477 [Zea mays]
          Length = 356

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 137/234 (58%), Positives = 165/234 (70%), Gaps = 6/234 (2%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
           P K ESP+ R R   +GKD TP K+K CNCRNS+CLKLYCECFA+G YCDGCNC NC NN
Sbjct: 126 PLKPESPKLRARL-YEGKDSTPTKKKCCNCRNSKCLKLYCECFASGTYCDGCNCTNCFNN 184

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
            ++EVAR+EAV A L+RNP+AFRPKI SSPH  ++  E   +  L  KHNKGCHCKKSGC
Sbjct: 185 PDNEVARREAVEAILDRNPDAFRPKIGSSPHMHRN-NEVPSDLPLVGKHNKGCHCKKSGC 243

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGS 308
           LKKYCECFQANILCSENC+C+DCKNFEGSEERR LF GDH     MQ+  NAA+ GA+G+
Sbjct: 244 LKKYCECFQANILCSENCKCMDCKNFEGSEERRNLFQGDHKNAINMQQVTNAAVNGAIGA 303

Query: 309 SGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVS 362
           +G+ +  TS KRK  +       K+    +N    Q    + + VPC  +  VS
Sbjct: 304 TGFPSPSTSRKRKHNDPSLDHFNKEHIAHKNYHIPQ----EQAEVPCYITRSVS 353


>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 912

 Score =  261 bits (666), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 155/291 (53%), Positives = 195/291 (67%), Gaps = 23/291 (7%)

Query: 128 PPGKQESPR---SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVN 184
           P  K  SPR   SR R+  DGKDGTPKK KQCNC+NSRCLKLYCECFA+G YCDGCNCVN
Sbjct: 440 PSIKHSSPRRPVSRSRSALDGKDGTPKKCKQCNCKNSRCLKLYCECFASGTYCDGCNCVN 499

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ-LAAKHNKGCHC 243
           C NN EHEV RQEAV ATLERNPNAFRPKIA+SP    D +ED  +    AAKHNKGCHC
Sbjct: 500 CCNNSEHEVVRQEAVEATLERNPNAFRPKIANSPRILHDGKEDVVDRNGSAAKHNKGCHC 559

Query: 244 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHN-GIAFMQRAANAAI 302
           KKSGCLKKYCEC+QANILCSENC+C+DC+NF  S+ERRALF GD   G++F+QR +  A 
Sbjct: 560 KKSGCLKKYCECYQANILCSENCKCVDCRNFITSDERRALFPGDPALGLSFVQRPSGCA- 618

Query: 303 IGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGN--NVQNSAVPCSQSVP 360
                 SG+ + + S KRK++++ +        ++   Q+ + N   V+N A    +   
Sbjct: 619 ------SGHMSPL-SRKRKTQDVEY--------IMPGLQTNKSNLVKVENHAATTPKMSI 663

Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSI 411
            + +A        K   RS LAD+L    +++LC LLV+VS++    ++ I
Sbjct: 664 YTASAEHIPEPMVKVVNRSLLADVLHTDVIQDLCKLLVIVSADVQHEVSGI 714


>gi|168001515|ref|XP_001753460.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695339|gb|EDQ81683.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 405

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 162/287 (56%), Positives = 196/287 (68%), Gaps = 15/287 (5%)

Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVE 190
           K ESPR R R+  D KDGTPKK KQCNC+NSRCLKLYCECFA+G YC+GCNCVNC NNVE
Sbjct: 4   KHESPRQRSRS-FDAKDGTPKKCKQCNCKNSRCLKLYCECFASGTYCEGCNCVNCCNNVE 62

Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
           +E+ RQEAV ATLERNPNAFRPKI SSP G +D+ ED  E  LA KHNKGCHCKKSGCLK
Sbjct: 63  NEIVRQEAVEATLERNPNAFRPKIFSSP-GIRDSGEDVGEHPLAGKHNKGCHCKKSGCLK 121

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHN-GIAFMQRAANAAIIGAVGSS 309
           KYCECFQANILCSENC+C+DCKNF+GSEERRALF  DHN  + FMQ    + ++    S+
Sbjct: 122 KYCECFQANILCSENCKCVDCKNFDGSEERRALFQADHNMSVNFMQPLGGSGVL----SN 177

Query: 310 GYGTLMTS---NKRKSEELLF-GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTA 365
               L  S    KR++ EL+F G   K+Q       +   +   +S +    S P SQ  
Sbjct: 178 SPAYLSPSPLLKKRRTHELVFSGPGLKEQLTSGGGTAPSLSLALSSTI---ASTPSSQGV 234

Query: 366 NASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIY 412
           + +V    K   +S LA ++Q + + ELC LLV+VSSE  K +  + 
Sbjct: 235 SPAV-PPAKPMHKSLLAGVVQIEAIHELCKLLVVVSSETQKEILGML 280


>gi|147817113|emb|CAN75367.1| hypothetical protein VITISV_008552 [Vitis vinifera]
          Length = 628

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 125/184 (67%), Positives = 149/184 (80%)

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           E+A +  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEERRALFH
Sbjct: 215 EEAGKVPLVGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERRALFH 274

Query: 286 GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQG 345
            + N +A++Q+AANAAI GA+GSSGYGT + S KRK +E+ F   A +QS  RN Q QQ 
Sbjct: 275 DNGNYMAYIQQAANAAISGAIGSSGYGTAVASKKRKQQEVSFCTTANNQSNHRNTQFQQD 334

Query: 346 NNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 405
           N+++ SAV  S S+ V+ T NA+VLGS+K   RSPLADILQPQD+K+LCSLLV+VS EA 
Sbjct: 335 NHLRASAVSPSPSISVTHTNNAAVLGSSKFTYRSPLADILQPQDVKDLCSLLVVVSQEAG 394

Query: 406 KTLT 409
           KTLT
Sbjct: 395 KTLT 398



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K  K C+C+ S CLK YCECF A I C + C C++C N
Sbjct: 225 KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 262


>gi|224059104|ref|XP_002299717.1| predicted protein [Populus trichocarpa]
 gi|222846975|gb|EEE84522.1| predicted protein [Populus trichocarpa]
          Length = 497

 Score =  244 bits (624), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 144/302 (47%), Positives = 187/302 (61%), Gaps = 6/302 (1%)

Query: 119 VHKLPLPTLPPGKQESPR-SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
           +H LP   L   K ESPR S P+A T   +GTPK  +QC+C+ S+CLKLYCECFA+G YC
Sbjct: 57  LHVLPRSPLSFKKSESPRISLPQARTKIINGTPKSHRQCHCKQSKCLKLYCECFASGSYC 116

Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
           D C+C NCHNNVE+E  R+EA    LERNPNAF+PKI  SP    D  + A++  + AKH
Sbjct: 117 DECSCANCHNNVENEDVRREAAECILERNPNAFKPKITGSPCTPPDDGDAAKDVLVMAKH 176

Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQR 296
            KGCHCK++GCLKKYCECFQANILCSENC+C+ CKN EG E    +   +H    A++Q+
Sbjct: 177 IKGCHCKRTGCLKKYCECFQANILCSENCKCVSCKNLEGRELGMVVTSENHCKTKAYIQQ 236

Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ-NSAVPC 355
               +    V SS   +   S KR   E LF    +D  +    +    N V  + ++P 
Sbjct: 237 ENANSSSAVVSSSHIFS-QESRKRTFRE-LFDSNMRDTEIHEFTKHHAVNPVGVHVSIPT 294

Query: 356 SQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSIYHYI 415
            ++  V Q A+++ LGS+K   RS LAD + P D +ELCSLLV+V SEA K L   +   
Sbjct: 295 LRADCVCQIASSAALGSSKLTYRSLLADAIHPLDTRELCSLLVVV-SEAAKFLADEHCKA 353

Query: 416 QI 417
           QI
Sbjct: 354 QI 355


>gi|357139249|ref|XP_003571196.1| PREDICTED: uncharacterized protein LOC100845699 [Brachypodium
           distachyon]
          Length = 487

 Score =  240 bits (613), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 181/308 (58%), Gaps = 22/308 (7%)

Query: 115 VPHPVHKLPLPTLPPGKQE----SPRSRPRANT------DGKDGTPKKQKQCNCRNSRCL 164
           VP     +  P + P +Q+     PR  P A T      + K+ TPKK+K CNC+NSRCL
Sbjct: 19  VPAAAGGMKAPVVAPQQQKPAAPVPRPWPVAFTPMKPAAEVKNVTPKKRKHCNCKNSRCL 78

Query: 165 KLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 224
           KLYCECFAA + CDGCNC NC NN E+E  R+ A+ ATL RNP AF+PKI + P      
Sbjct: 79  KLYCECFAARVSCDGCNCSNCGNNSENEKIRKGAIEATLLRNPLAFQPKIENGPSTVNVR 138

Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           ++++    L  KHNKGCHCKKSGCLKKYCECFQANILCS+NCRC+DCKNFEGSEERRAL 
Sbjct: 139 KDNSGAVPLRPKHNKGCHCKKSGCLKKYCECFQANILCSKNCRCMDCKNFEGSEERRALV 198

Query: 285 HGDHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQ 343
            GD+      +Q+AAN A+ GA+GSSGY       K   E+ L  +           Q Q
Sbjct: 199 QGDNTSDRNNIQQAANTALNGAIGSSGYKCSPARRKISQEDSLSEM-----------QFQ 247

Query: 344 QGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSE 403
           Q N    S +  S +         S   S+K   RSPLA+ +   D+ +L   +V+   +
Sbjct: 248 QVNLSDVSQLTPSCTGFGGHNGCYSQSNSSKMIYRSPLANTIHLSDVNDLVKHVVIACRK 307

Query: 404 ATKTLTSI 411
           A +   +I
Sbjct: 308 AAEAFPTI 315


>gi|302801582|ref|XP_002982547.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
 gi|300149646|gb|EFJ16300.1| hypothetical protein SELMODRAFT_422006 [Selaginella moellendorffii]
          Length = 460

 Score =  234 bits (596), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 149/332 (44%), Positives = 200/332 (60%), Gaps = 40/332 (12%)

Query: 87  HVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGK 146
           +  + +   SQA+P  Q  P       R   PV +L   ++ P        RP    + K
Sbjct: 42  YSSSRLSRTSQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELK 90

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           +   KK KQCNC+N+RCLKLYCECFA+G YCDGCNC+NC NNVE+E  RQEAV  TLERN
Sbjct: 91  EVAAKKCKQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERN 150

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           PNAFRPKIA++ H  +D+ +D R      KHNKGC+CKKSGCLKKYCECFQANILCS+NC
Sbjct: 151 PNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKSGCLKKYCECFQANILCSDNC 206

Query: 267 RCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKR 320
           +C+DCKN++ S+ERRAL+  D      H G      +A  + +  V SS     M + KR
Sbjct: 207 KCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATGS-LACVSSSP----MPAKKR 261

Query: 321 KSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALR 378
           ++ +L+F G + K  Q + R  Q  Q   +  S +P + S+            S K+   
Sbjct: 262 RTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSSL------------SAKTIQT 309

Query: 379 SPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
           S LA ++QP+ + +LC+LLVM S+EA K + +
Sbjct: 310 SLLAGVVQPETVSDLCNLLVMASAEAAKAMIA 341


>gi|302798629|ref|XP_002981074.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
 gi|300151128|gb|EFJ17775.1| hypothetical protein SELMODRAFT_420712 [Selaginella moellendorffii]
          Length = 461

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 149/332 (44%), Positives = 200/332 (60%), Gaps = 40/332 (12%)

Query: 87  HVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGK 146
           +  + +   SQA+P  Q  P       R   PV +L   ++ P        RP    + K
Sbjct: 42  YSSSRLSRTSQALPSSQVPPAQPGEPGR--KPVRQLNFSSILP--------RPPV-YELK 90

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           +   KK KQCNC+N+RCLKLYCECFA+G YCDGCNC+NC NNVE+E  RQEAV  TLERN
Sbjct: 91  EVAAKKCKQCNCKNTRCLKLYCECFASGTYCDGCNCLNCCNNVENEAVRQEAVEVTLERN 150

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           PNAFRPKIA++ H  +D+ +D R      KHNKGC+CKKSGCLKKYCECFQANILCS+NC
Sbjct: 151 PNAFRPKIANTQHAIRDSADDMR----LGKHNKGCNCKKSGCLKKYCECFQANILCSDNC 206

Query: 267 RCLDCKNFEGSEERRALFHGD------HNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKR 320
           +C+DCKN++ S+ERRAL+  D      H G      +A  + +  V SS     M + KR
Sbjct: 207 KCVDCKNYDSSDERRALYQPDYSQSTYHGGPPPYTTSATGS-LACVSSSP----MPAKKR 261

Query: 321 KSEELLF-GVAAK-DQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALR 378
           ++ +L+F G + K  Q + R  Q  Q   +  S +P + S+            S K+   
Sbjct: 262 RTSDLVFVGESLKVQQPIYRTLQGIQDAKLSGSMLPPNSSL------------SAKTIQT 309

Query: 379 SPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
           S LA ++QP+ + +LC+LLVM S+EA K + +
Sbjct: 310 SLLAGVVQPETVSDLCNLLVMASAEAAKAMIA 341


>gi|297605751|ref|NP_001057547.2| Os06g0332700 [Oryza sativa Japonica Group]
 gi|255677011|dbj|BAF19461.2| Os06g0332700 [Oryza sativa Japonica Group]
          Length = 448

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 128/291 (43%), Positives = 174/291 (59%), Gaps = 14/291 (4%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCN 181
            P+  +PP      +S P         + K++K CNC+NS+CLKLYCECFAAG+YCDGC+
Sbjct: 7   WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKLYCECFAAGLYCDGCH 57

Query: 182 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 241
           C  C N V +E ARQEA+ +T +RNP AF+PKI +  +     ++DA       KHNKGC
Sbjct: 58  CKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKDDAGAPASLPKHNKGC 117

Query: 242 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI-AFMQRAANA 300
           HCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +  GD++     +Q AAN 
Sbjct: 118 HCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANV 177

Query: 301 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVP 360
           A+ GA+GSSGY       KR  E+ L+     + S ++  Q Q+ N+V +S +  S  + 
Sbjct: 178 ALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL- 235

Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTSI 411
             +   ++    +    RS LA+ + P D   L   LV+V  +A     + 
Sbjct: 236 --EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKAADAFLTT 284


>gi|242095900|ref|XP_002438440.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
 gi|241916663|gb|EER89807.1| hypothetical protein SORBIDRAFT_10g019660 [Sorghum bicolor]
          Length = 509

 Score =  231 bits (589), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 145/321 (45%), Positives = 196/321 (61%), Gaps = 10/321 (3%)

Query: 95  PSQAMPQWQARPQHVRMVNRVPHPVHKL--PLPTLPPGKQESPRSRPRANTDGKDGTP-K 151
           P   +P+  A P+    + + P P  ++   +P L P   E   S  +   + K GTP K
Sbjct: 10  PPALVPEAVAEPRAAPQLQQQPKPAVQVQPSMPVLRPWPMEITLSM-KPVVEMKSGTPAK 68

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
           K+K CNC+NS+CLKLYCECFAAG YCDGCNC  C N VE+E  RQ+A+  T  RNPNAF+
Sbjct: 69  KKKHCNCKNSQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQDAINNTKLRNPNAFQ 128

Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           PKI + P      R+DA       KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DC
Sbjct: 129 PKIENGPIPPS-VRKDAGALPSLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDC 187

Query: 272 KNFEGSEERRALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVA 330
           KN+EGSEE R+   GD++     +Q+AAN A+ GA+GSSGY       KR +E+  +   
Sbjct: 188 KNYEGSEELRSTIQGDNSCDRNNLQQAANVALNGAIGSSGYRFSPVRRKRPAEDPHYQRL 247

Query: 331 AKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDM 390
             + S+++  Q Q+ NNV  S +  S        +  +  G +K   RSPLA+ +   D+
Sbjct: 248 NVEGSMMQT-QFQEANNVDVSQITSSTG---RDGSTGNFQGRSKLVYRSPLANTIHVTDV 303

Query: 391 KELCSLLVMVSSEATKTLTSI 411
            +L + LV+V  +A +  T+I
Sbjct: 304 NDLANHLVIVCRKAAERFTTI 324


>gi|242064794|ref|XP_002453686.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
 gi|241933517|gb|EES06662.1| hypothetical protein SORBIDRAFT_04g010380 [Sorghum bicolor]
          Length = 497

 Score =  230 bits (587), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 127/276 (46%), Positives = 173/276 (62%), Gaps = 15/276 (5%)

Query: 135 PRSRPRA------NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
           PR  P A      +T+ K  TPKK+K CNCRNS+CLK+YCECF    YCDGCNC NC N 
Sbjct: 43  PRQWPMAFNPLKPSTEVKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNI 102

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
           V +E AR EA+ A  +RNP+AF+PKI + P+     ++++    + AKH+KGCHCKKSGC
Sbjct: 103 VGNENARNEAIEAIRQRNPSAFQPKIENGPNTLNVRKDNSGAVPVVAKHHKGCHCKKSGC 162

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVG 307
           LKKYCEC+QAN+ CS+NCRC+ CKN EG+E+       DH +  + +Q+AAN A  G VG
Sbjct: 163 LKKYCECYQANVFCSKNCRCMHCKNSEGNEDTETSSQRDHASDRSHIQQAANVAFNGTVG 222

Query: 308 SSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS--QQGNNVQNSAV-PCSQSVPVSQT 364
           SSGY  + + ++++S E   GV  K +  +   Q+  QQGN+   S + PCS        
Sbjct: 223 SSGY--ICSLSRKRSHEDALGVRIKSEGSMPETQTQYQQGNHADVSPLAPCSTGFDRHNA 280

Query: 365 ANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 400
           AN+    S     RSPLA+ +  +++ +L   LVMV
Sbjct: 281 ANSK---SNNPIYRSPLANTIHLREVNDLVKHLVMV 313


>gi|259489914|ref|NP_001159253.1| uncharacterized protein LOC100304343 [Zea mays]
 gi|223943031|gb|ACN25599.1| unknown [Zea mays]
 gi|413936375|gb|AFW70926.1| hypothetical protein ZEAMMB73_955245 [Zea mays]
          Length = 530

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 120/271 (44%), Positives = 167/271 (61%), Gaps = 7/271 (2%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
           +++T+ K  TPKK+K CNCRNS+CLK+YCECF    YCDGCNC NC N V +E AR EA+
Sbjct: 56  KSSTEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAI 115

Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
            A  +RNP+AF+PKI +  +     ++++    L AKH+KGCHCKKSGCLKKYCEC+QAN
Sbjct: 116 EAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQAN 175

Query: 260 ILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSN 318
           + CS+NCRC+ CKN EG+++       DH +    +++AAN A  G+VGSSGY  + + +
Sbjct: 176 VFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGY--ICSLS 233

Query: 319 KRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCSQSVPVSQTANASVLGSTKSAL 377
           +++S E    V    +  +   Q QQGN+   S + PCS        AN           
Sbjct: 234 RKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCSTGFDGHNAANTKPYNPN---Y 290

Query: 378 RSPLADILQPQDMKELCSLLVMVSSEATKTL 408
           RSPLA+ +   ++ +L   LVMV   A  T+
Sbjct: 291 RSPLANTIHLSEVNDLVKQLVMVCRMAAATI 321


>gi|413936376|gb|AFW70927.1| hypothetical protein ZEAMMB73_955245, partial [Zea mays]
          Length = 485

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 120/271 (44%), Positives = 167/271 (61%), Gaps = 7/271 (2%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
           +++T+ K  TPKK+K CNCRNS+CLK+YCECF    YCDGCNC NC N V +E AR EA+
Sbjct: 56  KSSTEEKSVTPKKKKHCNCRNSKCLKMYCECFQELQYCDGCNCSNCGNIVGNENARNEAI 115

Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
            A  +RNP+AF+PKI +  +     ++++    L AKH+KGCHCKKSGCLKKYCEC+QAN
Sbjct: 116 EAIRQRNPSAFQPKIGNDSNTLNVRKDNSGAVPLVAKHHKGCHCKKSGCLKKYCECYQAN 175

Query: 260 ILCSENCRCLDCKNFEGSEERRALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSN 318
           + CS+NCRC+ CKN EG+++       DH +    +++AAN A  G+VGSSGY  + + +
Sbjct: 176 VFCSKNCRCMHCKNSEGNKDTETSIQRDHASDRNHIEQAANIAFNGSVGSSGY--ICSLS 233

Query: 319 KRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAV-PCSQSVPVSQTANASVLGSTKSAL 377
           +++S E    V    +  +   Q QQGN+   S + PCS        AN           
Sbjct: 234 RKRSHEDTQCVRINSEGSMPETQYQQGNHADVSMLPPCSTGFDGHNAANTKPYNPN---Y 290

Query: 378 RSPLADILQPQDMKELCSLLVMVSSEATKTL 408
           RSPLA+ +   ++ +L   LVMV   A  T+
Sbjct: 291 RSPLANTIHLSEVNDLVKQLVMVCRMAAATI 321


>gi|50725423|dbj|BAD32896.1| tesmin-like [Oryza sativa Japonica Group]
 gi|50725495|dbj|BAD32965.1| tesmin-like [Oryza sativa Japonica Group]
          Length = 473

 Score =  221 bits (562), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 128/308 (41%), Positives = 174/308 (56%), Gaps = 32/308 (10%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKL--------------- 166
            P+  +PP      +S P         + K++K CNC+NS+CLKL               
Sbjct: 58  WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKLACSSILDLFTGQIDQ 108

Query: 167 ---YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
              YCECFAAG+YCDGC+C  C N V +E ARQEA+ +T +RNP AF+PKI +  +    
Sbjct: 109 LYGYCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNL 168

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            ++DA       KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +
Sbjct: 169 RKDDAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLI 228

Query: 284 FHGDHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 342
             GD++     +Q AAN A+ GA+GSSGY       KR  E+ L+     + S ++  Q 
Sbjct: 229 TQGDNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QF 287

Query: 343 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 402
           Q+ N+V +S +  S  +   +   ++    +    RS LA+ + P D   L   LV+V  
Sbjct: 288 QEANHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCR 344

Query: 403 EATKTLTS 410
           +A     +
Sbjct: 345 KAADAFLT 352


>gi|293336528|ref|NP_001167886.1| uncharacterized protein LOC100381595 [Zea mays]
 gi|223944643|gb|ACN26405.1| unknown [Zea mays]
 gi|413944443|gb|AFW77092.1| hypothetical protein ZEAMMB73_278537 [Zea mays]
          Length = 499

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 121/252 (48%), Positives = 160/252 (63%), Gaps = 6/252 (2%)

Query: 161 SRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG 220
           S+CLKLYCECFAAG YCDGCNC  C N VE+E  RQEA+  T  RNPNAF+PKI + P  
Sbjct: 80  SQCLKLYCECFAAGDYCDGCNCKQCGNTVENEKGRQEAINNTKLRNPNAFQPKIENGPIP 139

Query: 221 AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 280
           +    +DA       KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C+DCKN+EGSEE 
Sbjct: 140 SS-VWKDAGALPSHPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCMDCKNYEGSEEL 198

Query: 281 RALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN 339
           R+   GD++     +Q+AAN A+ GA+GSSGY       KR  E+        + S+++ 
Sbjct: 199 RSTTQGDNSCDRNNIQQAANVALNGAIGSSGYRFSPVRRKRPPEDPHCQTLNVEGSMMQT 258

Query: 340 PQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVM 399
            Q ++ NNV  S +  S        +  +  G++K   RSPLA+ +   D+ +L + LV+
Sbjct: 259 -QFRETNNVDVSQMTSSTG---RDGSTVNFQGTSKLVYRSPLANTIHITDVNDLANHLVI 314

Query: 400 VSSEATKTLTSI 411
              +A +  T+I
Sbjct: 315 ACRKAAERFTTI 326


>gi|357141227|ref|XP_003572141.1| PREDICTED: uncharacterized protein LOC100829506 [Brachypodium
           distachyon]
          Length = 540

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 116/266 (43%), Positives = 157/266 (59%), Gaps = 13/266 (4%)

Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 199
            +++  +GTP++ K C C+NS+CLKLYC+CFA+G YC D CNC NC N+V HE ARQ+A+
Sbjct: 137 TSSESINGTPRR-KNCKCKNSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAI 195

Query: 200 GATLERNPNAFRPKIASSP-HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
            A +ERNP AF PKI + P H AQ+    A E     KH KGC CK+S CLKKYCECFQ+
Sbjct: 196 NAVMERNPVAFMPKIGNIPRHAAQNREYRAAEGPRVGKHMKGCQCKRSECLKKYCECFQS 255

Query: 259 NILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSN 318
           N+LCSENC+C DCKN+E SE+ + +         ++    N A+ G +G S    ++   
Sbjct: 256 NVLCSENCKCTDCKNYESSEDMKEMRRMTQQHGVYVHHVQNLALKGMIGPSA---VLPRA 312

Query: 319 KRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALR 378
             K   L      +DQ +  N  SQ          P   SVP   T ++  LG      R
Sbjct: 313 AEKFSGLSVASLGRDQPINNNDSSQ-------VLSPLLTSVPTEDTESSVRLGRHGVTYR 365

Query: 379 SPLADILQPQDMKELCSLLVMVSSEA 404
           + LADI+Q +D+  LC +LV+VS +A
Sbjct: 366 TLLADIIQIEDVNVLCKVLVLVSRQA 391



 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 39/79 (49%), Gaps = 7/79 (8%)

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR---- 281
           E + E+       K C CK S CLK YC+CF +   C+++C C +C N    E  R    
Sbjct: 136 ETSSESINGTPRRKNCKCKNSKCLKLYCDCFASGRYCNDDCNCKNCCNDVSHETARQDAI 195

Query: 282 -ALFHGDHNGIAFMQRAAN 299
            A+   + N +AFM +  N
Sbjct: 196 NAVM--ERNPVAFMPKIGN 212


>gi|222635505|gb|EEE65637.1| hypothetical protein OsJ_21206 [Oryza sativa Japonica Group]
          Length = 437

 Score =  204 bits (518), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 112/246 (45%), Positives = 150/246 (60%), Gaps = 5/246 (2%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YCECFAAG+YCDGC+C  C N V +E ARQEA+ +T +RNP AF+PKI +  +     ++
Sbjct: 76  YCECFAAGLYCDGCHCKQCGNYVGNESARQEAINSTKQRNPKAFQPKIENGSNALNLRKD 135

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG 286
           DA       KHNKGCHCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +  G
Sbjct: 136 DAGAPASLPKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQG 195

Query: 287 DHNGI-AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQG 345
           D++     +Q AAN A+ GA+GSSGY       KR  E+ L+     + S ++  Q Q+ 
Sbjct: 196 DNSSDRNNIQHAANVALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQ-TQFQEA 254

Query: 346 NNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 405
           N+V +S +  S  +   +   ++    +    RS LA+ + P D   L   LV+V  +A 
Sbjct: 255 NHVDSSEITSSTGL---EGCYSNYQSRSNVVYRSALANTISPTDATGLAKHLVIVCRKAA 311

Query: 406 KTLTSI 411
               + 
Sbjct: 312 DAFLTT 317



 Score = 45.8 bits (107), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           PK  K C+C+ S CLK YCECF A I C   C C +C N
Sbjct: 144 PKHNKGCHCKKSGCLKKYCECFQANILCSKNCKCQDCKN 182


>gi|307108583|gb|EFN56823.1| hypothetical protein CHLNCDRAFT_144374 [Chlorella variabilis]
          Length = 702

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 95/140 (67%), Positives = 107/140 (76%), Gaps = 11/140 (7%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC+NSRCLKLYCECFA+G YCDGCNC+NC+NN EHE  RQ AV A LERNPNAFRP
Sbjct: 153 KKHCNCKNSRCLKLYCECFASGRYCDGCNCLNCYNNREHEATRQSAVEAILERNPNAFRP 212

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KIA    GA + R         A+HNKGC+CKKSGCLKKYCECFQA I+CSE C+CLDCK
Sbjct: 213 KIAP---GADNRRAG------GARHNKGCNCKKSGCLKKYCECFQAGIVCSEICKCLDCK 263

Query: 273 NFEGSEERRALF--HGDHNG 290
           N+EGSE R AL    G+  G
Sbjct: 264 NYEGSEARDALVSPQGERQG 283


>gi|47848051|dbj|BAD21836.1| putative tesmin/TSO1-like CXC domain-containing protein [Oryza
           sativa Japonica Group]
          Length = 508

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 113/252 (44%), Positives = 149/252 (59%), Gaps = 18/252 (7%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 205
           K+ +PKK+K CNC+NS     YCECFAA +YCDGC+C  C N +E+E  R+EA+   L R
Sbjct: 115 KNASPKKRKHCNCKNS-----YCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLR 169

Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
           NP AF+PKI +SP+     ++++       KHNKGCHC+KS CLKKYCECFQANILCS+N
Sbjct: 170 NPLAFQPKIENSPNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKN 229

Query: 266 CRCLDCKNFEGSEERRALFH----GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
           CRC DCKNFEGSEER+AL       D N I   Q AAN A+ GA GS GY       K+ 
Sbjct: 230 CRCQDCKNFEGSEERKALVQMKNASDRNHI---QEAANFALNGATGSLGYKNSPVRRKKY 286

Query: 322 SEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPL 381
            E  L       + ++   Q Q  ++   S +  S +      A      S++   RSPL
Sbjct: 287 QENSL------GEQILSEAQFQPTDHADVSQLASSCTGFGGDIAINYQSKSSEMIYRSPL 340

Query: 382 ADILQPQDMKEL 393
           A+ +   ++ +L
Sbjct: 341 ANTIPLIEVNDL 352



 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           + N++     PK  K C+CR S CLK YCECF A I C   C C +C N
Sbjct: 189 KDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKN 237


>gi|412988366|emb|CCO17702.1| unnamed protein product [Bathycoccus prasinos]
          Length = 321

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 99/182 (54%), Positives = 119/182 (65%), Gaps = 23/182 (12%)

Query: 127 LPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNC 185
           +PP +   P +     TD        +K CNCRNSRCLKLYCECFA+G YC   CNCV C
Sbjct: 2   IPPVEGGLPLNNANIQTD--------RKTCNCRNSRCLKLYCECFASGQYCLQNCNCVEC 53

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NNVE+ V RQEAV ATLERNPNAFRPK+  + +G +             KHNKGCHCKK
Sbjct: 54  KNNVENSVQRQEAVEATLERNPNAFRPKVLPTLNGGEGDE----------KHNKGCHCKK 103

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGA 305
           SGCLKKYCECFQA+I CS+ CRCLDCKNFEGS+  +A+         FM   ++A+++ A
Sbjct: 104 SGCLKKYCECFQASIFCSDMCRCLDCKNFEGSDAHKAIASQQE----FMPAGSDASLLVA 159

Query: 306 VG 307
            G
Sbjct: 160 PG 161


>gi|224166209|ref|XP_002338900.1| predicted protein [Populus trichocarpa]
 gi|222873866|gb|EEF10997.1| predicted protein [Populus trichocarpa]
          Length = 154

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 94/152 (61%), Positives = 114/152 (75%), Gaps = 4/152 (2%)

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           E+  E  +  KHNKGCHCKKSGCLKKYCECFQANILCSENC+C+DCKNFEGSEER+ALFH
Sbjct: 1   EETGEGLVFVKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFH 60

Query: 286 GDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQ 344
           GDH N +A++Q+AANAAI GA+GSSGY +L  S KRK +EL FG   KD S  R    QQ
Sbjct: 61  GDHGNNMAYIQQAANAAITGAIGSSGYASLPVSRKRKGQELFFGQTVKDPSFHRLGHFQQ 120

Query: 345 GNNVQNSAVPCS--QSVPVSQTANASVLGSTK 374
            ++++  A P S   S PV++  N + LGS+K
Sbjct: 121 ASHIR-PAAPSSPLPSNPVARAGNTTTLGSSK 151



 Score = 45.4 bits (106), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K  K C+C+ S CLK YCECF A I C + C C++C N
Sbjct: 11  KHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 48


>gi|303274438|ref|XP_003056539.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226462623|gb|EEH59915.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 255

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 84/130 (64%), Positives = 99/130 (76%), Gaps = 6/130 (4%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K++K CNCRNSRCLKLYCECFA+G++CD CNCVNC NN+     R  AV +TLERNPNAF
Sbjct: 4   KRKKYCNCRNSRCLKLYCECFASGLHCDSCNCVNCSNNLASAAIRCTAVESTLERNPNAF 63

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           RPKIA      +  RE       A++HN+GCHCKKS CLKKYCECFQANI CS+ CRC++
Sbjct: 64  RPKIAPGVGPLEKRREG------ASRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCME 117

Query: 271 CKNFEGSEER 280
           CKNF GS +R
Sbjct: 118 CKNFLGSSQR 127



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           ++G  +  + C+C+ S CLK YCECF A I+C D C C+ C N
Sbjct: 78  REGASRHNRGCHCKKSSCLKKYCECFQANIFCSDICRCMECKN 120



 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 27/39 (69%), Gaps = 1/39 (2%)

Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           +K  K C+C+ S CLK YCECF + + C ++C C++C N
Sbjct: 3   SKRKKYCNCRNSRCLKLYCECFASGLHC-DSCNCVNCSN 40


>gi|218190473|gb|EEC72900.1| hypothetical protein OsI_06724 [Oryza sativa Indica Group]
          Length = 274

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 94/173 (54%), Positives = 115/173 (66%), Gaps = 8/173 (4%)

Query: 159 RNSRCLK-LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 217
           R +R +K  YCECFAA +YCDGC+C  C N +E+E  R+EA+   L RNP AF+PKI +S
Sbjct: 79  RATRLVKSWYCECFAARVYCDGCHCSPCGNKIENENIRKEAIETILLRNPLAFQPKIENS 138

Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 277
           P+     ++++       KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGS
Sbjct: 139 PNTVTVRKDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGS 198

Query: 278 EERRALFH----GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 326
           EER+AL       D N I   Q AAN A+ GA GS GY +     K+  E  L
Sbjct: 199 EERKALVQMKNASDRNHI---QEAANFALSGATGSLGYKSSPVRRKKYQENSL 248



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           + N++     PK  K C+CR S CLK YCECF A I C   C C +C N
Sbjct: 146 KDNSEAIPPIPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKN 194


>gi|91088685|ref|XP_974962.1| PREDICTED: similar to Lin54 protein [Tribolium castaneum]
          Length = 695

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 78/163 (47%), Positives = 103/163 (63%), Gaps = 10/163 (6%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           ES    P +N +     P+K   CNC  S+CLKLYC+CFA G +C  CNC+NC NN+E+E
Sbjct: 464 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 521

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
             RQ A+   LERNPNAFRPKI  +   A D+           KH KGC+CK+SGCLK Y
Sbjct: 522 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 573

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQ 295
           CEC++A I CS NC+C+ C+N E S E+++L     +G+  +Q
Sbjct: 574 CECYEAKIACSNNCKCMGCRNIEDSMEKKSLRASIEDGLHNLQ 616


>gi|308814306|ref|XP_003084458.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
 gi|116056343|emb|CAL56726.1| tesmin/TSO1-like CXC domain-containing protein (ISS) [Ostreococcus
           tauri]
          Length = 766

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 79/143 (55%), Positives = 95/143 (66%), Gaps = 16/143 (11%)

Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVAR 195
           + RA     DG     K CNC+NSRCLKLYCECFAAG  C    C C NC N VEH   R
Sbjct: 5   KDRAQIQRGDG-----KGCNCKNSRCLKLYCECFAAGALCASARCRCANCMNTVEHAGER 59

Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
             A+   L+RNPNAFRPKIA+    ++D         +A +HN+GCHCK+SGCLKKYCEC
Sbjct: 60  TAAIETVLDRNPNAFRPKIAAVASPSRD---------VALRHNRGCHCKRSGCLKKYCEC 110

Query: 256 FQANILCSENCRCLDCKNFEGSE 278
           FQA I C+E CRC+ C+N+EGS+
Sbjct: 111 FQAAIYCAETCRCVSCENYEGSD 133


>gi|270011672|gb|EFA08120.1| hypothetical protein TcasGA2_TC005724 [Tribolium castaneum]
          Length = 752

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 78/163 (47%), Positives = 103/163 (63%), Gaps = 10/163 (6%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           ES    P +N +     P+K   CNC  S+CLKLYC+CFA G +C  CNC+NC NN+E+E
Sbjct: 521 ESITMTPESNLESNGIRPRKP--CNCTKSQCLKLYCDCFANGEFCYMCNCMNCFNNLENE 578

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
             RQ A+   LERNPNAFRPKI  +   A D+           KH KGC+CK+SGCLK Y
Sbjct: 579 DHRQRAIKTCLERNPNAFRPKIGKAKDVAGDSS--------IRKHTKGCNCKRSGCLKNY 630

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQ 295
           CEC++A I CS NC+C+ C+N E S E+++L     +G+  +Q
Sbjct: 631 CECYEAKIACSNNCKCMGCRNIEDSMEKKSLRASIEDGLHNLQ 673


>gi|145356362|ref|XP_001422401.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582643|gb|ABP00718.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 151

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 82/141 (58%), Positives = 99/141 (70%), Gaps = 2/141 (1%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEV 193
           SPRS     +  +D +  ++K+CNC+NS+CLKLYCECFA+G YCD CNC  C NN EH  
Sbjct: 12  SPRSIAARASTPRD-SASQRKRCNCKNSKCLKLYCECFASGKYCDACNCAGCKNNDEHAH 70

Query: 194 ARQEAVGATLERNPNAFRPKIAS-SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
            RQ A+  TLERNPNAFRPKI + + +G  D       A   A+HN+GCHCK+SGCLKKY
Sbjct: 71  ERQSAIEQTLERNPNAFRPKIINQTTNGTLDGAGAGDGATTEARHNRGCHCKRSGCLKKY 130

Query: 253 CECFQANILCSENCRCLDCKN 273
           CECFQA I C E CRC +CKN
Sbjct: 131 CECFQAAIYCVERCRCAECKN 151


>gi|428168961|gb|EKX37899.1| hypothetical protein GUITHDRAFT_44838, partial [Guillardia theta
           CCMP2712]
          Length = 138

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 80/130 (61%), Positives = 100/130 (76%), Gaps = 4/130 (3%)

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERN 206
           GTP K+K CNC+NS+CLKLYCECFA+G+YC+  CNC +CHNNV  E  RQ AV +TLERN
Sbjct: 9   GTPAKRKPCNCKNSKCLKLYCECFASGLYCEVQCNCHSCHNNVRFEWDRQAAVVSTLERN 68

Query: 207 PNAFRPKIA-SSPHGAQDA-REDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCS 263
           P+AFRPKI  +   G  D+  + AR      A+H KGCHCKKSGCLKKYCECFQA ++CS
Sbjct: 69  PHAFRPKITQTGQQGIVDSPAKGARWLMFVKARHMKGCHCKKSGCLKKYCECFQAAVMCS 128

Query: 264 ENCRCLDCKN 273
           + C+C++C+N
Sbjct: 129 DTCKCIECRN 138


>gi|222640454|gb|EEE68586.1| hypothetical protein OsJ_27092 [Oryza sativa Japonica Group]
          Length = 612

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 130/368 (35%), Positives = 185/368 (50%), Gaps = 39/368 (10%)

Query: 54  PPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQW-QARPQHVRMV 112
           PP  + QVQ   +   Q  P  +L++LP      +Q   RSP    P   Q  P+  R +
Sbjct: 82  PPAKKLQVQQSSSMLPQDFP--KLEVLPVEKPTKLQVR-RSPQHVSPALKQDSPKMERPL 138

Query: 113 NRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
              P     + L +  P  +          ++ KD T  ++  C+C+ S+CLKLYCECF 
Sbjct: 139 LPRPATFIDVMLSSQKPSSE--------MWSETKDVTLTRKTNCSCKYSKCLKLYCECFE 190

Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
            G YC GCNC NC NNV HE  RQ+A+   LERNP AF PK+ +S   A +    A E  
Sbjct: 191 KGRYCIGCNCTNCCNNVNHENYRQDAINVALERNPAAFMPKVLNS--TAHNCESKAAEGD 248

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHN 289
           +  KH KGC CK++  LKKYCECF+A++ CSENCRC  CKNF+ +E+R   +   H  H 
Sbjct: 249 IVGKHTKGCKCKRTEYLKKYCECFKASVFCSENCRCTGCKNFKSNEDRISQKNTVHAHH- 307

Query: 290 GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ 349
                    N A  G VG S    ++         L    +  D S+  N  +     +Q
Sbjct: 308 -------VQNPASSGMVGQS---VIIFHAAENDSSLSLAASVSDHSI--NNNTSHVILLQ 355

Query: 350 NS-------AVPCSQSVPVSQTANASVLGST--KSALRSPLADILQPQDMKELCSLLVMV 400
           ++       A+ CS +  V++    S++     +   R  LADI+Q +D+ ELC+LL++V
Sbjct: 356 HTFQISPFQALSCSLTSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILV 415

Query: 401 SSEATKTL 408
           S +A  TL
Sbjct: 416 SRQAGATL 423


>gi|302780747|ref|XP_002972148.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
 gi|302791403|ref|XP_002977468.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300154838|gb|EFJ21472.1| hypothetical protein SELMODRAFT_37438 [Selaginella moellendorffii]
 gi|300160447|gb|EFJ27065.1| hypothetical protein SELMODRAFT_37441 [Selaginella moellendorffii]
          Length = 116

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 78/116 (67%), Positives = 86/116 (74%), Gaps = 9/116 (7%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD--- 223
           YCECFA+G+YCDGCNC+NC NN E+E  RQEAV   LERNPNAFRPKI S      D   
Sbjct: 1   YCECFASGVYCDGCNCLNCWNNSENEKVRQEAVELALERNPNAFRPKIGSKSVSVSDFLI 60

Query: 224 -----AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                 +E  RE Q   A+HNKGC CKKSGCLKKYCECFQANILCSENCRC+DC+N
Sbjct: 61  CVFSLIQELVREPQATVARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116



 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
             +  K C C+ S CLK YCECF A I C + C CV+C N
Sbjct: 77  VARHNKGCQCKKSGCLKKYCECFQANILCSENCRCVDCRN 116


>gi|218201063|gb|EEC83490.1| hypothetical protein OsI_29022 [Oryza sativa Indica Group]
          Length = 586

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 126/359 (35%), Positives = 176/359 (49%), Gaps = 47/359 (13%)

Query: 54  PPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQW-QARPQHVRMV 112
           PP  + QVQ   +   Q  P  +L++LP      +Q   RSP    P   Q  P+  R +
Sbjct: 82  PPAKKLQVQQSSSMLPQDFP--KLEVLPVEKPTKLQVR-RSPQHVSPALKQDSPKMERPL 138

Query: 113 NRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFA 172
              P     + L +  P  +          ++ KD T  ++  C+C+ S+CLKLYCECF 
Sbjct: 139 LPRPATFIDVMLSSQKPSSE--------MWSETKDVTLTRKTNCSCKYSKCLKLYCECFE 190

Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
            G YC GCNC NC NNV HE ARQ+A+   LERNP AF PK+++S   A +    A E  
Sbjct: 191 KGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVSNS--TAHNCESKAAEGD 248

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGDHN 289
           +  KH KGC CK++ CLKKYCECF+A++ CSENCRC  CKN++ + +R   +   H  H 
Sbjct: 249 IVGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAHH- 307

Query: 290 GIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQ 349
                    N A  G VG S               ++F  A  D S+     +   +   
Sbjct: 308 -------VQNPASSGMVGQSV--------------IIFHAAENDSSLSLAASALSCSL-- 344

Query: 350 NSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTL 408
                 + SV      +       +   R  LADI+Q +D+ ELC+LL++VS +A  TL
Sbjct: 345 ------TSSVTREDAKSLVKYVPHEVTYRPSLADIIQIEDVNELCNLLILVSRQAGATL 397


>gi|157111432|ref|XP_001651563.1| hypothetical protein AaeL_AAEL005893 [Aedes aegypti]
 gi|108878391|gb|EAT42616.1| AAEL005893-PA [Aedes aegypti]
          Length = 757

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 75/146 (51%), Positives = 97/146 (66%), Gaps = 11/146 (7%)

Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           G ++  R  P   T   D   KK + CNC  S+CLKLYC+CFA G +C  CNC +C+NN+
Sbjct: 467 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNL 525

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           ++E  RQ+A+ ATLERNP+AF+PKI     GA  A EDA        H KGC+CK+SGCL
Sbjct: 526 DNEEERQKAIRATLERNPSAFKPKI-----GAVSADEDALRL-----HTKGCNCKRSGCL 575

Query: 250 KKYCECFQANILCSENCRCLDCKNFE 275
           K YCEC++A I CS NC+C+ C+N E
Sbjct: 576 KNYCECYEAKIACSANCKCIGCRNTE 601



 Score = 47.0 bits (110), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+C KS CLK YC+CF     C  NC C DC N   +EE R
Sbjct: 492 CNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEER 531


>gi|328722237|ref|XP_001950656.2| PREDICTED: protein lin-54 homolog [Acyrthosiphon pisum]
          Length = 750

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 78/176 (44%), Positives = 107/176 (60%), Gaps = 23/176 (13%)

Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
           ++P++ +D      K +K CNC  S+CLKLYC+CFA G +C  CNC +C+NN+EHE  R 
Sbjct: 525 TKPKSTSD----VNKTKKPCNCTRSQCLKLYCDCFANGEFCYQCNCNSCYNNMEHEEDRA 580

Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
           +A+ + L+RNPNAFRPKI S               Q   +H KGC+CK+SGCLK YCECF
Sbjct: 581 QAIRSVLDRNPNAFRPKIKSFV------------GQAERQHTKGCNCKRSGCLKNYCECF 628

Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHGDHNGIA----FMQRAANAAIIGAVGS 308
           +A I CS+NC+C+ C+N E  ++    F GDH G         R  +A  + + GS
Sbjct: 629 EARIACSQNCKCIGCRNME--DDSHTSF-GDHPGSLSVDNLKSRTKSANFVTSFGS 681


>gi|427785019|gb|JAA57961.1| Putative dna-dependent [Rhipicephalus pulchellus]
          Length = 916

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/158 (52%), Positives = 102/158 (64%), Gaps = 17/158 (10%)

Query: 147 DGTP-----KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 201
           +GTP     + +K CNC  S+CLKLYC+CFA G +C  CNC NC NN+EHE  RQ+A+GA
Sbjct: 652 NGTPCEDNSRPRKPCNCTKSQCLKLYCDCFANGEFCHSCNCNNCFNNLEHEEERQKAIGA 711

Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
            LERNPNAFRPKI     G  + R           H KGC+CK+SGCLK YCEC++A IL
Sbjct: 712 CLERNPNAFRPKIGKGKEGDHERR-----------HTKGCNCKRSGCLKNYCECYEAKIL 760

Query: 262 CSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 298
           CS  C+C+ CKNFE S ER+ L    D   +   Q+AA
Sbjct: 761 CSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQQQAA 798


>gi|323457208|gb|EGB13074.1| hypothetical protein AURANDRAFT_60657 [Aureococcus anophagefferens]
          Length = 1772

 Score =  156 bits (395), Expect = 2e-35,   Method: Composition-based stats.
 Identities = 93/268 (34%), Positives = 124/268 (46%), Gaps = 59/268 (22%)

Query: 54  PPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVN 113
           PP++    +  +A   +  P   + LLPP S     A  +    A P W +   + R   
Sbjct: 204 PPRAGTYKERAIARVHELVPATPVALLPPRS-----AGEKETKPAAPGWLS-GLYARAAE 257

Query: 114 RVPHPVHKLPLPTLPPGKQESPRSR-----------------------PRANTDGKDG-- 148
           +    +      T  P  + S  S+                       P A  D  DG  
Sbjct: 258 KEERVLQASATTTPEPPAKRSRTSKTSQGLRGAAAPPARGGRGGGAATPTAEVDDDDGGD 317

Query: 149 --------TPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAV 199
                     K+ K CNCRNSRCLKLYC+CFAAG +C+  C CV+CHN+  H   R +A+
Sbjct: 318 RPTAAKASATKRFKPCNCRNSRCLKLYCDCFAAGRFCEESCKCVDCHNDQAHARDRDDAI 377

Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
            ATLE+NP AFR K+ +                 A  H  GCHCKK+ CLKKYCECF+A 
Sbjct: 378 KATLEKNPKAFRAKVDAE----------------ATTHQNGCHCKKTKCLKKYCECFEAG 421

Query: 260 ILCSENCRCLDCKNFEGSEE---RRALF 284
           I C   C+C DC+N+ GS +   RR L+
Sbjct: 422 ITCGAKCKCADCENYPGSLQLAARRGLY 449


>gi|403340562|gb|EJY69569.1| hypothetical protein OXYTRI_09693 [Oxytricha trifallax]
          Length = 596

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 87/164 (53%), Positives = 100/164 (60%), Gaps = 28/164 (17%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC+NS+CLKLYCECFA+G YC  CNC  C NN+E+E  R+E +   LERNPNAFRP
Sbjct: 33  KKNCNCKNSKCLKLYCECFASGEYCKNCNCNGCCNNIENESIRKETIAIILERNPNAFRP 92

Query: 213 KIAS----SPHGAQDA----------------------REDAREAQLAAKHNKGCHCKKS 246
           KIAS    SP   Q                             +   A KH KGC CKKS
Sbjct: 93  KIASLPPQSPQITQKVVGVTGGPINNPSQINIGLSTPIMTGVNDGVAAGKHAKGCACKKS 152

Query: 247 GCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNG 290
           GCLKKYCECFQA I CS+NC+C DCKNF+GS +RR L   D NG
Sbjct: 153 GCLKKYCECFQAGIFCSDNCKCCDCKNFDGSLDRRILL--DSNG 194



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/38 (57%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K  K C C+ S CLK YCECF AGI+C D C C +C N
Sbjct: 142 KHAKGCACKKSGCLKKYCECFQAGIFCSDNCKCCDCKN 179


>gi|302851277|ref|XP_002957163.1| TSO1-like protein [Volvox carteri f. nagariensis]
 gi|300257570|gb|EFJ41817.1| TSO1-like protein [Volvox carteri f. nagariensis]
          Length = 1133

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 72/128 (56%), Positives = 84/128 (65%), Gaps = 3/128 (2%)

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 207
           G P  Q Q     ++    YCECFA+G YC+ CNCV C NN EHE  RQ AV A LERNP
Sbjct: 128 GLPLFQAQAGNMTAQS---YCECFASGRYCENCNCVQCFNNREHEATRQSAVEAILERNP 184

Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
           NAFRPKI S+      A      A    +H KGC+CKKS CLKKYCECFQA I CS+NC+
Sbjct: 185 NAFRPKIQSNEQAPAAAAAVVNNAAAPGRHLKGCNCKKSSCLKKYCECFQAGIYCSDNCK 244

Query: 268 CLDCKNFE 275
           C++CKN+E
Sbjct: 245 CVECKNYE 252



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 25/38 (65%), Positives = 26/38 (68%), Gaps = 1/38 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           K CNC+ S CLK YCECF AGIYC D C CV C N  E
Sbjct: 216 KGCNCKKSSCLKKYCECFQAGIYCSDNCKCVECKNYEE 253


>gi|281363313|ref|NP_001163143.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
 gi|272432466|gb|ACZ94416.1| Myb-interacting protein 120, isoform B [Drosophila melanogaster]
          Length = 952

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 81/200 (40%), Positives = 110/200 (55%), Gaps = 36/200 (18%)

Query: 89  QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP--------- 139
           QA  R   Q +P  Q+ P  V      P       LPTLPPG + +  ++P         
Sbjct: 674 QAKQRIRQQQLPTEQSTPIKVE-----PK------LPTLPPGVKANVPAKPLFEVLKPPA 722

Query: 140 ----RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR 195
                   D   G   ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R
Sbjct: 723 TAAAAGAVDPLGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVER 782

Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
           + A+ + L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC
Sbjct: 783 ERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCEC 830

Query: 256 FQANILCSENCRCLDCKNFE 275
           ++A I CS  C+C+ C+N E
Sbjct: 831 YEAKIPCSSICKCVGCRNME 850


>gi|449676261|ref|XP_002165629.2| PREDICTED: uncharacterized protein LOC100197918, partial [Hydra
           magnipapillata]
          Length = 530

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 77/171 (45%), Positives = 102/171 (59%), Gaps = 13/171 (7%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           +S +S  +AN      +   +K CNC  S+CLKLYCECFA G +C+ CNC  C NN+ HE
Sbjct: 298 DSNKSASKANVGPITESKILKKPCNCTKSQCLKLYCECFANGEFCNNCNCRICFNNIAHE 357

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
           V R +A+ + L+RNP AF PKI          R          +H KGC+C++SGCLK Y
Sbjct: 358 VERSKAIKSCLDRNPYAFHPKIGKGKIKGDTER----------RHTKGCNCRRSGCLKNY 407

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAANAAI 302
           CEC++A ILCS  C+C  CKNFE S +R+ L    D   +  +Q+  NAAI
Sbjct: 408 CECYEAKILCSTLCKCSGCKNFEESADRKTLMQLADAAEVRVLQQ--NAAI 456


>gi|24653408|ref|NP_610879.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|190359804|sp|A1Z9E2.1|LIN54_DROME RecName: Full=Protein lin-54 homolog; AltName: Full=Myb complex
           protein of 120 kDa
 gi|7303305|gb|AAF58365.1| Myb-interacting protein 120, isoform A [Drosophila melanogaster]
 gi|372466647|gb|AEX93136.1| FI18123p1 [Drosophila melanogaster]
          Length = 950

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 81/200 (40%), Positives = 110/200 (55%), Gaps = 36/200 (18%)

Query: 89  QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP--------- 139
           QA  R   Q +P  Q+ P  V      P       LPTLPPG + +  ++P         
Sbjct: 672 QAKQRIRQQQLPTEQSTPIKVE-----PK------LPTLPPGVKANVPAKPLFEVLKPPA 720

Query: 140 ----RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR 195
                   D   G   ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R
Sbjct: 721 TAAAAGAVDPLGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVER 780

Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
           + A+ + L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC
Sbjct: 781 ERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCEC 828

Query: 256 FQANILCSENCRCLDCKNFE 275
           ++A I CS  C+C+ C+N E
Sbjct: 829 YEAKIPCSSICKCVGCRNME 848


>gi|159884199|gb|ABX00778.1| RE32583p [Drosophila melanogaster]
          Length = 950

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 81/200 (40%), Positives = 110/200 (55%), Gaps = 36/200 (18%)

Query: 89  QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP--------- 139
           QA  R   Q +P  Q+ P  V      P       LPTLPPG + +  ++P         
Sbjct: 672 QAKQRIRQQQLPTEQSTPIKVE-----PK------LPTLPPGVKANVPAKPLFEVLKPPA 720

Query: 140 ----RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR 195
                   D   G   ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R
Sbjct: 721 TAAAAGAVDPLGGMTSRRKHCNCSRSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVER 780

Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
           + A+ + L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC
Sbjct: 781 ERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCEC 828

Query: 256 FQANILCSENCRCLDCKNFE 275
           ++A I CS  C+C+ C+N E
Sbjct: 829 YEAKIPCSSICKCVGCRNME 848


>gi|320163778|gb|EFW40677.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 491

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 73/122 (59%), Positives = 91/122 (74%), Gaps = 13/122 (10%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
           +K CNC+NSRCLKLYCECFA+G YCD  C CV+C NN  ++ ARQ+AV ATLERNP AFR
Sbjct: 137 KKTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEARQQAVDATLERNPAAFR 196

Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           PKI S+          A E+   A+H+KGCHC+KS C K+YCEC+QA I C+ +C+C+DC
Sbjct: 197 PKINSA----------ATESW--ARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDC 244

Query: 272 KN 273
           KN
Sbjct: 245 KN 246



 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 1/52 (1%)

Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNN 188
           RP+ N+   +   +  K C+CR S C K YCEC+ AGI C   C CV+C N+
Sbjct: 196 RPKINSAATESWARHSKGCHCRKSNCQKRYCECYQAGIQCTAHCKCVDCKNH 247



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 28/43 (65%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K C+CK S CLK YCECF +   C  +C+C+ C N E ++E R
Sbjct: 138 KTCNCKNSRCLKLYCECFASGQYCDPSCKCVSCSNNESNQEAR 180


>gi|194883323|ref|XP_001975752.1| GG20386 [Drosophila erecta]
 gi|190658939|gb|EDV56152.1| GG20386 [Drosophila erecta]
          Length = 944

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 80/198 (40%), Positives = 108/198 (54%), Gaps = 34/198 (17%)

Query: 89  QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQ-----------ESPRS 137
           QA  R   Q +P  Q+ P  V      P       LPT PPG             + P +
Sbjct: 668 QAKQRIRQQQLPTEQSAPTKVE-----PK------LPTQPPGFNPVPTKPLFDILKQPAT 716

Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQE 197
              A  D   G   ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R+ 
Sbjct: 717 GAAATVDALGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNMDYEVERER 776

Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
           A+ + L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++
Sbjct: 777 AIRSCLDRNPSAFQPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYE 824

Query: 258 ANILCSENCRCLDCKNFE 275
           A I C+  C+C+ C+N E
Sbjct: 825 AKIPCTSICKCVGCRNME 842


>gi|391347269|ref|XP_003747887.1| PREDICTED: uncharacterized protein LOC100905647 [Metaseiulus
           occidentalis]
          Length = 609

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 83/192 (43%), Positives = 106/192 (55%), Gaps = 24/192 (12%)

Query: 97  QAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRP-RANTDGKD---GTPKK 152
           QA PQ Q  PQ++ +    P   H   +        +   S P +A+ D  D        
Sbjct: 341 QASPQSQ-HPQYLVVKQTTPQGQHSFAV--------QPQNSEPAQADYDAYDKQLSHTSS 391

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S+CLKLYC+CFA G +C  CNCV C+NN+EHE  R  AV   LERNPNAF P
Sbjct: 392 KKPCNCTKSQCLKLYCDCFANGEFCQNCNCVQCYNNLEHEEERSLAVKLCLERNPNAFHP 451

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G ++ R           H KGC+CK+SGCLK YCEC++A ILCS  CRC+ C 
Sbjct: 452 KIGKYKPGDKERR-----------HTKGCNCKRSGCLKNYCECYEARILCSGVCRCVGCH 500

Query: 273 NFEGSEERRALF 284
           N E + +  +L 
Sbjct: 501 NIEENMDAESLL 512


>gi|268552505|ref|XP_002634235.1| C. briggsae CBR-LIN-54 protein [Caenorhabditis briggsae]
          Length = 437

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 114/217 (52%), Gaps = 22/217 (10%)

Query: 114 RVPHPVHKLPLPTLPPGKQESPRSRP------------RANTDGKDGTPKKQKQCNCRNS 161
           R   P++  P  + P G + +   R             R  T  K   P ++K CNC  S
Sbjct: 123 RASRPMYSRPGASRPAGGELTGEQRKAFIEMKEQVNQVRMKTKKKVYAPGQRKPCNCTKS 182

Query: 162 RCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 221
           +CLKLYC+CFA G +C  CNC +CHNN++++  R +A+  +LERNPNAF+PKI  +  G 
Sbjct: 183 QCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAIRQSLERNPNAFKPKIGIARGGT 242

Query: 222 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
            D         +   H KGCHCKKSGCLK YCEC++A + C++ C+C  C+N E     R
Sbjct: 243 AD---------IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTR 293

Query: 282 ALFHGDHN-GIAFMQRAANAAIIGAVGSSGYGTLMTS 317
               G    G   +    NA    A   SG G++MT 
Sbjct: 294 FKNSGGPGAGANALMSLTNANSSTATPDSGPGSVMTD 330


>gi|195334093|ref|XP_002033719.1| GM21473 [Drosophila sechellia]
 gi|194125689|gb|EDW47732.1| GM21473 [Drosophila sechellia]
          Length = 949

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 71/165 (43%), Positives = 99/165 (60%), Gaps = 25/165 (15%)

Query: 124 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
           LPTLPPG + +  ++P                 D   G   ++K CNC  S+CLKLYC+C
Sbjct: 695 LPTLPPGAKANVPAKPLFEVLKPAATPAAAGAVDQLGGMTSRRKHCNCSKSQCLKLYCDC 754

Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
           FA G +C  C C +C NN+++EV R+ A+ + L+RNP+AF+PKI +   G  D R     
Sbjct: 755 FANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMRL---- 808

Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
                 HNKGC+CK+SGCLK YCEC++A I C+  C+C+ C+N E
Sbjct: 809 ------HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 847


>gi|198433102|ref|XP_002124866.1| PREDICTED: similar to lin-54 homolog [Ciona intestinalis]
          Length = 605

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 70/133 (52%), Positives = 87/133 (65%), Gaps = 15/133 (11%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K +K CNC  S CLKLYCECFA G +CD CNC+NCHNN+E +  R +A+ + LERNP AF
Sbjct: 375 KIRKPCNCTKSMCLKLYCECFANGHFCDSCNCINCHNNLEFDTDRSKAIKSCLERNPMAF 434

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           RPKI       +DA            H KGC+CK+SGCLK YCEC++A I C+  C+C+ 
Sbjct: 435 RPKIGR----GRDANR---------THQKGCNCKRSGCLKNYCECYEARIPCTSKCKCIG 481

Query: 271 CKNFEGSEERRAL 283
           CKN E  E+  AL
Sbjct: 482 CKNLE--EDSVAL 492



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/54 (48%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQE 197
           G+D     QK CNC+ S CLK YCEC+ A I C   C C+ C N  E  VA +E
Sbjct: 441 GRDANRTHQKGCNCKRSGCLKNYCECYEARIPCTSKCKCIGCKNLEEDSVALKE 494


>gi|341881213|gb|EGT37148.1| CBN-LIN-54 protein [Caenorhabditis brenneri]
          Length = 407

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 78/179 (43%), Positives = 104/179 (58%), Gaps = 11/179 (6%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
           R  T  K   P ++K CNC  S+CLKLYC+CFA G +C  CNC +CHNN++++  R +A+
Sbjct: 134 RMKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 193

Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
             +LERNPNAF+PKI  +  G  D         +   H KGCHCKKSGCLK YCEC++A 
Sbjct: 194 RQSLERNPNAFKPKIGIA-RGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 243

Query: 260 ILCSENCRCLDCKNFEGSEERRALFHG-DHNGIAFMQRAANAAIIGAVGSSGYGTLMTS 317
           + C++ C+C  C+N E     R    G    G   +    NAA   A   SG G++MT 
Sbjct: 244 VPCTDRCKCKGCQNTETYRMTRFKNSGVSVPGTNALISLTNAASSSATPDSGPGSVMTD 302


>gi|308492241|ref|XP_003108311.1| CRE-LIN-54 protein [Caenorhabditis remanei]
 gi|308249159|gb|EFO93111.1| CRE-LIN-54 protein [Caenorhabditis remanei]
          Length = 433

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 87/252 (34%), Positives = 126/252 (50%), Gaps = 21/252 (8%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
           R  T  K   P ++K CNC  S+CLKLYC+CFA G +C  CNC +CHNN++++  R +A+
Sbjct: 159 RLKTKKKVYAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIDYDSQRSKAI 218

Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
             +LERNPNAF+PKI  +  G  D         +   H KGCHCKKSGCLK YCEC++A 
Sbjct: 219 RQSLERNPNAFKPKIGIARGGTAD---------IERLHQKGCHCKKSGCLKNYCECYEAK 269

Query: 260 ILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNK 319
           + C++ C+C  C+N E     R    G       +    NA+   A   SG G++MT   
Sbjct: 270 VPCTDRCKCKCCQNTEAYRMTRFKNSGSLPSTNALMSLTNASST-ATPDSGPGSVMTDE- 327

Query: 320 RKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRS 379
                     A  D   +  PQ  +             +  V + A   ++   + AL +
Sbjct: 328 ----------AGDDYDDMMLPQKPKAEMDPRRFPWYYMTDEVVEAATMCMVAQAEEALNA 377

Query: 380 PLADILQPQDMK 391
             ++ +Q +D K
Sbjct: 378 SFSEKMQTEDDK 389


>gi|17541142|ref|NP_502544.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
 gi|75019534|sp|Q95QD7.1|LIN54_CAEEL RecName: Full=Protein lin-54; AltName: Full=Abnormal cell lineage
           protein 54
 gi|6425180|emb|CAB05229.1| Protein LIN-54, isoform a [Caenorhabditis elegans]
          Length = 435

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 109/195 (55%), Gaps = 17/195 (8%)

Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
           S  R  T  K   P ++K CNC  S+CLKLYC+CFA G +C  CNC +CHNN+E++  R 
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 217

Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
           +A+  +LERNPNAF+PKI  +  G  D         +   H KGCHCKKSGCLK YCEC+
Sbjct: 218 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 268

Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHG----DHNGIAFMQRAANAAIIGAVGSSGYG 312
           +A + C++ C+C  C+N E     R    G    + N +  +  A++     A   SG G
Sbjct: 269 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASST----ATPDSGPG 324

Query: 313 TLMTSNKRKSEELLF 327
           +++T       E + 
Sbjct: 325 SVVTDEHGDDYEDML 339


>gi|116831214|gb|ABK28561.1| unknown [Arabidopsis thaliana]
          Length = 369

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 146/300 (48%), Gaps = 66/300 (22%)

Query: 118 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
           PV +LPL   PP  +E   S  +  TD +  T +K K C C+ S+CLKLYC+CFA+G+ C
Sbjct: 34  PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVC 89

Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAK 236
             C+CV+CHNN E   AR+ A+   L RNPNAF  K   S    Q  A  D +   L+  
Sbjct: 90  TDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLS-- 147

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
             +GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN   SE  +      HN     +R
Sbjct: 148 --RGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNV--SEAFQPPAFSAHNSPQVYRR 203

Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEEL---------LFGVAAKDQSVIRNPQSQQGNN 347
                                  R+  EL         LF +   D S+          N
Sbjct: 204 -----------------------RRDRELTEWNSCPAPLFSIP--DNSI---------QN 229

Query: 348 VQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
              S + CS  +P  +    S++G T + L           D+ +LCSLLV  S  AT T
Sbjct: 230 ALGSPMSCSPKLPYRK--KRSLMGYTSTLL----------PDLGDLCSLLVAASESATTT 277


>gi|79403647|ref|NP_188237.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75136698|sp|Q700D0.1|TCX8_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 8; Short=AtTCX8
 gi|45935045|gb|AAS79557.1| tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|46367484|emb|CAG25868.1| hypothetical protein [Arabidopsis thaliana]
 gi|332642257|gb|AEE75778.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 368

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 146/300 (48%), Gaps = 66/300 (22%)

Query: 118 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
           PV +LPL   PP  +E   S  +  TD +  T +K K C C+ S+CLKLYC+CFA+G+ C
Sbjct: 34  PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVC 89

Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAK 236
             C+CV+CHNN E   AR+ A+   L RNPNAF  K   S    Q  A  D +   L+  
Sbjct: 90  TDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLS-- 147

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
             +GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN   SE  +      HN     +R
Sbjct: 148 --RGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNV--SEAFQPPAFSAHNSPQVYRR 203

Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEEL---------LFGVAAKDQSVIRNPQSQQGNN 347
                                  R+  EL         LF +   D S+          N
Sbjct: 204 -----------------------RRDRELTEWNSCPAPLFSIP--DNSI---------QN 229

Query: 348 VQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
              S + CS  +P  +    S++G T + L           D+ +LCSLLV  S  AT T
Sbjct: 230 ALGSPMSCSPKLPYRK--KRSLMGYTSTLL----------PDLGDLCSLLVAASESATTT 277


>gi|17541144|ref|NP_502545.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
 gi|3924783|emb|CAB05228.1| Protein LIN-54, isoform b [Caenorhabditis elegans]
          Length = 429

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 109/195 (55%), Gaps = 17/195 (8%)

Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
           S  R  T  K   P ++K CNC  S+CLKLYC+CFA G +C  CNC +CHNN+E++  R 
Sbjct: 152 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 211

Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
           +A+  +LERNPNAF+PKI  +  G  D         +   H KGCHCKKSGCLK YCEC+
Sbjct: 212 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 262

Query: 257 QANILCSENCRCLDCKNFEGSEERRALFHG----DHNGIAFMQRAANAAIIGAVGSSGYG 312
           +A + C++ C+C  C+N E     R    G    + N +  +  A++     A   SG G
Sbjct: 263 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASST----ATPDSGPG 318

Query: 313 TLMTSNKRKSEELLF 327
           +++T       E + 
Sbjct: 319 SVVTDEHGDDYEDML 333


>gi|195583082|ref|XP_002081353.1| GD10970 [Drosophila simulans]
 gi|194193362|gb|EDX06938.1| GD10970 [Drosophila simulans]
          Length = 950

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 71/165 (43%), Positives = 98/165 (59%), Gaps = 25/165 (15%)

Query: 124 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
           LPTLPPG +    ++P                 D   G   ++K CNC  S+CLKLYC+C
Sbjct: 696 LPTLPPGVKADVPAKPLFEVLKPAATPAAAGAVDPLGGMTSRRKHCNCSKSQCLKLYCDC 755

Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
           FA G +C  C C +C NN+++EV R+ A+ + L+RNP+AF+PKI +   G  D R     
Sbjct: 756 FANGEFCQDCTCKDCCNNLDYEVERERAIRSCLDRNPSAFKPKITAPNSG--DMR----- 808

Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
                 HNKGC+CK+SGCLK YCEC++A I C+  C+C+ C+N E
Sbjct: 809 -----LHNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGCRNME 848


>gi|146182846|ref|XP_001025406.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146143702|gb|EAS05161.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 546

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 66/120 (55%), Positives = 86/120 (71%), Gaps = 4/120 (3%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           K+CNC+ S+C+KLYCEC+AAG+YC+GC+CV+C NN + E  R +A+  TL +NP AF PK
Sbjct: 69  KKCNCKQSQCIKLYCECYAAGVYCNGCHCVSCKNNDQSEGWRNQAIQNTLVKNPTAFLPK 128

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           I+     A    +         KHNKGC CK+SGC+KKYCECFQA + CSENC+C DC+N
Sbjct: 129 ISQKQETASKVVDPLN----LGKHNKGCACKRSGCMKKYCECFQAGVPCSENCKCTDCQN 184



 Score = 46.6 bits (109), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K C+CK+S C+K YCEC+ A + C+  C C+ CKN + SE  R
Sbjct: 69  KKCNCKQSQCIKLYCECYAAGVYCN-GCHCVSCKNNDQSEGWR 110



 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K  K C C+ S C+K YCECF AG+ C + C C +C N
Sbjct: 147 KHNKGCACKRSGCMKKYCECFQAGVPCSENCKCTDCQN 184


>gi|198458221|ref|XP_001360959.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
 gi|198136263|gb|EAL25535.2| GA19331 [Drosophila pseudoobscura pseudoobscura]
          Length = 978

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 67/143 (46%), Positives = 92/143 (64%), Gaps = 12/143 (8%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           +SP+      TD   G   ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++E
Sbjct: 745 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 804

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
           V R+ A+ + L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK Y
Sbjct: 805 VERERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNY 852

Query: 253 CECFQANILCSENCRCLDCKNFE 275
           CEC++A I CS  C+C+ C+N E
Sbjct: 853 CECYEAKIPCSSICKCVGCRNME 875


>gi|9294463|dbj|BAB02682.1| unnamed protein product [Arabidopsis thaliana]
          Length = 356

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 146/300 (48%), Gaps = 66/300 (22%)

Query: 118 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC 177
           PV +LPL   PP  +E   S  +  TD +  T +K K C C+ S+CLKLYC+CFA+G+ C
Sbjct: 34  PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVC 89

Query: 178 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAK 236
             C+CV+CHNN E   AR+ A+   L RNPNAF  K   S    Q  A  D +   L+  
Sbjct: 90  TDCDCVDCHNNSEKCDAREAAMVNVLGRNPNAFSEKALGSLTDNQCKAAPDTKPGLLS-- 147

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQR 296
             +GC CK++ CLKKYCECFQAN+LCS+NC+C++CKN   SE  +      HN     +R
Sbjct: 148 --RGCKCKRTRCLKKYCECFQANLLCSDNCKCINCKNV--SEAFQPPAFSAHNSPQVYRR 203

Query: 297 AANAAIIGAVGSSGYGTLMTSNKRKSEEL---------LFGVAAKDQSVIRNPQSQQGNN 347
                                  R+  EL         LF +   D S+          N
Sbjct: 204 -----------------------RRDRELTEWNSCPAPLFSIP--DNSI---------QN 229

Query: 348 VQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKT 407
              S + CS  +P  +    S++G T + L           D+ +LCSLLV  S  AT T
Sbjct: 230 ALGSPMSCSPKLPYRK--KRSLMGYTSTLL----------PDLGDLCSLLVAASESATTT 277


>gi|359476108|ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera]
          Length = 1001

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 83/196 (42%), Positives = 109/196 (55%), Gaps = 20/196 (10%)

Query: 116 PHPVHKLP----------LPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLK 165
           P  +H++P          + +L    Q SP+ R +  +   DG     K+CNC+ S+CLK
Sbjct: 521 PIELHEIPCDKRKFNSEHMDSLEEFNQPSPKKRRKKASSTNDG--DGCKRCNCKKSKCLK 578

Query: 166 LYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-D 223
           LYC+CFAAGIYC +GC CV C N  E++    E       RNP AF PKI    +G+  +
Sbjct: 579 LYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPIN 638

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           + ED R    +A+H +GC+CKKS CLKKYCEC+QAN+ CS  CRC  CKN  G +E    
Sbjct: 639 SGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSAGCRCEGCKNVYGRKEEYGA 698

Query: 284 FHGDHNGIAFMQRAAN 299
           F         M R AN
Sbjct: 699 FK------EMMSRRAN 708


>gi|326680100|ref|XP_002666845.2| PREDICTED: protein lin-54 homolog [Danio rerio]
          Length = 460

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 71/165 (43%), Positives = 95/165 (57%), Gaps = 11/165 (6%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K ++ CNC  S+CLKLYCECFA G +C  C C NC NN EH   R +AV A L+RNP AF
Sbjct: 262 KSKRPCNCTKSQCLKLYCECFANGEFCSSCKCTNCFNNTEHVFERSQAVKACLDRNPGAF 321

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           RPKI S   G            +   H KGC+CK+SGCLK YCEC++A I+C+  C+C+ 
Sbjct: 322 RPKIGSRKQG-----------NVKGCHTKGCNCKRSGCLKNYCECYEAKIMCTSTCKCVG 370

Query: 271 CKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLM 315
           C+N++   + +     D + I F  R   + I   V  +  G L+
Sbjct: 371 CRNYDEGCDMKKSSEQDRHDIYFPTRCPVSVITRDVVEATCGCLL 415


>gi|158289963|ref|XP_559078.3| AGAP010380-PA [Anopheles gambiae str. PEST]
 gi|157018411|gb|EAL41036.3| AGAP010380-PA [Anopheles gambiae str. PEST]
          Length = 725

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 96/156 (61%), Gaps = 15/156 (9%)

Query: 133 ESPRSRPRANTDGKDGTPK---KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           ++ +  PRA T    G P    K++ CNC  S+CLKLYC+CFA G YC  CNC +C N  
Sbjct: 432 DTSKDSPRALTTTVYGFPDEAYKKRPCNCTKSQCLKLYCDCFANGEYCYNCNCKDCFNTF 491

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           +H+  RQ+A+ +TLERNPNAF+PKI     G+  + +D         H KGC+CK+SGCL
Sbjct: 492 DHDNERQKAIRSTLERNPNAFKPKI-----GSIGSTDDGTRL-----HTKGCNCKRSGCL 541

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           K YCEC++  I CS NC+C+   +F   +    LFH
Sbjct: 542 KNYCECYEGKIACSSNCKCVGTPSFHLLQSH--LFH 575


>gi|195124896|ref|XP_002006919.1| GI21332 [Drosophila mojavensis]
 gi|193911987|gb|EDW10854.1| GI21332 [Drosophila mojavensis]
          Length = 879

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 64/124 (51%), Positives = 86/124 (69%), Gaps = 12/124 (9%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
           ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R+EA+ + LERNP+AF+
Sbjct: 663 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKREEAIRSCLERNPSAFK 722

Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I CS  C+C+ C
Sbjct: 723 PKITAPNSG--DIRL----------HNKGCNCKRSGCLKNYCECYEAKIPCSTMCKCVGC 770

Query: 272 KNFE 275
           +N E
Sbjct: 771 RNME 774


>gi|218198100|gb|EEC80527.1| hypothetical protein OsI_22807 [Oryza sativa Indica Group]
          Length = 426

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 101/290 (34%), Positives = 142/290 (48%), Gaps = 43/290 (14%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCN 181
            P+  +PP      +S P         + K++K CNC+NS+CLKL               
Sbjct: 58  WPVAFIPPKPVAEIKSTP---------STKRKKHCNCKNSQCLKL--------------- 93

Query: 182 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 241
                         QEA+           + KI +  +     ++DA       KHNKGC
Sbjct: 94  --------------QEAITVRSNGIQKPSKLKIENGSNALNLRKDDAGAPASLPKHNKGC 139

Query: 242 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI-AFMQRAANA 300
           HCKKSGCLKKYCECFQANILCS+NC+C DCKNFEGSEE R +  GD++     +Q AAN 
Sbjct: 140 HCKKSGCLKKYCECFQANILCSKNCKCQDCKNFEGSEELRLITQGDNSSDRNNIQHAANV 199

Query: 301 AIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVP 360
           A+ GA+GSSGY       KR  E+ L+     + S ++  Q Q+ N+V +S +  S  + 
Sbjct: 200 ALNGAIGSSGYRYSPVRRKRPPEDHLYQKLNGEGSTMQT-QFQEANHVDSSEITSSTGL- 257

Query: 361 VSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEATKTLTS 410
             +   ++    +    RS LA+ + P D   L   LV+V  +A     +
Sbjct: 258 --EGCYSNYQCRSNVVYRSALANTISPTDATGLAKHLVIVCRKAADAFLT 305


>gi|165972329|ref|NP_766302.2| protein lin-54 homolog [Mus musculus]
 gi|169234729|ref|NP_001108482.1| protein lin-54 homolog [Mus musculus]
 gi|190359887|sp|Q571G4.2|LIN54_MOUSE RecName: Full=Protein lin-54 homolog
 gi|74141051|dbj|BAE22096.1| unnamed protein product [Mus musculus]
 gi|148688344|gb|EDL20291.1| expressed sequence AI461788, isoform CRA_a [Mus musculus]
          Length = 749

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           + T + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RN
Sbjct: 516 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           P AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 576 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624

Query: 267 RCLDCKNFEGSEERRALFH 285
           +C+ CKNFE S ER+ L H
Sbjct: 625 KCIGCKNFEESPERKTLMH 643


>gi|195484960|ref|XP_002090893.1| GE12548 [Drosophila yakuba]
 gi|194176994|gb|EDW90605.1| GE12548 [Drosophila yakuba]
          Length = 950

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 110/203 (54%), Gaps = 35/203 (17%)

Query: 85  QPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANT- 143
           Q   QA  R   Q +P  Q  P  V      P       LPTLP G + +  ++P  +  
Sbjct: 669 QLQAQAKQRIRQQQLPTEQNAPTKVE-----PK------LPTLPTGLKANAPAKPLFDIL 717

Query: 144 -----------DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
                      D   G   ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++E
Sbjct: 718 KPPATGAAAAVDALGGMTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 777

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
           V R+ A+ + L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK Y
Sbjct: 778 VERERAIRSCLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNY 825

Query: 253 CECFQANILCSENCRCLDCKNFE 275
           CEC++A I C+  C+C+ C+N E
Sbjct: 826 CECYEAKIPCTSICKCVGCRNME 848


>gi|297834488|ref|XP_002885126.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330966|gb|EFH61385.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 76/176 (43%), Positives = 105/176 (59%), Gaps = 18/176 (10%)

Query: 124 LPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCV 183
           +P  PP  +E   ++   + +G       +K+C C+ S+CLKLYC+CFA+G+ C  C+CV
Sbjct: 38  IPPFPPISREHSETKDTTDQEGITW----RKRCRCKQSKCLKLYCDCFASGVLCTDCDCV 93

Query: 184 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCH 242
           +CHNN ++  AR  AV   L RNPNAF  K+ SS +  Q  A  D R   L+    +GC 
Sbjct: 94  DCHNNSDNYDARDAAVVNVLGRNPNAFNEKLFSSINDKQCKAAPDTRPGLLS----RGCK 149

Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKN---------FEGSEERRALFHGDHN 289
           CK++ CLKKYCECFQAN+LCS+NC+C++CKN         F      R L+ G  N
Sbjct: 150 CKRTKCLKKYCECFQANVLCSDNCKCINCKNVSEAFQPSVFAWGLNSRKLYEGFDN 205


>gi|301094740|ref|XP_002896474.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109449|gb|EEY67501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 72/153 (47%), Positives = 89/153 (58%), Gaps = 31/153 (20%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K+  CNC+ S+CLKLYCECFA+G YCD  CNC++C N    E  RQ+A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVRQQAIASRLEKNPNAF 619

Query: 211 RPKIASSPH-------GAQDAREDAR---------------------EAQLAAK--HNKG 240
           +PKI ++         GA+    D R                      A +A    H  G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMATTKMHKHG 679

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVSCGENCRCIDCKN 712



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 28/41 (68%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+CKKS CLK YCECF +   C E+C CLDC N   +E+ R
Sbjct: 564 CNCKKSKCLKLYCECFASGGYCDESCNCLDCANTTAAEDVR 604


>gi|60359862|dbj|BAD90150.1| mKIAA2037 protein [Mus musculus]
          Length = 793

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           + T + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RN
Sbjct: 560 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 619

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           P AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 620 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 668

Query: 267 RCLDCKNFEGSEERRALFH 285
           +C+ CKNFE S ER+ L H
Sbjct: 669 KCIGCKNFEESPERKTLMH 687


>gi|428168842|gb|EKX37782.1| hypothetical protein GUITHDRAFT_52378, partial [Guillardia theta
           CCMP2712]
          Length = 118

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 77/124 (62%), Positives = 92/124 (74%), Gaps = 9/124 (7%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K+KQCNC+NS+CLKLYCECFA+G YC+  CNC  C NN  +E  R+ A+ ATLERNP AF
Sbjct: 2   KRKQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAF 61

Query: 211 RPKIASS-PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
           +PKIASS P G +D+    R       H KGCHCKKSGCLKKYCECFQA +LCS  C+C 
Sbjct: 62  QPKIASSAPPGIEDSPVQGR-------HTKGCHCKKSGCLKKYCECFQAGVLCSAQCKCH 114

Query: 270 DCKN 273
           +C+N
Sbjct: 115 ECRN 118



 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 3/66 (4%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE-ERRALFHG--DHNGIAFMQ 295
           K C+CK S CLK YCECF +   C+ +C C  C+N E  E +R++      + N +AF  
Sbjct: 4   KQCNCKNSKCLKLYCECFASGSYCNLSCNCQLCQNNENYEKQRKSAIDATLERNPLAFQP 63

Query: 296 RAANAA 301
           + A++A
Sbjct: 64  KIASSA 69


>gi|390332563|ref|XP_792482.3| PREDICTED: protein lin-54 homolog [Strongylocentrotus purpuratus]
          Length = 705

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 78/142 (54%), Positives = 96/142 (67%), Gaps = 13/142 (9%)

Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATL 203
           DG    P+K   CNC  S+CLKLYC+CFA G +C  CNC NC NN++HE  R +AV A L
Sbjct: 475 DGSGSRPRKP--CNCTKSQCLKLYCDCFANGEFCRNCNCNNCLNNLDHEDERTKAVKACL 532

Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
           +RNP+AF PKI    HG+Q  R          +HNKGC+CK+SGCLK YCEC++A ILCS
Sbjct: 533 DRNPHAFHPKIGKG-HGSQTNR----------RHNKGCNCKRSGCLKNYCECYEAKILCS 581

Query: 264 ENCRCLDCKNFEGSEERRALFH 285
             C+C+ CKNFE S ER+ L H
Sbjct: 582 NFCKCVGCKNFEESPERKTLMH 603


>gi|194376810|dbj|BAG57551.1| unnamed protein product [Homo sapiens]
          Length = 709

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/149 (51%), Positives = 98/149 (65%), Gaps = 12/149 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ 
Sbjct: 580 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 628

Query: 271 CKNFEGSEERRALFH-GDHNGIAFMQRAA 298
           CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 629 CKNFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|410903926|ref|XP_003965444.1| PREDICTED: protein lin-54 homolog [Takifugu rubripes]
          Length = 807

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/161 (47%), Positives = 100/161 (62%), Gaps = 12/161 (7%)

Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEA 198
           P   T   +  P+ +K CNC  S+CLKLYC+CFA G +C+ CNC NC NN+EHE  R +A
Sbjct: 566 PLNGTSTTETAPRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKA 625

Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
           +   L+RNP AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A
Sbjct: 626 IKTCLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEA 674

Query: 259 NILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 298
            I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 675 KIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 715


>gi|325183843|emb|CCA18301.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183968|emb|CCA18426.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 983

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 71/152 (46%), Positives = 89/152 (58%), Gaps = 24/152 (15%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K+  CNC+ S+CLKLYCECFA G YCD  CNC++C N +E    RQ AV + LE+NPNAF
Sbjct: 583 KKAPCNCKKSKCLKLYCECFANGGYCDENCNCLDCSNTLERIDERQAAVASRLEKNPNAF 642

Query: 211 RPKIASSPHGAQDAREDAREAQLAAK-----------------------HNKGCHCKKSG 247
           +PKI  S   A  +  +A  +  +                         H  GCHCKKS 
Sbjct: 643 KPKILPSQSSAVKSLYEALSSTKSTSSGVFPMRRSTADGHRTPTGKKRMHKDGCHCKKSA 702

Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           C KKYCECFQA +LC +NCRC+DC+N E  +E
Sbjct: 703 CQKKYCECFQAGVLCGDNCRCIDCRNVEKKKE 734


>gi|348671359|gb|EGZ11180.1| hypothetical protein PHYSODRAFT_520155 [Phytophthora sojae]
          Length = 999

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 78/186 (41%), Positives = 99/186 (53%), Gaps = 44/186 (23%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K+  CNC+ S+CLKLYCECFA+G YCD  CNC++C N    E  RQ+A+ A LE+NPNAF
Sbjct: 563 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCSNTPATEEVRQQAIAARLEKNPNAF 622

Query: 211 RPKIASSPH-----------GAQDAREDAREAQLAAK---------------------HN 238
           +PKI ++P            GA      +    L+                       H 
Sbjct: 623 KPKIGATPGMTPGSARRLSVGASGGSHASPGTFLSPSGRLNLQQQHQLLSAGLLSTKMHK 682

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 298
            GCHCKKS C KKYCECFQA + C ENCRC+DCKN       +A      NG+A     A
Sbjct: 683 HGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKN-------QAPCVAHANGVA----TA 731

Query: 299 NAAIIG 304
            +A++G
Sbjct: 732 GSAVVG 737


>gi|324510685|gb|ADY44467.1| Protein lin-54 [Ascaris suum]
          Length = 448

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 67/134 (50%), Positives = 87/134 (64%), Gaps = 11/134 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C  CNC +CHNN+++E  R  A+ A+LERNPNAF+P
Sbjct: 221 KKPCNCTKSMCLKLYCDCFANGEFCKDCNCKDCHNNLDNEAERSRAIKASLERNPNAFKP 280

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI  +  G  D+            H KGCHCKKS CLK YCEC++A + C+E C+C  C+
Sbjct: 281 KIGVASRGRVDSER---------LHQKGCHCKKSNCLKNYCECYEAKVPCTERCKCSSCR 331

Query: 273 NFEGSEERRALFHG 286
           N E   +R + F G
Sbjct: 332 NTES--DRASKFRG 343


>gi|195400539|ref|XP_002058874.1| GJ19678 [Drosophila virilis]
 gi|194156225|gb|EDW71409.1| GJ19678 [Drosophila virilis]
          Length = 980

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 63/124 (50%), Positives = 85/124 (68%), Gaps = 12/124 (9%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
           ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R+ A+ + LERNP+AF+
Sbjct: 765 RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAIRSCLERNPSAFK 824

Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I CS  C+C+ C
Sbjct: 825 PKITAPNSG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIPCSAMCKCVGC 872

Query: 272 KNFE 275
           +N E
Sbjct: 873 RNME 876


>gi|301089591|ref|XP_002895080.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262102431|gb|EEY60483.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 972

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 72/153 (47%), Positives = 89/153 (58%), Gaps = 31/153 (20%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K+  CNC+ S+CLKLYCECFA+G YCD  CNC++C N    E  RQ+A+ + LE+NPNAF
Sbjct: 560 KKAPCNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVRQQAIASRLEKNPNAF 619

Query: 211 RPKIASSPH-------GAQDAREDAR---------------------EAQLA--AKHNKG 240
           +PKI ++         GA+    D R                      A +A    H  G
Sbjct: 620 KPKIGATSTMVTVTSGGARRLSADRRASTGTFLSPPGQLSLQQRQLLSAGMANTKMHKHG 679

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           CHCKKS C KKYCECFQA + C ENCRC+DCKN
Sbjct: 680 CHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 712



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 28/41 (68%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+CKKS CLK YCECF +   C E+C CLDC N   +E+ R
Sbjct: 564 CNCKKSKCLKLYCECFASGGYCDESCNCLDCTNTTAAEDVR 604



 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           G   T   +  C+C+ S C K YCECF AG+ C + C C++C N
Sbjct: 669 GMANTKMHKHGCHCKKSACQKKYCECFQAGVPCGENCRCIDCKN 712


>gi|34365308|emb|CAE45981.1| hypothetical protein [Homo sapiens]
          Length = 749

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|190689471|gb|ACE86510.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|195431948|ref|XP_002063989.1| GK15963 [Drosophila willistoni]
 gi|194160074|gb|EDW74975.1| GK15963 [Drosophila willistoni]
          Length = 587

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/188 (41%), Positives = 110/188 (58%), Gaps = 17/188 (9%)

Query: 89  QAHMRSPSQAMPQWQ-ARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKD 147
           Q H++   +A PQ + ++      V  VP P +K     L P    S  S P    D + 
Sbjct: 304 QQHLQQLQKAKPQLKGSQLNQSANVKPVPAPANKPNFEILKP--PSSAASTPAV--DAQL 359

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNP 207
               ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R++A+ + L+RNP
Sbjct: 360 NLASRRKHCNCSKSQCLKLYCDCFANGEFCKDCTCKDCFNNLDNEVEREKAIRSCLDRNP 419

Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
           +AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I CS  C+
Sbjct: 420 SAFKPKITAPASG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIPCSSMCK 467

Query: 268 CLDCKNFE 275
           C+ C+N E
Sbjct: 468 CVGCRNME 475


>gi|149701464|ref|XP_001494323.1| PREDICTED: protein lin-54 homolog isoform 2 [Equus caballus]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|402869363|ref|XP_003898732.1| PREDICTED: protein lin-54 homolog isoform 2 [Papio anubis]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|291401518|ref|XP_002717112.1| PREDICTED: lin-54 homolog [Oryctolagus cuniculus]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|426344811|ref|XP_004038946.1| PREDICTED: protein lin-54 homolog isoform 1 [Gorilla gorilla
           gorilla]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|344284817|ref|XP_003414161.1| PREDICTED: protein lin-54 homolog isoform 1 [Loxodonta africana]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/135 (54%), Positives = 93/135 (68%), Gaps = 11/135 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 579

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ 
Sbjct: 580 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 628

Query: 271 CKNFEGSEERRALFH 285
           CKNFE S ER+ L H
Sbjct: 629 CKNFEESPERKTLMH 643


>gi|329664578|ref|NP_001192410.1| protein lin-54 homolog [Bos taurus]
 gi|296486396|tpg|DAA28509.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
 gi|440898648|gb|ELR50096.1| Protein lin-54-like protein [Bos grunniens mutus]
          Length = 748

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 521 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 580

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 581 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 629

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 630 NFEESPERKTLMH 642


>gi|169234719|ref|NP_919258.2| protein lin-54 homolog isoform a [Homo sapiens]
 gi|114593921|ref|XP_517185.2| PREDICTED: protein lin-54 homolog isoform 3 [Pan troglodytes]
 gi|397524648|ref|XP_003832301.1| PREDICTED: protein lin-54 homolog isoform 1 [Pan paniscus]
 gi|313104222|sp|Q6MZP7.3|LIN54_HUMAN RecName: Full=Protein lin-54 homolog; AltName: Full=CXC
           domain-containing protein 1
 gi|119626322|gb|EAX05917.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|119626324|gb|EAX05919.1| hypothetical protein DKFZp686L1814, isoform CRA_a [Homo sapiens]
 gi|355687283|gb|EHH25867.1| CXC domain-containing protein 1 [Macaca mulatta]
 gi|355762609|gb|EHH62025.1| CXC domain-containing protein 1 [Macaca fascicularis]
 gi|383415851|gb|AFH31139.1| protein lin-54 homolog isoform a [Macaca mulatta]
 gi|410222154|gb|JAA08296.1| lin-54 homolog [Pan troglodytes]
 gi|410249630|gb|JAA12782.1| lin-54 homolog [Pan troglodytes]
 gi|410296420|gb|JAA26810.1| lin-54 homolog [Pan troglodytes]
 gi|410329877|gb|JAA33885.1| lin-54 homolog [Pan troglodytes]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|165972337|ref|NP_001094034.1| protein lin-54 homolog [Rattus norvegicus]
 gi|190359881|sp|Q641Z1.2|LIN54_RAT RecName: Full=Protein lin-54 homolog
 gi|149046795|gb|EDL99569.1| similar to hypothetical protein, isoform CRA_b [Rattus norvegicus]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|332233371|ref|XP_003265875.1| PREDICTED: protein lin-54 homolog isoform 1 [Nomascus leucogenys]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|426231958|ref|XP_004010003.1| PREDICTED: protein lin-54 homolog isoform 1 [Ovis aries]
          Length = 748

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 521 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 580

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 581 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 629

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 630 NFEESPERKTLMH 642


>gi|118404776|ref|NP_001072590.1| protein lin-54 homolog [Xenopus (Silurana) tropicalis]
 gi|123905701|sp|Q0IHV2.1|LIN54_XENTR RecName: Full=Protein lin-54 homolog
 gi|114107615|gb|AAI22958.1| lin-54 homolog [Xenopus (Silurana) tropicalis]
          Length = 741

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 101/153 (66%), Gaps = 14/153 (9%)

Query: 136 RSRPRANTDG---KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           +++PR   +G    +   + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE
Sbjct: 494 QNQPRGPLNGIISSESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHE 553

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
             RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC+CK+SGCLK Y
Sbjct: 554 NERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNY 602

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           CEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 603 CECYEAKIMCSSICKCIGCKNFEESPERKTLMH 635


>gi|452819860|gb|EME26911.1| transcription factor [Galdieria sulphuraria]
          Length = 284

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 67/127 (52%), Positives = 91/127 (71%), Gaps = 11/127 (8%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K C C  S+CLK+YC+CF +G+YC   C CV C+NN +H+V R+ A+ + LE+NP+AF+P
Sbjct: 13  KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVEREAAIKSVLEKNPDAFQP 72

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+A+     +D+        L  KHNKGC+C+K+ CLKKYCECFQ+ +LCSE C+C  CK
Sbjct: 73  KVAT----FRDS------GGLIVKHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCK 122

Query: 273 NFEGSEE 279
           NFEGS E
Sbjct: 123 NFEGSLE 129



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           KGC C KS CLK YC+CF + + C  +C C+ C N E  +  R
Sbjct: 13  KGCKCGKSKCLKMYCDCFNSGVYCGRHCICVGCYNNEQHQVER 55



 Score = 45.4 bits (106), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K  K CNCR + CLK YCECF +G+ C + C C  C N
Sbjct: 86  KHNKGCNCRKTACLKKYCECFQSGVLCSELCKCSGCKN 123


>gi|354499573|ref|XP_003511883.1| PREDICTED: protein lin-54 homolog [Cricetulus griseus]
 gi|344242817|gb|EGV98920.1| Protein lin-54-like [Cricetulus griseus]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|190690833|gb|ACE87191.1| lin-54 homolog (C. elegans) protein [synthetic construct]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 96/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQRAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|403263377|ref|XP_003924012.1| PREDICTED: protein lin-54 homolog isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|296082007|emb|CBI21012.3| unnamed protein product [Vitis vinifera]
          Length = 1094

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/161 (47%), Positives = 97/161 (60%), Gaps = 12/161 (7%)

Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 199
           ++T+  DG     K+CNC+ S+CLKLYC+CFAAGIYC +GC CV C N  E++    E  
Sbjct: 565 SSTNDGDGC----KRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETR 620

Query: 200 GATLERNPNAFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
                RNP AF PKI    +G+  ++ ED R    +A+H +GC+CKKS CLKKYCEC+QA
Sbjct: 621 KQIESRNPLAFAPKIVPPVNGSPINSGEDGRSTPSSARHKRGCNCKKSMCLKKYCECYQA 680

Query: 259 NILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAAN 299
           N+ CS  CRC  CKN  G +E    F         M R AN
Sbjct: 681 NVGCSAGCRCEGCKNVYGRKEEYGAFK------EMMSRRAN 715



 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 44/106 (41%), Gaps = 21/106 (19%)

Query: 196 QEAVGATLERNPN----AFRPKIASSPHGAQDAREDAREAQL------------AAKHNK 239
           + + G    RN N    A     + S HG Q  +    +A L            A+  N 
Sbjct: 510 ENSAGTEDRRNENKASIAMSSATSQSSHGIQPTKPKKEKASLSFHVISHNFDFSASSTND 569

Query: 240 GCHCKKSGC-----LKKYCECFQANILCSENCRCLDCKNFEGSEER 280
           G  CK+  C     LK YC+CF A I C+E C C+ C N    ++R
Sbjct: 570 GDGCKRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDR 615


>gi|301753331|ref|XP_002912513.1| PREDICTED: protein lin-54 homolog [Ailuropoda melanoleuca]
 gi|281346804|gb|EFB22388.1| hypothetical protein PANDA_000261 [Ailuropoda melanoleuca]
          Length = 749

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 631 NFEESPERKTLMHLADAAEVRVQQQTA 657


>gi|359323632|ref|XP_003640147.1| PREDICTED: protein lin-54 homolog isoform 2 [Canis lupus
           familiaris]
          Length = 749

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|410957317|ref|XP_003985276.1| PREDICTED: protein lin-54 homolog isoform 1 [Felis catus]
          Length = 749

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|431916156|gb|ELK16408.1| Protein lin-54 like protein [Pteropus alecto]
          Length = 827

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 600 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 659

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 660 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 708

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 709 NFEESPERKTLMH 721


>gi|297292592|ref|XP_001083154.2| PREDICTED: protein lin-54 homolog, partial [Macaca mulatta]
          Length = 813

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 73/135 (54%), Positives = 93/135 (68%), Gaps = 11/135 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF
Sbjct: 584 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 643

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ 
Sbjct: 644 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 692

Query: 271 CKNFEGSEERRALFH 285
           CKNFE S ER+ L H
Sbjct: 693 CKNFEESPERKTLMH 707


>gi|145539662|ref|XP_001455521.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423329|emb|CAK88124.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 147/298 (49%), Gaps = 50/298 (16%)

Query: 7   ISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKS------QPQ 60
           I D +P+++ ++L+FT C  S N    + S+ S +S+ Q +SPL  Q  K        P 
Sbjct: 41  IKDQSPEQVCKKLEFTPCLKSPN----QYSIFSINSRFQPYSPLNHQTTKKANIFVISPA 96

Query: 61  VQSKVASPSQSQPLVRLQ-----LLPP-----------SSQPHVQAHMRSPSQAMPQWQA 104
            Q         Q LV +      L  P           S    + +  RSP ++  +   
Sbjct: 97  RQINFDHTKTEQRLVNISSPLSSLKVPDDLLCEEEEILSDDQDIHSQFRSPPKSPHRKLE 156

Query: 105 RPQHVRMVNRVPHPVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTPKKQKQ-----CNC 158
           R   ++ +N V          TL  P K  S +   R  T  K   P  +++     CNC
Sbjct: 157 RATKLKFINFV----------TLKKPLKFTSDQVNNRKRTSSKRRKPLTEQKPKIIVCNC 206

Query: 159 RNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASS 217
           + S+CLKLYC+CFAAG+ C   CNC +CHNN +H   R+  +   +ERNP AFRPK+ S 
Sbjct: 207 KKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKERENIIQQIMERNPQAFRPKVDSR 266

Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
            +      ED  E     +H KGC+CKKS CLKKYCEC+Q  + CSE C+C DCKN E
Sbjct: 267 SNS-----ED--EIDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDDCKNCE 317



 Score = 45.1 bits (105), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+CKKS CLK YC+CF A + C ++C C  C N +   + R
Sbjct: 204 CNCKKSKCLKLYCDCFAAGVPCGKDCNCCSCHNNDDHTKER 244


>gi|410038272|ref|XP_003950370.1| PREDICTED: protein lin-54 homolog [Pan troglodytes]
 gi|426344815|ref|XP_004038948.1| PREDICTED: protein lin-54 homolog isoform 3 [Gorilla gorilla
           gorilla]
 gi|80475938|gb|AAI09279.1| LIN54 protein [Homo sapiens]
          Length = 660

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554


>gi|441625230|ref|XP_004089055.1| PREDICTED: protein lin-54 homolog [Nomascus leucogenys]
          Length = 660

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554


>gi|311262890|ref|XP_003129406.1| PREDICTED: protein lin-54 homolog [Sus scrofa]
          Length = 763

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 536 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 595

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 596 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 644

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 645 NFEESPERKTLMH 657


>gi|402869365|ref|XP_003898733.1| PREDICTED: protein lin-54 homolog isoform 3 [Papio anubis]
          Length = 660

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554


>gi|149701467|ref|XP_001494305.1| PREDICTED: protein lin-54 homolog isoform 1 [Equus caballus]
          Length = 660

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 541

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554


>gi|426231960|ref|XP_004010004.1| PREDICTED: protein lin-54 homolog isoform 2 [Ovis aries]
          Length = 659

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 432 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 491

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 492 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 540

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 541 NFEESPERKTLMH 553


>gi|403263381|ref|XP_003924014.1| PREDICTED: protein lin-54 homolog isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 660

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 541

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554


>gi|345795675|ref|XP_003434061.1| PREDICTED: protein lin-54 homolog isoform 1 [Canis lupus
           familiaris]
          Length = 660

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 541

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 542 NFEESPERKTLMHLADAAEVRVQQQTA 568


>gi|145479557|ref|XP_001425801.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392873|emb|CAK58403.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 150/298 (50%), Gaps = 36/298 (12%)

Query: 7   ISDFAPKKLARQLDFTICRASANGLLPEQSVKSQSSQPQSHSPLQPQPPKSQPQVQSKVA 66
           I D +P+++ ++L+FT C  S N   P   +  Q+++  +   + P    +    +++  
Sbjct: 41  IKDQSPEQVCKKLEFTPCLKSPNQFQPYSPLNRQTTKKANIFVISPARQINFDHSKTEQR 100

Query: 67  SPSQSQPLVRLQLLPP---------SSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPH 117
             + S PL  L++            S    + +  RSP ++  +   R   ++ +N V  
Sbjct: 101 LVNISSPLSSLKVPDDLLCDEEEIFSDDQEIHSQFRSPPKSPHRKLERATKLKFINFV-- 158

Query: 118 PVHKLPLPTLP-PGKQESPRSRPRANTDGKDGTPKKQKQ-----CNCRNSRCLKLYCECF 171
                   TL  P K  S +   R  T  K   P+ +++     CNC+ S+CLKLYC+CF
Sbjct: 159 --------TLKKPLKFTSDQVNNRKRTSSKRRKPQTEQKPKIIVCNCKKSKCLKLYCDCF 210

Query: 172 AAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
           AAG+ C   CNC +CHNN +H   R+  +   +ERNP AFRPK+ S  +   +  +  R 
Sbjct: 211 AAGVPCGKDCNCCSCHNNEDHVKERENIIKQIMERNPQAFRPKVESRSNSEDEIDQKPR- 269

Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFH 285
                 H KGC+CKKS CLKKYCEC+Q  + CSE C+C +CKN E     E R+ + H
Sbjct: 270 ------HFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKNCEMPVKKESRKRVKH 321


>gi|117306452|gb|AAI25581.1| Lin54 protein [Mus musculus]
          Length = 515

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           + T + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RN
Sbjct: 282 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 341

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           P AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 342 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 390

Query: 267 RCLDCKNFEGSEERRALFH 285
           +C+ CKNFE S ER+ L H
Sbjct: 391 KCIGCKNFEESPERKTLMH 409


>gi|194757683|ref|XP_001961092.1| GF13699 [Drosophila ananassae]
 gi|190622390|gb|EDV37914.1| GF13699 [Drosophila ananassae]
          Length = 921

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 85/124 (68%), Gaps = 12/124 (9%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
           ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R+ A+ + L+RNP+AF+
Sbjct: 706 RRKHCNCSKSQCLKLYCDCFANGEFCQNCTCKDCFNNLDYEVERERAIRSCLDRNPSAFK 765

Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I C+  C+C+ C
Sbjct: 766 PKITAPNSG--DMRL----------HNKGCNCKRSGCLKNYCECYEAKIPCTSICKCVGC 813

Query: 272 KNFE 275
           +N E
Sbjct: 814 RNME 817


>gi|31873949|emb|CAD97902.1| hypothetical protein [Homo sapiens]
          Length = 621

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 73/135 (54%), Positives = 93/135 (68%), Gaps = 11/135 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF
Sbjct: 392 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 451

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ 
Sbjct: 452 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 500

Query: 271 CKNFEGSEERRALFH 285
           CKNFE S ER+ L H
Sbjct: 501 CKNFEESPERKTLMH 515


>gi|410957319|ref|XP_003985277.1| PREDICTED: protein lin-54 homolog isoform 2 [Felis catus]
          Length = 660

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 433 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 492

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 493 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 541

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 542 NFEESPERKTLMH 554


>gi|363733235|ref|XP_003641221.1| PREDICTED: protein lin-54 homolog isoform 1 [Gallus gallus]
          Length = 804

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN++HE  RQ+A+ A L+RNP AF+P
Sbjct: 577 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 636

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 637 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 685

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 686 NFEESPERKTLMH 698


>gi|380799221|gb|AFE71486.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 560

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 333 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 392

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 393 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 441

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 442 NFEESPERKTLMHLADAAEVRVQQQTA 468


>gi|327272952|ref|XP_003221248.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Anolis
           carolinensis]
          Length = 749

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN++HE  RQ+A+ A L+RNP AF+P
Sbjct: 522 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCANCYNNLDHENDRQKAIKACLDRNPEAFKP 581

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 582 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 630

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 631 NFEESPERKTLMH 643


>gi|432885357|ref|XP_004074681.1| PREDICTED: protein lin-54 homolog [Oryzias latipes]
          Length = 821

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 72/139 (51%), Positives = 93/139 (66%), Gaps = 11/139 (7%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           + T + +K CNC  S+CLKLYC+CFA G +C+ CNC NC NN+EHE  R +A+   L+RN
Sbjct: 588 ETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNMCNCNNCFNNLEHETERLKAIKTCLDRN 647

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           P AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 648 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 696

Query: 267 RCLDCKNFEGSEERRALFH 285
           +C+ CKNFE S ER+ L H
Sbjct: 697 KCIGCKNFEESPERKTLMH 715


>gi|224049388|ref|XP_002189980.1| PREDICTED: protein lin-54 homolog [Taeniopygia guttata]
          Length = 751

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN++HE  RQ+A+ A L+RNP AF+P
Sbjct: 524 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 583

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 584 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 632

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 633 NFEESPERKTLMH 645


>gi|326918692|ref|XP_003205622.1| PREDICTED: protein lin-54 homolog isoform 1 [Meleagris gallopavo]
          Length = 756

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN++HE  RQ+A+ A L+RNP AF+P
Sbjct: 529 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 588

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 589 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 637

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 638 NFEESPERKTLMH 650


>gi|126330910|ref|XP_001376980.1| PREDICTED: protein lin-54 homolog [Monodelphis domestica]
          Length = 756

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 91/133 (68%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 529 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKP 588

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 589 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 637

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 638 NFEESPERKTLMH 650


>gi|118090184|ref|XP_420554.2| PREDICTED: protein lin-54 homolog isoform 2 [Gallus gallus]
          Length = 752

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN++HE  RQ+A+ A L+RNP AF+P
Sbjct: 525 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 584

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 585 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 633

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 634 NFEESPERKTLMH 646


>gi|169234721|ref|NP_001108479.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|169234723|ref|NP_001108480.1| protein lin-54 homolog isoform b [Homo sapiens]
 gi|197098666|ref|NP_001125469.1| protein lin-54 homolog [Pongo abelii]
 gi|332233373|ref|XP_003265876.1| PREDICTED: protein lin-54 homolog isoform 2 [Nomascus leucogenys]
 gi|332233375|ref|XP_003265877.1| PREDICTED: protein lin-54 homolog isoform 3 [Nomascus leucogenys]
 gi|332233377|ref|XP_003265878.1| PREDICTED: protein lin-54 homolog isoform 4 [Nomascus leucogenys]
 gi|397524650|ref|XP_003832302.1| PREDICTED: protein lin-54 homolog isoform 2 [Pan paniscus]
 gi|402869361|ref|XP_003898731.1| PREDICTED: protein lin-54 homolog isoform 1 [Papio anubis]
 gi|75042058|sp|Q5RBN8.1|LIN54_PONAB RecName: Full=Protein lin-54 homolog
 gi|34364698|emb|CAE45799.1| hypothetical protein [Homo sapiens]
 gi|55728144|emb|CAH90822.1| hypothetical protein [Pongo abelii]
 gi|158258975|dbj|BAF85458.1| unnamed protein product [Homo sapiens]
 gi|190689473|gb|ACE86511.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|190690835|gb|ACE87192.1| lin-54 homolog (C. elegans) protein [synthetic construct]
 gi|410249632|gb|JAA12783.1| lin-54 homolog [Pan troglodytes]
 gi|410329875|gb|JAA33884.1| lin-54 homolog [Pan troglodytes]
          Length = 528

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|449276576|gb|EMC85038.1| Protein lin-54 like protein, partial [Columba livia]
          Length = 757

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN++HE  RQ+A+ A L+RNP AF+P
Sbjct: 530 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 589

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 590 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 638

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 639 NFEESPERKTLMH 651


>gi|426344813|ref|XP_004038947.1| PREDICTED: protein lin-54 homolog isoform 2 [Gorilla gorilla
           gorilla]
          Length = 528

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|348535421|ref|XP_003455199.1| PREDICTED: protein lin-54 homolog [Oreochromis niloticus]
          Length = 775

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/147 (49%), Positives = 94/147 (63%), Gaps = 11/147 (7%)

Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEA 198
           P       + T + +K CNC  S+CLKLYC+CFA G +C+ CNC NC NN+EHE  R +A
Sbjct: 534 PLNGLSAAETTSRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFNNLEHETERLKA 593

Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
           +   L+RNP AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A
Sbjct: 594 IKTCLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEA 642

Query: 259 NILCSENCRCLDCKNFEGSEERRALFH 285
            I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 643 KIMCSSICKCIGCKNFEESPERKTLMH 669


>gi|426231962|ref|XP_004010005.1| PREDICTED: protein lin-54 homolog isoform 3 [Ovis aries]
          Length = 528

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|417402343|gb|JAA48022.1| Putative protein lin-54 [Desmodus rotundus]
          Length = 527

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 300 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 359

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 360 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 408

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 409 NFEESPERKTLMH 421


>gi|338723424|ref|XP_003364724.1| PREDICTED: protein lin-54 homolog [Equus caballus]
          Length = 528

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|403263379|ref|XP_003924013.1| PREDICTED: protein lin-54 homolog isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 528

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 409

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|380799223|gb|AFE71487.1| protein lin-54 homolog isoform a, partial [Macaca mulatta]
          Length = 521

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 294 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 353

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 354 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 402

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 403 NFEESPERKTLMH 415


>gi|344284819|ref|XP_003414162.1| PREDICTED: protein lin-54 homolog isoform 2 [Loxodonta africana]
          Length = 528

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/149 (51%), Positives = 98/149 (65%), Gaps = 12/149 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF
Sbjct: 299 RPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAF 358

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ 
Sbjct: 359 KPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIG 407

Query: 271 CKNFEGSEERRALFH-GDHNGIAFMQRAA 298
           CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 408 CKNFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|359323634|ref|XP_003640148.1| PREDICTED: protein lin-54 homolog isoform 3 [Canis lupus
           familiaris]
 gi|410957321|ref|XP_003985278.1| PREDICTED: protein lin-54 homolog isoform 3 [Felis catus]
          Length = 528

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 409

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|340376737|ref|XP_003386888.1| PREDICTED: protein lin-54 homolog [Amphimedon queenslandica]
          Length = 408

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/161 (47%), Positives = 99/161 (61%), Gaps = 19/161 (11%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
           T+PP   +  +S+P++  +     P+K   CNC+NS+CLKLYC+CFA G +C D CNC N
Sbjct: 155 TIPPPSHQ--QSKPKSGANLASNKPRKP--CNCKNSQCLKLYCDCFANGEFCRDSCNCQN 210

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHG-AQDAREDAREAQLAAKHNKGCHC 243
           C N+ +HE  R  AV A LERNPNAF+PK+    HG   D R          +H KGC+C
Sbjct: 211 CKNSFQHEGDRSRAVKACLERNPNAFKPKVG---HGRVGDER----------RHIKGCNC 257

Query: 244 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           KKS CLK YCEC++A I CS  CRC+ C+N     E + L 
Sbjct: 258 KKSSCLKNYCECYEAKIPCSHLCRCVGCQNLADRPEGKGLM 298


>gi|115497062|ref|NP_001070035.1| protein lin-54 homolog [Danio rerio]
 gi|123905351|sp|Q08CM4.1|LIN54_DANRE RecName: Full=Protein lin-54 homolog
 gi|115313049|gb|AAI24176.1| Zgc:152921 [Danio rerio]
          Length = 771

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 76/149 (51%), Positives = 98/149 (65%), Gaps = 12/149 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC  S+CLKLYC+CFA G +C+ CNCVNC NN++HE  R +A+ A L+RNP AF
Sbjct: 543 RPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKAIKACLDRNPVAF 602

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI     G  D R           H+KGC+CKKSGCLK YCEC++A I+CS  C+C+ 
Sbjct: 603 KPKIGKGKEGESDRR-----------HSKGCNCKKSGCLKNYCECYEAKIMCSSICKCMG 651

Query: 271 CKNFEGSEERRALFH-GDHNGIAFMQRAA 298
           CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 652 CKNFEESPERKTLMHLADAAEVRVQQQTA 680


>gi|326918694|ref|XP_003205623.1| PREDICTED: protein lin-54 homolog isoform 2 [Meleagris gallopavo]
          Length = 667

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN++HE  RQ+A+ A L+RNP AF+P
Sbjct: 440 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLDHENDRQKAIKACLDRNPEAFKP 499

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 500 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCK 548

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 549 NFEESPERKTLMH 561


>gi|37360936|dbj|BAC98377.1| KIAA2037 protein [Homo sapiens]
          Length = 494

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 267 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 326

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 327 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 375

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 376 NFEESPERKTLMH 388


>gi|291243053|ref|XP_002741422.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Saccoglossus kowalevskii]
          Length = 946

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 77/153 (50%), Positives = 95/153 (62%), Gaps = 24/153 (15%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           E+P +RPR             K CNC  S+CLKLYC+CFA G +C  CNC NC NN+EHE
Sbjct: 495 EAPGTRPR-------------KPCNCTKSQCLKLYCDCFANGEFCSNCNCTNCFNNLEHE 541

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
             R +A+ + LERNP AF PKI     G  D R           HNKGC+CK+SGCLK Y
Sbjct: 542 SERAKAIKSCLERNPLAFHPKIGKGKEGQADRR-----------HNKGCNCKRSGCLKNY 590

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           CEC++A I+CS  C+C+ CKNFE S +R+ L H
Sbjct: 591 CECYEAKIMCSSICKCVGCKNFEESPDRKTLMH 623


>gi|26325238|dbj|BAC26373.1| unnamed protein product [Mus musculus]
 gi|148688345|gb|EDL20292.1| expressed sequence AI461788, isoform CRA_b [Mus musculus]
 gi|219520492|gb|AAI45026.1| Lin54 protein [Mus musculus]
          Length = 419

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 74/139 (53%), Positives = 95/139 (68%), Gaps = 11/139 (7%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           + T + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RN
Sbjct: 186 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 245

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           P AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C
Sbjct: 246 PEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSIC 294

Query: 267 RCLDCKNFEGSEERRALFH 285
           +C+ CKNFE S ER+ L H
Sbjct: 295 KCIGCKNFEESPERKTLMH 313


>gi|51980517|gb|AAH82043.1| Lin54 protein [Rattus norvegicus]
          Length = 495

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 268 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 327

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 328 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 376

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 377 NFEESPERKTLMH 389


>gi|449455360|ref|XP_004145421.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
 gi|449494974|ref|XP_004159699.1| PREDICTED: CRC domain-containing protein TSO1-like [Cucumis
           sativus]
          Length = 602

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 66/132 (50%), Positives = 86/132 (65%), Gaps = 1/132 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ SRCLKLYCECFAAG+YC + C+C +C N   HE    E       RNP AF P
Sbjct: 308 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 367

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  +     + R+D+ +   +A+H +GC+CKKSGCLKKYCEC+Q  + CS NCRC  CK
Sbjct: 368 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 427

Query: 273 NFEGSEERRALF 284
           N  G ++  AL 
Sbjct: 428 NAFGRKDESALL 439



 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           R +++    + + ++ CNC+ S CLK YCEC+  G+ C   C C  C N
Sbjct: 380 RDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKN 428


>gi|449471001|ref|XP_004153179.1| PREDICTED: CRC domain-containing protein TSO1-like, partial
           [Cucumis sativus]
          Length = 581

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 66/132 (50%), Positives = 86/132 (65%), Gaps = 1/132 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ SRCLKLYCECFAAG+YC + C+C +C N   HE    E       RNP AF P
Sbjct: 287 KRCNCKKSRCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEAMVLETRRQIESRNPLAFAP 346

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  +     + R+D+ +   +A+H +GC+CKKSGCLKKYCEC+Q  + CS NCRC  CK
Sbjct: 347 KVIMNCDLISELRDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCK 406

Query: 273 NFEGSEERRALF 284
           N  G ++  AL 
Sbjct: 407 NAFGRKDESALL 418



 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           R +++    + + ++ CNC+ S CLK YCEC+  G+ C   C C  C N
Sbjct: 359 RDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKN 407


>gi|355699397|gb|AES01114.1| lin-54-like protein [Mustela putorius furo]
          Length = 748

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 101/260 (38%), Positives = 138/260 (53%), Gaps = 32/260 (12%)

Query: 52  PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
           P+P    PQ+   VAS   SQP  RL ++P +  P +Q ++           +P      
Sbjct: 417 PKPINPTPQI---VAS---SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 469

Query: 102 WQARP-QHVRMVNRVPHPVHKLPLP-TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
           +   P Q+V  + +  +         T  PG Q   R  P       +   + +K CNC 
Sbjct: 470 YAVLPAQYVTQLQQSSYVSIASSSSFTGTPGIQTQAR-LPFNGIIPSESASRPRKPCNCT 528

Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
            S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     
Sbjct: 529 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 588

Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S E
Sbjct: 589 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 637

Query: 280 RRALFH-GDHNGIAFMQRAA 298
           R+ L H  D   +   Q+ A
Sbjct: 638 RKTLMHLADAAEVRVQQQTA 657


>gi|115476122|ref|NP_001061657.1| Os08g0369600 [Oryza sativa Japonica Group]
 gi|38637224|dbj|BAD03490.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|38637277|dbj|BAD03541.1| transcription factor-like [Oryza sativa Japonica Group]
 gi|113623626|dbj|BAF23571.1| Os08g0369600 [Oryza sativa Japonica Group]
          Length = 369

 Score =  140 bits (354), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 100/257 (38%), Positives = 142/257 (55%), Gaps = 21/257 (8%)

Query: 52  PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMRSPSQAMPQW-QARPQHVR 110
           P PP ++ ++Q + +S    Q   +L++LP      +Q   RSP    P   Q  P+  R
Sbjct: 79  PSPPPAK-KLQVQQSSSMLPQDFPKLEVLPVEKPTKLQVR-RSPQHVSPALKQDSPKMER 136

Query: 111 MVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCEC 170
            +  +P P       T       S +      ++ KD T  ++  C+C+ S+CLKLYCEC
Sbjct: 137 PL--LPRPA------TFIDVMLSSQKPSSEMWSETKDVTLTRKTNCSCKYSKCLKLYCEC 188

Query: 171 FAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDARE 230
           F  G YC GCNC NC NNV HE ARQ+A+   LERNP AF PK+ +S   A +    A E
Sbjct: 189 FEKGRYCIGCNCTNCCNNVNHENARQDAINVALERNPAAFMPKVLNS--TAHNCESKAAE 246

Query: 231 AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER---RALFHGD 287
             +  KH KGC CK++ CLKKYCECF+A++ CSENCRC  CKN++ + +R   +   H  
Sbjct: 247 GDIVGKHTKGCKCKRTECLKKYCECFKASVFCSENCRCTGCKNYKSNADRISQKNTVHAH 306

Query: 288 HNGIAFMQRAANAAIIG 304
           H     +Q  A++ ++G
Sbjct: 307 H-----VQNPASSGMVG 318


>gi|297830990|ref|XP_002883377.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329217|gb|EFH59636.1| hypothetical protein ARALYDRAFT_479789 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 684

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 69/135 (51%), Positives = 89/135 (65%), Gaps = 9/135 (6%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
           K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N   HE    E V AT +    RNP 
Sbjct: 390 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 445

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF PK+  S     +A +DA +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC
Sbjct: 446 AFAPKVIRSADSIMEAGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSVNCRC 505

Query: 269 LDCKNFEGSEERRAL 283
             CKN  G +E   L
Sbjct: 506 EGCKNVFGRKEGSLL 520


>gi|71051367|gb|AAH99076.1| Lin54 protein [Rattus norvegicus]
          Length = 438

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 211 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 270

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 271 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 319

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 320 NFEESPERKTLMH 332


>gi|156544375|ref|XP_001607403.1| PREDICTED: protein lin-54 homolog isoform 1 [Nasonia vitripennis]
 gi|345480219|ref|XP_003424108.1| PREDICTED: protein lin-54 homolog isoform 2 [Nasonia vitripennis]
          Length = 818

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/139 (52%), Positives = 91/139 (65%), Gaps = 12/139 (8%)

Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQ 196
           SR   N+D     P+K   CNC  S+CLKLYC+CFA G +C  CNC NC NN+ +E  RQ
Sbjct: 583 SRSFNNSDANGIRPRKP--CNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQ 640

Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
            A+ + LERNPNAFRPKI       +D R          +HNKGC+CK+SGCLK YCEC+
Sbjct: 641 RAIKSCLERNPNAFRPKIGKGRETGEDIR----------RHNKGCNCKRSGCLKNYCECY 690

Query: 257 QANILCSENCRCLDCKNFE 275
           +A I CSENC+C+ C+N E
Sbjct: 691 EAKIPCSENCKCIGCRNVE 709



 Score = 47.8 bits (112), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
           K CNC+ S CLK YCEC+ A I C + C C+ C N  E  + +Q
Sbjct: 673 KGCNCKRSGCLKNYCECYEAKIPCSENCKCIGCRNVEEPALHKQ 716


>gi|260803175|ref|XP_002596466.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
 gi|229281723|gb|EEN52478.1| hypothetical protein BRAFLDRAFT_270843 [Branchiostoma floridae]
          Length = 458

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 78/158 (49%), Positives = 100/158 (63%), Gaps = 15/158 (9%)

Query: 132 QESPRSRPRANTDGKDGT----PKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
           Q +  ++PR   +G   T     + +K CNC  S+CLKLYC+CFA G +C+ CNC NC N
Sbjct: 210 QATTATQPRQQVNGTLNTEQVGTRPRKPCNCTKSQCLKLYCDCFANGEFCNNCNCNNCFN 269

Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
           N+EHE  R +++ A LERNP AF PKI     G  D R           HNKGC+CK+SG
Sbjct: 270 NLEHEAERAKSIKACLERNPYAFHPKIGKGKEGEGDRR-----------HNKGCNCKRSG 318

Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           CLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 319 CLKNYCECYEAKIMCSSICKCVGCKNFEESPERKTLMH 356


>gi|296196136|ref|XP_002745695.1| PREDICTED: protein lin-54 homolog isoform 1 [Callithrix jacchus]
          Length = 749

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 95/246 (38%), Positives = 132/246 (53%), Gaps = 31/246 (12%)

Query: 52  PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
           P+P  S  Q+ +       SQP  RL ++P +  P +Q ++           +P      
Sbjct: 417 PKPINSTSQIVTT------SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 469

Query: 102 WQARP-QHVRMVNRVPH-PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
           +   P Q+V  + +  +  +      T  PG Q   R  P       +   + +K CNC 
Sbjct: 470 YAVLPAQYVTQLQQSSYVSIASNSTFTGTPGIQTQARP-PFNGIIPSESASRPRKPCNCT 528

Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
            S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     
Sbjct: 529 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 588

Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S E
Sbjct: 589 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 637

Query: 280 RRALFH 285
           R+ L H
Sbjct: 638 RKTLMH 643


>gi|443711997|gb|ELU05498.1| hypothetical protein CAPTEDRAFT_229042 [Capitella teleta]
          Length = 614

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/140 (52%), Positives = 93/140 (66%), Gaps = 10/140 (7%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLER 205
           ++   + +K CNC  S+CLKLYC+CFA G +C  CNCVNC NN+EHE  R  ++   LER
Sbjct: 386 QNAAARPRKACNCTKSQCLKLYCDCFANGEFCSNCNCVNCFNNLEHEEERSRSIKTCLER 445

Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
           NP AF PKI  S  G  D R          +HNKGC+CK+SGCLK YCEC++A ILC+  
Sbjct: 446 NPQAFHPKIGKSKAGQGDIR----------RHNKGCNCKRSGCLKNYCECYEAKILCTSL 495

Query: 266 CRCLDCKNFEGSEERRALFH 285
           C+C+ CKNFE S +R+ L H
Sbjct: 496 CKCVGCKNFEESPDRKTLMH 515


>gi|149046796|gb|EDL99570.1| similar to hypothetical protein, isoform CRA_c [Rattus norvegicus]
          Length = 419

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 192 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 251

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 252 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 300

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 301 NFEESPERKTLMH 313


>gi|296196138|ref|XP_002745696.1| PREDICTED: protein lin-54 homolog isoform 2 [Callithrix jacchus]
          Length = 660

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 95/246 (38%), Positives = 132/246 (53%), Gaps = 31/246 (12%)

Query: 52  PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
           P+P  S  Q+ +       SQP  RL ++P +  P +Q ++           +P      
Sbjct: 328 PKPINSTSQIVTT------SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 380

Query: 102 WQARP-QHVRMVNRVPH-PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
           +   P Q+V  + +  +  +      T  PG Q   R  P       +   + +K CNC 
Sbjct: 381 YAVLPAQYVTQLQQSSYVSIASNSTFTGTPGIQTQARP-PFNGIIPSESASRPRKPCNCT 439

Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
            S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     
Sbjct: 440 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 499

Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S E
Sbjct: 500 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 548

Query: 280 RRALFH 285
           R+ L H
Sbjct: 549 RKTLMH 554


>gi|395850121|ref|XP_003797647.1| PREDICTED: LOW QUALITY PROTEIN: protein lin-54 homolog [Otolemur
           garnettii]
          Length = 544

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 82/132 (62%), Gaps = 11/132 (8%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           K C C  S CLKL C+CFA G + + CNC + +N++EHE  RQ+A+ A L+RNP AF+PK
Sbjct: 345 KPCECVISLCLKLDCDCFANGEFYNNCNCTSYYNSLEHENERQKAIKACLDRNPEAFKPK 404

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                          +E      H+KGC+CK SGCLK YCEC++A I+CS  C+C  CKN
Sbjct: 405 TG-----------KGKEEDSDXHHSKGCNCKXSGCLKNYCECYEAKIMCSSICKCKGCKN 453

Query: 274 FEGSEERRALFH 285
           FE   ER+ L H
Sbjct: 454 FEEXPERKTLMH 465



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           +D      K CNC+ S CLK YCEC+ A I C   C C  C N
Sbjct: 411 EDSDXHHSKGCNCKXSGCLKNYCECYEAKIMCSSICKCKGCKN 453


>gi|298708780|emb|CBJ30741.1| tesmin-like, animal TCX proteins are associated with development of
           both male and female reproductiv [Ectocarpus
           siliculosus]
          Length = 735

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/126 (51%), Positives = 78/126 (61%), Gaps = 14/126 (11%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 215
           CNC+ S+CLKLYCECF    YC+GCNCV C N    E  RQ A+  T+ERNP AF  K  
Sbjct: 440 CNCKKSKCLKLYCECFQRQQYCNGCNCVECLNTERTEDLRQLAIQGTIERNPQAFVSKF- 498

Query: 216 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
                  + R   R       HN GC+CKKS CLKKYCECFQA + C  NC+C++CKN+E
Sbjct: 499 -------ERRAGKRS------HNAGCNCKKSACLKKYCECFQAGVACGTNCKCVNCKNYE 545

Query: 276 GSEERR 281
           G+   R
Sbjct: 546 GAAGGR 551


>gi|66812974|ref|XP_640666.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
 gi|60468739|gb|EAL66741.1| tesmin/TSO1-like, CXC domain-containing protein [Dictyostelium
           discoideum AX4]
          Length = 902

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 69/137 (50%), Positives = 87/137 (63%), Gaps = 11/137 (8%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           K+C+C+NS+CLK+YCECFAA + C+GC C  C NN  +    + A   TLERNP+AF PK
Sbjct: 360 KKCHCKNSKCLKMYCECFAAKVLCNGCLCFGCQNNEANIEKVERARFQTLERNPDAFNPK 419

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           I  S               +  KH+KGCHC+KS CLKKYCECFQA+I C+ENC+C DCKN
Sbjct: 420 IKPS-------NSKINNQIIIDKHSKGCHCRKSSCLKKYCECFQASIPCNENCKCYDCKN 472

Query: 274 FEGSEERRALFHGDHNG 290
               +    L H D+ G
Sbjct: 473 ----QAHHHLHHPDNGG 485


>gi|14587305|dbj|BAB61216.1| P0460E08.26 [Oryza sativa Japonica Group]
          Length = 553

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 66/102 (64%), Positives = 82/102 (80%), Gaps = 2/102 (1%)

Query: 131 KQESPRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           K ESP+  PRA   +GKDGTP K+K CNC++S+CLKLYCECFA+G+YCDGCNC NC NNV
Sbjct: 137 KPESPKP-PRARLYEGKDGTPTKKKCCNCKHSKCLKLYCECFASGVYCDGCNCSNCFNNV 195

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREA 231
           ++E AR+EA+ ATLERNP+AFRPKI SSPH  ++  + A  A
Sbjct: 196 KNETARREAIDATLERNPDAFRPKIGSSPHANRNNMQAANAA 237



 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 65/184 (35%), Positives = 91/184 (49%), Gaps = 18/184 (9%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDC-KNFEGSEERRALFHG--DHNGIAFMQ 295
           K C+CK S CLK YCECF + + C + C C +C  N +    RR       + N  AF  
Sbjct: 160 KCCNCKHSKCLKLYCECFASGVYC-DGCNCSNCFNNVKNETARREAIDATLERNPDAFRP 218

Query: 296 R-------------AANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQS 342
           +             AANAA+ GA+G++ + +   S KRK  +     +AK+    R    
Sbjct: 219 KIGSSPHANRNNMQAANAAVNGAIGAAAFSSPSASRKRKHIDPSLDHSAKENGAHRTNHV 278

Query: 343 QQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSS 402
            Q    Q + +P    +P++Q  +   LG  K   R  LADI+QP D+KELC LLV+VS 
Sbjct: 279 PQVF-YQKNGMPSDGPLPINQPVHPPTLGPFKVTYRPLLADIVQPGDVKELCKLLVVVSG 337

Query: 403 EATK 406
            A K
Sbjct: 338 NAAK 341


>gi|55729585|emb|CAH91522.1| hypothetical protein [Pongo abelii]
          Length = 528

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/147 (51%), Positives = 96/147 (65%), Gaps = 12/147 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+C A G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 301 RKPCNCTKSLCLKLYCDCCANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 360

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 361 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 409

Query: 273 NFEGSEERRALFH-GDHNGIAFMQRAA 298
           NFE S ER+ L H  D   +   Q+ A
Sbjct: 410 NFEESPERKTLMHLADAAEVRVQQQTA 436


>gi|452822459|gb|EME29478.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 542

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 79/177 (44%), Positives = 100/177 (56%), Gaps = 14/177 (7%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K++K C C+NS CLKLYCECFAAG  C  C+C NC N+ +HE    EA  A L RNP AF
Sbjct: 228 KERKSCKCKNSMCLKLYCECFAAGQLCSNCSCQNCLNDEDHEKEVSEARNAILLRNPAAF 287

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
            PK+ + P      +ED     LA KH KGC+CK+SGC K YCECF A + CS  C+C D
Sbjct: 288 EPKMTAVP------KED---GALAIKHQKGCNCKRSGCQKNYCECFHAGVFCSNICKCND 338

Query: 271 CKNFE--GSEERRALFH-GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEE 324
           C N E   SE+   L H    +GI    R A + + G      +   M S +++  E
Sbjct: 339 CGNREDDNSEKSTNLSHILSDSGIGI--RGALSPLRGETNGYQHLLFMESPRKRWRE 393


>gi|312078614|ref|XP_003141815.1| hypothetical protein LOAG_06231 [Loa loa]
          Length = 495

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 84/244 (34%), Positives = 126/244 (51%), Gaps = 37/244 (15%)

Query: 82  PSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRA 141
           PS    +Q    SP  A    QA  Q     NRV  PV    L         +P+ RP  
Sbjct: 216 PSKSRLLQTGPLSPPLAYRAAQASIQR----NRVFTPVLAAKL---------APKKRPLG 262

Query: 142 NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 201
           +          +K C+C  S CLKLYC+CFA G +C+ C+C +C N +E+E+ R  A+  
Sbjct: 263 S----------KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRL 312

Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
           +LERNPNAF+PKI  + +         R+ +    H KGCHCKKS CLK YCEC++A + 
Sbjct: 313 SLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVP 363

Query: 262 CSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
           C++ C+C+ C+N   +E  RA    +      +    +AA +G+  +  + T + S++  
Sbjct: 364 CTDRCKCVCCRN---TESDRAARLANTKSATALTDIRSAATMGS--ADFHKTTLYSDEDS 418

Query: 322 SEEL 325
            +E+
Sbjct: 419 DDEV 422


>gi|219126634|ref|XP_002183557.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404794|gb|EEC44739.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1852

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 77/191 (40%), Positives = 105/191 (54%), Gaps = 28/191 (14%)

Query: 115  VPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAG 174
             P  + K PL    PGK+         N +G       + +CNC+ S+CLKLYCECF+A 
Sbjct: 1481 FPSSIVKTPLAV--PGKE---------NPNGITPMSTPRNRCNCKKSQCLKLYCECFSAE 1529

Query: 175  IYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 234
            + C+GCNC +C N       R +A+   L +N +AF+P+IA++P GA D +         
Sbjct: 1530 VMCEGCNCTDCRNLPAFAAVRAKAMTDCLNKNAHAFKPRIAAAP-GAADLQ--------- 1579

Query: 235  AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS----EERRALFHGDHNG 290
              H+ GC CKKS CLKKYCECFQA +LC   C+C  CKN  GS    ++RR +   D +G
Sbjct: 1580 -GHSMGCKCKKSECLKKYCECFQAGVLCGLKCKCESCKNIAGSQKLIDKRRQM--KDTSG 1636

Query: 291  IAFMQRAANAA 301
              F  R +  A
Sbjct: 1637 AEFAMRVSQEA 1647


>gi|390460682|ref|XP_003732523.1| PREDICTED: protein lin-54 homolog [Callithrix jacchus]
          Length = 528

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 98/260 (37%), Positives = 137/260 (52%), Gaps = 32/260 (12%)

Query: 52  PQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQAHMR----------SPSQAMPQ 101
           P+P  S  Q+ +       SQP  RL ++P +  P +Q ++           +P      
Sbjct: 196 PKPINSTSQIVTT------SQPQQRL-IMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVG 248

Query: 102 WQARP-QHVRMVNRVPH-PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCR 159
           +   P Q+V  + +  +  +      T  PG Q   R  P       +   + +K CNC 
Sbjct: 249 YAVLPAQYVTQLQQSSYVSIASNSTFTGTPGIQTQARP-PFNGIIPSESASRPRKPCNCT 307

Query: 160 NSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
            S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     
Sbjct: 308 KSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKE 367

Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S E
Sbjct: 368 GESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNFEESPE 416

Query: 280 RRALFH-GDHNGIAFMQRAA 298
           R+ L H  D   +   Q+ A
Sbjct: 417 RKTLMHLADAAEVRVQQQTA 436


>gi|194389404|dbj|BAG61668.1| unnamed protein product [Homo sapiens]
          Length = 348

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 92/133 (69%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 121 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKP 180

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 181 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 229

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 230 NFEESPERKTLMH 242


>gi|393905053|gb|EFO22256.2| hypothetical protein LOAG_06231 [Loa loa]
          Length = 452

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 84/244 (34%), Positives = 126/244 (51%), Gaps = 37/244 (15%)

Query: 82  PSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRA 141
           PS    +Q    SP  A    QA  Q     NRV  PV    L         +P+ RP  
Sbjct: 173 PSKSRLLQTGPLSPPLAYRAAQASIQR----NRVFTPVLAAKL---------APKKRPLG 219

Query: 142 NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 201
           +          +K C+C  S CLKLYC+CFA G +C+ C+C +C N +E+E+ R  A+  
Sbjct: 220 S----------KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRL 269

Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
           +LERNPNAF+PKI  + +         R+ +    H KGCHCKKS CLK YCEC++A + 
Sbjct: 270 SLERNPNAFKPKIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVP 320

Query: 262 CSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
           C++ C+C+ C+N   +E  RA    +      +    +AA +G+  +  + T + S++  
Sbjct: 321 CTDRCKCVCCRN---TESDRAARLANTKSATALTDIRSAATMGS--ADFHKTTLYSDEDS 375

Query: 322 SEEL 325
            +E+
Sbjct: 376 DDEV 379


>gi|334186533|ref|NP_193213.5| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
 gi|395455099|sp|F4JIF5.1|TCX2_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 2; Short=AtTCX2;
           AltName: Full=Protein TSO1-like 2; Short=Protein SOL2
 gi|332658096|gb|AEE83496.1| TESMIN/TSO1-like CXC 2 [Arabidopsis thaliana]
          Length = 674

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 70/159 (44%), Positives = 95/159 (59%), Gaps = 6/159 (3%)

Query: 129 PGKQESPRSRP---RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
           P  QE   S P   R   D  +G  +  K+CNC+ S+CLKLYCECFAAG+YC + C+C++
Sbjct: 349 PALQELNLSSPKKKRVKLDSGEG--ESCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCID 406

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
           C N   HE            RNP AF PK+  +    Q+  +DA +   +A+H +GC+CK
Sbjct: 407 CFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQETGDDASKTPASARHKRGCNCK 466

Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           KS CLKKYCEC+Q  + CS NCRC  CKN  G ++  ++
Sbjct: 467 KSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSSI 505


>gi|297830988|ref|XP_002883376.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329216|gb|EFH59635.1| hypothetical protein ARALYDRAFT_898758 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 623

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 73/179 (40%), Positives = 104/179 (58%), Gaps = 9/179 (5%)

Query: 110 RMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCE 169
           +++  VP     + L  + P K+     R ++   G+ G+    K+CNC+ S+CLKLYCE
Sbjct: 302 KVIEVVPKSSGLVELTPISPKKK-----RRKSEQSGEGGS--SCKRCNCKKSKCLKLYCE 354

Query: 170 CFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDA 228
           CFAAG+YC + C+C++C N   HE            RNP AF PK+  +     +  EDA
Sbjct: 355 CFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDA 414

Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 287
            +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  +LF  D
Sbjct: 415 SKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 472


>gi|328865133|gb|EGG13519.1| tesmin/TSO1-like protein [Dictyostelium fasciculatum]
          Length = 597

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 65/129 (50%), Positives = 84/129 (65%), Gaps = 13/129 (10%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K C+C+NS+CLKLYCECFA  + C+GC+C  CHNN  +    Q +    LERNP AF P
Sbjct: 341 RKGCHCKNSKCLKLYCECFANKLLCNGCHCFGCHNNDANIEVVQRSRSNILERNPEAFNP 400

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K           +E+ ++     KH KGCHC+KS CLKKYCECFQA I C E+C+C+DC+
Sbjct: 401 KFKQ--------KEETKQ----HKHTKGCHCRKSECLKKYCECFQAGIPCGEHCKCIDCR 448

Query: 273 NF-EGSEER 280
           N  E  +ER
Sbjct: 449 NTGEDIKER 457


>gi|449015362|dbj|BAM78764.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 707

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/296 (33%), Positives = 145/296 (48%), Gaps = 71/296 (23%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC+NS+CLKLYC+CFAAG YC+GC+C NC N   ++  RQ A+  +LERNP+AF
Sbjct: 99  RPRKPCNCKNSKCLKLYCDCFAAGTYCNGCHCTNCLNLPAYDTLRQNAIRTSLERNPHAF 158

Query: 211 RPKIASSPHGA-------------------------QDAREDARE--------------- 230
           R K+ +  +                           +D+ E  +E               
Sbjct: 159 RSKVVTLEYDTSSKDTASRQNCAATAAAAKSADVAPEDSSEPVQEQVPNEEASSTSTQPR 218

Query: 231 -----AQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE----RR 281
                 Q  A+H KGC C +S CLKKYCECFQ  + CS +CRC +C+NFEGS E    R 
Sbjct: 219 TTGTTTQRHAQHIKGCSCLRSLCLKKYCECFQNGVYCSASCRCSNCRNFEGSAELSQARE 278

Query: 282 ALFHGDHNGIAFM-QRAAN-----AAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQS 335
            L   +  GI+F  Q+ A+     ++++G V S   G L   + R     L G+  K QS
Sbjct: 279 RLVLAEL-GISFSHQKGASGTPTKSSLVGVVES---GLLSEESVRNIG--LEGILEKVQS 332

Query: 336 V-IRNPQSQQGNNVQNSA------VPCSQSVPVSQTANASVLGSTKSALRSPLADI 384
           +  R P      +  +S          S+    +QT+N S +   +S+ R+P A +
Sbjct: 333 LRKRTPGESNAQSTGSSKSHRAKRSETSRRTSYAQTSNGSTV---ESSPRTPTAKL 385



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
           +A E     +  K C+CK S CLK YC+CF A   C+  C C +C N 
Sbjct: 90  NALEGGSVTRPRKPCNCKNSKCLKLYCDCFAAGTYCN-GCHCTNCLNL 136


>gi|300121347|emb|CBK21727.2| unnamed protein product [Blastocystis hominis]
          Length = 305

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 67/145 (46%), Positives = 89/145 (61%), Gaps = 17/145 (11%)

Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVG 200
           +++DG    P     CNCR SRCLKLYCECFA+ ++C+GCNCVNC NN EH   R+E   
Sbjct: 174 SSSDGSPHDPNHSHTCNCRKSRCLKLYCECFASDVFCNGCNCVNCKNNPEHLKEREEVKQ 233

Query: 201 ATLERNPNAFRPKIA-SSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQA 258
           A ++R+P+ F P +   +P               A + N +GC C+K+ C+KKYC+CFQ 
Sbjct: 234 ALIQRDPHVFEPHVVWDTP---------------ATRTNLRGCRCQKTRCVKKYCDCFQN 278

Query: 259 NILCSENCRCLDCKNFEGSEERRAL 283
            I C  NCRC+DCKN E +   RAL
Sbjct: 279 GICCGPNCRCVDCKNTEEAALNRAL 303


>gi|170582258|ref|XP_001896048.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
 gi|158596825|gb|EDP35103.1| Tesmin/TSO1-like CXC domain containing protein [Brugia malayi]
          Length = 469

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 59/129 (45%), Positives = 84/129 (65%), Gaps = 9/129 (6%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K C+C  S CLKLYC+CFA G +C+ C+C +C N +E+E+ R  A+  +LERNPNAF+P
Sbjct: 238 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 297

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI  + +         R+ +    H KGCHCKKS CLK YCEC++A + C++ C+C+ C+
Sbjct: 298 KIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCR 348

Query: 273 NFEGSEERR 281
           N E     R
Sbjct: 349 NTESDRAAR 357


>gi|321459097|gb|EFX70154.1| hypothetical protein DAPPUDRAFT_217492 [Daphnia pulex]
          Length = 209

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 68/123 (55%), Positives = 83/123 (67%), Gaps = 12/123 (9%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K +K CNC  S+CLKLYC+CFA G +C GCNCV C NN+EHE  R  A+ + L+RNP+AF
Sbjct: 20  KPRKPCNCTKSQCLKLYCDCFANGEFCHGCNCVCCANNLEHEELRLRAIRSCLDRNPHAF 79

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI     G     E  R       HNKGCHCK+SGCLK YCEC++A I CS  C+C+ 
Sbjct: 80  KPKI-----GVGWGPEPRR-------HNKGCHCKRSGCLKNYCECYEAKIACSAICKCIG 127

Query: 271 CKN 273
           CKN
Sbjct: 128 CKN 130


>gi|395542051|ref|XP_003772948.1| PREDICTED: protein lin-54 homolog, partial [Sarcophilus harrisii]
          Length = 360

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/133 (54%), Positives = 91/133 (68%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYC+CFA G +C+ CNC NC NN+EHE  RQ+A+ A L+RNP AF+P
Sbjct: 133 RKPCNCTKSLCLKLYCDCFANGEFCNNCNCNNCFNNLEHENERQKAIKACLDRNPEAFKP 192

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 193 KIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCK 241

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 242 NFEESPERKTLMH 254


>gi|380022721|ref|XP_003695187.1| PREDICTED: protein lin-54 homolog [Apis florea]
          Length = 827

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
           TL  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 639

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+
Sbjct: 640 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 689

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
           SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 719


>gi|340722920|ref|XP_003399847.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus terrestris]
          Length = 829

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
           TL  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 641

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+
Sbjct: 642 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 691

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
           SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 692 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 721


>gi|340722922|ref|XP_003399848.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus terrestris]
          Length = 835

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
           TL  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 647

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+
Sbjct: 648 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 697

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
           SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 698 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 727


>gi|328785527|ref|XP_392991.3| PREDICTED: protein lin-54 homolog isoform 1 [Apis mellifera]
          Length = 804

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
           TL  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC
Sbjct: 580 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 639

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+
Sbjct: 640 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 689

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
           SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 719


>gi|350403514|ref|XP_003486824.1| PREDICTED: protein lin-54 homolog isoform 2 [Bombus impatiens]
          Length = 829

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
           TL  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC
Sbjct: 582 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 641

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+
Sbjct: 642 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 691

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
           SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 692 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 721


>gi|350403512|ref|XP_003486823.1| PREDICTED: protein lin-54 homolog isoform 1 [Bombus impatiens]
          Length = 835

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 93/150 (62%), Gaps = 10/150 (6%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNC 185
           TL  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC
Sbjct: 588 TLSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNC 647

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+
Sbjct: 648 SNNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKR 697

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
           SGCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 698 SGCLKNYCECYEAKIPCSANCKCIGCRNVE 727


>gi|30686784|ref|NP_566718.2| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75274232|sp|Q9LUI3.1|TSO1_ARATH RecName: Full=CRC domain-containing protein TSO1
 gi|9279696|dbj|BAB01253.1| DNA binding protein-like [Arabidopsis thaliana]
 gi|332643155|gb|AEE76676.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 695

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 65/135 (48%), Positives = 88/135 (65%), Gaps = 9/135 (6%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
           K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N   HE    E V AT +    RNP 
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 456

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF PK+  +     +A +DA +   +A+H +GC+CKKS C+KKYCEC+Q  + CS NCRC
Sbjct: 457 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 516

Query: 269 LDCKNFEGSEERRAL 283
             C N  G ++   L
Sbjct: 517 EGCTNVFGRKDGSLL 531


>gi|6739821|gb|AAF27433.1|AF206324_1 putative DNA binding protein [Arabidopsis thaliana]
 gi|7767425|gb|AAF69124.1|AF204059_1 CXC domain protein TSO1 [Arabidopsis thaliana]
          Length = 695

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 65/135 (48%), Positives = 88/135 (65%), Gaps = 9/135 (6%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
           K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N   HE    E V AT +    RNP 
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 456

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF PK+  +     +A +DA +   +A+H +GC+CKKS C+KKYCEC+Q  + CS NCRC
Sbjct: 457 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 516

Query: 269 LDCKNFEGSEERRAL 283
             C N  G ++   L
Sbjct: 517 EGCTNVFGRKDGSLL 531


>gi|223995127|ref|XP_002287247.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976363|gb|EED94690.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1174

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 70/174 (40%), Positives = 94/174 (54%), Gaps = 25/174 (14%)

Query: 145  GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLE 204
            GK+ + + +  C C+NS+CLKLYC+CF+A  YC GC C++C N    E  R +A+  T  
Sbjct: 852  GKENS-ENESTCKCKNSKCLKLYCDCFSAEKYCVGCKCIDCQNTPTFESIRAKAIADTRA 910

Query: 205  RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
            RNPNAF+ K+ +S                  +H  GC CKKS CLKKYCECF   ++C E
Sbjct: 911  RNPNAFKTKLNAS----------------VTEHATGCKCKKSECLKKYCECFANLVVCGE 954

Query: 265  NCRCLDCKNFEGS----EERRALFHGDHNGIAFMQRAANAAIIGAVGSS--GYG 312
            +C+C +CKN+ GS    E RR L   D  G     R       G++  S   YG
Sbjct: 955  SCKCSNCKNYVGSQALIERRRKL--KDAKGTELAMRPVARGWKGSMSDSKVAYG 1006


>gi|145502865|ref|XP_001437410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404560|emb|CAK70013.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 64/122 (52%), Positives = 89/122 (72%), Gaps = 9/122 (7%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+NS+C+KLYCEC+   ++CD  C+C NC NN  +   RQ+A+  TLE+NP+AF+P
Sbjct: 16  KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQRQKAMQYTLEKNPSAFQP 75

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI +S     D++ D        KHNKGC CKKSGC+KKYCECFQA + CS+ C+C++C+
Sbjct: 76  KITTS-----DSKPDPLNF---GKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECR 127

Query: 273 NF 274
           N+
Sbjct: 128 NY 129



 Score = 45.4 bits (106), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 27/43 (62%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K C+CK S C+K YCEC+Q  + C  NC C +C N   + ++R
Sbjct: 16  KKCNCKNSKCIKLYCECYQNKVFCDNNCHCHNCFNNSSNAKQR 58



 Score = 40.4 bits (93), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K  K C C+ S C+K YCECF A + C D C C+ C N
Sbjct: 91  KHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 128


>gi|413952362|gb|AFW85011.1| hypothetical protein ZEAMMB73_286736 [Zea mays]
          Length = 235

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 64/91 (70%), Positives = 74/91 (81%), Gaps = 1/91 (1%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNN 188
           P K +SP+   R   +GKD TP K+K CNCRNSRCLKLYCECFA+G +CDGCNC NC NN
Sbjct: 131 PLKPDSPKPWARL-YEGKDSTPTKKKCCNCRNSRCLKLYCECFASGTHCDGCNCTNCFNN 189

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPH 219
            E+EVAR+EA+ ATLERNP+AFRPKI SSPH
Sbjct: 190 PENEVARREAIEATLERNPDAFRPKIGSSPH 220



 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K C+C+ S CLK YCECF +   C + C C +C N   +E  R
Sbjct: 155 KCCNCRNSRCLKLYCECFASGTHC-DGCNCTNCFNNPENEVAR 196


>gi|402586637|gb|EJW80574.1| hypothetical protein WUBG_08516, partial [Wuchereria bancrofti]
          Length = 391

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 59/129 (45%), Positives = 84/129 (65%), Gaps = 9/129 (6%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K C+C  S CLKLYC+CFA G +C+ C+C +C N +E+E+ R  A+  +LERNPNAF+P
Sbjct: 234 KKPCHCTRSMCLKLYCDCFANGEFCNDCDCKDCKNTIEYEIERTRAIRLSLERNPNAFKP 293

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI  + +         R+ +    H KGCHCKKS CLK YCEC++A + C++ C+C+ C+
Sbjct: 294 KIGVATN---------RQVEPERLHQKGCHCKKSNCLKNYCECYEAKVPCTDRCKCICCR 344

Query: 273 NFEGSEERR 281
           N E     R
Sbjct: 345 NTESDRAAR 353


>gi|414877954|tpg|DAA55085.1| TPA: hypothetical protein ZEAMMB73_985885 [Zea mays]
          Length = 772

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 69/154 (44%), Positives = 92/154 (59%), Gaps = 4/154 (2%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           Q SP+ + R   D  DGT    K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C NN  
Sbjct: 443 QGSPKKK-RHKFDSGDGT--SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCMNNQS 499

Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
           H             RNP AF PK+  +     + R+ + +   +A+H +GC+CKKS CLK
Sbjct: 500 HTETVLSTRQQIESRNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLK 559

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           KYCECFQ  + CS +CRC  CKN  G  E  A+ 
Sbjct: 560 KYCECFQGGVGCSISCRCEGCKNAFGKREGAAVL 593


>gi|18403510|ref|NP_566717.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75153799|sp|Q8L548.1|TCX3_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 3; Short=AtTCX3;
           AltName: Full=Protein TSO1-like 1; Short=Protein SOL1
 gi|20466374|gb|AAM20504.1| unknown protein [Arabidopsis thaliana]
 gi|22136318|gb|AAM91237.1| unknown protein [Arabidopsis thaliana]
 gi|332643153|gb|AEE76674.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 609

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 3/155 (1%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           SP+ + R +    +G     K+CNC+ S+CLKLYCECFAAG YC + C+C+NC N   H+
Sbjct: 309 SPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHK 367

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
                       RNP AF PK+  +     +  EDA +   +A+H +GC+CKKS CLKKY
Sbjct: 368 DVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKY 427

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGD 287
           CEC+Q  + CS NCRC  CKN  G ++  +LF  D
Sbjct: 428 CECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461


>gi|242019702|ref|XP_002430298.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515413|gb|EEB17560.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 995

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 70/125 (56%), Positives = 90/125 (72%), Gaps = 11/125 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + +K CNC  S+CLKLYCECFA G +C  CNC NC+NN+EHE ARQ+++ + LERNP AF
Sbjct: 779 RMKKPCNCTRSQCLKLYCECFANGEFCFQCNCNNCYNNIEHEEARQKSIKSCLERNPCAF 838

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           RPKI        +  ED R      +HNKGCHCK+SGCLK YCEC++A I CS++C+C+ 
Sbjct: 839 RPKIQ-----IGETEEDER------RHNKGCHCKRSGCLKNYCECYEAKITCSKSCKCIG 887

Query: 271 CKNFE 275
           C+N E
Sbjct: 888 CRNTE 892


>gi|7767427|gb|AAF69125.1|AF205142_1 CXC domain containing TSO1-like protein 1 [Arabidopsis thaliana]
          Length = 609

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 93/155 (60%), Gaps = 3/155 (1%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           SP+ + R +    +G     K+CNC+ S+CLKLYCECFAAG YC + C+C+NC N   H+
Sbjct: 309 SPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHK 367

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
                       RNP AF PK+  +     +  EDA +   +A+H +GC+CKKS CLKKY
Sbjct: 368 DVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKY 427

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGD 287
           CEC+Q  + CS NCRC  CKN  G ++  +LF  D
Sbjct: 428 CECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461


>gi|168046141|ref|XP_001775533.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673088|gb|EDQ59616.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1223

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 63/127 (49%), Positives = 81/127 (63%), Gaps = 1/127 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAA  +C G C C NC N  E+E            RNP AF P
Sbjct: 737 KRCNCKKSKCLKLYCECFAAREFCVGSCACRNCFNKPEYEATVLNTRQQIESRNPLAFAP 796

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI  +P  +    ++A +   +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC  C+
Sbjct: 797 KIVQAPESSPIPGDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCR 856

Query: 273 NFEGSEE 279
           N  G +E
Sbjct: 857 NMYGRKE 863


>gi|255542340|ref|XP_002512233.1| tso1, putative [Ricinus communis]
 gi|223548194|gb|EEF49685.1| tso1, putative [Ricinus communis]
          Length = 729

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 63/126 (50%), Positives = 81/126 (64%), Gaps = 1/126 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYC+CFAAG+YC + C+C +C N   HE    E       RNP AF P
Sbjct: 450 KRCNCKRSKCLKLYCDCFAAGLYCIEPCSCQDCFNKPAHEDTVLETRKQIESRNPLAFAP 509

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  S     D  ++  +   +A+H +GC+CKKS CLKKYCECFQ  + CS NCRC  CK
Sbjct: 510 KVIRSTDYVSDFGDETNKTPASARHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCK 569

Query: 273 NFEGSE 278
           N  GS+
Sbjct: 570 NSFGSK 575



 Score = 45.8 bits (107), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 4/48 (8%)

Query: 144 DGKDGTP---KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           D  + TP   + ++ CNC+ S CLK YCECF  G+ C   C C  C N
Sbjct: 523 DETNKTPASARHKRGCNCKKSNCLKKYCECFQGGVGCSSNCRCEGCKN 570


>gi|383858991|ref|XP_003704982.1| PREDICTED: protein lin-54 homolog isoform 2 [Megachile rotundata]
          Length = 802

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 72/149 (48%), Positives = 92/149 (61%), Gaps = 10/149 (6%)

Query: 127 LPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCH 186
           L  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC 
Sbjct: 580 LSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCS 639

Query: 187 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 246
           NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+S
Sbjct: 640 NNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKRS 689

Query: 247 GCLKKYCECFQANILCSENCRCLDCKNFE 275
           GCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 GCLKNYCECYEAKIPCSANCKCIGCRNVE 718


>gi|15028139|gb|AAK76693.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 65/136 (47%), Positives = 88/136 (64%), Gaps = 9/136 (6%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
           K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N   HE    E V AT +    RNP 
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 287

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF PK+  +     +A +DA +   +A+H +GC+CKKS C+KKYCEC+Q  + CS NCRC
Sbjct: 288 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 347

Query: 269 LDCKNFEGSEERRALF 284
             C N  G ++   L 
Sbjct: 348 EGCTNVFGRKDGSLLV 363


>gi|23297130|gb|AAN13100.1| putative DNA binding protein [Arabidopsis thaliana]
          Length = 526

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 65/136 (47%), Positives = 88/136 (64%), Gaps = 9/136 (6%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLE----RNPN 208
           K+CNC+ S+CLKLYCECFAAG+YC + C+C++C N   HE    E V AT +    RNP 
Sbjct: 232 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHE----ETVLATRKQIESRNPL 287

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF PK+  +     +A +DA +   +A+H +GC+CKKS C+KKYCEC+Q  + CS NCRC
Sbjct: 288 AFAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRC 347

Query: 269 LDCKNFEGSEERRALF 284
             C N  G ++   L 
Sbjct: 348 EGCTNVFGRKDGSLLV 363


>gi|383858989|ref|XP_003704981.1| PREDICTED: protein lin-54 homolog isoform 1 [Megachile rotundata]
          Length = 826

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 72/149 (48%), Positives = 92/149 (61%), Gaps = 10/149 (6%)

Query: 127 LPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCH 186
           L  G     +S  R+ T  +    + +K CNC  S+CLKLYC+CFA G +C  CNC NC 
Sbjct: 580 LSVGSVIESKSSQRSYTSVEPNGIRPRKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCS 639

Query: 187 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKS 246
           NN+ +E  RQ A+ + LERNPNAFRPKI        D R          +HNKGC+CK+S
Sbjct: 640 NNLGNEEERQRAIKSCLERNPNAFRPKIGKGRETGDDIR----------RHNKGCNCKRS 689

Query: 247 GCLKKYCECFQANILCSENCRCLDCKNFE 275
           GCLK YCEC++A I CS NC+C+ C+N E
Sbjct: 690 GCLKNYCECYEAKIPCSANCKCIGCRNVE 718


>gi|307206123|gb|EFN84203.1| Protein lin-54-like protein [Harpegnathos saltator]
          Length = 825

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 67/123 (54%), Positives = 83/123 (67%), Gaps = 10/123 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S+CLKLYC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRP
Sbjct: 605 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 664

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI        D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 665 KIGKGRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCR 714

Query: 273 NFE 275
           N E
Sbjct: 715 NIE 717



 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           D   +  K CNC+ S CLK YCEC+ A I C   C C+ C N
Sbjct: 674 DDIRRHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRN 715


>gi|307182142|gb|EFN69485.1| Protein lin-54-like protein [Camponotus floridanus]
          Length = 822

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 67/123 (54%), Positives = 83/123 (67%), Gaps = 10/123 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S+CLKLYC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRP
Sbjct: 602 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 661

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI        D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 662 KIGKGRETGDDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCR 711

Query: 273 NFE 275
           N E
Sbjct: 712 NIE 714



 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           D   +  K CNC+ S CLK YCEC+ A I C   C C+ C N
Sbjct: 671 DDIRRHNKGCNCKRSGCLKNYCECYEAKIPCSANCKCIGCRN 712


>gi|414883728|tpg|DAA59742.1| TPA: hypothetical protein ZEAMMB73_893320 [Zea mays]
          Length = 800

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/148 (45%), Positives = 87/148 (58%), Gaps = 3/148 (2%)

Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
           G+  SP+ + R   +G DG  +  K+C+C+ S+CLKLYCECFAAG+YC + C+C  C N 
Sbjct: 460 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCMNK 517

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
             HE            RNP AF PK+        +  ED      +A+H +GC+CKKS C
Sbjct: 518 PIHEETVLSTRKQIEFRNPLAFAPKVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSC 577

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEG 276
           LKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 578 LKKYCECYQGGVGCSSNCRCESCKNAFG 605


>gi|326500406|dbj|BAK06292.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 782

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/142 (45%), Positives = 86/142 (60%), Gaps = 1/142 (0%)

Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQE 197
           P+      +G  +  K+C+C+ S+CLKLYCECFAAG+YC + C+C  C N   HE     
Sbjct: 458 PKKRRKSDNGDDESCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHEEIVLS 517

Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
                  RNP AF PK+       Q+A+ED +    +A+H +GC+CKKS CLKKYCEC+Q
Sbjct: 518 TRKHIEFRNPLAFAPKVIRMSEAGQEAQEDPKNTPASARHKRGCNCKKSSCLKKYCECYQ 577

Query: 258 ANILCSENCRCLDCKNFEGSEE 279
             + CS NCRC  CKN  G+ +
Sbjct: 578 GGVGCSNNCRCETCKNTFGTRD 599


>gi|242086180|ref|XP_002443515.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
 gi|241944208|gb|EES17353.1| hypothetical protein SORBIDRAFT_08g020800 [Sorghum bicolor]
          Length = 767

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 68/154 (44%), Positives = 91/154 (59%), Gaps = 4/154 (2%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           Q SP+ + R   D  DGT    K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C NN  
Sbjct: 438 QGSPKKK-RQKFDNGDGT--SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCLNNQS 494

Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
           H             RNP AF PK+  +     +  + + +   +A+H +GC+CKKS CLK
Sbjct: 495 HTETVLSTRQQIESRNPLAFAPKVIHTSESGLELGDFSNKTPASARHKRGCNCKKSSCLK 554

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           KYCECFQ  + CS +CRC  CKN  G  E  A+ 
Sbjct: 555 KYCECFQGGVGCSISCRCEGCKNAFGKREGAAVL 588


>gi|156378675|ref|XP_001631267.1| predicted protein [Nematostella vectensis]
 gi|156218304|gb|EDO39204.1| predicted protein [Nematostella vectensis]
          Length = 226

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 75/133 (56%), Positives = 90/133 (67%), Gaps = 11/133 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S+CLKLYC+CFA G +C  CNCVNC NN+EHE  R +A+ A LERNP+AF P
Sbjct: 4   RKPCNCTKSQCLKLYCDCFAQGEFCSNCNCVNCSNNIEHERERSKAIKACLERNPHAFHP 63

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI     G  + R           HNKGCHCK+SGCLK YCEC++A ILC+  C+C  CK
Sbjct: 64  KIGKGKVGESERR-----------HNKGCHCKRSGCLKNYCECYEAKILCTSLCKCTGCK 112

Query: 273 NFEGSEERRALFH 285
           NFE S ER+ L H
Sbjct: 113 NFEESPERKTLMH 125


>gi|343959662|dbj|BAK63688.1| hypothetical protein [Pan troglodytes]
          Length = 348

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 70/129 (54%), Positives = 89/129 (68%), Gaps = 11/129 (8%)

Query: 157 NCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 216
           NC  S CL+LYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI  
Sbjct: 125 NCTKSLCLRLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGK 184

Query: 217 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
              G  D R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE 
Sbjct: 185 GKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEE 233

Query: 277 SEERRALFH 285
           S ER+ L H
Sbjct: 234 SPERKTLMH 242


>gi|322788388|gb|EFZ14059.1| hypothetical protein SINV_04654 [Solenopsis invicta]
          Length = 834

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/122 (54%), Positives = 83/122 (68%), Gaps = 10/122 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S+CLKLYC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRP
Sbjct: 614 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 673

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI       +D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 674 KIGKGRETGEDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSGNCKCIGCR 723

Query: 273 NF 274
           N 
Sbjct: 724 NI 725


>gi|414868877|tpg|DAA47434.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
 gi|414868878|tpg|DAA47435.1| TPA: hypothetical protein ZEAMMB73_892276, partial [Zea mays]
          Length = 587

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 68/149 (45%), Positives = 89/149 (59%), Gaps = 4/149 (2%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           Q SP+ + R   D  DGT    K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C NN  
Sbjct: 441 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCIGCQNNQS 497

Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
           H             RNP AF PK+  +     D  + + +   +A+H +GC+CKKS CLK
Sbjct: 498 HMETVLSTRQQIESRNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLK 557

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEE 279
           KYCECFQ  + CS +CRC  CKN  G  E
Sbjct: 558 KYCECFQGGVGCSVSCRCEGCKNAFGKRE 586


>gi|242043084|ref|XP_002459413.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
 gi|241922790|gb|EER95934.1| hypothetical protein SORBIDRAFT_02g004250 [Sorghum bicolor]
          Length = 776

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 67/148 (45%), Positives = 87/148 (58%), Gaps = 3/148 (2%)

Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
           G+  SP+ + R   +G DG  +  K+C+C+ S+CLKLYCECFAAG+YC + C+C  C N 
Sbjct: 442 GQSVSPQKKRRKTDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNK 499

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
             HE            RNP AF PK+        +  ED      +A+H +GC+CKKS C
Sbjct: 500 PIHEEIVLSTRKQIEFRNPLAFAPKVIRMSDAGLETGEDPNNTPASARHKRGCNCKKSSC 559

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEG 276
           LKKYCEC+Q  + CS NCRC  CKN  G
Sbjct: 560 LKKYCECYQGGVGCSTNCRCESCKNTFG 587


>gi|167536200|ref|XP_001749772.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771699|gb|EDQ85361.1| predicted protein [Monosiga brevicollis MX1]
          Length = 429

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 62/121 (51%), Positives = 79/121 (65%), Gaps = 10/121 (8%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           + CNC+NS+CLKLYCECFA G YC   CNCV+C N    E  R++A+  TL +NP+AFRP
Sbjct: 216 RTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLRKQAIEVTLAKNPDAFRP 275

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K      GA  +R+ A      + H KGC C +S C+K+YCECFQA I C+  C+C DC 
Sbjct: 276 KF-----GANKSRQSA----APSAHVKGCRCARSKCIKRYCECFQAGIPCTSACQCRDCH 326

Query: 273 N 273
           N
Sbjct: 327 N 327



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 26/47 (55%)

Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           A   + C+CK S CLK YCECF     C  NC C+ C N E +E  R
Sbjct: 212 ASGGRTCNCKNSKCLKLYCECFANGEYCGPNCNCVSCGNREATETLR 258


>gi|255548982|ref|XP_002515547.1| tso1, putative [Ricinus communis]
 gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis]
          Length = 873

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 65/133 (48%), Positives = 83/133 (62%), Gaps = 2/133 (1%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ ++CLKLYC+CFAAGIYC D C C +C N  E+E    E       RNP AF P
Sbjct: 453 KRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNPLAFAP 512

Query: 213 KIASSPHGAQDAREDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           KI         +RED   +  + ++H +GC+CKKS CLKKYCEC+QAN+ CS  CRC  C
Sbjct: 513 KIVQHAKEFAASREDRSSSMPSLSRHKRGCNCKKSMCLKKYCECYQANVGCSSECRCEGC 572

Query: 272 KNFEGSEERRALF 284
           KN  G +E   + 
Sbjct: 573 KNGYGRKEEYGII 585



 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 30/51 (58%)

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           D  E+ +   +     K C+CKK+ CLK YC+CF A I C++ C C DC N
Sbjct: 437 DNFEEKKSLSIDGDGCKRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFN 487



 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLE 204
           + ++ CNC+ S CLK YCEC+ A + C   C C  C N       R+E  G   E
Sbjct: 537 RHKRGCNCKKSMCLKKYCECYQANVGCSSECRCEGCKNG----YGRKEEYGIIEE 587


>gi|332021648|gb|EGI62007.1| Protein lin-54-like protein [Acromyrmex echinatior]
          Length = 859

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/122 (54%), Positives = 83/122 (68%), Gaps = 10/122 (8%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S+CLKLYC+CFA G +C  CNC NC NN+ +E  RQ A+ + LERNPNAFRP
Sbjct: 639 RKPCNCTKSQCLKLYCDCFANGEFCHMCNCNNCSNNLGNEEERQRAIKSCLERNPNAFRP 698

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI       +D R          +HNKGC+CK+SGCLK YCEC++A I CS NC+C+ C+
Sbjct: 699 KIGKGRETGEDIR----------RHNKGCNCKRSGCLKNYCECYEAKIPCSGNCKCIGCR 748

Query: 273 NF 274
           N 
Sbjct: 749 NI 750


>gi|357111377|ref|XP_003557490.1| PREDICTED: uncharacterized protein LOC100836677 [Brachypodium
           distachyon]
          Length = 787

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 87/143 (60%), Gaps = 4/143 (2%)

Query: 135 PRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEV 193
           P+ + R + +G D   +  K+C+C+ S+CLKLYCECFAAG+YC + C+C+ C N   HE 
Sbjct: 459 PQKKRRKSDNGDD---ESCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCLGCLNKPIHEE 515

Query: 194 ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYC 253
                      RNP AF PK+       Q+ +ED      +A+H +GC+CKKS CLKKYC
Sbjct: 516 IVLSTRKQIEFRNPLAFAPKVIRLSDAGQETQEDPNNTPASARHKRGCNCKKSSCLKKYC 575

Query: 254 ECFQANILCSENCRCLDCKNFEG 276
           EC+Q  + CS NCRC  CKN  G
Sbjct: 576 ECYQGGVGCSSNCRCESCKNTFG 598


>gi|115470839|ref|NP_001059018.1| Os07g0176200 [Oryza sativa Japonica Group]
 gi|34393829|dbj|BAC83433.1| putative CXC domain protein TSO1 [Oryza sativa Japonica Group]
 gi|113610554|dbj|BAF20932.1| Os07g0176200 [Oryza sativa Japonica Group]
          Length = 782

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 68/148 (45%), Positives = 88/148 (59%), Gaps = 4/148 (2%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           SP+ + R + +G DG  +  K+C+C+ S+CLKLYCECFAAG+YC + C+C  C N   HE
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHE 507

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSGCLKK 251
                       RNP AF PK+       QD   ED      +A+H +GC+CKKS CLKK
Sbjct: 508 EIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKK 567

Query: 252 YCECFQANILCSENCRCLDCKNFEGSEE 279
           YCEC+Q  + CS NCRC  CKN  G  +
Sbjct: 568 YCECYQGGVGCSSNCRCEGCKNTFGKRD 595


>gi|222636534|gb|EEE66666.1| hypothetical protein OsJ_23296 [Oryza sativa Japonica Group]
          Length = 782

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 68/148 (45%), Positives = 88/148 (59%), Gaps = 4/148 (2%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           SP+ + R + +G DG  +  K+C+C+ S+CLKLYCECFAAG+YC + C+C  C N   HE
Sbjct: 438 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHE 495

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSGCLKK 251
                       RNP AF PK+       QD   ED      +A+H +GC+CKKS CLKK
Sbjct: 496 EIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKK 555

Query: 252 YCECFQANILCSENCRCLDCKNFEGSEE 279
           YCEC+Q  + CS NCRC  CKN  G  +
Sbjct: 556 YCECYQGGVGCSSNCRCEGCKNTFGKRD 583


>gi|218199174|gb|EEC81601.1| hypothetical protein OsI_25091 [Oryza sativa Indica Group]
          Length = 794

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 68/148 (45%), Positives = 88/148 (59%), Gaps = 4/148 (2%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           SP+ + R + +G DG  +  K+C+C+ S+CLKLYCECFAAG+YC + C+C  C N   HE
Sbjct: 450 SPQKKRRKSDNGDDG--EACKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKPIHE 507

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSGCLKK 251
                       RNP AF PK+       QD   ED      +A+H +GC+CKKS CLKK
Sbjct: 508 EIVLSTRKQIEFRNPLAFAPKVIRMSDAGQDITGEDPNNTPASARHKRGCNCKKSSCLKK 567

Query: 252 YCECFQANILCSENCRCLDCKNFEGSEE 279
           YCEC+Q  + CS NCRC  CKN  G  +
Sbjct: 568 YCECYQGGVGCSSNCRCEGCKNTFGKRD 595


>gi|297737091|emb|CBI26292.3| unnamed protein product [Vitis vinifera]
          Length = 717

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 68/154 (44%), Positives = 90/154 (58%), Gaps = 3/154 (1%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           Q SP+ + R    G  G  +  K+CNC+ S+CLKLYCECFAAG+YC + C+C  C N   
Sbjct: 403 QSSPKKKRRRLEHG--GETEGCKRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKPI 460

Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
           HE            RNP AF PK+  S     +  ++  +   +A+H +GC+CKKS CLK
Sbjct: 461 HEDTVLATRKQIESRNPLAFAPKVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLK 520

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           KYCEC+Q  + CS NCRC  CKN  G ++  AL 
Sbjct: 521 KYCECYQGGVGCSINCRCEGCKNAFGRKDGSALI 554


>gi|40241245|emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus]
 gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus]
          Length = 897

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 63/128 (49%), Positives = 81/128 (63%), Gaps = 2/128 (1%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYC+CFAAG++C D C+C +C N  E+     E       RNP AF P
Sbjct: 509 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 568

Query: 213 KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           KI  S   A    ED       +A+H +GC+CK+S CLKKYCEC+Q+N+ CS  CRC  C
Sbjct: 569 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 628

Query: 272 KNFEGSEE 279
           KN  G +E
Sbjct: 629 KNVYGKKE 636


>gi|51949820|gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus]
          Length = 821

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 63/128 (49%), Positives = 81/128 (63%), Gaps = 2/128 (1%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYC+CFAAG++C D C+C +C N  E+     E       RNP AF P
Sbjct: 433 KRCNCKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAP 492

Query: 213 KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           KI  S   A    ED       +A+H +GC+CK+S CLKKYCEC+Q+N+ CS  CRC  C
Sbjct: 493 KIVKSATNAPSNMEDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGC 552

Query: 272 KNFEGSEE 279
           KN  G +E
Sbjct: 553 KNVYGKKE 560


>gi|348567366|ref|XP_003469470.1| PREDICTED: protein lin-54 homolog [Cavia porcellus]
          Length = 749

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 65/150 (43%), Positives = 81/150 (54%), Gaps = 41/150 (27%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV---------------EHEVAR 195
           + +K CNC  S CLKLYC+CFA G                              EHE  R
Sbjct: 520 RPRKPCNCTKSLCLKLYCDCFANG---------------EFCNNCNCTNCYNNLEHENER 564

Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
           Q+A+ A L+RNP AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC
Sbjct: 565 QKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCEC 613

Query: 256 FQANILCSENCRCLDCKNFEGSEERRALFH 285
           ++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 614 YEAKIMCSSICKCIGCKNFEESPERKTLMH 643


>gi|359477548|ref|XP_002279154.2| PREDICTED: uncharacterized protein LOC100261336 [Vitis vinifera]
          Length = 738

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 62/132 (46%), Positives = 81/132 (61%), Gaps = 1/132 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C+C  C N   HE            RNP AF P
Sbjct: 444 KRCNCKKSKCLKLYCECFAAGVYCVEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAP 503

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  S     +  ++  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CK
Sbjct: 504 KVIRSSDSLPEVGDEPSKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCK 563

Query: 273 NFEGSEERRALF 284
           N  G ++  AL 
Sbjct: 564 NAFGRKDGSALI 575



 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           + ++ CNC+ S CLK YCEC+  G+ C   C C  C N
Sbjct: 527 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKN 564


>gi|356538172|ref|XP_003537578.1| PREDICTED: uncharacterized protein LOC100815907 [Glycine max]
          Length = 774

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/152 (43%), Positives = 91/152 (59%), Gaps = 3/152 (1%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           Q SPR + R +     G  +  K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N   
Sbjct: 466 QNSPRMKRRKSE--TPGDTEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPI 523

Query: 191 HEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLK 250
           HE    +       RNP AF PK+  +     +  +D  +   +A+H +GC+CKKS CLK
Sbjct: 524 HEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSSCLK 583

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEERRA 282
           KYCEC+Q  + CS +CRC  CKN  G ++  A
Sbjct: 584 KYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 615


>gi|298204891|emb|CBI34198.3| unnamed protein product [Vitis vinifera]
          Length = 829

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 1/127 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N   HE    +       RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  S     +  +++ +   +A+H +GC+CKKS CLKKYCECFQ  + CS +CRC  CK
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCK 622

Query: 273 NFEGSEE 279
           N  G ++
Sbjct: 623 NTFGRKD 629



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 229 REAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           R ++ AA++   K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 491 RRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537


>gi|147840870|emb|CAN68781.1| hypothetical protein VITISV_018991 [Vitis vinifera]
          Length = 806

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 1/127 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N   HE    +       RNP AF P
Sbjct: 503 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 562

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  S     +  +++ +   +A+H +GC+CKKS CLKKYCECFQ  + CS +CRC  CK
Sbjct: 563 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCK 622

Query: 273 NFEGSEE 279
           N  G ++
Sbjct: 623 NTFGRKD 629



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 229 REAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           R ++ AA++   K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 491 RRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 537


>gi|356497442|ref|XP_003517569.1| PREDICTED: uncharacterized protein LOC100804447 [Glycine max]
          Length = 774

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/155 (43%), Positives = 92/155 (59%), Gaps = 9/155 (5%)

Query: 132 QESPRSRPRANTDGKDGTPKKQ---KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           Q SP+ + R     K  TP +    K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N
Sbjct: 466 QNSPKKKRR-----KSETPGETEGCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFN 520

Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
              HE    +       RNP AF PK+  +     +  +D  +   +A+H +GC+CKKS 
Sbjct: 521 KPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASARHKRGCNCKKSS 580

Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
           CLKKYCEC+Q  + CS +CRC  CKN  G ++  A
Sbjct: 581 CLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 615


>gi|359487901|ref|XP_002272709.2| PREDICTED: uncharacterized protein LOC100265943 [Vitis vinifera]
          Length = 777

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 82/127 (64%), Gaps = 1/127 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N   HE    +       RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFNKPVHEDTVLQTRKQIESRNPLAFAP 533

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  S     +  +++ +   +A+H +GC+CKKS CLKKYCECFQ  + CS +CRC  CK
Sbjct: 534 KVIRSSDYVPEFGDESNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCK 593

Query: 273 NFEGSEE 279
           N  G ++
Sbjct: 594 NTFGRKD 600



 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 2/47 (4%)

Query: 229 REAQLAAKHN--KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           R ++ AA++   K C+CKKS CLK YCECF A + C E C C DC N
Sbjct: 462 RRSEQAAENEACKRCNCKKSKCLKLYCECFAAGLYCVEPCSCQDCFN 508


>gi|255552089|ref|XP_002517089.1| tso1, putative [Ricinus communis]
 gi|223543724|gb|EEF45252.1| tso1, putative [Ricinus communis]
          Length = 727

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 80/130 (61%), Gaps = 1/130 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C+C  C N   HE            RNP AF P
Sbjct: 435 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKPIHEDTVLATRKQIESRNPLAFAP 494

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  S     +  ++  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CK
Sbjct: 495 KVIRSSEPVTEIGDEFSKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCK 554

Query: 273 NFEGSEERRA 282
           N  G ++  A
Sbjct: 555 NAFGRKDGSA 564


>gi|118384765|ref|XP_001025522.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307289|gb|EAS05277.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1084

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 60/118 (50%), Positives = 78/118 (66%), Gaps = 1/118 (0%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 215
           CNC+ ++CLKLYCECFAA   C+GC C  C N  E E  R+EA  A LERN +AF PKI 
Sbjct: 533 CNCKRTKCLKLYCECFAASRMCEGCTCQGCFNKPEFEAMRKEARQAILERNNSAFDPKID 592

Query: 216 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           S     Q   E+  + +    H+KGC+CKKS CLKKYCEC+Q  + C++ C+C +C+N
Sbjct: 593 SQSKLKQLCNEEGVDNEQKV-HSKGCNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRN 649



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 142 NTDGKDGTPK-KQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQE 197
           N +G D   K   K CNC+ S CLK YCEC+  G+ C   C C NC NN+ H  + +E
Sbjct: 602 NEEGVDNEQKVHSKGCNCKKSNCLKKYCECYQLGVKCTKLCKCDNCRNNINHPHSHEE 659


>gi|357436899|ref|XP_003588725.1| Lin-54-like protein [Medicago truncatula]
 gi|355477773|gb|AES58976.1| Lin-54-like protein [Medicago truncatula]
          Length = 921

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 67/149 (44%), Positives = 84/149 (56%), Gaps = 11/149 (7%)

Query: 136 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVA 194
           R + + +TD   G     K CNC+ S+CLKLYC+CF AGI+C DGC C  C N VE +  
Sbjct: 506 RKKKKTSTDDASGP----KTCNCKKSKCLKLYCDCFGAGIFCGDGCACEGCGNRVEFQDK 561

Query: 195 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKGCHCKKSGCLK 250
             E       RNP AF PKI   P  A     +  +  +    +A+H +GC+CK+S C K
Sbjct: 562 VVETKQQIESRNPQAFAPKIV--PCAADVPPNNMEDVNMTTPASARHKRGCNCKRSKCTK 619

Query: 251 KYCECFQANILCSENCRCLDCKNFEGSEE 279
           KYCECFQAN+ CS  CRC  C N  G  E
Sbjct: 620 KYCECFQANVGCSTGCRCDGCMNAFGKRE 648


>gi|193237591|dbj|BAG50072.1| transcription factor CPP [Lotus japonicus]
          Length = 764

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 68/167 (40%), Positives = 94/167 (56%), Gaps = 14/167 (8%)

Query: 130 GKQESPRSRPRANTDGKDGTPKKQ-------------KQCNCRNSRCLKLYCECFAAGIY 176
           G Q +  S   A+ D    +PKK+             K+CNC+ S+CLKLYCECFAAG+Y
Sbjct: 439 GVQPAEDSSQAASEDFNQNSPKKKRRKVEPAGDTEGCKRCNCKKSKCLKLYCECFAAGVY 498

Query: 177 C-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAA 235
           C + C+C +C N   HE    +       RNP AF PK+  +     +  +D  +   +A
Sbjct: 499 CIEPCSCQDCFNKPIHEDTVLQTRKQIESRNPLAFAPKVIRNSDSVPEIGDDPNKTPASA 558

Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
           +H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CKN  G ++  A
Sbjct: 559 RHKRGCNCKKSNCLKKYCECYQGGVGCSISCRCEGCKNAFGRKDGSA 605


>gi|357157337|ref|XP_003577764.1| PREDICTED: uncharacterized protein LOC100829642 [Brachypodium
           distachyon]
          Length = 771

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 121/226 (53%), Gaps = 26/226 (11%)

Query: 81  PPSSQPHVQAHMRS----PSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGKQESP 135
           PP + P +  H+ +    P   +  + A+   ++  N  P  V   P  + P    ++S 
Sbjct: 375 PPCALPGIGLHLNALATIPKDKIVPYAAQSSTIQASN-FPSAVSSSPATSEPNTANEDSS 433

Query: 136 RSRPRANTD-GKDGTPKKQ------------KQCNCRNSRCLKLYCECFAAGIYC-DGCN 181
           ++   AN D    G+PKK+            K+C+C+ S+CLKLYCECF AG++C + C+
Sbjct: 434 QAIVVANADESGQGSPKKKRHKFDNGDGTSCKRCSCKKSKCLKLYCECFHAGVFCSEPCS 493

Query: 182 CVNCHN---NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN 238
           C  C N   N+E  ++ +E + +   RNP AF PK+  +    Q+  E + +   +A+H 
Sbjct: 494 CQGCLNKPSNMETVLSTREQIES---RNPLAFAPKVIRTSEPGQELGEQSNKTPASARHK 550

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           +GC+CKKS CLKKYCECFQ  + CS +CRC  CKN  G  E   L 
Sbjct: 551 RGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKNAFGRREGVVLL 596


>gi|356575873|ref|XP_003556061.1| PREDICTED: uncharacterized protein LOC100811662 [Glycine max]
          Length = 774

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 64/150 (42%), Positives = 89/150 (59%), Gaps = 3/150 (2%)

Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
           +Q SP+ + R      +G  +  K+CNC+ S+CLKLYCECFAAG+YC + C C +C N  
Sbjct: 467 QQNSPKKKRRKLEPAGEG--ESCKRCNCKKSKCLKLYCECFAAGVYCIEPCACRDCFNKP 524

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
            H     +       RNP AF PK+  S     +  +D  +   +A+H +GC+CKKS CL
Sbjct: 525 IHVETVLQTRQQIESRNPLAFAPKVIRSSDSVSEIGDDLNKTPASARHKRGCNCKKSSCL 584

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEE 279
           KKYCEC+Q  + CS +CRC  CKN  G ++
Sbjct: 585 KKYCECYQGGVGCSVSCRCEGCKNTYGRKD 614


>gi|358332933|dbj|GAA51516.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 638

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 56/121 (46%), Positives = 77/121 (63%), Gaps = 13/121 (10%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           ++C+C  S CLKLYCECFAAG++C  C+CV C+N  +HE  RQ+A+   + R P+AF+ K
Sbjct: 163 RRCSCSRSFCLKLYCECFAAGVFCSDCSCVGCYNLAQHENYRQKAIMRIVNRKPDAFQSK 222

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           IA S              + A  H +GC+CK+SGCLK YCEC++A + C+  CRC  C N
Sbjct: 223 IAHS-------------LENATVHARGCNCKRSGCLKNYCECYEARVRCTSRCRCQYCYN 269

Query: 274 F 274
            
Sbjct: 270 M 270


>gi|297833182|ref|XP_002884473.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330313|gb|EFH60732.1| hypothetical protein ARALYDRAFT_317350 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 611

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 69/179 (38%), Positives = 100/179 (55%), Gaps = 9/179 (5%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+C C+ S+CLKLYCECFAAG+YC + C+C NC N   HE    ++      RNP AF P
Sbjct: 431 KRCKCKKSQCLKLYCECFAAGLYCVEPCSCQNCFNKPIHEDLVMKSREVIEARNPLAFAP 490

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+ S+   A D   +  +   +A+H +GC+C+KSGCLKKYCEC    + CS +CRC+ CK
Sbjct: 491 KVVSTSDTAIDLWVENSKTPASARHTRGCNCRKSGCLKKYCECNLMGVRCSSSCRCIGCK 550

Query: 273 NFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAA 331
           N  G    +    G+ + +     A +       G S      ++++R++  LL G  A
Sbjct: 551 NVFGHTNEKCA--GESDAVTINDEAKHC------GDSRQNEETSNSERRNRLLLPGSVA 601


>gi|218187206|gb|EEC69633.1| hypothetical protein OsI_39028 [Oryza sativa Indica Group]
          Length = 780

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 69/157 (43%), Positives = 99/157 (63%), Gaps = 11/157 (7%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN--- 187
           Q SP+ + R   D  DGT    K+C+C+ S+CLKLYCECF AG++C + C+C  C N   
Sbjct: 458 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPS 514

Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
           N+E  ++ +E + +   RNP AF PK+  +  G Q+  +D+ +   +++H +GC+CKKS 
Sbjct: 515 NMETVLSTREQIES---RNPLAFAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSS 570

Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           CLKKYCEC+Q  + CS +CRC  CKN  G  E  AL 
Sbjct: 571 CLKKYCECYQGGVGCSVSCRCEGCKNAFGRREGVALL 607


>gi|89953384|gb|ABD83288.1| Fgenesh protein 45 [Beta vulgaris]
          Length = 665

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 67/145 (46%), Positives = 88/145 (60%), Gaps = 10/145 (6%)

Query: 140 RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEA 198
           +A+T   D T K+   CNC+ ++CLKLYC+CFAAG YC + C C  C N  E+E    E 
Sbjct: 476 KASTSTDDDTTKR---CNCKKTKCLKLYCDCFAAGFYCGESCACQGCFNRPEYEDTVLET 532

Query: 199 VGATLERNPNAFRPKI----ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 254
                 RNP AF PK+     +SP   +   + +  +  +A+H +GC+CKKS CLKKYCE
Sbjct: 533 RQQIESRNPLAFAPKVMQTVTASPTTGKVVEKLSTPS--SARHKRGCNCKKSMCLKKYCE 590

Query: 255 CFQANILCSENCRCLDCKNFEGSEE 279
           C+QAN+ CSE CRC  CKN  G  E
Sbjct: 591 CYQANVGCSEGCRCEGCKNIYGVRE 615


>gi|115489476|ref|NP_001067225.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|77557032|gb|ABA99828.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|77557033|gb|ABA99829.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649732|dbj|BAF30244.1| Os12g0605500 [Oryza sativa Japonica Group]
 gi|215707264|dbj|BAG93724.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 760

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 69/157 (43%), Positives = 99/157 (63%), Gaps = 11/157 (7%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN--- 187
           Q SP+ + R   D  DGT    K+C+C+ S+CLKLYCECF AG++C + C+C  C N   
Sbjct: 438 QGSPKKK-RHKFDNGDGT--SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPS 494

Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
           N+E  ++ +E + +   RNP AF PK+  +  G Q+  +D+ +   +++H +GC+CKKS 
Sbjct: 495 NMETVLSTREQIES---RNPLAFAPKVIRTEPG-QELADDSNKTPASSRHKRGCNCKKSS 550

Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           CLKKYCEC+Q  + CS +CRC  CKN  G  E  AL 
Sbjct: 551 CLKKYCECYQGGVGCSVSCRCEGCKNAFGRREGVALL 587


>gi|351726712|ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max]
 gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max]
          Length = 896

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 77/133 (57%), Gaps = 2/133 (1%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYC+CFAAG YC D C C  C N  E+     E       RNP AF P
Sbjct: 474 KRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESRNPIAFAP 533

Query: 213 KIASSPHGAQDAREDAR-EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           KI           +D       +A+H +GC+CK+S CLKKYCEC+QAN+ CS  CRC  C
Sbjct: 534 KIVQPTTDISSHMDDENLTTPSSARHKRGCNCKRSMCLKKYCECYQANVGCSSGCRCEGC 593

Query: 272 KNFEGSEERRALF 284
           KN  G +E    F
Sbjct: 594 KNVHGKKEDYVAF 606



 Score = 45.1 bits (105), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           + ++ CNC+ S CLK YCEC+ A + C  GC C  C N
Sbjct: 558 RHKRGCNCKRSMCLKKYCECYQANVGCSSGCRCEGCKN 595


>gi|42563508|ref|NP_187136.3| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
 gi|75146756|sp|Q84JZ8.1|TCX4_ARATH RecName: Full=Protein tesmin/TSO1-like CXC 4; Short=AtTCX4
 gi|28393078|gb|AAO41973.1| unknown protein [Arabidopsis thaliana]
 gi|28827398|gb|AAO50543.1| unknown protein [Arabidopsis thaliana]
 gi|332640623|gb|AEE74144.1| Tesmin/TSO1-like CXC domain-containing protein [Arabidopsis
           thaliana]
          Length = 639

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 59/124 (47%), Positives = 79/124 (63%), Gaps = 1/124 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+C CR S+CLKLYCECF+AG++C + C+C NC N   HE    ++      RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+ S+     D   +  +   +A+H +GC+C+KSGC KKYCECF   + CS NCRC+ CK
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCK 572

Query: 273 NFEG 276
           N  G
Sbjct: 573 NTFG 576



 Score = 45.8 bits (107), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEH 191
            + + ++ CNCR S C K YCECF  G+ C   C C+ C N   H
Sbjct: 533 ASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFGH 577


>gi|255553069|ref|XP_002517577.1| tso1, putative [Ricinus communis]
 gi|223543209|gb|EEF44741.1| tso1, putative [Ricinus communis]
          Length = 348

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 76/124 (61%), Gaps = 1/124 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC D C C +C N  E+E    +       RNP AF P
Sbjct: 20  KRCNCKRSKCLKLYCECFAAGVYCLDSCACEDCFNRPEYEDTVLDTRQQIEARNPLAFAP 79

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+      +    E       +A+H +GC+CKKS CLKKYCEC+QA + CS  CRC  CK
Sbjct: 80  KVVKQATNSPANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGCSSGCRCEGCK 139

Query: 273 NFEG 276
           N  G
Sbjct: 140 NSYG 143



 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 33/53 (62%), Gaps = 4/53 (7%)

Query: 141 ANTDGKDGTP---KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
           AN +G   TP   + ++ CNC+ S+CLK YCEC+ AG+ C  GC C  C N+ 
Sbjct: 90  ANEEGNWTTPSSARHKRGCNCKKSKCLKKYCECYQAGVGCSSGCRCEGCKNSY 142


>gi|452819762|gb|EME26815.1| tesmin/TSO1-like CXC domain-containing protein [Galdieria
           sulphuraria]
          Length = 444

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 61/119 (51%), Positives = 78/119 (65%), Gaps = 5/119 (4%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ S+CLKLYCECFA+G YC   C C  C NN ++    + A   TLERNP AF PKI
Sbjct: 318 CNCKRSQCLKLYCECFASGSYCTSKCKCNGCKNNGDNANHVKTARERTLERNPRAFSPKI 377

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           + S     +  +  R     A H++GC+CK+S C KKYCECFQA + C +NC+C+DCKN
Sbjct: 378 SESTAAVTEEGQVLR----VAAHHRGCNCKRSNCRKKYCECFQAGVPCGDNCKCVDCKN 432



 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
            + CNC+ S C K YCECF AG+ C D C CV+C N
Sbjct: 397 HRGCNCKRSNCRKKYCECFQAGVPCGDNCKCVDCKN 432


>gi|403352890|gb|EJY75972.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 1042

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 79/132 (59%), Gaps = 20/132 (15%)

Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGA 201
           +D K   P K   CNC+ S+CLKLYC+CFA G+ C   CNC +C N  E    R++A+  
Sbjct: 642 SDNKSTQPLKTT-CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCAN-TEGNEERKQAMDQ 699

Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
             ERNPNAF+PKI                 Q    H+KGCHCKKSGCLKKYCEC+Q+ ++
Sbjct: 700 ITERNPNAFKPKI-----------------QEKGFHSKGCHCKKSGCLKKYCECYQSGVV 742

Query: 262 CSENCRCLDCKN 273
           C+  C C  CKN
Sbjct: 743 CTNLCACEGCKN 754



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 30/41 (73%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+CKKS CLK YC+CF   + CS +C C DC N EG+EER+
Sbjct: 654 CNCKKSKCLKLYCDCFAYGLGCSPDCNCADCANTEGNEERK 694


>gi|449522630|ref|XP_004168329.1| PREDICTED: LOW QUALITY PROTEIN: protein tesmin/TSO1-like CXC 3-like
           [Cucumis sativus]
          Length = 788

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 58/132 (43%), Positives = 83/132 (62%), Gaps = 1/132 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N   HE            RNP AF P
Sbjct: 493 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAP 552

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  +     +  +++ +   +A+H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CK
Sbjct: 553 KVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 612

Query: 273 NFEGSEERRALF 284
           N  G ++  ++ 
Sbjct: 613 NAFGRKDGSSIL 624


>gi|449432678|ref|XP_004134126.1| PREDICTED: protein tesmin/TSO1-like CXC 3-like [Cucumis sativus]
          Length = 788

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 58/132 (43%), Positives = 83/132 (62%), Gaps = 1/132 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C+C +C N   HE            RNP AF P
Sbjct: 493 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKPIHEDTVLATRKQIESRNPLAFAP 552

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  +     +  +++ +   +A+H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CK
Sbjct: 553 KVIRNSDSLPEPGDESNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 612

Query: 273 NFEGSEERRALF 284
           N  G ++  ++ 
Sbjct: 613 NAFGRKDGSSIL 624


>gi|344295768|ref|XP_003419583.1| PREDICTED: tesmin [Loxodonta africana]
          Length = 503

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 31/194 (15%)

Query: 99  MPQWQARPQHVRMVNRVPHPVHKL-----PLPT---LPPGKQESPRSRPRANTDGKDGTP 150
           +PQ+Q +  + +  + VP P+  L     P+P    L   + E+    P  N+    G P
Sbjct: 238 LPQYQDQSSYPQ--SDVPKPMTTLVGRFLPVPAKLNLITQQLENGALSPVVNSSALPGPP 295

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K            L  YC+CFA+G +C+ C+  NC NN+ HE+ R +A+ A L+RNP AF
Sbjct: 296 KLT----------LAGYCDCFASGDFCNNCSRHNCCNNLRHEIERFKAIKACLDRNPEAF 345

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI +   G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ 
Sbjct: 346 QPKIGTGNLG-----------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIG 394

Query: 271 CKNFEGSEERRALF 284
           CKN++ S ER+ L 
Sbjct: 395 CKNYDESPERKTLM 408


>gi|357606575|gb|EHJ65118.1| hypothetical protein KGM_22197 [Danaus plexippus]
          Length = 844

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 74/151 (49%), Positives = 100/151 (66%), Gaps = 12/151 (7%)

Query: 133 ESPRSRPRANTD---GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           +SP +   + TD    + G  + +K CNC  S+CLKLYC+CFA G +C+ CNC NCHNN+
Sbjct: 605 QSPTTPALSETDQVTTELGGLRPRKACNCTKSQCLKLYCDCFANGEFCNRCNCNNCHNNL 664

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           E+E  RQ+A+   L+RNPNAFRPKI  S  G           ++  +HNKGC+CK+SGCL
Sbjct: 665 ENEELRQKAIRGCLDRNPNAFRPKIGKSKAGG---------PEIIRRHNKGCNCKRSGCL 715

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEER 280
           K YCEC++A I CS  C+C+ C+N E + ER
Sbjct: 716 KNYCECYEAKIACSSICKCVGCRNVEETLER 746


>gi|356534189|ref|XP_003535640.1| PREDICTED: uncharacterized protein LOC100820491 [Glycine max]
          Length = 760

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 59/127 (46%), Positives = 79/127 (62%), Gaps = 1/127 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG+YC + C C +C N   H     +       RNP AF P
Sbjct: 475 KRCNCKKSKCLKLYCECFAAGVYCIEPCACHDCFNKPIHVETVLQTRQQIESRNPLAFAP 534

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K+  S     +  +D  +   +A+H +GC+CKKS CLKKYCEC+Q  + CS +CRC  CK
Sbjct: 535 KVIRSSDSVSEIGDDPNKTPASARHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCK 594

Query: 273 NFEGSEE 279
           N  G ++
Sbjct: 595 NTYGRKD 601


>gi|302843447|ref|XP_002953265.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
 gi|300261362|gb|EFJ45575.1| hypothetical protein VOLCADRAFT_94037 [Volvox carteri f.
           nagariensis]
          Length = 1987

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 62/148 (41%), Positives = 85/148 (57%), Gaps = 25/148 (16%)

Query: 145 GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE---VARQEAVGA 201
           G+  +    K C C+ S+CLKLYC+CFAAG YC  C+C++CHN  EH    + R+E + A
Sbjct: 779 GRRTSENSSKSCRCKKSQCLKLYCDCFAAGQYCGSCSCISCHNRPEHADRVLQRREDIAA 838

Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
              R+P AF  KI  +P+G               KH +GC+C+KS CLKKYCEC+Q  + 
Sbjct: 839 ---RDPQAFTRKIQLAPNG-------------NGKHKRGCNCRKSHCLKKYCECYQGGVK 882

Query: 262 CSENCRCLDCKNF------EGSEERRAL 283
           C   C C++C+N       EG+  R AL
Sbjct: 883 CGIQCTCMECENMDVGSSQEGAGARGAL 910



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 1/74 (1%)

Query: 200 GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
           G     NPNA R   A             R  + +   +K C CKKS CLK YC+CF A 
Sbjct: 749 GINANANPNAHRSSGAGGGLTQIQGGGPNRGRRTSENSSKSCRCKKSQCLKLYCDCFAAG 808

Query: 260 ILCSENCRCLDCKN 273
             C  +C C+ C N
Sbjct: 809 QYCG-SCSCISCHN 821


>gi|359482667|ref|XP_003632803.1| PREDICTED: uncharacterized protein LOC100855293 [Vitis vinifera]
          Length = 613

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 85/143 (59%), Gaps = 5/143 (3%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNP 207
           T   +K CNC+ ++CLKLYC+C AAG+YC D C C NC N  E+E   Q        R+P
Sbjct: 282 TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 341

Query: 208 NAFRPKIASSPHGAQD--AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
            AF P+I +      D   +ED       +A+H +GC+CKKS C KKYCEC+QA I CS+
Sbjct: 342 LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQAGIGCSD 401

Query: 265 NCRCLDCKN-FEGSEERRALFHG 286
            CRC DC+N F    +RRA   G
Sbjct: 402 GCRCEDCRNSFGIKADRRAERWG 424


>gi|405973431|gb|EKC38148.1| lin-54-like protein [Crassostrea gigas]
          Length = 764

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/119 (52%), Positives = 79/119 (66%), Gaps = 11/119 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA G +C  CNC NC NN++HE  R  A+ + L+RNP AF PKI     G +D R 
Sbjct: 529 YCDCFANGEFCHNCNCNNCANNLDHEEERSRAIKSCLDRNPMAFHPKIGKGRDGERDRR- 587

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
                     HNKGC+CK+SGCLK YCEC++A I+CS +C+C+ CKNFE S ER+ L H
Sbjct: 588 ----------HNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKNFEESPERKTLMH 636



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)

Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           G+DG   ++  K CNC+ S CLK YCEC+ A I C   C C+ C N
Sbjct: 579 GRDGERDRRHNKGCNCKRSGCLKNYCECYEAKIMCSSSCKCIGCKN 624


>gi|326427032|gb|EGD72602.1| hypothetical protein PTSG_04338 [Salpingoeca sp. ATCC 50818]
          Length = 584

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 58/111 (52%), Positives = 74/111 (66%), Gaps = 12/111 (10%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           + CNC  SRCLKLYCEC+AAG YC   C C NC NN+E+E  RQ AV +TL++NP+AFRP
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQLAVLSTLDKNPDAFRP 466

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
           KI       Q+   + +     A H +GC C KS CL++YCECFQA + C+
Sbjct: 467 KIK------QETTTNRK-----ANHLRGCKCTKSKCLQRYCECFQAGVPCT 506



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 28/45 (62%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           + C+C KS CLK YCEC+ A   C   C+C +C+N   +E RR L
Sbjct: 407 RSCNCGKSRCLKLYCECYAAGTYCGPQCKCTNCQNNLENEGRRQL 451


>gi|118352258|ref|XP_001009402.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89291169|gb|EAR89157.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 805

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/148 (41%), Positives = 94/148 (63%), Gaps = 7/148 (4%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLER 205
           +G+ KK + CNC+ ++CLKLYC+CFAAG +C   CNC  C N V ++ AR + V   LE+
Sbjct: 471 NGSNKKTRICNCKKTKCLKLYCDCFAAGEFCGAECNCCECSNTVANKEARNKIVEGLLEK 530

Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
           NP AF  K       +Q + +  +  +LA+K  KGC+C++SGCLKKYC+C+Q  + C E+
Sbjct: 531 NPFAFNVKDIEIEEPSQLSLKAQK--KLASK--KGCNCRRSGCLKKYCQCYQDGLQCGEH 586

Query: 266 CRCLDCKNFEGSEERRALFHGDHNGIAF 293
           C+C  C+N E  +  + LF  + N ++ 
Sbjct: 587 CKCNGCENCE--KPAKKLFLAEQNSLSL 612


>gi|224143100|ref|XP_002324849.1| predicted protein [Populus trichocarpa]
 gi|222866283|gb|EEF03414.1| predicted protein [Populus trichocarpa]
          Length = 561

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/161 (44%), Positives = 94/161 (58%), Gaps = 16/161 (9%)

Query: 132 QESPRSRPR----ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCH 186
           Q SP+ + +    ++TDG DG     K+CNC+ +RCLK YC+CFAAGIYC + C C  C 
Sbjct: 405 QSSPKKKRQDHKVSSTDG-DGC----KRCNCKKTRCLKRYCDCFAAGIYCAETCACQGCF 459

Query: 187 NNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKGCH 242
           N +E+E    EA       NP AF PKI    H  +    +  +A L    + +H  GC+
Sbjct: 460 NRLEYEDTVLEARQQKESHNPLAFAPKIVQ--HVTEFHAINVEDASLFTPSSGRHKTGCN 517

Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           CK+S C+KKYCEC+QAN+ CS  CRC  CKN  G +E   L
Sbjct: 518 CKRSMCVKKYCECYQANVGCSNACRCEGCKNIHGRKEGEKL 558


>gi|414871812|tpg|DAA50369.1| TPA: hypothetical protein ZEAMMB73_080577 [Zea mays]
          Length = 662

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/190 (37%), Positives = 100/190 (52%), Gaps = 34/190 (17%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQ-----------CNCRNSRCLKLYCECFAAGIYCDG-CN 181
           S ++ P A+  G+    KK+++           CNCR S+CLKLYC CFAA +YC   C+
Sbjct: 334 SQKTMPNADKSGQQIHKKKRRKLQKDDGESCRHCNCRRSKCLKLYCPCFAAKVYCSSLCS 393

Query: 182 CVNCHNNVEHEVARQEAVGATLER----NPNAFRPKIASSPHGAQDAREDAREAQLAAKH 237
           C  C NN  HE    E V  T +R    NP AF P +  +     +  +D+ +   +A+H
Sbjct: 394 CQGCSNNHAHE----ELVSCTRKRTESHNPLAFAPTVTHTYGTVLEFGDDSNKTPASARH 449

Query: 238 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF------------- 284
            +GC+C+KS CLKKYCECFQ+ + C  +CRC+ CKN  G  E   L              
Sbjct: 450 KRGCNCRKSSCLKKYCECFQSGVGCCISCRCMSCKNSFGKREGVLLLTTEKLEKGGKAKG 509

Query: 285 -HGDHNGIAF 293
            HG+ + +AF
Sbjct: 510 THGEEDKLAF 519


>gi|422292803|gb|EKU20105.1| tesmin tso1-like cxc domain-containing protein [Nannochloropsis
           gaditana CCMP526]
          Length = 365

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 108/227 (47%), Gaps = 56/227 (24%)

Query: 89  QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDG 148
           Q   +S + + P   +RP  V    R  +P             QE+ RS P   + G  G
Sbjct: 98  QRRNQSSNNSHPGGSSRPVRVEAAIRRQNP------------SQEARRSPPLNGSGG--G 143

Query: 149 TP-----------KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
           TP           +  K+CNCR S+CLKLYCECFA G YC D C C++C+N +  E  R 
Sbjct: 144 TPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRT 203

Query: 197 EAVGATLERNPNAFRPKI-----------ASSPH------GAQDAREDAREAQL------ 233
           +AV + L RNP+AF  KI           A   H      G Q +  D  E  +      
Sbjct: 204 QAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQG 263

Query: 234 -------AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                   A H +GC+CKKS C+KKYCECF A + C ENC+C  C+N
Sbjct: 264 TAGGALPTAAHVRGCNCKKSLCIKKYCECFFAGVYCGENCQCEGCQN 310



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 30/48 (62%)

Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           A + +K C+C+KS CLK YCECF     C ++C CLDC N    E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 1/38 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           + CNC+ S C+K YCECF AG+YC + C C  C NN E
Sbjct: 276 RGCNCKKSLCIKKYCECFFAGVYCGENCQCEGCQNNPE 313


>gi|89111949|ref|NP_001034746.1| tesmin isoform a [Mus musculus]
 gi|205371774|sp|Q9WTJ6.2|MTL5_MOUSE RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|15617422|dbj|BAB64935.1| tesmin [Mus musculus]
 gi|19353778|gb|AAH24377.1| Metallothionein-like 5, testis-specific (tesmin) [Mus musculus]
 gi|148700990|gb|EDL32937.1| mCG3887, isoform CRA_d [Mus musculus]
          Length = 475

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA  
Sbjct: 275 LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 379

Query: 284 FHGDH 288
               H
Sbjct: 380 MSTPH 384



 Score = 45.1 bits (105), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           +  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 372


>gi|47228462|emb|CAG05282.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 798

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/122 (51%), Positives = 80/122 (65%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA G +C+ CNC NC NN+EHE  R +A+   L+RNP AF+PKI     G  D
Sbjct: 587 LNGYCDCFANGEFCNNCNCNNCFNNLEHETERLKAIKTCLDRNPEAFKPKIGKGKEGESD 646

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L
Sbjct: 647 RR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTL 695

Query: 284 FH 285
            H
Sbjct: 696 MH 697



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)

Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           GK+G   ++  K CNC+ S CLK YCEC+ A I C   C C+ C N
Sbjct: 640 GKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 685


>gi|15617420|dbj|BAB64934.1| tesmin [Mus musculus]
          Length = 475

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA  
Sbjct: 275 LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 379

Query: 284 FHGDH 288
               H
Sbjct: 380 MSTPH 384



 Score = 45.1 bits (105), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           +  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 372


>gi|118349584|ref|XP_001008073.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89289840|gb|EAR87828.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 471

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 63/144 (43%), Positives = 80/144 (55%), Gaps = 21/144 (14%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNA 209
           KKQK CNC+ ++CLKLYC+CFAAG  C G CNC  C N    E  R  A+ + ++R P+A
Sbjct: 127 KKQKTCNCKKTKCLKLYCDCFAAGELCGGECNCFGCFNTSNSE-KRAGAIVSIMDRQPDA 185

Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
           F PK+  + H                   KGC+C +SGCLKKYCEC+   + C E C+C 
Sbjct: 186 FGPKVQQNTHK------------------KGCNCTRSGCLKKYCECYTLGVNCGEYCKCT 227

Query: 270 DCKNFE-GSEERRALFHGDHNGIA 292
            CKN +    ER AL     NGI 
Sbjct: 228 QCKNVDLNHSERLALSMNGSNGIG 251


>gi|37718857|gb|AAR01728.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 766

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 61/156 (39%), Positives = 89/156 (57%), Gaps = 5/156 (3%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
           +P      G++   R R +   D  D      K+C+C+ S+CLKLYCECFA+ +YC + C
Sbjct: 443 MPTSAGNSGQENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESC 498

Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
           +C  C N+  HE            RNP AF PK+  +     +  ED+     +++H +G
Sbjct: 499 SCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFGEDSNATPASSRHKRG 558

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           C+CK+S C+KKYCECFQ+ + CS +CRC +CKN  G
Sbjct: 559 CNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 594



 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 471 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 511


>gi|108710010|gb|ABF97805.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
          Length = 757

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 61/156 (39%), Positives = 89/156 (57%), Gaps = 5/156 (3%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
           +P      G++   R R +   D  D      K+C+C+ S+CLKLYCECFA+ +YC + C
Sbjct: 434 MPTSAGNSGQENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESC 489

Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
           +C  C N+  HE            RNP AF PK+  +     +  ED+     +++H +G
Sbjct: 490 SCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFGEDSNATPASSRHKRG 549

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           C+CK+S C+KKYCECFQ+ + CS +CRC +CKN  G
Sbjct: 550 CNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 585



 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 462 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 502


>gi|358340153|dbj|GAA48108.1| protein lin-54 homolog [Clonorchis sinensis]
          Length = 450

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 71/132 (53%), Positives = 82/132 (62%), Gaps = 14/132 (10%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYCECFA G  C  CNC NC NN+ +E  R  A+  TLERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCHNCMNNLAYEEERGRAIKMTLERNPTAFHP 203

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI           E  R      KH KGC+CK+SGCLK YCEC++A I CS+ CRC  C+
Sbjct: 204 KIGRG--------EGER------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 249

Query: 273 NFEGSEERRALF 284
           N E S ERR+L 
Sbjct: 250 NTEDSVERRSLM 261


>gi|387219095|gb|AFJ69256.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 365

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 82/227 (36%), Positives = 108/227 (47%), Gaps = 56/227 (24%)

Query: 89  QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDG 148
           Q   +S + + P   +RP  V    R  +P             QE+ RS P   + G  G
Sbjct: 98  QRRNQSSNNSHPGGSSRPVRVEAAIRRQNP------------SQEARRSPPLNGSGG--G 143

Query: 149 TP-----------KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
           TP           +  K+CNCR S+CLKLYCECFA G YC D C C++C+N +  E  R 
Sbjct: 144 TPQGRGTSGLAATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVRT 203

Query: 197 EAVGATLERNPNAFRPKI-----------ASSPH------GAQDAREDAREAQL------ 233
           +AV + L RNP+AF  KI           A   H      G Q +  D  E  +      
Sbjct: 204 QAVESILSRNPHAFTEKIQFIETGATGPVAMGTHRRTSLPGGQCSVGDNGETTMMGGLQG 263

Query: 234 -------AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                   A H +GC+CKKS C++KYCECF A + C ENC+C  C+N
Sbjct: 264 TAGGALPTAAHVRGCNCKKSLCIQKYCECFFAGVYCGENCQCEGCQN 310



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 30/48 (62%)

Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           A + +K C+C+KS CLK YCECF     C ++C CLDC N    E+ R
Sbjct: 155 ATRSSKKCNCRKSKCLKLYCECFAMGAYCRDDCGCLDCNNTLAFEQVR 202



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 26/38 (68%), Gaps = 1/38 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           + CNC+ S C++ YCECF AG+YC + C C  C NN E
Sbjct: 276 RGCNCKKSLCIQKYCECFFAGVYCGENCQCEGCQNNPE 313


>gi|222625427|gb|EEE59559.1| hypothetical protein OsJ_11844 [Oryza sativa Japonica Group]
          Length = 644

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 61/156 (39%), Positives = 89/156 (57%), Gaps = 5/156 (3%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
           +P      G++   R R +   D  D      K+C+C+ S+CLKLYCECFA+ +YC + C
Sbjct: 321 MPTSAGNSGQENPKRKRQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESC 376

Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
           +C  C N+  HE            RNP AF PK+  +     +  ED+     +++H +G
Sbjct: 377 SCRGCFNDHSHEETVLSTRNRIESRNPLAFAPKVIRTCGPGLEFGEDSNATPASSRHKRG 436

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           C+CK+S C+KKYCECFQ+ + CS +CRC +CKN  G
Sbjct: 437 CNCKRSYCVKKYCECFQSGVGCSMSCRCENCKNSFG 472



 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 349 KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 389


>gi|148700988|gb|EDL32935.1| mCG3887, isoform CRA_b [Mus musculus]
          Length = 360

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA  
Sbjct: 160 LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 215

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 216 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 264

Query: 284 FHGDH 288
               H
Sbjct: 265 MSTPH 269



 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 220 KGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 257


>gi|349501047|ref|NP_001012069.2| tesmin [Rattus norvegicus]
 gi|205829210|sp|Q5XHX9.2|MTL5_RAT RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific
          Length = 475

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 79/125 (63%), Gaps = 15/125 (12%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA  
Sbjct: 275 LAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I CS  C+C+ CKN+E S ER+ L
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKML 379

Query: 284 FHGDH 288
               H
Sbjct: 380 MSTPH 384



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           +  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEE 372


>gi|387192746|gb|AFJ68672.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 510

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 61/122 (50%), Positives = 72/122 (59%), Gaps = 15/122 (12%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ SRCLKLYC+CFAA  +C   CNC NC N       R EA+   +ERNPNAF+ K 
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
              P+G     EDA        H  GC C+KS CLKKYCECF A   CS+ C C+ C+N 
Sbjct: 285 --RPNG-----EDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330

Query: 275 EG 276
            G
Sbjct: 331 AG 332



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%)

Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
           GC+CK+S CLK YC+CF A   C  +C C +C N  G    RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266


>gi|53733621|gb|AAH83920.1| Metallothionein-like 5, testis-specific (tesmin) [Rattus
           norvegicus]
          Length = 467

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 79/125 (63%), Gaps = 15/125 (12%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA  
Sbjct: 267 LAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 322

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I CS  C+C+ CKN+E S ER+ L
Sbjct: 323 LR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKML 371

Query: 284 FHGDH 288
               H
Sbjct: 372 MSTPH 376



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           +  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 324 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEE 364


>gi|354496627|ref|XP_003510427.1| PREDICTED: tesmin-like [Cricetulus griseus]
          Length = 380

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 56/125 (44%), Positives = 79/125 (63%), Gaps = 15/125 (12%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C  C+C    NN+ HE+ R +A+ A ++RNP AF+PK+     G   
Sbjct: 180 LAGYCDCFASGDFCTSCSC----NNLRHELERFKAIKACIDRNPEAFQPKMGKGRLGVSK 235

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 236 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 284

Query: 284 FHGDH 288
            +  H
Sbjct: 285 MNTPH 289



 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 3/49 (6%)

Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           G+ G  K +  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 229 GRLGVSKLRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 277


>gi|299472731|emb|CBN80299.2| TSO1; transcription factor (Partial) [Ectocarpus siliculosus]
          Length = 792

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 77/143 (53%), Gaps = 19/143 (13%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ S+CLKLYC+CFA  + C   CNC +C+NN   E  R +A+   LERNP AF    
Sbjct: 26  CNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMRLQAINGVLERNPAAF---- 81

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
                   DA+    E   A  H  GC C+KS CLKKYCECF   + CSE C C+ C+N 
Sbjct: 82  --------DAKFRETEKHDAVVHKTGCKCRKSACLKKYCECFNKGVACSEKCNCVGCRNT 133

Query: 275 EGSEERRALFHGDHNGIAFMQRA 297
                  AL+     G A  +RA
Sbjct: 134 V------ALYSEYRGGGAAAERA 150



 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 26/42 (61%)

Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           GC+CK+S CLK YC+CF   ++C+ +C C  C N    E  R
Sbjct: 25  GCNCKRSKCLKLYCQCFAMQVVCTASCNCHHCYNNANQESMR 66


>gi|422294067|gb|EKU21367.1| tesmin tso1-like cxc domain-containing protein, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 511

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 61/122 (50%), Positives = 72/122 (59%), Gaps = 15/122 (12%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ SRCLKLYC+CFAA  +C   CNC NC N       R EA+   +ERNPNAF+ K 
Sbjct: 225 CNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRAEAIRQIMERNPNAFQTKF 284

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
              P+G     EDA        H  GC C+KS CLKKYCECF A   CS+ C C+ C+N 
Sbjct: 285 --RPNG-----EDA-------VHKMGCKCRKSACLKKYCECFNAGARCSDKCSCVGCQNV 330

Query: 275 EG 276
            G
Sbjct: 331 AG 332



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%)

Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
           GC+CK+S CLK YC+CF A   C  +C C +C N  G    RA
Sbjct: 224 GCNCKRSRCLKLYCQCFAAQAFCIPSCNCQNCLNTAGQNHLRA 266


>gi|4581559|gb|AAD24666.1|U67176_1 tesmin-1 [Mus musculus]
 gi|4581561|gb|AAD24667.1|U77383_1 tesmin-2 [Mus musculus]
 gi|21464732|gb|AAM54491.1|AF329359_1 tesmin [Mus musculus]
 gi|21464734|gb|AAM54492.1|AF329360_1 tesmin [Mus musculus]
 gi|148700989|gb|EDL32936.1| mCG3887, isoform CRA_c [Mus musculus]
 gi|148700991|gb|EDL32938.1| mCG3887, isoform CRA_c [Mus musculus]
          Length = 295

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 15/125 (12%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA  
Sbjct: 95  LSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 150

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 151 LR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKML 199

Query: 284 FHGDH 288
               H
Sbjct: 200 MSTPH 204



 Score = 42.7 bits (99), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 3/49 (6%)

Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           G+ G  K +  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 144 GRLGAAKLRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEE 192


>gi|256089961|ref|XP_002581002.1| tesmin-related [Schistosoma mansoni]
          Length = 364

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 71/133 (53%), Positives = 82/133 (61%), Gaps = 14/133 (10%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYCECFA G  C  CNC NC NN+ +E  R  A+  TLERNP AF P
Sbjct: 145 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 204

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI           E  R      KH KGC+CK+SGCLK YCEC++A I CS+ CRC  C+
Sbjct: 205 KIGRG--------EGER------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 250

Query: 273 NFEGSEERRALFH 285
           N E S ERR+L  
Sbjct: 251 NTEDSVERRSLMR 263



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 5/77 (6%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE----HEVARQEAVGA 201
           +G  K  K CNC+ S CLK YCEC+ A I C D C C  C N  +      + R  A+GA
Sbjct: 210 EGERKHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLMRLAAMGA 269

Query: 202 TLERNPNAFRPKIASSP 218
              R P++F+  +   P
Sbjct: 270 LRSRQPSSFKKHLVKPP 286


>gi|360044492|emb|CCD82040.1| tesmin-related [Schistosoma mansoni]
          Length = 363

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 71/133 (53%), Positives = 82/133 (61%), Gaps = 14/133 (10%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +K CNC  S CLKLYCECFA G  C  CNC NC NN+ +E  R  A+  TLERNP AF P
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQGQLCQNCNCNNCMNNLAYEEERGRAIKMTLERNPTAFHP 203

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI           E  R      KH KGC+CK+SGCLK YCEC++A I CS+ CRC  C+
Sbjct: 204 KIGRG--------EGER------KHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCR 249

Query: 273 NFEGSEERRALFH 285
           N E S ERR+L  
Sbjct: 250 NTEDSVERRSLMR 262



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 5/77 (6%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE----HEVARQEAVGA 201
           +G  K  K CNC+ S CLK YCEC+ A I C D C C  C N  +      + R  A+GA
Sbjct: 209 EGERKHTKGCNCKRSGCLKNYCECYEAKISCSDLCRCQGCRNTEDSVERRSLMRLAAMGA 268

Query: 202 TLERNPNAFRPKIASSP 218
              R P++F+  +   P
Sbjct: 269 LRSRQPSSFKKHLVKPP 285


>gi|326495114|dbj|BAJ85653.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 766

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 65/152 (42%), Positives = 94/152 (61%), Gaps = 10/152 (6%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN--- 187
           Q SP+ + R   D  DGT    K+C+C+ S+CLKLYCECF AG++C + C+C +C N   
Sbjct: 440 QGSPKKK-RHKFDDGDGT--SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQDCLNKPS 496

Query: 188 NVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
           N+E  ++ ++ + +   RNP AF PK+  +    Q+  E +     +A+H +GC+CKKS 
Sbjct: 497 NMEIVLSTRKQIES---RNPLAFAPKVIRTSEPGQELGEYSNRTPASARHKRGCNCKKSS 553

Query: 248 CLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           CLKKYCEC+Q  + CS +CR   CKN  G  E
Sbjct: 554 CLKKYCECYQGGVGCSISCRYEGCKNAFGRRE 585


>gi|428180550|gb|EKX49417.1| hypothetical protein GUITHDRAFT_43196, partial [Guillardia theta
           CCMP2712]
          Length = 113

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 60/125 (48%), Positives = 78/125 (62%), Gaps = 12/125 (9%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 208
           +  ++K C+C+ SRCLKLYCECFAAG  C GC CV+C N+ +HE  R +AV    +RN N
Sbjct: 1   SDSRKKVCSCKKSRCLKLYCECFAAGEICSGCKCVDCANDGDHEDMRLQAVDTIKQRNNN 60

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF PKI                 Q    H +GC CKKS CLKKYCEC+QA + C++ C+C
Sbjct: 61  AFAPKIVDEIQ------------QDKGMHARGCRCKKSHCLKKYCECYQAGVQCTDKCKC 108

Query: 269 LDCKN 273
            +C+N
Sbjct: 109 EECQN 113



 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K C CKKS CLK YCECF A  +CS  C+C+DC N    E+ R
Sbjct: 6   KVCSCKKSRCLKLYCECFAAGEICS-GCKCVDCANDGDHEDMR 47


>gi|224099825|ref|XP_002311635.1| predicted protein [Populus trichocarpa]
 gi|222851455|gb|EEE89002.1| predicted protein [Populus trichocarpa]
          Length = 749

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 61/143 (42%), Positives = 83/143 (58%), Gaps = 3/143 (2%)

Query: 144 DGKDGTPKKQKQCN--CRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
           D    +PKK++      R  + L+LYCECFAAG+YC + C C +C N   HE        
Sbjct: 445 DFSQTSPKKKRHVGRRSRGLQALQLYCECFAAGVYCIEPCACQDCFNKPIHEDTVLATRK 504

Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 260
               RNP AF PK+  S   A +  +++ +   +A+H +GC+CKKS CLKKYCEC+Q  +
Sbjct: 505 QIESRNPLAFAPKVIRSSEPAPEIGDESSKTPASARHKRGCNCKKSSCLKKYCECYQGGV 564

Query: 261 LCSENCRCLDCKNFEGSEERRAL 283
            CS NCRC  CKN  G ++  AL
Sbjct: 565 GCSLNCRCEGCKNAFGRKDGSAL 587


>gi|118383712|ref|XP_001025010.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89306777|gb|EAS04765.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1040

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 37/180 (20%)

Query: 136 RSRPRANTDGKDG------------TPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNC 182
           R R   N    DG            T K++ +CNC+ S+CLKLYC+CF  G +C+  CNC
Sbjct: 682 REREEGNLKDLDGIVEYEQTLRPQTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNC 741

Query: 183 VNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED--------------- 227
             C N   ++  R+ A+    ERNP+AF+PKI +     Q                    
Sbjct: 742 TECSNTENNKAERESAIKQLQERNPDAFKPKIQTKEQIIQQYLSKLQIFMLIISSLLSKN 801

Query: 228 ---------AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
                     ++ +    HNKGC+CKKSGC KKYCEC+   + CS+ C+C  C+N + SE
Sbjct: 802 QIQYLIPKIIKDDEEILIHNKGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNRDPSE 861



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 28/50 (56%)

Query: 232 QLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           Q  +K    C+CKKS CLK YC+CF     C+  C C +C N E ++  R
Sbjct: 705 QTTSKQKVKCNCKKSKCLKLYCDCFNQGQFCNSECNCTECSNTENNKAER 754



 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVAR 195
           K CNC+ S C K YCEC+  G+ C D C C  C N    E+ +
Sbjct: 822 KGCNCKKSGCEKKYCECYNTGVKCSDQCKCEGCRNRDPSEIVK 864


>gi|302789393|ref|XP_002976465.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
 gi|300156095|gb|EFJ22725.1| hypothetical protein SELMODRAFT_443191 [Selaginella moellendorffii]
          Length = 710

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 69/176 (39%), Positives = 90/176 (51%), Gaps = 35/176 (19%)

Query: 136 RSRPRANTDGKDG----TPKKQ------------KQCNCRNSRCLKLYCECFAAGIYC-D 178
           R R    +D  DG    TPKK+            K+C+C+ S+CLKLYCECFAAG+YC D
Sbjct: 356 RKRLSMQSDSMDGEMGRTPKKRRKSLGEKAGDGCKRCHCKKSKCLKLYCECFAAGVYCLD 415

Query: 179 GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS--------------SPHGAQDA 224
            C C +C N  E              RNP AF PKI                S H   ++
Sbjct: 416 SCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEGVGIWFSGHAMTNS 475

Query: 225 ----REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
               + +  +   +A+H +GC+CKKS CLKKYCEC+QA + CS+ C+C  CKN  G
Sbjct: 476 LFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 531


>gi|301095347|ref|XP_002896774.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108657|gb|EEY66709.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 394

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 64/160 (40%), Positives = 89/160 (55%), Gaps = 22/160 (13%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPN 208
           P++ K C+C+ S CLKLYCEC AA   CD  CNC  C N  E    R+ AV A L+RNP 
Sbjct: 45  PERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAILDRNPL 104

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF+PK+AS                 +++H +GC+C+KSGC+K YCEC QA + C+  C C
Sbjct: 105 AFQPKVASG----------------SSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCAC 148

Query: 269 LDCKNFEG--SEERRALFHG---DHNGIAFMQRAANAAII 303
             C+N E   S ++  +F G   D N +    R +   ++
Sbjct: 149 HQCRNTETFVSAKKMLVFAGVSADDNDLRVTIRPSQRKLV 188



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 3/75 (4%)

Query: 230 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 286
           + Q   +  KGC CKKS CLK YCEC  A  +C   C C  CKN        ER      
Sbjct: 40  QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHRCNCEGCKNCSETLAERERAVAAIL 99

Query: 287 DHNGIAFMQRAANAA 301
           D N +AF  + A+ +
Sbjct: 100 DRNPLAFQPKVASGS 114



 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERN 206
           G+ +  + CNCR S C+K YCEC  AG+ C   C C  C N      A++  V A +  +
Sbjct: 113 GSSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSAD 172

Query: 207 PNAFRPKI 214
            N  R  I
Sbjct: 173 DNDLRVTI 180


>gi|2244834|emb|CAB10256.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268183|emb|CAB78519.1| hypothetical protein [Arabidopsis thaliana]
          Length = 658

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 54/120 (45%), Positives = 74/120 (61%), Gaps = 1/120 (0%)

Query: 165 KLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           K YCECFAAG+YC + C+C++C N   HE            RNP AF PK+  +    Q+
Sbjct: 370 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSDSVQE 429

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
             +DA +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  ++
Sbjct: 430 TGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDGSSI 489



 Score = 44.7 bits (104), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           + ++ CNC+ S CLK YCEC+  G+ C   C C  C N
Sbjct: 442 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 479


>gi|348686835|gb|EGZ26649.1| hypothetical protein PHYSODRAFT_473582 [Phytophthora sojae]
          Length = 407

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/140 (44%), Positives = 81/140 (57%), Gaps = 19/140 (13%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPN 208
           P++ K C+C+ S CLKLYCEC AA   CD  CNC  C N  E    R+ AV A LERNP 
Sbjct: 45  PERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAILERNPL 104

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF PK+AS                 +++H +GC+C+KSGC+K YCEC QA + C+  C C
Sbjct: 105 AFLPKVASG----------------SSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCAC 148

Query: 269 LDCKNFEG--SEERRALFHG 286
             C+N E   S ++  +F G
Sbjct: 149 HQCRNTETFVSAKKMLVFAG 168



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 3/75 (4%)

Query: 230 EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE---GSEERRALFHG 286
           + Q   +  KGC CKKS CLK YCEC  A  +C   C C  CKN +      ER      
Sbjct: 40  QVQTQPERTKGCSCKKSNCLKLYCECLAAQRMCDHLCNCEGCKNCQETLAERERAVAAIL 99

Query: 287 DHNGIAFMQRAANAA 301
           + N +AF+ + A+ +
Sbjct: 100 ERNPLAFLPKVASGS 114



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERN 206
           G+ +  + CNCR S C+K YCEC  AG+ C   C C  C N      A++  V A +  +
Sbjct: 113 GSSQHLRGCNCRKSGCMKNYCECHQAGVACTSRCACHQCRNTETFVSAKKMLVFAGVSAD 172

Query: 207 PNAFR 211
            ++ R
Sbjct: 173 DSSLR 177


>gi|219109951|ref|XP_002176728.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411263|gb|EEC51191.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 490

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 71/122 (58%), Gaps = 11/122 (9%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C+ +RCLKLYC+CF   IYC   C C++CHN    E ARQ A+   L RNP+AF  K 
Sbjct: 217 CTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNVPAQEDARQNAMKVILSRNPHAFDTKF 276

Query: 215 ASSPHGAQDAREDAREAQLAAK---HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
             +P        D    +  +K   H  GC C+KS C+KKYCEC+  ++ C+ +CRC  C
Sbjct: 277 QKTP-------VDGATVETPSKLLTHKLGCKCRKSACMKKYCECYAGHVYCNTHCRCTGC 329

Query: 272 KN 273
           KN
Sbjct: 330 KN 331



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/67 (46%), Positives = 37/67 (55%), Gaps = 6/67 (8%)

Query: 221 AQDAREDARE-AQLAAKHNK-----GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
           A D RE A   +Q+   H +     GC CKK+ CLK YC+CF   I C  NCRCLDC N 
Sbjct: 191 ANDGREGATSPSQIETDHLEFVQAVGCTCKKTRCLKLYCQCFGVKIYCGPNCRCLDCHNV 250

Query: 275 EGSEERR 281
              E+ R
Sbjct: 251 PAQEDAR 257


>gi|313238970|emb|CBY13958.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 52/111 (46%), Positives = 73/111 (65%), Gaps = 11/111 (9%)

Query: 164 LKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
           LKLYCECF  G YC   C+C  CHN  ++   RQ+A+ + L+RN +AF+PK+A S     
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                 ++     KH +GC+CK+SGCLKKYCEC+QA + C++ C C++CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKN 305



 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 35/58 (60%), Gaps = 3/58 (5%)

Query: 138 RPRANTDGK--DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           +P+    GK  +   K  + CNC+ S CLK YCEC+ AG+ C D CNCV C N  +H+
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVECKNRDDHD 310


>gi|9279694|dbj|BAB01251.1| DNA binding protein-like [Arabidopsis thaliana]
          Length = 593

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 52/106 (49%), Positives = 69/106 (65%), Gaps = 1/106 (0%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+CNC+ S+CLKLYCECFAAG YC + C+C+NC N   H+            RNP AF P
Sbjct: 328 KRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIESRNPLAFAP 387

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
           K+  +     +  EDA +   +A+H +GC+CKKS CLKKYCEC+Q+
Sbjct: 388 KVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECYQS 433



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 190 EHEVARQEAVGA----TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
           +H++ R+   G      +E  P +    +  +P   +  R  + ++       K C+CKK
Sbjct: 276 QHDIVRENESGKDSGQIIEVVPKSL-ASVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKK 334

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKN 273
           S CLK YCECF A   C E C C++C N
Sbjct: 335 SKCLKLYCECFAAGFYCIEPCSCINCFN 362


>gi|297800740|ref|XP_002868254.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314090|gb|EFH44513.1| hypothetical protein ARALYDRAFT_493420 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 651

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 53/116 (45%), Positives = 71/116 (61%), Gaps = 1/116 (0%)

Query: 165 KLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           K YCECFAAG+YC + C+C++C N   HE            RNP AF PK+  +     +
Sbjct: 363 KSYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVIRNSESVLE 422

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
             +DA +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  G ++
Sbjct: 423 TGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD 478



 Score = 45.1 bits (105), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           + ++ CNC+ S CLK YCEC+  G+ C   C C  C N
Sbjct: 435 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKN 472


>gi|313245432|emb|CBY40166.1| unnamed protein product [Oikopleura dioica]
          Length = 394

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 52/111 (46%), Positives = 72/111 (64%), Gaps = 11/111 (9%)

Query: 164 LKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
           LKLYCECF  G YC   C+C  CHN  ++   RQ+A+ + L+RN +AF+PK+A S     
Sbjct: 205 LKLYCECFRRGEYCHSYCSCSQCHNQEQYNDLRQKAIKSALDRNEDAFKPKVAKS----- 259

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                 ++     KH +GC+CK+SGCLKKYCEC+QA + C++ C C+ CKN
Sbjct: 260 -----GKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKN 305



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 35/58 (60%), Gaps = 3/58 (5%)

Query: 138 RPRANTDGK--DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           +P+    GK  +   K  + CNC+ S CLK YCEC+ AG+ C D CNCV C N  +H+
Sbjct: 253 KPKVAKSGKQVNNKGKHLRGCNCKRSGCLKKYCECYQAGVPCNDICNCVGCKNRDDHD 310


>gi|118399021|ref|XP_001031837.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89286171|gb|EAR84174.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 742

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 57/128 (44%), Positives = 75/128 (58%), Gaps = 19/128 (14%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLER 205
           + +P + K CNCRNS+CLK YC+CFAAG+YC   C C  CHN  E+E  RQ+A+    +R
Sbjct: 493 EKSPTEAKPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDERQKAIQKKKKR 552

Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
             +AF P I ++ H                   KGCHCK S C KKYC C Q  +LCS  
Sbjct: 553 IKDAFLPVIQNNSHI------------------KGCHCKNSHCQKKYCVCHQNGVLCSSL 594

Query: 266 CRCLDCKN 273
           C+C++C+N
Sbjct: 595 CQCVECEN 602



 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 27/43 (62%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K C+C+ S CLK+YC+CF A + C   C+C +C N    E+ R
Sbjct: 500 KPCNCRNSKCLKRYCDCFAAGLYCQAECKCEECHNKPEYEDER 542



 Score = 42.0 bits (97), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV 199
           K C+C+NS C K YC C   G+ C   C CV C N +E     Q ++
Sbjct: 568 KGCHCKNSHCQKKYCVCHQNGVLCSSLCQCVECENKIESMKKEQSSI 614


>gi|323456808|gb|EGB12674.1| hypothetical protein AURANDRAFT_60658 [Aureococcus anophagefferens]
          Length = 1430

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 58/138 (42%), Positives = 78/138 (56%), Gaps = 14/138 (10%)

Query: 145  GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVAR-----QEAV 199
            G+DG+    K CNC+ S+CLKLYCEC+AAG++C  CNC NC N  E   A        A+
Sbjct: 1003 GRDGSGGGFKPCNCKKSKCLKLYCECYAAGVFCRDCNCSNCLNKPEDGEAHPPYPATHAL 1062

Query: 200  GATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
            G  L     A    + +S           R+ Q   +   GC CKKS CLKKYCECF+A 
Sbjct: 1063 G-RLRHGDVAVPTAVLTSAQ--------KRKPQSKQRQESGCFCKKSKCLKKYCECFEAG 1113

Query: 260  ILCSENCRCLDCKNFEGS 277
            + C  +C+C +C+N++GS
Sbjct: 1114 VHCEASCKCENCENYDGS 1131


>gi|224131306|ref|XP_002328506.1| predicted protein [Populus trichocarpa]
 gi|222838221|gb|EEE76586.1| predicted protein [Populus trichocarpa]
          Length = 414

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 54/114 (47%), Positives = 69/114 (60%), Gaps = 1/114 (0%)

Query: 164 LKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
           +K YCECFAAG+YC + C+C+ C NN  HE    E       RNP AF PK+  +     
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHEDTVLETRRQIESRNPLAFAPKVIRNSDSVS 197

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           +  E+  +   +A+H +GC+CKKS CLKKYCECFQ  + CS  CRC  CKN  G
Sbjct: 198 EFGEETNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFG 251



 Score = 45.8 bits (107), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 7/55 (12%)

Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGI------YCDGC-NCVNCHNNVE 190
           T+    + + ++ CNC+ S CLK YCECF  G+       C+GC N   C N VE
Sbjct: 203 TNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSSYCRCEGCKNTFGCKNGVE 257



 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           +K YCECF A + C E C CL+C N    E+
Sbjct: 138 MKSYCECFAAGLYCIEPCSCLECSNNPAHED 168


>gi|118373056|ref|XP_001019722.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89301489|gb|EAR99477.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 495

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 75/130 (57%), Gaps = 3/130 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLER 205
           D   K ++ C C+NS+CLKLYCECFA G YC D C C+ C N  E E   +EA   TL R
Sbjct: 247 DSEQKAREGCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECENEIKEARKVTLTR 306

Query: 206 NPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN 265
           NP+AF  K+       QD   +A    L  K  KGC CK++ C KKYCEC+ A + CS  
Sbjct: 307 NPDAFTSKLEVVGTIVQDEDIEANRGMLGYK--KGCKCKRTYCKKKYCECYNAGVKCSYL 364

Query: 266 CRCLDCKNFE 275
           C C +C N E
Sbjct: 365 CICENCHNQE 374



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 29/56 (51%), Gaps = 9/56 (16%)

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
           D+ + ARE         GC CK S CLK YCECF     C + C+CL C N E  E
Sbjct: 247 DSEQKARE---------GCRCKNSKCLKLYCECFAKGAYCRDICKCLQCSNTEECE 293


>gi|255084730|ref|XP_002504796.1| tso1-like transcription factor [Micromonas sp. RCC299]
 gi|226520065|gb|ACO66054.1| tso1-like transcription factor [Micromonas sp. RCC299]
          Length = 620

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 54/110 (49%), Positives = 64/110 (58%), Gaps = 13/110 (11%)

Query: 166 LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           LYCECFAAG++C  C+C NC N  ++    Q        RNPNAF  KI +S  G     
Sbjct: 277 LYCECFAAGVFCHECSCQNCQNTSDNAGLVQMTRQQIELRNPNAFADKIVASGDGG---- 332

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
                     +H KGCHCKKS CLKKYCECFQA +LC + C+C  CKN E
Sbjct: 333 ---------GQHKKGCHCKKSACLKKYCECFQAGVLCQDYCKCDGCKNKE 373


>gi|145350962|ref|XP_001419860.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580092|gb|ABO98153.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 510

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 54/108 (50%), Positives = 64/108 (59%), Gaps = 13/108 (12%)

Query: 166 LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           LYCECFAAG +C  C+C  C N  E+E    +      +RNP AF  KI +      DA 
Sbjct: 207 LYCECFAAGAFCKDCSCQQCQNTTENEAIVTKTRQQIEQRNPYAFESKIMA------DAG 260

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           +DAR       H KGCHCKKS CLKKYCECFQA + C + C+C  CKN
Sbjct: 261 DDAR-------HTKGCHCKKSACLKKYCECFQAGVKCQDYCKCEGCKN 301


>gi|297721065|ref|NP_001172895.1| Os02g0274400 [Oryza sativa Japonica Group]
 gi|255670792|dbj|BAH91624.1| Os02g0274400, partial [Oryza sativa Japonica Group]
          Length = 140

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/95 (63%), Positives = 65/95 (68%), Gaps = 7/95 (7%)

Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH----GDHNGI 291
           KHNKGCHC+KS CLKKYCECFQANILCS+NCRC DCKNFEGSEER+AL       D N I
Sbjct: 23  KHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKNFEGSEERKALVQMKNASDRNHI 82

Query: 292 AFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELL 326
              Q AAN A+ GA GS GY       K+  E  L
Sbjct: 83  ---QEAANFALNGATGSLGYKNSPVRRKKYQENSL 114



 Score = 45.4 bits (106), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
            PK  K C+CR S CLK YCECF A I C   C C +C N
Sbjct: 21  IPKHNKGCHCRKSECLKKYCECFQANILCSKNCRCQDCKN 60


>gi|426252144|ref|XP_004019777.1| PREDICTED: tesmin [Ovis aries]
          Length = 512

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 59/122 (48%), Positives = 82/122 (67%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC N++ HE+ R +A+ A L+RNP AF+PKI     G   
Sbjct: 308 LAGYCDCFASGDFCNNCNCNNCCNDLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 364

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 365 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 416

Query: 284 FH 285
            +
Sbjct: 417 MN 418



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 365 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 409


>gi|118355040|ref|XP_001010781.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292548|gb|EAR90536.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 978

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 55/119 (46%), Positives = 73/119 (61%), Gaps = 9/119 (7%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ ++CLKLYCECFA   YC +GC+C  C N  E+E  RQ A+     RN +AF+PK 
Sbjct: 405 CNCKKTKCLKLYCECFAKLGYCGEGCHCHECKNRPENEDERQNAIQEAKSRNNDAFQPKT 464

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            S     Q+ +  A   Q+     KGC+C+K+ CLKKYCECF A   C+  C+C  C+N
Sbjct: 465 ESE----QERKNKANNEQMK----KGCNCRKTKCLKKYCECFNAGTYCNNMCKCDSCEN 515


>gi|307110312|gb|EFN58548.1| hypothetical protein CHLNCDRAFT_140658 [Chlorella variabilis]
          Length = 664

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 72/128 (56%), Gaps = 19/128 (14%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNA 209
           ++Q+ CNC+ S CLK+YCECFAAG +C   C+C++C N          A    L +NPNA
Sbjct: 260 REQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTPAEMGVVMAAREVVLAKNPNA 319

Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRC 268
           F  K+ +                 A  H +GC CK+S CLKKYCECF A   C+ E C+C
Sbjct: 320 FEVKVTA-----------------ATGHRRGCRCKRSKCLKKYCECFHAGARCNPEVCQC 362

Query: 269 LDCKNFEG 276
            DC+N EG
Sbjct: 363 EDCRNTEG 370



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 26/39 (66%)

Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           A+  + C+CK+S CLK YCECF A   C+ +C CL C N
Sbjct: 259 AREQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSN 297


>gi|224000241|ref|XP_002289793.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975001|gb|EED93330.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 712

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 60/147 (40%), Positives = 75/147 (51%), Gaps = 29/147 (19%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C+ S+CLKLYC CF+A   C   C C  C N  E    R++AV ATL+RNPNAFR K 
Sbjct: 338 CTCKKSKCLKLYCHCFSANSMCSSFCRCTECMNTTEAGTVREDAVKATLQRNPNAFRSKF 397

Query: 215 ASSPHGAQDARE-----DAREAQLA-----------------------AKHNKGCHCKKS 246
                G + A+E      A++ QL                          H  GC+C+KS
Sbjct: 398 VKVTGGEEFAKEWEKNERAKQMQLLNVGDSPAKTKTTGVEMAAVAIANPTHKLGCNCRKS 457

Query: 247 GCLKKYCECFQANILCSENCRCLDCKN 273
            CLK+YCECF A   C  NC+CL C+N
Sbjct: 458 FCLKRYCECFGAQTPCGLNCKCLGCQN 484


>gi|118355648|ref|XP_001011083.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89292850|gb|EAR90838.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 974

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 76/124 (61%), Gaps = 7/124 (5%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ ++CLKLYC+CFA G  C   C CV C NN      R +A+   L+RNP AF  K 
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLRNQAIEGILQRNPFAFNVK- 694

Query: 215 ASSPHGAQDAREDAREAQLAAKHNK--GCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
              P  A+ ++++   + L   ++K  GC+C+KS CLKKYC C+ + + CSE C+C +CK
Sbjct: 695 ---PEEAKKSKKELNNSILQNSNSKVIGCNCRKSNCLKKYCLCYHSGMKCSEACKCTECK 751

Query: 273 NFEG 276
           N E 
Sbjct: 752 NLEN 755



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 28/41 (68%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+CKK+ CLK YC+CF    LC ++C+C++C N E +   R
Sbjct: 636 CNCKKTKCLKLYCDCFAVGELCGQDCKCVECCNNEATLNLR 676


>gi|311247083|ref|XP_003122474.1| PREDICTED: tesmin [Sus scrofa]
          Length = 511

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 60/122 (49%), Positives = 82/122 (67%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G   
Sbjct: 307 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 363

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 364 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 415

Query: 284 FH 285
            +
Sbjct: 416 MN 417



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 364 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 408


>gi|395851609|ref|XP_003798345.1| PREDICTED: tesmin [Otolemur garnettii]
          Length = 509

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 60/122 (49%), Positives = 83/122 (68%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G   
Sbjct: 305 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 361

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER++L
Sbjct: 362 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKSL 413

Query: 284 FH 285
            +
Sbjct: 414 IN 415



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 362 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 406


>gi|340505892|gb|EGR32168.1| hypothetical protein IMG5_093910 [Ichthyophthirius multifiliis]
          Length = 260

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 60/153 (39%), Positives = 88/153 (57%), Gaps = 15/153 (9%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           +Q+QCNC+ S+CLKLYC+CFA G YC+  C+C  C NN  ++  R+ A+    ERNPNAF
Sbjct: 60  QQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREMALKQNKERNPNAF 119

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
             +I              +++QL   H +GC+CKKSGC KKYC+C+   + CS  CRC+ 
Sbjct: 120 TSRII-------------QQSQLNN-HQRGCNCKKSGCQKKYCQCYLDGMECSRYCRCIG 165

Query: 271 CKNFEGSEERRALFHGDHNGIAFMQRAANAAII 303
           C+N    E ++   +  +      Q+  N+  I
Sbjct: 166 CENCNIFENKKICLNNKNIEKQIQQQKGNSDYI 198



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 32/51 (62%)

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           L A   + C+CKKS CLK YC+CF     C+  C C +CKN E ++E+R +
Sbjct: 56  LKALQQRQCNCKKSKCLKLYCDCFAIGEYCNPICHCFECKNNESNKEKREM 106


>gi|334332647|ref|XP_001371812.2| PREDICTED: tesmin [Monodelphis domestica]
          Length = 728

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 59/119 (49%), Positives = 81/119 (68%), Gaps = 11/119 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+PKI     G      
Sbjct: 527 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG------ 580

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
                 +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 581 -----DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 634



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 581 DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 625


>gi|403353185|gb|EJY76132.1| Tso1, putative [Oxytricha trifallax]
          Length = 712

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 60/150 (40%), Positives = 78/150 (52%), Gaps = 28/150 (18%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K CNC+ S+CLKLYCECFA   +C   C C  C N+ EH+  R  A    L RNP AFRP
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDERLHAKEQILMRNPLAFRP 508

Query: 213 KIASSPHGAQDAREDAR----------EAQLAA-----------------KHNKGCHCKK 245
           K+ +         +D +          +A L+                  +H +GC+CKK
Sbjct: 509 KVETQQDETSLFSKDIQITKINIQEQSDALLSGLSKPQTFGALAVSDKDKRHFRGCNCKK 568

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFE 275
           S C KKYCECFQ  + CS+ C+C +CKN E
Sbjct: 569 SNCQKKYCECFQQGVNCSDLCKCEECKNNE 598



 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 25/43 (58%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K C+CKKS CLK YCECF  N  C  NC C  C N    ++ R
Sbjct: 449 KSCNCKKSKCLKLYCECFANNKFCGANCACNGCSNHAEHDDER 491


>gi|345309503|ref|XP_001516587.2| PREDICTED: tesmin-like [Ornithorhynchus anatinus]
          Length = 243

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 62/125 (49%), Positives = 84/125 (67%), Gaps = 11/125 (8%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA+G +C+ CNC NC+NN  HE  R +A+ A L RNP AF+PKI     G      
Sbjct: 52  YCDCFASGDFCNNCNCNNCYNNQSHEAERCKAIKACLNRNPEAFQPKIGKGKLG------ 105

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG 286
                ++  +HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S +RR L++G
Sbjct: 106 -----EVKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEESPQRRILWNG 160

Query: 287 DHNGI 291
            H  +
Sbjct: 161 PHPAV 165



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           P+  K CNC+ S CLK YCECF A I C   C C++C N  E
Sbjct: 109 PRHNKGCNCKRSGCLKNYCECFEAKIMCSSTCKCISCKNYEE 150


>gi|146174407|ref|XP_001019363.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|146144792|gb|EAR99118.2| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1114

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 59/126 (46%), Positives = 77/126 (61%), Gaps = 9/126 (7%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC  ++CLKLYC CF  G+ C D C C +C+N + +   R+ A+ A  +R P+AF  K+
Sbjct: 714 CNCGKTKCLKLYCVCFQQGLMCSDQCRCTDCYNKIAYLEERERAIKAIKQRYPDAFEKKV 773

Query: 215 ASSPHGA---QDA-REDAR---EAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
              P  A   +DA R DA    + QL A H KGC+CKKS C KKYCECF A + C+  CR
Sbjct: 774 NFLPELARKIEDAERLDADLELQNQLLA-HKKGCNCKKSACKKKYCECFLAGVKCTYLCR 832

Query: 268 CLDCKN 273
           C  C+N
Sbjct: 833 CEQCQN 838



 Score = 42.7 bits (99), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           +K CNC+ S C K YCECF AG+ C   C C  C N
Sbjct: 803 KKGCNCKKSACKKKYCECFLAGVKCTYLCRCEQCQN 838


>gi|145545444|ref|XP_001458406.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426226|emb|CAK91009.1| unnamed protein product [Paramecium tetraurelia]
          Length = 218

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/119 (46%), Positives = 72/119 (60%), Gaps = 15/119 (12%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ S+CLKLYC+CFAAG  C   CNC  C NN  + + R + +   +ERNP AF  K+
Sbjct: 85  CNCKKSKCLKLYCDCFAAGKLCSSKCNCCGCFNNSSNLLERNQFIEKMVERNPEAFNQKV 144

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                         +E      H+KGC+C+KSGC KKYCEC+Q  I CS+NC+C  C+N
Sbjct: 145 --------------KEVDYKLAHSKGCNCRKSGCKKKYCECYQLGIECSDNCKCDGCQN 189


>gi|332249871|ref|XP_003274078.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Nomascus leucogenys]
          Length = 517

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/122 (49%), Positives = 81/122 (66%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI     G   
Sbjct: 317 LARYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 373

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC C++SGCLK YCEC++A I+CS  C+C+DCKN+E S ER+ L
Sbjct: 374 --------DVKPRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEESPERKTL 425

Query: 284 FH 285
            +
Sbjct: 426 MN 427



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K C+CR S CLK YCEC+ A I C   C C++C N  E
Sbjct: 374 DVKPRHNKGCSCRRSGCLKNYCECYEAQIMCSSICKCIDCKNYEE 418


>gi|118384767|ref|XP_001025523.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89307290|gb|EAS05278.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 1022

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 53/118 (44%), Positives = 70/118 (59%), Gaps = 6/118 (5%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 215
           C C+ ++CLKLYCECF A  YC G NC  C N  E+E  R+ A+    +RN +AF PK  
Sbjct: 766 CGCKKTKCLKLYCECFQAMKYCVGTNCQGCLNKPEYEDQRKHAMELIQQRNSSAFDPKAD 825

Query: 216 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            S    +       ++   A H+KGC+CKKS C KKYCEC+Q    C+  C+C +CKN
Sbjct: 826 KSDQYYK------SDSGTKAIHSKGCNCKKSDCRKKYCECYQMGAKCTHLCKCYNCKN 877


>gi|145489787|ref|XP_001430895.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397996|emb|CAK63497.1| unnamed protein product [Paramecium tetraurelia]
          Length = 276

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 77/245 (31%), Positives = 111/245 (45%), Gaps = 37/245 (15%)

Query: 32  LPEQSVKSQSSQPQSHSPLQPQPP-KSQPQVQSKVASPS-QSQPLVRLQLLPPSSQPHVQ 89
           +P+  +KS   Q      L+   P KS   +Q++  S S       R+ L  P +  ++ 
Sbjct: 43  IPQIVIKSNFQQELICFMLELHKPNKSHSIIQNRNHSRSFNHNQQFRISLNYPQANLNLS 102

Query: 90  AHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGT 149
             +++          +P H +  N   + + ++  P L    QE                
Sbjct: 103 PSIKNKGMYYYSIHLQP-HTQYYNLNLNIIFEILQPNLFYLNQE---------------- 145

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPN 208
               K CNCR S+CLKLYC+CFA G  C   CNC  C NN  + + R   +   +ERNP 
Sbjct: 146 ---HKPCNCRKSKCLKLYCDCFAVGKLCSSKCNCCGCFNNSSNLLERNSFIQKMIERNPQ 202

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF  K+              +E +    H KGC+C+KSGC KKYCEC+Q  I CS+NC+C
Sbjct: 203 AFNQKV--------------QEVESKMTHAKGCNCRKSGCQKKYCECYQMGIECSDNCKC 248

Query: 269 LDCKN 273
             C N
Sbjct: 249 DGCLN 253


>gi|301787847|ref|XP_002929338.1| PREDICTED: tesmin-like [Ailuropoda melanoleuca]
 gi|281340625|gb|EFB16209.1| hypothetical protein PANDA_019494 [Ailuropoda melanoleuca]
          Length = 510

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/137 (45%), Positives = 86/137 (62%), Gaps = 22/137 (16%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G   
Sbjct: 306 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIQRFKAIKACLDRNPEAFQPKIGKGKLG--- 362

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 363 --------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 414

Query: 284 -----------FHGDHN 289
                      F G+H+
Sbjct: 415 MNMPNYMEIGGFEGNHH 431



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 363 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 407


>gi|410974670|ref|XP_003993766.1| PREDICTED: tesmin [Felis catus]
          Length = 502

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/132 (45%), Positives = 83/132 (62%), Gaps = 15/132 (11%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C  CNC NC NN+ HEV R +A+ A L+RNP AF+PKI         
Sbjct: 299 LAGYCDCFASGDFCHNCNCNNCRNNLHHEVQRFKAMKACLDRNPEAFQPKIGK------- 351

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                R   +  +HN+GC+C++SGCLK YCEC++A I+CS  C+C+ CKN++   ER+ L
Sbjct: 352 ----GRPGDVTPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDEGPERKTL 407

Query: 284 ----FHGDHNGI 291
               +H D  G 
Sbjct: 408 MSMPYHPDAGGF 419



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D TP+  + CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 356 DVTPRHNRGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYDE 400


>gi|395544681|ref|XP_003774236.1| PREDICTED: tesmin [Sarcophilus harrisii]
          Length = 548

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/119 (49%), Positives = 81/119 (68%), Gaps = 11/119 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA G +C+ CNC NC+NN+ HE+ R +A+ A L+RNP AF+PKI     G      
Sbjct: 347 YCDCFANGDFCNNCNCNNCYNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG------ 400

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
                 +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 401 -----DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLLN 454



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 401 DIKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 445


>gi|291415497|ref|XP_002723989.1| PREDICTED: abnormal cell LINeage family member (lin-54)-like
           [Oryctolagus cuniculus]
          Length = 450

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/122 (50%), Positives = 81/122 (66%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN  HE+ R +A+ A L+RNP AF+PKI     G+  
Sbjct: 247 LSGYCDCFASGDFCNNCNCNNCCNNWRHEIERFKAIKACLDRNPEAFQPKIGKGRPGSAK 306

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
            R           HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 307 PR-----------HNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEESPERKTL 355

Query: 284 FH 285
            +
Sbjct: 356 MN 357



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 3/49 (6%)

Query: 145 GKDGT--PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           G+ G+  P+  K CNC+ S CLK YCEC+ A I C   C CV C N  E
Sbjct: 300 GRPGSAKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCVGCKNYEE 348


>gi|440894221|gb|ELR46727.1| Tesmin [Bos grunniens mutus]
          Length = 511

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/122 (49%), Positives = 81/122 (66%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G   
Sbjct: 307 LAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 363

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 364 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 415

Query: 284 FH 285
            +
Sbjct: 416 MN 417



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 364 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 408


>gi|329663862|ref|NP_001192832.1| tesmin [Bos taurus]
 gi|296471372|tpg|DAA13487.1| TPA: abnormal cell LINeage family member (lin-54)-like [Bos taurus]
          Length = 511

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/122 (49%), Positives = 81/122 (66%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G   
Sbjct: 307 LAGYCDCFANGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 363

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 364 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 415

Query: 284 FH 285
            +
Sbjct: 416 MN 417



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 364 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 408


>gi|145535353|ref|XP_001453415.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421126|emb|CAK86018.1| unnamed protein product [Paramecium tetraurelia]
          Length = 348

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 52/120 (43%), Positives = 71/120 (59%), Gaps = 1/120 (0%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           QCNC+ S+CLKLYCECF     C   CNC  C N +++   R +A+   L RNP AF P 
Sbjct: 167 QCNCKKSKCLKLYCECFTNNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFAPI 226

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           + ++    Q  +E   +  +  +  KGC+CKKS C KKYCEC+  N  C+E C+C +C N
Sbjct: 227 LTNNGQQPQVIQEQKSQKDIQKETKKGCNCKKSECKKKYCECYSINQKCTELCKCENCLN 286


>gi|330843039|ref|XP_003293472.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
 gi|325076199|gb|EGC30006.1| hypothetical protein DICPUDRAFT_158333 [Dictyostelium purpureum]
          Length = 478

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 55/120 (45%), Positives = 69/120 (57%), Gaps = 31/120 (25%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           K+C+C+NS+CLK+YCECFA  I C+ C C  C NN     A  E +G             
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCNNCQCFGCQNNE----ANIEKIGND----------- 147

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                           ++ L  KH++GCHC+KSGCLKKYCECFQA I C+ENC+C DCKN
Sbjct: 148 ----------------QSVLIDKHSRGCHCRKSGCLKKYCECFQAGIPCNENCKCYDCKN 191



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/42 (57%), Positives = 29/42 (69%), Gaps = 1/42 (2%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 280
           K CHCK S CLK YCECF   ILC+ NC+C  C+N E + E+
Sbjct: 103 KKCHCKNSKCLKMYCECFANKILCN-NCQCFGCQNNEANIEK 143



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
           G Q +  +  +   D      K  + C+CR S CLK YCECF AGI C + C C +C N 
Sbjct: 133 GCQNNEANIEKIGNDQSVLIDKHSRGCHCRKSGCLKKYCECFQAGIPCNENCKCYDCKNQ 192

Query: 189 VEH 191
             H
Sbjct: 193 SHH 195


>gi|194218566|ref|XP_001492479.2| PREDICTED: tesmin-like [Equus caballus]
          Length = 383

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/122 (49%), Positives = 82/122 (67%), Gaps = 11/122 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G   
Sbjct: 179 LAGYCDCFASGDFCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQPKIGKGKLG--- 235

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 236 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 287

Query: 284 FH 285
            +
Sbjct: 288 MN 289



 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 236 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 280


>gi|357480917|ref|XP_003610744.1| Lin-54-like protein [Medicago truncatula]
 gi|355512079|gb|AES93702.1| Lin-54-like protein [Medicago truncatula]
          Length = 808

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 63/160 (39%), Positives = 81/160 (50%), Gaps = 31/160 (19%)

Query: 154 KQCNCRNSRCLKL-------------------YCECFAAGIYC-DGCNCVNCHNNVEHEV 193
           K+CNC+ S+CLKL                   YCECFAAG+YC + C+C  C N   HE 
Sbjct: 488 KRCNCKKSKCLKLIDLKMTILVLELTILFFFSYCECFAAGVYCIEPCSCQECFNKPIHED 547

Query: 194 ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYC 253
              +       RNP AF PK+  S     +   D  +   +A+H +GC+CKKS CLKKYC
Sbjct: 548 TVLQTRKQIESRNPLAFAPKVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYC 607

Query: 254 ECFQANIL-----------CSENCRCLDCKNFEGSEERRA 282
           EC+Q  IL           CS +CRC  CKN  G ++  A
Sbjct: 608 ECYQVLILTISCLPQGGVGCSISCRCEGCKNAFGRKDGSA 647


>gi|242033599|ref|XP_002464194.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
 gi|241918048|gb|EER91192.1| hypothetical protein SORBIDRAFT_01g013900 [Sorghum bicolor]
          Length = 641

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 83/161 (51%), Gaps = 9/161 (5%)

Query: 136 RSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVA 194
           R R   N DGK       + C+C+ S+CLKLYC CFAA +YC + C+C  C NN  HE A
Sbjct: 344 RRRKLQNDDGKSC-----RHCSCKKSKCLKLYCACFAAKVYCSEFCSCQGCSNNHMHEEA 398

Query: 195 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 254
                  T  RNP AF P +        +  +D+     +A+H +GC+C+KS CLKKYCE
Sbjct: 399 VSHIRKQTESRNPLAFAPTVTRKCGSVSELGDDSNNTPASARHKRGCNCRKSSCLKKYCE 458

Query: 255 CFQANILCS--ENCRCLDCKNFEGSEERRALFHGDHNGIAF 293
           CFQ  +L    E    L  +  +   E +   HG    +AF
Sbjct: 459 CFQVLMLLFSLEGVLLLTTEKLKKGGEAKGT-HGKEEKLAF 498


>gi|402892588|ref|XP_003909492.1| PREDICTED: tesmin [Papio anubis]
          Length = 515

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 60/121 (49%), Positives = 80/121 (66%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI     G   
Sbjct: 311 LAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 367

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 368 --------DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 419

Query: 284 F 284
            
Sbjct: 420 M 420



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 57/137 (41%), Gaps = 19/137 (13%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV------ 199
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C  N E    R+  +      
Sbjct: 368 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGC-KNYEESPERKTLMSMPNYV 426

Query: 200 -GATLERNPNAFRPKIASSPHGAQDAR------EDAREAQLAAKHNKGCHCKKSGCLKKY 252
               LE + +    K +  P  +QD R       +  EA  A    +G   +K  C K  
Sbjct: 427 QAGGLEGSRHLPPTKFSGLPKFSQDRRPSSCISWEVVEATCACLLAQGEEAEKEHCSK-- 484

Query: 253 CECFQANILCSENCRCL 269
             C    ++  E  RCL
Sbjct: 485 --CLAEQMILEEFGRCL 499


>gi|145478369|ref|XP_001425207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392276|emb|CAK57809.1| unnamed protein product [Paramecium tetraurelia]
          Length = 373

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 63/146 (43%), Positives = 89/146 (60%), Gaps = 10/146 (6%)

Query: 155 QCNCRNSRCLKLYCECFAA-GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           QCNC+ S+CLKLYCECFA  G+    CNC +C N +++   R +A+   L RN +AF  +
Sbjct: 187 QCNCKKSKCLKLYCECFANNGVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNTDAFV-Q 245

Query: 214 IASSPHGAQDAREDAREAQLAAKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 268
             S+  GAQ  ++D +  +  +K N     KGC+CKKSGC KKYCEC+  NI C+E C+C
Sbjct: 246 CFSTKGGAQFVQQD-KPLKEPSKDNSSVIRKGCNCKKSGCKKKYCECYSQNIRCNELCKC 304

Query: 269 LDCKNFEGSE--ERRALFHGDHNGIA 292
             C N   +E   ++ L   DHN ++
Sbjct: 305 EHCLNKTDAEIQAQQDLGQEDHNNLS 330



 Score = 46.6 bits (109), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 6/77 (7%)

Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNV 189
           +Q+ P   P      KD +   +K CNC+ S C K YCEC++  I C+  C C +C N  
Sbjct: 257 QQDKPLKEP-----SKDNSSVIRKGCNCKKSGCKKKYCECYSQNIRCNELCKCEHCLNKT 311

Query: 190 EHEVARQEAVGATLERN 206
           + E+  Q+ +G     N
Sbjct: 312 DAEIQAQQDLGQEDHNN 328


>gi|412990962|emb|CCO18334.1| predicted protein [Bathycoccus prasinos]
          Length = 671

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 54/109 (49%), Positives = 65/109 (59%), Gaps = 15/109 (13%)

Query: 166 LYCECFAAGIYCDGCNCVNCHNNV-EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 224
           LYC+CFAAG++C  C+C +C N   + EV RQ        RNPNAF  KI          
Sbjct: 358 LYCDCFAAGVFCRDCSCQSCSNTEGDLEVVRQTRY-QIESRNPNAFANKIVD-------- 408

Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                +  + AKH KGCHCKKS CLKKYCECFQAN+ C + C+C  CKN
Sbjct: 409 -----DDSVDAKHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKN 452



 Score = 45.8 bits (107), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           K  K C+C+ S CLK YCECF A + C D C C  C N  +
Sbjct: 415 KHAKGCHCKKSACLKKYCECFQANVRCQDYCKCEGCKNTTD 455


>gi|302753274|ref|XP_002960061.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
 gi|300171000|gb|EFJ37600.1| hypothetical protein SELMODRAFT_75102 [Selaginella moellendorffii]
          Length = 155

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 61/144 (42%), Positives = 80/144 (55%), Gaps = 14/144 (9%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           +C+C+ S+CLKLYCECFA+G YC D C C NC N  EHE    E       RNP AF P+
Sbjct: 1   RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60

Query: 214 IASSPHGAQDAREDAREAQL-------------AAKHNKGCHCKKSGCLKKYCECFQANI 260
           + S     +  +       L              +KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61  VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120

Query: 261 LCSENCRCLDCKNFEGSEERRALF 284
            CS+ C+C  CKN  G+++    F
Sbjct: 121 GCSDACKCRGCKNTFGAKQGDDFF 144



 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNN 188
           T K ++ CNC+ S CLK YCECF + + C D C C  C N 
Sbjct: 94  TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNT 134


>gi|300175264|emb|CBK20575.2| unnamed protein product [Blastocystis hominis]
          Length = 396

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 71/121 (58%), Gaps = 17/121 (14%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ SRCLKLYCECF +  YC   CNC+NC+NN +HE  R+EAV     RNP+ F   +
Sbjct: 151 CNCKKSRCLKLYCECFKSQEYCSSECNCLNCYNNRKHESEREEAVQRMKARNPHCFENHV 210

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
                   D ++   ++        GC C+++ C KKYCECF   + C + C+C DCKN 
Sbjct: 211 --------DEKKGVNKS--------GCRCRRTHCDKKYCECFSHGVPCGDQCQCKDCKNE 254

Query: 275 E 275
           E
Sbjct: 255 E 255



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 38/75 (50%), Gaps = 10/75 (13%)

Query: 217 SPHGAQDAREDAREAQLAAKHNKG----------CHCKKSGCLKKYCECFQANILCSENC 266
           S   ++DA E A   + +AK  +           C+CKKS CLK YCECF++   CS  C
Sbjct: 117 SAASSRDALESAFPREKSAKSTRATQPSIPGLVTCNCKKSRCLKLYCECFKSQEYCSSEC 176

Query: 267 RCLDCKNFEGSEERR 281
            CL+C N    E  R
Sbjct: 177 NCLNCYNNRKHESER 191



 Score = 44.7 bits (104), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 3/66 (4%)

Query: 142 NTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
           + D K G  K    C CR + C K YCECF+ G+ C D C C +C N  +     +  VG
Sbjct: 209 HVDEKKGVNKSG--CRCRRTHCDKKYCECFSHGVPCGDQCQCKDCKNEEQWNNGMESMVG 266

Query: 201 ATLERN 206
              E N
Sbjct: 267 DGEETN 272


>gi|145526438|ref|XP_001449030.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416596|emb|CAK81633.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 53/127 (41%), Positives = 75/127 (59%), Gaps = 1/127 (0%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           QCNC+ S+CLKLYCECFA    C   CNC  C N +++   R +A+   L RNP AF   
Sbjct: 163 QCNCKKSKCLKLYCECFANNWVCSQSCNCTECKNRIDNPNERSKAIEEALLRNPEAFSTI 222

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           + ++    Q   E   + + + +  KGC+CKKS C KKYCEC+  N  C++ C+C +C N
Sbjct: 223 LTNNGQQPQIIPEPKSQKEQSKETKKGCNCKKSECKKKYCECYSINQKCTDLCKCENCLN 282

Query: 274 FEGSEER 280
            E  +E+
Sbjct: 283 KEEPQEQ 289


>gi|12322849|gb|AAG51411.1|AC009465_11 hypothetical protein; 69435-72676 [Arabidopsis thaliana]
          Length = 601

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 68/111 (61%), Gaps = 1/111 (0%)

Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           YCECF+AG++C + C+C NC N   HE    ++      RNP AF PK+ S+     D  
Sbjct: 428 YCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLW 487

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
            +  +   +A+H +GC+C+KSGC KKYCECF   + CS NCRC+ CKN  G
Sbjct: 488 VENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 538



 Score = 45.8 bits (107), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEH 191
           + ++ CNCR S C K YCECF  G+ C   C C+ C N   H
Sbjct: 498 RHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFGH 539


>gi|224054140|ref|XP_002298111.1| predicted protein [Populus trichocarpa]
 gi|222845369|gb|EEE82916.1| predicted protein [Populus trichocarpa]
          Length = 781

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/128 (49%), Positives = 79/128 (61%), Gaps = 3/128 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           ++CNC+ SRCLKLYCECFAAGIYC D C+CVNC N  E+E    +    T  RNP AF P
Sbjct: 441 RRCNCKKSRCLKLYCECFAAGIYCLDTCSCVNCINKPEYEDTVLDMRQQTEARNPLAFAP 500

Query: 213 KIASSPHG--AQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           K+ ++     A    E       +++H KGC+CKKS C KKYCECFQ  + C   CRC  
Sbjct: 501 KVVNNATNSPANMMEEGKWMKTSSSRHKKGCNCKKSKCSKKYCECFQGGVGCCNGCRCEG 560

Query: 271 CKNFEGSE 278
           C N  G++
Sbjct: 561 CYNPYGNK 568


>gi|355566221|gb|EHH22600.1| Metallothionein-like 5, testis-specific [Macaca mulatta]
          Length = 508

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/121 (48%), Positives = 80/121 (66%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI     G   
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 361 --------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 412

Query: 284 F 284
            
Sbjct: 413 M 413



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 57/137 (41%), Gaps = 19/137 (13%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV------ 199
           D  P+  K CNCR S CLK YCEC+ A I C   C C+ C  N E    R+  +      
Sbjct: 361 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGC-KNYEESPERKTLMSMPNYV 419

Query: 200 -GATLERNPNAFRPKIASSPHGAQDAR------EDAREAQLAAKHNKGCHCKKSGCLKKY 252
               LE + +    K +  P  +QD R       +  EA  A    +G   +K  C K  
Sbjct: 420 QAGGLEGSRHLPPTKFSGLPKFSQDRRPSSCISWEVVEATCACLLAQGEEAEKEHCSK-- 477

Query: 253 CECFQANILCSENCRCL 269
             C    ++  E  RCL
Sbjct: 478 --CLAEQMILEEFGRCL 492


>gi|348564752|ref|XP_003468168.1| PREDICTED: tesmin-like [Cavia porcellus]
          Length = 391

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 58/119 (48%), Positives = 79/119 (66%), Gaps = 11/119 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA+G +C+ CNC NC NN+ HE+ R  A+   L+RNP AF+PKI            
Sbjct: 186 YCDCFASGDFCNNCNCNNCCNNLHHEMERFNAIKVCLDRNPEAFQPKIGK---------- 235

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
             R   +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L +
Sbjct: 236 -GRSGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLMN 293



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 240 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 284


>gi|355751890|gb|EHH56010.1| Metallothionein-like 5, testis-specific, partial [Macaca
           fascicularis]
          Length = 490

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 59/121 (48%), Positives = 80/121 (66%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L RNP AF+PKI     G   
Sbjct: 286 LAGYCDCFASGDFCNNCNCNNCCNNLHHEIERFKAIKACLGRNPEAFQPKIGKGQLG--- 342

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 343 --------DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTL 394

Query: 284 F 284
            
Sbjct: 395 M 395



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 57/137 (41%), Gaps = 19/137 (13%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAV------ 199
           D  P+  K CNCR S CLK YCEC+ A I C   C C+ C  N E    R+  +      
Sbjct: 343 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGC-KNYEESPERKTLMSMPNYV 401

Query: 200 -GATLERNPNAFRPKIASSPHGAQDAR------EDAREAQLAAKHNKGCHCKKSGCLKKY 252
               LE + +    K +  P  +QD R       +  EA  A    +G   +K  C K  
Sbjct: 402 QAGGLEGSRHLPPTKFSGLPKFSQDRRPSSCISWEVVEATCACLLAQGEEAEKEHCSK-- 459

Query: 253 CECFQANILCSENCRCL 269
             C    ++  E  RCL
Sbjct: 460 --CLAEQMILEEFGRCL 474


>gi|345784018|ref|XP_854573.2| PREDICTED: tesmin [Canis lupus familiaris]
          Length = 516

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 85/134 (63%), Gaps = 22/134 (16%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA+G +C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+PKI     G      
Sbjct: 315 YCDCFASGDFCNNCNCNNCCNNLHHEIQRFKAIKACLDRNPEAFQPKIGKGKLG------ 368

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL--- 283
                 +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S E++ L   
Sbjct: 369 -----DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPEQKTLMNM 423

Query: 284 --------FHGDHN 289
                   F G+H+
Sbjct: 424 PNYMEIGGFEGNHH 437



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 369 DVKPRHNKGCNCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 413


>gi|301631659|ref|XP_002944915.1| PREDICTED: protein lin-54 homolog, partial [Xenopus (Silurana)
           tropicalis]
          Length = 183

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 65/125 (52%), Positives = 84/125 (67%), Gaps = 11/125 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K +K CNC  S+CLKLYC+CFA G  C+ CNC NC+NN EHE+ R +AV A L RNP+AF
Sbjct: 9   KTKKPCNCTRSQCLKLYCDCFANGDLCNNCNCSNCYNNAEHELERFKAVKACLYRNPDAF 68

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +PKI           E  +   +  +HNKGC+CK+SGC K YCEC++A I CS  C+C+ 
Sbjct: 69  QPKI-----------EKGKIGDVKPRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVG 117

Query: 271 CKNFE 275
           CKN +
Sbjct: 118 CKNVK 122



 Score = 43.1 bits (100), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           D  P+  K CNC+ S C K YCEC+ A I C   C CV C N
Sbjct: 79  DVKPRHNKGCNCKRSGCFKNYCECYEAKIACSSVCKCVGCKN 120


>gi|145552635|ref|XP_001461993.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124429830|emb|CAK94620.1| unnamed protein product [Paramecium tetraurelia]
          Length = 384

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 54/130 (41%), Positives = 73/130 (56%), Gaps = 6/130 (4%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           QCNC+ S+CLKLYCECFA    C   CNC +C N +++   R +A+   L RNP+AF   
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250

Query: 214 IASSPHGAQDAREDAREAQLAAKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 268
                      ++  +  +   K N     KGC+CKKSGC KKYCEC+  N+ CS+ C+C
Sbjct: 251 FQQKGQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310

Query: 269 LDCKNFEGSE 278
             C N   +E
Sbjct: 311 EQCLNRTDAE 320



 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 280
           C+CKKS CLK YCECF  N +CS++C C DCKN  +  +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232



 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATL 203
           KD +   +K CNC+ S C K YCEC++  + C D C C  C N  + E+  Q+ +   L
Sbjct: 273 KDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAEIQAQQDIAQVL 331


>gi|403301068|ref|XP_003941221.1| PREDICTED: tesmin [Saimiri boliviensis boliviensis]
          Length = 263

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 59/118 (50%), Positives = 80/118 (67%), Gaps = 11/118 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA+G +C+ CNC NC NN++HE+ R +A+ A L RNP AF+PKI +S  G    R 
Sbjct: 53  YCDCFASGDFCNNCNCNNCCNNLQHEIERFKAIKACLGRNPEAFQPKIGTSQLGDVKPR- 111

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
                     H+KGC C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 112 ----------HSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEESPERKTLM 159



 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K C+CR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 107 DVKPRHSKGCSCRRSGCLKNYCECYEAKIMCSSICKCIGCKNYEE 151


>gi|145498644|ref|XP_001435309.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402440|emb|CAK67912.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 55/119 (46%), Positives = 69/119 (57%), Gaps = 15/119 (12%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ S+CLKLYC+CFAA   C   CNC  C NN  + + R       +ERNP AF  K+
Sbjct: 109 CNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLYREKMVERNPEAFNQKV 168

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                         +E      H+KGC+C+KSGC KKYCEC+Q  I CS+NC+C  CKN
Sbjct: 169 --------------KEVDQKMAHSKGCNCRKSGCKKKYCECYQMGIECSDNCKCDGCKN 213



 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 28/46 (60%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
           K C+CKKS CLK YC+CF AN LCS  C C  C N   +   R L+
Sbjct: 107 KPCNCKKSKCLKLYCDCFAANKLCSSKCNCCGCFNNSSNLIERNLY 152


>gi|302804648|ref|XP_002984076.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
 gi|300148428|gb|EFJ15088.1| hypothetical protein SELMODRAFT_27247 [Selaginella moellendorffii]
          Length = 141

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 79/139 (56%), Gaps = 14/139 (10%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           +C+C+ S+CLKLYCECFA+G YC D C C NC N  EHE    E       RNP AF P+
Sbjct: 1   RCSCKKSKCLKLYCECFASGSYCLDSCACGNCSNRQEHEDVVSETRQLIQSRNPLAFAPR 60

Query: 214 IASSPHGAQDAREDAREAQL-------------AAKHNKGCHCKKSGCLKKYCECFQANI 260
           + S     +  +       L              +KH +GC+CKKS CLKKYCECFQ+ +
Sbjct: 61  VISPAEVVRVCQAFVSSNSLIFVSPAAAAIPAATSKHKRGCNCKKSLCLKKYCECFQSEV 120

Query: 261 LCSENCRCLDCKNFEGSEE 279
            CS+ C+C  CKN  G+++
Sbjct: 121 GCSDACKCRGCKNTFGAKQ 139



 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
           T K ++ CNC+ S CLK YCECF + + C D C C  C N  
Sbjct: 94  TSKHKRGCNCKKSLCLKKYCECFQSEVGCSDACKCRGCKNTF 135


>gi|307103614|gb|EFN51873.1| hypothetical protein CHLNCDRAFT_139785 [Chlorella variabilis]
          Length = 929

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 52/117 (44%), Positives = 66/117 (56%), Gaps = 14/117 (11%)

Query: 166 LYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA 224
           LYC+CFA G YC   C+C +C N VE+         A  +RNPNAF  KI +        
Sbjct: 540 LYCDCFANGGYCGPACSCSSCANKVENRAMVAAQREAIKQRNPNAFVQKIEA-------- 591

Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
                +A L  +H +GC+CKKS C+KKYCECFQA + C E+C+C  C N  G   RR
Sbjct: 592 -----DAALGGQHRRGCNCKKSHCMKKYCECFQAGVPCGEHCKCESCHNTAGHSGRR 643


>gi|145478035|ref|XP_001425040.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392108|emb|CAK57642.1| unnamed protein product [Paramecium tetraurelia]
          Length = 386

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/130 (41%), Positives = 73/130 (56%), Gaps = 6/130 (4%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           QCNC+ S+CLKLYCECFA    C   CNC +C N +++   R +A+   L RNP+AF   
Sbjct: 191 QCNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQERSKAIEEALLRNPDAFAQC 250

Query: 214 IASSPHGAQDAREDAREAQLAAKHN-----KGCHCKKSGCLKKYCECFQANILCSENCRC 268
                      ++  +  +   K N     KGC+CKKSGC KKYCEC+  N+ CS+ C+C
Sbjct: 251 FQQKSQTQFSVQQQDKPLKEPTKDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKC 310

Query: 269 LDCKNFEGSE 278
             C N   +E
Sbjct: 311 EQCLNRTDAE 320



 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 30/41 (73%), Gaps = 1/41 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 280
           C+CKKS CLK YCECF  N +CS++C C DCKN  +  +ER
Sbjct: 192 CNCKKSKCLKLYCECFANNWVCSQSCNCQDCKNRIDNPQER 232



 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATL 203
           KD +   +K CNC+ S C K YCEC++  + C D C C  C N  + E+  Q+ +   L
Sbjct: 273 KDNSNITRKGCNCKKSGCKKKYCECYSQNLKCSDLCKCEQCLNRTDAEIQAQQDIAQVL 331


>gi|351709791|gb|EHB12710.1| Tesmin [Heterocephalus glaber]
          Length = 283

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/119 (49%), Positives = 81/119 (68%), Gaps = 11/119 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA+G +C+ C+C NC NN+ HE+ R  A+ A L+RNP AF+PKI  S         
Sbjct: 77  YCDCFASGDFCNNCHCNNCCNNLHHEIERFNAIKACLDRNPEAFQPKIGKS--------- 127

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
             R   +  +HNKGC+CK+SGCLK YCEC++A I+CS  C+C+DC N+E S ER+ L +
Sbjct: 128 --RLGDVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEESPERKTLMN 184



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCEC+ A I C   C C++C+N  E
Sbjct: 131 DVKPRHNKGCNCKRSGCLKNYCECYEAKIMCSSICKCIDCNNYEE 175


>gi|327260145|ref|XP_003214896.1| PREDICTED: tesmin-like [Anolis carolinensis]
          Length = 377

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/137 (45%), Positives = 84/137 (61%), Gaps = 21/137 (15%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA G +C  CNC NC+NN +HE+ R  A+ A L+RNP AF PKI     G      
Sbjct: 176 YCDCFANGDFCCNCNCNNCYNNPQHEIERFNAIKACLDRNPEAFLPKIGKGKLG------ 229

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF-- 284
                 +  +HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S ER+ L   
Sbjct: 230 -----DIKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEESPERKTLINM 284

Query: 285 --------HGDHNGIAF 293
                   H +++ ++F
Sbjct: 285 PNDMEIKSHEENDAVSF 301



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P+  K CNC+ S CLK YCECF A I C   C CV C N  E
Sbjct: 230 DIKPRHNKGCNCKRSGCLKNYCECFEAKIMCSSICKCVGCKNYEE 274


>gi|119595124|gb|EAW74718.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_a
           [Homo sapiens]
          Length = 529

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G   
Sbjct: 325 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 381

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 382 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 433

Query: 284 F 284
            
Sbjct: 434 M 434



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           P+  K CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 385 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 426


>gi|196011968|ref|XP_002115847.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
 gi|190581623|gb|EDV21699.1| hypothetical protein TRIADDRAFT_59682 [Trichoplax adhaerens]
          Length = 316

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 65/133 (48%), Positives = 82/133 (61%), Gaps = 13/133 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YCECFA G +C  CNC NCHNN+EHE  R  A+ A LERNP+AFRPKI  +  G +    
Sbjct: 138 YCECFANGEFCKNCNCTNCHNNIEHENERSRAIKACLERNPHAFRPKIGKAKEGNE---- 193

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH- 285
                    +H+KGC+CK+SGCLK YCEC++A I C+  C+C +CKN E   ER+ L   
Sbjct: 194 --------RRHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKNREDCLERKNLMQL 245

Query: 286 GDHNGIAFMQRAA 298
            D   I   Q+ A
Sbjct: 246 ADAADIRVKQQTA 258



 Score = 45.1 bits (105), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           RP+     +    +  K CNC+ S CLK YCEC+ A I C   C C NC N
Sbjct: 182 RPKIGKAKEGNERRHSKGCNCKRSGCLKNYCECYEAKIPCTVLCKCTNCKN 232


>gi|40806200|ref|NP_004914.2| tesmin isoform a [Homo sapiens]
 gi|205371773|sp|Q9Y4I5.2|MTL5_HUMAN RecName: Full=Tesmin; AltName: Full=Metallothionein-like 5,
           testis-specific; AltName: Full=Testis-specific
           metallothionein-like protein
 gi|40555823|gb|AAH64579.1| Metallothionein-like 5, testis-specific (tesmin) [Homo sapiens]
 gi|119595125|gb|EAW74719.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_b
           [Homo sapiens]
          Length = 508

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G   
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 361 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 412

Query: 284 F 284
            
Sbjct: 413 M 413



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           P+  K CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 405


>gi|426369505|ref|XP_004051728.1| PREDICTED: tesmin [Gorilla gorilla gorilla]
          Length = 508

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G   
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 361 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 412

Query: 284 F 284
            
Sbjct: 413 M 413



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           P+  K CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 405


>gi|119595126|gb|EAW74720.1| metallothionein-like 5, testis-specific (tesmin), isoform CRA_c
           [Homo sapiens]
          Length = 463

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 58/121 (47%), Positives = 80/121 (66%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G   
Sbjct: 259 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 315

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 316 --------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTL 367

Query: 284 F 284
            
Sbjct: 368 M 368



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           P+  K CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 319 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 360


>gi|403340785|gb|EJY69688.1| Tesmin/TSO1-like CXC domain containing protein [Oxytricha
           trifallax]
          Length = 710

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 54/123 (43%), Positives = 72/123 (58%), Gaps = 13/123 (10%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K+C C  S+CLKLYCECFAAG+ C   C C +C N  +   A Q+A    ++R+P AF  
Sbjct: 525 KKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSDAIQKAKDEIMKRDPLAFEI 584

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K++      Q++  D  +      H KGC CKKSGC K YCECFQ  + C++ C+C  C 
Sbjct: 585 KVS------QNSGTDKLQ------HRKGCTCKKSGCQKGYCECFQLGVPCTDFCKCSGCA 632

Query: 273 NFE 275
           N E
Sbjct: 633 NCE 635



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 26/41 (63%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K C C KS CLK YCECF A ++C  +C C DC N E S +
Sbjct: 525 KKCTCSKSKCLKLYCECFAAGLVCGLDCGCKDCCNTEDSSD 565


>gi|444723239|gb|ELW63898.1| Protein lin-54 like protein [Tupaia chinensis]
          Length = 783

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/99 (47%), Positives = 60/99 (60%), Gaps = 12/99 (12%)

Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 260
           A L+RNP AF+PKI     G  D R           H+KGC+CK+SGCLK YCEC++A I
Sbjct: 604 ACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKRSGCLKNYCECYEAKI 652

Query: 261 LCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 298
           +CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 653 MCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 691



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 3/46 (6%)

Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           GK+G   ++  K CNC+ S CLK YCEC+ A I C   C C+ C N
Sbjct: 620 GKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 665


>gi|4581563|gb|AAD24668.1|U86074_1 tesmin [Homo sapiens]
          Length = 299

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 57/118 (48%), Positives = 79/118 (66%), Gaps = 11/118 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G      
Sbjct: 98  YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG------ 151

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALF 284
                 +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CKN+E S ER+ L 
Sbjct: 152 -----NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 204



 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           P+  K CNCR S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 155 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEE 196


>gi|145511942|ref|XP_001441893.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409154|emb|CAK74496.1| unnamed protein product [Paramecium tetraurelia]
          Length = 374

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 73/121 (60%), Gaps = 3/121 (2%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ S+CLKLYCECFA    C   CNC  C N +++   R +A+   L RNP+AF   +
Sbjct: 169 CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKAIEEALLRNPDAFAAIL 228

Query: 215 ASSPHGAQDAREDA--REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
            ++    Q  +E+   +E+Q      KGC+CKKS C KKYCEC+  N  C++ C+C +C 
Sbjct: 229 TNNGQQPQIIQEEKSQKESQKDITTRKGCNCKKSECKKKYCECYSINQRCTDLCKCENCL 288

Query: 273 N 273
           N
Sbjct: 289 N 289



 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVE 190
           K CNC+ S C K YCEC++    C D C C NC N V+
Sbjct: 255 KGCNCKKSECKKKYCECYSINQRCTDLCKCENCLNKVQ 292


>gi|281211685|gb|EFA85847.1| hypothetical protein PPL_01079 [Polysphondylium pallidum PN500]
          Length = 735

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 61/94 (64%), Gaps = 9/94 (9%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           K C+C+NS+CLK+YCECFA+   C+GC C  CHNN +H  + + A  +TLERNP+AF PK
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCNGCQCFGCHNNEDHIESVENARASTLERNPDAFGPK 385

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSG 247
              S +G         E   A KH KGCHC++SG
Sbjct: 386 FKQSFNG---------EKPKAEKHLKGCHCRRSG 410



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K CHCK S CLK YCECF +  LC+  C+C  C N E   E
Sbjct: 326 KMCHCKNSKCLKMYCECFASKQLCN-GCQCFGCHNNEDHIE 365


>gi|326920244|ref|XP_003206384.1| PREDICTED: tesmin-like [Meleagris gallopavo]
          Length = 622

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/116 (50%), Positives = 76/116 (65%), Gaps = 11/116 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YCECFA+G +C+ CNC NC+NN  HE  R +A+   L+RNP AF PKI  S  G      
Sbjct: 427 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSKLG------ 480

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
                 +   HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S +++ 
Sbjct: 481 -----DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT 531



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P   K CNC+ S CLK YCECF A I C   C C+ C N  E
Sbjct: 481 DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 525


>gi|159478621|ref|XP_001697401.1| hypothetical protein CHLREDRAFT_105644 [Chlamydomonas reinhardtii]
 gi|158274559|gb|EDP00341.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 91

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 52/92 (56%), Positives = 61/92 (66%), Gaps = 1/92 (1%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YCECFA+  YC+ CNC+ C NN E+E  RQ AV A +ERNPNAF+PKI    H       
Sbjct: 1   YCECFASSRYCEMCNCMQCFNNRENEAVRQSAVEAIMERNPNAFKPKITVRGHETHTPVV 60

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
            A  A  + +H KGC+CKKS CLKKYCECFQ 
Sbjct: 61  VA-AAGASGRHLKGCNCKKSFCLKKYCECFQV 91


>gi|145534959|ref|XP_001453218.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420929|emb|CAK85821.1| unnamed protein product [Paramecium tetraurelia]
          Length = 265

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 76/135 (56%), Gaps = 11/135 (8%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC+ S+CLKLYCECFA    C   CNC  C N +++   R +A    L RNP+AF   +
Sbjct: 52  CNCKKSKCLKLYCECFANNWVCSQNCNCCECKNRIDNPNERSKATEEALLRNPDAFAAIL 111

Query: 215 ASS---PHGAQDA-------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
            ++   P   QD         +  +E+Q+     KGC+CKKS C KKYCEC+  N  C++
Sbjct: 112 TNNGQQPQIIQDKTLILPLEEKSQKESQIDITTRKGCNCKKSECKKKYCECYSINQKCTD 171

Query: 265 NCRCLDCKNFEGSEE 279
            C+C +C N   S+E
Sbjct: 172 LCKCENCLNKAESQE 186


>gi|118091374|ref|XP_001231493.1| PREDICTED: tesmin [Gallus gallus]
          Length = 503

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/116 (50%), Positives = 76/116 (65%), Gaps = 11/116 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YCECFA+G +C+ CNC NC+NN  HE  R +A+   L+RNP AF PKI  S  G      
Sbjct: 307 YCECFASGDFCNNCNCNNCYNNPLHETERFKAIKVCLDRNPEAFLPKIGQSELG------ 360

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
                 +   HNKGC+CK+SGCLK YCECF+A I+CS  C+C+ CKN+E S +++ 
Sbjct: 361 -----DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEESPDKKT 411



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           D  P   K CNC+ S CLK YCECF A I C   C C+ C N  E
Sbjct: 361 DIKPHHNKGCNCKRSGCLKNYCECFEAKIVCSSICKCIGCKNYEE 405


>gi|357133064|ref|XP_003568148.1| PREDICTED: uncharacterized protein LOC100845078 [Brachypodium
           distachyon]
          Length = 377

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 64/165 (38%), Positives = 78/165 (47%), Gaps = 13/165 (7%)

Query: 116 PHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGI 175
           P   ++L L T P       R R     DG        K C C+ S+CLKLYC CFA G 
Sbjct: 62  PSAANQLALATTPK------RQRVEEAADGNGC-----KHCACKKSKCLKLYCPCFAGGG 110

Query: 176 YC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 234
           YC + C CV C N  ++    Q      L R       KI        +A EDA  +  +
Sbjct: 111 YCSEKCGCVPCFNKSDYAETVQTTRKVLLSRQKR-MSLKINRRSEANAEAMEDAHNSSSS 169

Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
               +GC+CKKS CLKKYC+C+Q    CS  CRC DCKN  G  E
Sbjct: 170 TPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCEDCKNPYGKNE 214


>gi|332837107|ref|XP_001154282.2| PREDICTED: tesmin [Pan troglodytes]
          Length = 508

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 57/121 (47%), Positives = 79/121 (65%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G   
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ C+N E S ER+ L
Sbjct: 361 --------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTL 412

Query: 284 F 284
            
Sbjct: 413 M 413



 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           P+  K CNCR S CLK YCEC+ A I C   C C+ C N
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCEN 402


>gi|397517193|ref|XP_003828803.1| PREDICTED: tesmin [Pan paniscus]
          Length = 508

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 57/121 (47%), Positives = 79/121 (65%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+PKI     G   
Sbjct: 304 LAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIGKGQLG--- 360

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                    +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ C+N E S ER+ L
Sbjct: 361 --------NVKPRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCENCEESPERKTL 412

Query: 284 F 284
            
Sbjct: 413 M 413



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           P+  K CNCR S CLK YCEC+ A I C   C C+ C N
Sbjct: 364 PRHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCEN 402


>gi|168010127|ref|XP_001757756.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691032|gb|EDQ77396.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 113

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 54/111 (48%), Positives = 67/111 (60%), Gaps = 1/111 (0%)

Query: 167 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           YCECFAA ++C G C C NC N  E+E            R+P AF PKI  +       R
Sbjct: 1   YCECFAARLFCVGSCACRNCFNKPEYEATVLNTRQQIEARDPLAFAPKIVQAAEPTPIPR 60

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           +DA +   +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC  CKN  G
Sbjct: 61  DDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCKNMYG 111



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 2/67 (2%)

Query: 123 PLPTLPPGKQES-PRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
           PL   P   Q + P   PR +      + + ++ CNC+ S CLK YCEC+ AG+ C +GC
Sbjct: 42  PLAFAPKIVQAAEPTPIPRDDALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGC 101

Query: 181 NCVNCHN 187
            C  C N
Sbjct: 102 RCEGCKN 108


>gi|326491307|dbj|BAK05753.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326502882|dbj|BAJ99069.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 375

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 64/161 (39%), Positives = 78/161 (48%), Gaps = 21/161 (13%)

Query: 149 TPKKQK---------QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEA 198
           TPK+QK          C C+ SRCLKLYC CFA G YC + C CV C N  +     Q  
Sbjct: 66  TPKRQKVEDAADGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCVPCFNKADFAETVQTT 125

Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
               L R       KI        +  EDA  +  +    +GC+CKKS CLKKYC+C+Q 
Sbjct: 126 RKVLLSRQKR-MSLKINRRLEANAETMEDAHNSSSSTPPRRGCNCKKSSCLKKYCDCYQD 184

Query: 259 NILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 289
              CS  CRC DCKN  G        E +R L+ G   DH+
Sbjct: 185 GTGCSLFCRCEDCKNPFGKNEGIMADESKRFLYTGADLDHS 225


>gi|297743393|emb|CBI36260.3| unnamed protein product [Vitis vinifera]
          Length = 326

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 52/116 (44%), Positives = 68/116 (58%), Gaps = 4/116 (3%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNP 207
           T   +K CNC+ ++CLKLYC+C AAG+YC D C C NC N  E+E   Q        R+P
Sbjct: 10  TESDRKHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDP 69

Query: 208 NAFRPKIASSPHGAQD--AREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANI 260
            AF P+I +      D   +ED       +A+H +GC+CKKS C KKYCEC+Q  I
Sbjct: 70  LAFAPRIVNPDTDTTDNVLQEDGNWTTPSSARHKRGCNCKKSMCQKKYCECYQVYI 125



 Score = 45.4 bits (106), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 27/40 (67%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
           K C+CK++ CLK YC+C  A + C+++C C +C N   +E
Sbjct: 15  KHCNCKRTQCLKLYCDCLAAGVYCTDSCACSNCLNKSENE 54


>gi|403344595|gb|EJY71645.1| Tesmin/TSO1-like CXC domain-containing protein [Oxytricha
           trifallax]
          Length = 600

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 54/115 (46%), Positives = 71/115 (61%), Gaps = 10/115 (8%)

Query: 162 RCLKLYCECFAAGIYCD-GCNCVNCHNNVEH-EVARQEAVGATLERNPNAFRPKIAS-SP 218
           +CLKLYC+CFA G  C   C C+NC N  ++ E    EA    + RNP AF+PK+    P
Sbjct: 454 QCLKLYCDCFALGGVCGPDCGCLNCENKEQNQESVVLEARKKIIGRNPEAFQPKVVEVGP 513

Query: 219 HGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
             AQ ++ DA +     +H KGC+CK SGC K YCECFQ  + C++NCRC+ C N
Sbjct: 514 --AQMSQNDADK-----QHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561



 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           D   + +K CNC+NS C K YCECF  G+ C+  C C+ C N
Sbjct: 520 DADKQHRKGCNCKNSGCQKNYCECFQFGLECNKNCRCIGCSN 561


>gi|339238897|ref|XP_003381003.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976020|gb|EFV59373.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 492

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 63/113 (55%), Gaps = 12/113 (10%)

Query: 173 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQ 232
           A +    C+C NC NN+ +E+ R  A+ + LERNP AF+PKI          R D     
Sbjct: 275 APVLKSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQPKIG-------KGRADTERL- 326

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
               HNKGC+CKKS CLK YCEC++A + C+  C+C+ C+N E     R   H
Sbjct: 327 ----HNKGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRNTEADRNHRNRGH 375



 Score = 45.1 bits (105), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           K CNC+ S CLK YCEC+ A + C   C CV C N
Sbjct: 329 KGCNCKKSSCLKNYCECYEARVSCTVRCKCVGCRN 363


>gi|348678286|gb|EGZ18103.1| hypothetical protein PHYSODRAFT_504652 [Phytophthora sojae]
          Length = 713

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 58/142 (40%), Positives = 75/142 (52%), Gaps = 14/142 (9%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C+   CLK+YC CF++  +C  GC C +C N  +H+  R EA+   L  +P AF    
Sbjct: 536 CGCKTG-CLKMYCMCFSSRGFCHAGCACDDCKNGRKHQTERVEAIQNYLANDPRAF--SF 592

Query: 215 ASSPHGAQDAREDAREAQLA-AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           AS P  A          Q + A   +GC CKKS CLKKYCECFQ  I C+ +CRC+DC N
Sbjct: 593 ASLPQSASTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSN 652

Query: 274 FEGS---------EERRALFHG 286
              S         ++R A  HG
Sbjct: 653 HSESTAAHQHPHLQKRSATAHG 674


>gi|119626323|gb|EAX05918.1| hypothetical protein DKFZp686L1814, isoform CRA_b [Homo sapiens]
          Length = 223

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 48/100 (48%), Positives = 60/100 (60%), Gaps = 12/100 (12%)

Query: 187 NNVEHEVA-RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKK 245
            NV + V   Q    A L+RNP AF+PKI     G  D R           H+KGC+CK+
Sbjct: 29  GNVGYAVLPAQYVTQACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGCNCKR 77

Query: 246 SGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 78  SGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMH 117



 Score = 46.2 bits (108), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 29/49 (59%), Gaps = 3/49 (6%)

Query: 145 GKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           GK+G   ++  K CNC+ S CLK YCEC+ A I C   C C+ C N  E
Sbjct: 60  GKEGESDRRHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEE 108


>gi|226502887|ref|NP_001148761.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|195621962|gb|ACG32811.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|413949786|gb|AFW82435.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 356

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 59/147 (40%), Positives = 72/147 (48%), Gaps = 12/147 (8%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K C C+ SRCLKLYC CFA G YC D C C  C N        Q      L R       
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSM 120

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI   P    +  EDA  +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DC+
Sbjct: 121 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 180

Query: 273 NFEG-------SEERRALFHG---DHN 289
           N  G        + +R L+ G   DH+
Sbjct: 181 NPFGKNDGIIADDSKRYLYTGADLDHS 207



 Score = 39.3 bits (90), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 9/58 (15%)

Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           RP AN +           TP + + CNC+ S CLK YC+C+  G  C   C C +C N
Sbjct: 125 RPEANAEPMEDAHHSSSSTPPR-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 181


>gi|242091037|ref|XP_002441351.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
 gi|241946636|gb|EES19781.1| hypothetical protein SORBIDRAFT_09g025040 [Sorghum bicolor]
          Length = 359

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/141 (40%), Positives = 69/141 (48%), Gaps = 9/141 (6%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K C C+ SRCLKLYC CF+ G YC D C C  C N        Q      L R       
Sbjct: 67  KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSM 125

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI   P    +  EDA  +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DC+
Sbjct: 126 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 185

Query: 273 NFEGSEE-------RRALFHG 286
           N  G  E       +R L+ G
Sbjct: 186 NPFGKNEGIIADDSKRYLYTG 206


>gi|301097298|ref|XP_002897744.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
 gi|262106765|gb|EEY64817.1| exosome complex exonuclease RRP41-like protein [Phytophthora
           infestans T30-4]
          Length = 777

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/123 (43%), Positives = 68/123 (55%), Gaps = 11/123 (8%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C+   CLK+YC CF++  +C  GC C +C N    +  R EA+ + L  +P AF    
Sbjct: 540 CGCKTG-CLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTERVEAIQSYLANDPRAF--SF 596

Query: 215 ASSPHGAQDAREDAREAQLAAKHN----KGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           AS P   QD         L  K +    +GC CKKS CLKKYCECFQ  I C+ +CRC+D
Sbjct: 597 ASLP---QDTNTTGFLHLLPQKSSAVVMRGCRCKKSKCLKKYCECFQNGIACTSHCRCMD 653

Query: 271 CKN 273
           C N
Sbjct: 654 CSN 656



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQ 196
           + C C+ S+CLK YCECF  GI C   C C++C N+ E   A Q
Sbjct: 622 RGCRCKKSKCLKKYCECFQNGIACTSHCRCMDCSNHSESTAAHQ 665



 Score = 45.4 bits (106), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C CK +GCLK YC CF +   C   C C DCKN   S+  R
Sbjct: 540 CGCK-TGCLKMYCMCFSSRGFCHAGCACDDCKNGRNSQTER 579


>gi|115464805|ref|NP_001056002.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|48475201|gb|AAT44270.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579553|dbj|BAF17916.1| Os05g0509400 [Oryza sativa Japonica Group]
 gi|215766036|dbj|BAG98264.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218197077|gb|EEC79504.1| hypothetical protein OsI_20571 [Oryza sativa Indica Group]
          Length = 374

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/163 (38%), Positives = 78/163 (47%), Gaps = 23/163 (14%)

Query: 149 TPKKQK-----------QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
           TPK+QK            C C+ SRCLKLYC CFA G YC + C C  C N   +    Q
Sbjct: 70  TPKRQKVEESADGNGCKHCACKKSRCLKLYCPCFAGGGYCSEKCGCQPCFNKALYAETVQ 129

Query: 197 EAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
                 L R       KI        +A EDA  +  +    +GC+CKKS CLKKYC+C+
Sbjct: 130 TTRKVLLSRQKR-MSLKINRRSEANAEAVEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCY 188

Query: 257 QANILCSENCRCLDCKNFEG-------SEERRALFHG---DHN 289
           Q    CS  CRC DC+N  G        E +R L+ G   DH+
Sbjct: 189 QDGTGCSLFCRCEDCRNPFGKNEGIMAEESKRFLYTGADLDHS 231


>gi|224035483|gb|ACN36817.1| unknown [Zea mays]
          Length = 356

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 58/147 (39%), Positives = 72/147 (48%), Gaps = 12/147 (8%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K C C+ SRCLKLYC CFA G YC D C C  C +        Q      L R       
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFSKEAFSETVQTTRKVLLSRQ-KRMSM 120

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI   P    +  EDA  +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DC+
Sbjct: 121 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 180

Query: 273 NFEG-------SEERRALFHG---DHN 289
           N  G        + +R L+ G   DH+
Sbjct: 181 NPFGKNDGIIADDSKRYLYTGADLDHS 207



 Score = 39.3 bits (90), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 9/58 (15%)

Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           RP AN +           TP + + CNC+ S CLK YC+C+  G  C   C C +C N
Sbjct: 125 RPEANAEPMEDAHHSSSSTPPR-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 181


>gi|226507366|ref|NP_001150621.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|195640640|gb|ACG39788.1| tesmin/TSO1-like CXC domain containing protein [Zea mays]
 gi|219884557|gb|ACL52653.1| unknown [Zea mays]
 gi|414868874|tpg|DAA47431.1| TPA: Tesmin/TSO1-like CXC domain containing protein [Zea mays]
          Length = 394

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 74/141 (52%), Gaps = 16/141 (11%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           S R R R   DG DG     K C+C+ SRCLKLYC CFA+G++C + C C  CHN  + +
Sbjct: 72  SDRKRRRKAEDG-DGC----KTCSCKKSRCLKLYCVCFASGLHCTESCGCEPCHNKPQLQ 126

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
              + A    L         K   +    QD  E    + +     K C CKKSGCLKKY
Sbjct: 127 GTLRMATVLPL---------KPVQTFEAGQDIVEQVIRSPMDLIRRK-CTCKKSGCLKKY 176

Query: 253 CECFQANILCSENCRCLDCKN 273
           C+C+Q    CS NC+C DC+N
Sbjct: 177 CDCYQGGAGCSINCKCDDCRN 197


>gi|226500562|ref|NP_001150787.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|194699308|gb|ACF83738.1| unknown [Zea mays]
 gi|195641826|gb|ACG40381.1| CXC domain containing TSO1-like protein 1 [Zea mays]
 gi|224028923|gb|ACN33537.1| unknown [Zea mays]
 gi|413945974|gb|AFW78623.1| CXC domain containing TSO1-like protein 1 [Zea mays]
          Length = 359

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/141 (39%), Positives = 68/141 (48%), Gaps = 9/141 (6%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K C C+ SRCLKLYC CF+ G YC D C C  C N               L R       
Sbjct: 67  KHCACKKSRCLKLYCPCFSGGGYCSDKCGCQPCFNKEAFAETVHTTRKVLLSRQ-KRMSM 125

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           KI   P    +  EDA  +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DC+
Sbjct: 126 KINRRPEANTEPMEDAHHSSSSTPPKRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQ 185

Query: 273 NFEGSEE-------RRALFHG 286
           N  G  E       +R L+ G
Sbjct: 186 NPFGKNEGIMADDSKRYLYTG 206



 Score = 42.0 bits (97), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 30/58 (51%), Gaps = 9/58 (15%)

Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           RP ANT+           TP K + CNC+ S CLK YC+C+  G  C   C C +C N
Sbjct: 130 RPEANTEPMEDAHHSSSSTPPK-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQN 186


>gi|145513118|ref|XP_001442470.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409823|emb|CAK75073.1| unnamed protein product [Paramecium tetraurelia]
          Length = 176

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/169 (37%), Positives = 89/169 (52%), Gaps = 22/169 (13%)

Query: 89  QAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDG 148
           ++  RSP ++  +   R   ++ +N V     K PL      K  S +   R  T  K  
Sbjct: 22  RSQFRSPPKSPHRKLERATKLKFINFV---TLKKPL------KYTSDQITNRKRTSSKRR 72

Query: 149 TPKKQKQ-----CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGAT 202
            P+ +++     CNC+ S+CLKLYC+CF AG+ C   CNC +CHNN EH   R+  +   
Sbjct: 73  KPQTEQKPKIIVCNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREIVIKQI 132

Query: 203 LERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKK 251
           +ERNP+AFRPK+ S  +      ED  E     +H KGC+CKKS   KK
Sbjct: 133 MERNPSAFRPKVESKSNS-----ED--EQDHKPRHFKGCNCKKSNWSKK 174



 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 27/43 (62%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           C+CKKS CLK YC+CF A + C ++C C  C N +   + R +
Sbjct: 85  CNCKKSKCLKLYCDCFTAGVTCGKDCNCCSCHNNDEHPKEREI 127


>gi|390470886|ref|XP_003734371.1| PREDICTED: LOW QUALITY PROTEIN: tesmin [Callithrix jacchus]
          Length = 466

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 78/121 (64%), Gaps = 11/121 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  +C+CFA+G + + CNC NC NN+ HE+ R +A+ A L R+P AF+P+I     G   
Sbjct: 305 LAGHCDCFASGDFGNNCNCNNCCNNLHHEIERFKAIKACLGRDPEAFQPQIGKGQLG--- 361

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
                   ++  +H+KGC+C++ GCL+ YCEC +A I+CS  C+C+ CKN+E S ER+ L
Sbjct: 362 --------EVRPRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCKNYEESPERKTL 413

Query: 284 F 284
            
Sbjct: 414 M 414



 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           P+  K CNCR   CL+ YCEC  A I C   C C+ C N  E
Sbjct: 365 PRHSKGCNCRRLGCLRNYCECQEAKIMCSSICKCIGCKNYEE 406


>gi|449274469|gb|EMC83611.1| Tesmin, partial [Columba livia]
          Length = 289

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 68/156 (43%), Positives = 86/156 (55%), Gaps = 18/156 (11%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CFA G +C  CNC NC NN  HE  R  A+ A LERNP AF PKI     G      
Sbjct: 151 YCDCFANGDFCKNCNCDNCCNNQLHETERLTAIKACLERNPEAFLPKIGKRKLG------ 204

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS-EERRALFH 285
                ++   HNKGC+CK+SGCL+ YCECF+A I+CS  C+C  CKN+E S +E + L  
Sbjct: 205 -----EIKHHHNKGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKNYEESPDENKQL-- 257

Query: 286 GDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRK 321
              N I +M    N      V +S +  L  S K +
Sbjct: 258 ---NVINYMD-IGNKEGNSPVLTSAFNMLPKSRKDR 289



 Score = 41.2 bits (95), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 22/38 (57%), Gaps = 1/38 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVE 190
           K CNC+ S CL+ YCECF A I C   C C  C N  E
Sbjct: 212 KGCNCKRSGCLENYCECFEAKIVCSSICKCTGCKNYEE 249


>gi|222632736|gb|EEE64868.1| hypothetical protein OsJ_19725 [Oryza sativa Japonica Group]
          Length = 223

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 73/137 (53%), Gaps = 15/137 (10%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           +C+C  S C +LYC CF +  +C D CNC  C+N   +E A +E       +NPNAF P+
Sbjct: 9   KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68

Query: 214 I--------ASSPHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQAN 259
           I        A+ P  +  A  D +    A       KH KGC C+KS C K YCECF+ +
Sbjct: 69  IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128

Query: 260 ILCSENCRCLDCKNFEG 276
           + C+  C+CL+C N  G
Sbjct: 129 VGCTAKCKCLECSNSFG 145



 Score = 40.0 bits (92), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           C C +S C + YC CF++   CS+NC C  C N +  E+
Sbjct: 10  CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48


>gi|384246119|gb|EIE19610.1| hypothetical protein COCSUDRAFT_9740, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 90

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 63/95 (66%), Positives = 67/95 (70%), Gaps = 9/95 (9%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS----SPHGAQ 222
           YCECFA+G YCDGCNCVNC NN E E  RQ AV A LERNPNAFRPKI         G  
Sbjct: 1   YCECFASGRYCDGCNCVNCCNNKESEQVRQAAVEAILERNPNAFRPKIQVLEPFCTTGTH 60

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
            A + A     AA+HNKGC+CKKSGCLKKYCECFQ
Sbjct: 61  VAIQGA-----AARHNKGCNCKKSGCLKKYCECFQ 90


>gi|403347003|gb|EJY72913.1| Transcription factor, putative [Oxytricha trifallax]
          Length = 286

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 52/138 (37%), Positives = 74/138 (53%), Gaps = 12/138 (8%)

Query: 157 NCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIAS 216
           NC N+ C+K YC+CF    YC  CN   C NN E+E  RQ+A+   L +NP AF  K  +
Sbjct: 144 NCSNTYCIKRYCQCFKLNKYCTNCNHPECKNNQENEEERQKAIDYILSKNPQAFMTKNKT 203

Query: 217 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC--LDCKNF 274
             +  + +++D+            C+C KSGC  KYC C +  ILCS+NC+C    CKN 
Sbjct: 204 RKNFIKKSKKDSNLIHT-------CNCSKSGCNNKYCVCLKEGILCSKNCKCNHRKCKNK 256

Query: 275 EGS---EERRALFHGDHN 289
           +G    ++R  +   D N
Sbjct: 257 QGDRNQKQRSVIISNDEN 274


>gi|340507878|gb|EGR33739.1| tesmin tso1-like cxc domain protein [Ichthyophthirius multifiliis]
          Length = 142

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 54/128 (42%), Positives = 72/128 (56%), Gaps = 18/128 (14%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNC----HNNVEHEVARQEAVGATLERNPNAF 210
           C C+ SRCL+LYCECF  GI+C+  C C NC    +N  + E A+QEA      RNP+AF
Sbjct: 15  CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQIESAKQEAKM----RNPDAF 70

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
             K         +             H KGC+C K+ C KKYCECF A I C+ENC+C +
Sbjct: 71  SQKFLVVKQNQYEG---------IVSHKKGCNCTKTQCTKKYCECFNAGIKCTENCKCEN 121

Query: 271 CKNFEGSE 278
           C+N++  +
Sbjct: 122 CENYKDEQ 129



 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 35/70 (50%), Gaps = 9/70 (12%)

Query: 127 LPPGKQESPRSRPRANT--------DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC- 177
           +   KQE+    P A +        +  +G    +K CNC  ++C K YCECF AGI C 
Sbjct: 55  IESAKQEAKMRNPDAFSQKFLVVKQNQYEGIVSHKKGCNCTKTQCTKKYCECFNAGIKCT 114

Query: 178 DGCNCVNCHN 187
           + C C NC N
Sbjct: 115 ENCKCENCEN 124



 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 28/40 (70%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER 280
           C CKKS CL+ YCECF   I C+++C C +C N E ++++
Sbjct: 15  CKCKKSRCLQLYCECFVNGIFCNKSCICTNCGNTENNKKQ 54


>gi|48475095|gb|AAT44164.1| hypothetical protein, contains Tesmin/TSO1-like CXC domain, PF03638
           [Oryza sativa Japonica Group]
          Length = 194

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 73/137 (53%), Gaps = 15/137 (10%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           +C+C  S C +LYC CF +  +C D CNC  C+N   +E A +E       +NPNAF P+
Sbjct: 9   KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68

Query: 214 I--------ASSPHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQAN 259
           I        A+ P  +  A  D +    A       KH KGC C+KS C K YCECF+ +
Sbjct: 69  IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128

Query: 260 ILCSENCRCLDCKNFEG 276
           + C+  C+CL+C N  G
Sbjct: 129 VGCTAKCKCLECSNSFG 145



 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           C C +S C + YC CF++   CS+NC C  C N +  E+
Sbjct: 10  CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48


>gi|118356010|ref|XP_001011264.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89293031|gb|EAR91019.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 617

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 46/126 (36%), Positives = 66/126 (52%), Gaps = 14/126 (11%)

Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           YCECF  G YC D C C  C NN ++   R++A+     RN +AF  K   + +      
Sbjct: 357 YCECFLRGNYCNDQCICTECGNNDKNLEEREKAIDEAKTRNQDAFNLKFQITSNST---- 412

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
                      H KGCHCK++ CLKKYCECF A + C+ NC+C +C+N +   +   +  
Sbjct: 413 ---------VTHKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRNTQDGLKTMGISG 463

Query: 286 GDHNGI 291
           G  +G+
Sbjct: 464 GGTSGL 469



 Score = 44.7 bits (104), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           + T   +K C+C+ + CLK YCECF AG+ C + C C  C N
Sbjct: 410 NSTVTHKKGCHCKRTHCLKKYCECFNAGLKCTNNCKCEECRN 451


>gi|219112185|ref|XP_002177844.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410729|gb|EEC50658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 455

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 51/126 (40%), Positives = 69/126 (54%), Gaps = 22/126 (17%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVARQEAVGATLERNP 207
           P  Q  CNC  SRCLKLYC CF AG  C    C C+ C N+VE    R++A+  TL++ P
Sbjct: 332 PAGQVACNCIRSRCLKLYCTCFQAGKMCQPGICTCIACANSVEDHPERKQAIKHTLQKRP 391

Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
           +AF+ K    P G                   GC CK + C++KYCECF+  + C++ C 
Sbjct: 392 DAFQTK--DRPVGL------------------GCACKNNKCIRKYCECFRNGLSCADKCC 431

Query: 268 CLDCKN 273
           CL+C+N
Sbjct: 432 CLNCEN 437



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/133 (38%), Positives = 66/133 (49%), Gaps = 19/133 (14%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD--GCNCVNCHNNVEH---EVARQEAVGATLERNPNAF 210
           C+C  S+C+ LYC+CF AG  C+   CNC  C N +     + AR  A+ A L RNP AF
Sbjct: 260 CSCPKSKCVALYCDCFKAGRRCNPINCNCTACKNTINESGPQGARTNAIRAILARNPRAF 319

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCL 269
                    G   A +     Q+A      C+C +S CLK YC CFQA  +C    C C+
Sbjct: 320 ------VTAGVASAVQKLPAGQVA------CNCIRSRCLKLYCTCFQAGKMCQPGICTCI 367

Query: 270 DCKN-FEGSEERR 281
            C N  E   ER+
Sbjct: 368 ACANSVEDHPERK 380



 Score = 39.7 bits (91), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 240 GCHCKKSGCLKKYCECFQANILCSE-NCRCLDCKN 273
           GC C KS C+  YC+CF+A   C+  NC C  CKN
Sbjct: 259 GCSCPKSKCVALYCDCFKAGRRCNPINCNCTACKN 293


>gi|224007227|ref|XP_002292573.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971435|gb|EED89769.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 886

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 52/127 (40%), Positives = 65/127 (51%), Gaps = 17/127 (13%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVA---RQEAVGATLERN 206
           KK   C C  ++CL LYC+CF   I+C + C C++C N VE+  A   R  AV   LER 
Sbjct: 6   KKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLERR 65

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
           P AF              RE    A+   K + GC C K+ CLKKYC CF A + C   C
Sbjct: 66  PEAF-------------GRESGAFAERRNKKDGGCTCAKNSCLKKYCVCFGAKMQCRARC 112

Query: 267 RCLDCKN 273
            C +CKN
Sbjct: 113 NCSNCKN 119



 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 46/126 (36%), Positives = 63/126 (50%), Gaps = 11/126 (8%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNA 209
           KK   C C  + CLK YC CF A + C   CNC NC N     +  ++ V  T+E     
Sbjct: 82  KKDGGCTCAKNSCLKKYCVCFGAKMQCRARCNCSNCKNPF-GTIRIEDGVTCTIEGGKVN 140

Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
                A  P          R+    AK N  C C ++ CLK YC+CF+ +++C++NC CL
Sbjct: 141 ALKGAAFVP---------IRKLANLAKGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCL 191

Query: 270 DCKNFE 275
           +CKN E
Sbjct: 192 ECKNSE 197



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 24/129 (18%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEV---ARQEAVGATLERN 206
           K   +C+C  + CLKLYC+CF   + C D C+C+ C N+ E+      R   +   LE+ 
Sbjct: 158 KGNTKCSCTRAACLKLYCDCFRQSMVCNDNCSCLECKNSEEYSGPMGERTLVIKDILEKR 217

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-EN 265
           P+ F+ K A    G                   GC C+K+ C   YC+C  +   C    
Sbjct: 218 PHIFQ-KTAKRKAGG------------------GCSCQKNKCRAGYCDCKGSGGKCDPAV 258

Query: 266 CRCLDCKNF 274
           C C++C N 
Sbjct: 259 CTCINCDNM 267



 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 38/77 (49%), Gaps = 15/77 (19%)

Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--------------FEGSEE 279
           AAK    C C K+ CL  YC+CF  NI C E+C C+DCKN               EG  E
Sbjct: 4   AAKKAPSCKCTKTKCLLLYCDCFHHNIFCKESCTCIDCKNTVEYNGAGGERTLAVEGVLE 63

Query: 280 RRALFHGDHNGIAFMQR 296
           RR    G  +G AF +R
Sbjct: 64  RRPEAFGRESG-AFAER 79


>gi|428163094|gb|EKX32185.1| hypothetical protein GUITHDRAFT_148793 [Guillardia theta CCMP2712]
          Length = 496

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 63/113 (55%), Gaps = 11/113 (9%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           ++ YC CF A + C+GC+CV+C+N+ +HE  R  A+      +P AF  ++         
Sbjct: 297 VRQYCVCFRAALLCEGCDCVDCYNDGQHEQERLAAIEHIKTSDPLAFADRVR-------- 348

Query: 224 AREDAREAQLAAK---HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           A  DA    + +K   H +GC CK S CLKKYCECF+  + CS  C C DC N
Sbjct: 349 AEGDAASVSVESKPKQHVRGCKCKNSKCLKKYCECFEFGVSCSSKCDCKDCMN 401


>gi|242086178|ref|XP_002443514.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
 gi|241944207|gb|EES17352.1| hypothetical protein SORBIDRAFT_08g020790 [Sorghum bicolor]
          Length = 409

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 73/141 (51%), Gaps = 16/141 (11%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           S R R R   DG DG     K C+C+ SRC+KLYC CFA+G  C + C C  C N    +
Sbjct: 87  SDRKRRRKAEDG-DGC----KTCSCKKSRCMKLYCVCFASGSRCTESCGCEPCENKQPLQ 141

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
           VA + A+   L         K   +    QD  E    + +     K C CKKSGCLKKY
Sbjct: 142 VAPRTALILPL---------KPVQTSEAGQDIVEQVIRSPMDLIRRK-CTCKKSGCLKKY 191

Query: 253 CECFQANILCSENCRCLDCKN 273
           C+C+Q    CS NC+C DC+N
Sbjct: 192 CDCYQGGAGCSINCKCDDCRN 212


>gi|145496358|ref|XP_001434170.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401293|emb|CAK66773.1| unnamed protein product [Paramecium tetraurelia]
          Length = 387

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 74/132 (56%), Gaps = 15/132 (11%)

Query: 155 QCNCRNSRCLKLYCECFAA-GIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           QCNC+ S+CLKLYCECFA  G+    CNC  C N +++   R +A+   L RN +AF   
Sbjct: 194 QCNCKKSKCLKLYCECFANNGVCSQSCNCQECKNRIDNPQERSKAIEEALLRNSDAFAQC 253

Query: 214 IAS--SPHGAQDA----------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
             +  +P   Q            +E +++       +KGC+CKKSGC KKYCEC+  N+ 
Sbjct: 254 FTTKGAPQFVQQGIFNFIQDKPIKEPSKDNSSVV--HKGCNCKKSGCKKKYCECYSQNLK 311

Query: 262 CSENCRCLDCKN 273
           C++ C+C  C N
Sbjct: 312 CNDLCKCEHCLN 323



 Score = 41.2 bits (95), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
           KD +    K CNC+ S C K YCEC++  + C D C C +C N  + ++   +  G
Sbjct: 281 KDNSSVVHKGCNCKKSGCKKKYCECYSQNLKCNDLCKCEHCLNKTDAQIQASQEQG 336


>gi|428176775|gb|EKX45658.1| hypothetical protein GUITHDRAFT_152657 [Guillardia theta CCMP2712]
          Length = 434

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 131/307 (42%), Gaps = 48/307 (15%)

Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGA 201
           TD      +K+K C C  ++CLK YC CF   + C + C CV+CHN+ +HE AR  AV  
Sbjct: 4   TDNAQKKSEKEKGCRCGRTKCLKQYCACFRNDVRCTNDCVCVDCHNDGKHEQARMMAVRL 63

Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
               +P AF+          +   ++        K  +GC C++S C KKYCECF A + 
Sbjct: 64  VRLNDPMAFKGT------SLELENQEVHTPNGTLKTVRGCRCRRSKCQKKYCECFGAGLK 117

Query: 262 CSENCRCLDCKNFEGSEERRALF--HGDHNGIAFMQRA--ANAAIIGAVGSSGY-----G 312
           CS NC C  C N  G++  +  +   G  + +A M  A  A  ++    G + +      
Sbjct: 118 CSTNCVCEGCLN--GNDPSKPFYPSGGSASSVARMAPAIPAGFSVSNMAGKTHFDRTSKA 175

Query: 313 TLMTSNKRKSEELLFG---------VAAKDQSV----------IRNPQSQQGN------- 346
           +L TS   +  +L F           A K QS+          +RNP S +         
Sbjct: 176 SLDTSRASRPPQLSFSRTSAPSAEQKAEKSQSLASRRPPISVEVRNPTSMRVACAVTEIP 235

Query: 347 -NVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVSSEAT 405
            +   S  P +Q V +      +      S+L +P AD++     +E  SL   +SS   
Sbjct: 236 LHDSRSRAPVAQEVKLEDKVEVNKENKPLSSLDNPAADLVL---ERESSSLNYYMSSTPQ 292

Query: 406 KTLTSIY 412
             LT  +
Sbjct: 293 LNLTPTW 299


>gi|218197350|gb|EEC79777.1| hypothetical protein OsI_21187 [Oryza sativa Indica Group]
          Length = 214

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 52/137 (37%), Positives = 72/137 (52%), Gaps = 15/137 (10%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           +C+C  S C +LYC CF +  +C D CNC  C+N   +E A +E       +NPNAF P+
Sbjct: 9   KCSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYEDAIEEISDMIQMKNPNAFDPR 68

Query: 214 I--------ASSPHGAQDAREDAREAQLAA------KHNKGCHCKKSGCLKKYCECFQAN 259
           I        A+ P  +  A  D +    A       KH KGC C+KS C K YCECF+ +
Sbjct: 69  IIVSVQDATAADPQSSTSAISDPKNTSDAMPGNEQRKHAKGCSCRKSKCSKLYCECFKNS 128

Query: 260 ILCSENCRCLDCKNFEG 276
           + C+  C+C +C N  G
Sbjct: 129 VGCTAKCKCQECSNSFG 145



 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           C C +S C + YC CF++   CS+NC C  C N +  E+
Sbjct: 10  CSCVQSSCTQLYCRCFRSRYFCSDNCNCSGCYNIKYYED 48


>gi|326504624|dbj|BAK06603.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 405

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/147 (40%), Positives = 77/147 (52%), Gaps = 20/147 (13%)

Query: 139 PRANTDGKDGTPKKQK--------QCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
           P A  +G     KK+K         C+C+ SRCLKLYC CFA+G +C + C C  C+N  
Sbjct: 72  PPAEANGSSDRRKKRKAEDGEGCRTCSCKKSRCLKLYCVCFASGSHCSELCGCDPCYNKP 131

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
            H V +    G  L+         +  +    Q++ E AR  Q+     K C CKKSGCL
Sbjct: 132 IHGVQQNTPPGLPLQ---------VVRTAEAGQNSAEFAR-YQMDFFRRK-CTCKKSGCL 180

Query: 250 KKYCECFQANILCSENCRCLDCKNFEG 276
           KKYC+C+Q    CS NC+C DCKN  G
Sbjct: 181 KKYCDCYQGGAGCSINCKCDDCKNPYG 207


>gi|149061864|gb|EDM12287.1| rCG47760, isoform CRA_b [Rattus norvegicus]
          Length = 371

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 43/96 (44%), Positives = 62/96 (64%), Gaps = 15/96 (15%)

Query: 164 LKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 223
           L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA  
Sbjct: 275 LAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAK 330

Query: 224 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
            R           H+KGC+CK+SGCLK YCEC++A+
Sbjct: 331 LR-----------HSKGCNCKRSGCLKNYCECYEAS 355


>gi|145506479|ref|XP_001439200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406384|emb|CAK71803.1| unnamed protein product [Paramecium tetraurelia]
          Length = 175

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 64/121 (52%), Gaps = 25/121 (20%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K CNC+ S+CLK YC+CFA G  C + CNCV C NN  +   R+EA      R+P AF+ 
Sbjct: 76  KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKEAKLQIQTRDPRAFKQ 135

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
                                     KGC+CKKSGC KKYCECFQ N+ C+  CRC  C 
Sbjct: 136 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 171

Query: 273 N 273
           N
Sbjct: 172 N 172


>gi|145518329|ref|XP_001445042.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412475|emb|CAK77645.1| unnamed protein product [Paramecium tetraurelia]
          Length = 190

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 52/121 (42%), Positives = 64/121 (52%), Gaps = 25/121 (20%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K CNC+ S+CLK YC+CFA G  C + CNCV C NN  +   R++A      R+P AF+ 
Sbjct: 91  KTCNCKKSQCLKQYCDCFANGQVCSENCNCVGCFNNSLNNSQRKDAKLQIQARDPGAFKQ 150

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
                                     KGC+CKKSGC KKYCECFQ N+ C+  CRC  C 
Sbjct: 151 AF------------------------KGCNCKKSGCQKKYCECFQNNLQCTHQCRCEGCV 186

Query: 273 N 273
           N
Sbjct: 187 N 187


>gi|170044124|ref|XP_001849708.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867389|gb|EDS30772.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 586

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/82 (50%), Positives = 56/82 (68%), Gaps = 1/82 (1%)

Query: 130 GKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNV 189
           G ++  R  P   T   D   KK + CNC  S+CLKLYC+CFA G +C  CNC +C+NN+
Sbjct: 506 GSEDETRKGPAVGTIFPDEAYKK-RPCNCTKSQCLKLYCDCFANGEFCYNCNCRDCYNNL 564

Query: 190 EHEVARQEAVGATLERNPNAFR 211
           ++E  RQ+A+ ATLERNP+AF+
Sbjct: 565 DNEEERQKAIRATLERNPSAFK 586



 Score = 47.8 bits (112), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+C KS CLK YC+CF     C  NC C DC N   +EE R
Sbjct: 531 CNCTKSQCLKLYCDCFANGEFC-YNCNCRDCYNNLDNEEER 570


>gi|168049832|ref|XP_001777365.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671214|gb|EDQ57769.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 120

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/118 (45%), Positives = 67/118 (56%), Gaps = 8/118 (6%)

Query: 167 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA----SSPH-- 219
           YCECFAAG+YC G C C +C N  E+             RNP AF PKI     SSP   
Sbjct: 1   YCECFAAGLYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPG 60

Query: 220 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
             +    ++  +   +A+H +GC+CKKS CLKKYCEC+QA + CSE CRC  C N  G
Sbjct: 61  VSSNQKLDEVLDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMNKYG 118



 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 10/93 (10%)

Query: 105 RPQHVRMVNRVPHPVH-KLPL---PTLPPGKQESPRSRPRAN--TDGKDGTP---KKQKQ 155
           +P+++  V      +  + PL   P +  G + SP     +N   D    TP   + ++ 
Sbjct: 23  KPEYIETVINTRQQIESRNPLAFAPKIVQGAESSPVPGVSSNQKLDEVLDTPASARHKRG 82

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           CNC+ S CLK YCEC+ AG+ C +GC C  C N
Sbjct: 83  CNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMN 115


>gi|149061865|gb|EDM12288.1| rCG47760, isoform CRA_c [Rattus norvegicus]
 gi|149061866|gb|EDM12289.1| rCG47760, isoform CRA_c [Rattus norvegicus]
          Length = 191

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 61/93 (65%), Gaps = 15/93 (16%)

Query: 167 YCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+PK+     GA   R 
Sbjct: 98  YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 152

Query: 227 DAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
                     H+KGC+CK+SGCLK YCEC++A+
Sbjct: 153 ----------HSKGCNCKRSGCLKNYCECYEAS 175


>gi|145525306|ref|XP_001448475.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416019|emb|CAK81078.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/121 (42%), Positives = 64/121 (52%), Gaps = 25/121 (20%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K CNC+ S+CLK YC+CFA G  C + CNCV C NN  +   R+EA    + R+P AF+ 
Sbjct: 89  KSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFKQ 148

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
                                     KGC+CKKSGC KKYCECF + + C+  CRC  C 
Sbjct: 149 SF------------------------KGCNCKKSGCQKKYCECFLSGLACTHLCRCDGCL 184

Query: 273 N 273
           N
Sbjct: 185 N 185



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 31/45 (68%)

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           + K C+CKKS CLK+YC+CF    +CSENC C+ C N   + E+R
Sbjct: 87  NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 131


>gi|145536949|ref|XP_001454192.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421945|emb|CAK86795.1| unnamed protein product [Paramecium tetraurelia]
          Length = 196

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/122 (40%), Positives = 65/122 (53%), Gaps = 25/122 (20%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFR 211
           ++ CNC+ S+CLK YC+CFA G  C + CNCV C NN  +   R+EA    + R+P AF+
Sbjct: 96  ERSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQRKEAKVQIINRDPGAFK 155

Query: 212 PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
                                      KGC+CKKSGC KKYCECF + + C+  CRC  C
Sbjct: 156 QSF------------------------KGCNCKKSGCQKKYCECFLSGLACTHLCRCDGC 191

Query: 272 KN 273
            N
Sbjct: 192 LN 193



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 9/99 (9%)

Query: 186 HNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAK---HNKGCH 242
            N +E E   +E V  +++     F+  ++++       +   +  Q+ +K   + + C+
Sbjct: 47  QNRIEME---EETVETSIQL---VFKEYLSTNQDYYFLLQISQKSLQIQSKLFINERSCN 100

Query: 243 CKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           CKKS CLK+YC+CF    +CSENC C+ C N   + E+R
Sbjct: 101 CKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNMEQR 139


>gi|168057921|ref|XP_001780960.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667594|gb|EDQ54220.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 159

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 70/132 (53%), Gaps = 22/132 (16%)

Query: 167 YCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDA- 224
           YCECFA G+YC G C C +C N  E+             RNP AF PKI     GA+ + 
Sbjct: 1   YCECFALGVYCVGSCACRDCFNKPEYIETVINTRQQIESRNPLAFAPKIV---QGAEPSP 57

Query: 225 -----------------REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
                            +++A +   +A+H +GC+CKKS CLKKYCEC+QA + CSE CR
Sbjct: 58  VPGVTLPAAEACVQLVVQDEALDTPASARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCR 117

Query: 268 CLDCKNFEGSEE 279
           C  C N  G +E
Sbjct: 118 CEGCMNKYGKKE 129



 Score = 43.1 bits (100), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           + + ++ CNC+ S CLK YCEC+ AG+ C +GC C  C N
Sbjct: 84  SARHKRGCNCKKSLCLKKYCECYQAGVGCSEGCRCEGCMN 123


>gi|195153923|ref|XP_002017873.1| GL17406 [Drosophila persimilis]
 gi|194113669|gb|EDW35712.1| GL17406 [Drosophila persimilis]
          Length = 554

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 58/94 (61%)

Query: 133 ESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHE 192
           +SP+      TD   G   ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++E
Sbjct: 456 KSPQLASAPTTDALAGMASRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYE 515

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDARE 226
           V R+ A+ + L+RNP+AF  K     + A+D + 
Sbjct: 516 VERERAIRSCLDRNPSAFNLKHGEDSNNAKDVKR 549



 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 38/84 (45%), Gaps = 6/84 (7%)

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           +I  SP  A     DA      A   K C+C KS CLK YC+CF     C ++C C DC 
Sbjct: 453 EILKSPQLASAPTTDALAGM--ASRRKHCNCSKSQCLKLYCDCFANGEFC-QDCTCKDCF 509

Query: 273 N---FEGSEERRALFHGDHNGIAF 293
           N   +E   ER      D N  AF
Sbjct: 510 NNLDYEVERERAIRSCLDRNPSAF 533


>gi|145484384|ref|XP_001428202.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395286|emb|CAK60804.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 54/141 (38%), Positives = 75/141 (53%), Gaps = 31/141 (21%)

Query: 136 RSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHE 192
           R + R  T+     P  +KQ+QC C+ S+CLK+YC+C A G YC   C+C NCHN+ +H 
Sbjct: 20  RQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHNDDKHP 79

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
             R + +   +E+N +A                      QL       C+C+KS C KKY
Sbjct: 80  EQRSKVLEQMVEKNSSA----------------------QLK------CNCRKSNCQKKY 111

Query: 253 CECFQANILCSENCRCLDCKN 273
           CEC+ A + CSE C+C DCKN
Sbjct: 112 CECYNAGVKCSELCKCDDCKN 132


>gi|302824819|ref|XP_002994049.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
 gi|300138103|gb|EFJ04883.1| hypothetical protein SELMODRAFT_138111 [Selaginella moellendorffii]
          Length = 144

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 50/118 (42%), Positives = 65/118 (55%), Gaps = 8/118 (6%)

Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH------ 219
           YCECFAAG+YC D C C +C N  E              RNP AF PKI  +        
Sbjct: 1   YCECFAAGVYCLDSCACRDCFNKPEFADTVMGTRQQIETRNPLAFAPKIVQTSDNTPTEG 60

Query: 220 -GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
             +   + +  +   +A+H +GC+CKKS CLKKYCEC+QA + CS+ C+C  CKN  G
Sbjct: 61  VASFALQPNCPDTTGSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKNIYG 118



 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           G+ + ++ CNC+ S CLK YCEC+ AG+ C DGC C  C N
Sbjct: 75  GSARHKRGCNCKKSLCLKKYCECYQAGVGCSDGCKCNGCKN 115


>gi|145479163|ref|XP_001425604.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392675|emb|CAK58206.1| unnamed protein product [Paramecium tetraurelia]
          Length = 164

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/145 (37%), Positives = 77/145 (53%), Gaps = 31/145 (21%)

Query: 132 QESPRSRPRANTDGKDGTP--KKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNN 188
           Q   R + R  T+     P  +KQ+QC C+ S+CLK+YC+C A G YC   C+C NCHN+
Sbjct: 16  QTPTRQQKRVITETTIKKPFQQKQRQCTCKKSQCLKMYCDCLAFGEYCGSECSCQNCHND 75

Query: 189 VEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
            +H   R + +   +E+N          +P            AQ+       C+C+KS C
Sbjct: 76  DKHPEQRNKVLEQMVEKN----------TP------------AQIK------CNCRKSNC 107

Query: 249 LKKYCECFQANILCSENCRCLDCKN 273
            KKYCEC+ A + CSE C+C DCKN
Sbjct: 108 QKKYCECYNAGVKCSELCKCDDCKN 132


>gi|348532345|ref|XP_003453667.1| PREDICTED: hypothetical protein LOC100710529 [Oreochromis
           niloticus]
          Length = 310

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 52/77 (67%), Gaps = 11/77 (14%)

Query: 203 LERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILC 262
           L  NP+AFRPKIA    GA           +   HNKGC+CK+SGCLK+YCEC++ANI+C
Sbjct: 172 LGHNPDAFRPKIAGVKSGA-----------VRGWHNKGCNCKRSGCLKRYCECYEANIMC 220

Query: 263 SENCRCLDCKNFEGSEE 279
           + +C+C+ C+N++   +
Sbjct: 221 TSSCKCVGCRNYDAGSQ 237



 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K CNC+ S CLK YCEC+ A I C   C CV C N
Sbjct: 197 KGCNCKRSGCLKRYCECYEANIMCTSSCKCVGCRN 231


>gi|223974815|gb|ACN31595.1| unknown [Zea mays]
          Length = 391

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 78/159 (49%), Gaps = 18/159 (11%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           S R R R   DG DG     K C+C+ SRCLKLYC C+A+G +C + C C  CHN     
Sbjct: 61  SDRKRRRKADDG-DGC----KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQ- 114

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
                  GA         +P    +    Q+  E    + +     K C CKKSGCLKKY
Sbjct: 115 -------GAPWTAPVLPLKP--VQTLEAGQEIVEQLIRSPMDLIRRK-CTCKKSGCLKKY 164

Query: 253 CECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI 291
           C+C+Q    CS NC+C DC+N  G  +   +  G  +G+
Sbjct: 165 CDCYQGGAGCSINCKCDDCRNPFG-RKVGVILDGKSSGL 202


>gi|339238879|ref|XP_003380994.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316976043|gb|EFV59390.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 238

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 72/136 (52%), Gaps = 20/136 (14%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           ++ CNC  S+CLKLYC+CFA G +C  C+C NC NN+ +E+ R  A+ + LERNP AF+P
Sbjct: 55  RRPCNCTKSQCLKLYCDCFANGEFCSNCHCTNCLNNLTNELDRSRAIKSCLERNPMAFQP 114

Query: 213 KIASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCL 269
           KI +            + +++ AK      GC   +   +K       A + C+  C+C+
Sbjct: 115 KIGN------------KTSEIMAKDELTLNGCT-TRDAIVKSQ----AARVSCTVRCKCV 157

Query: 270 DCKNFEGSEERRALFH 285
            C+N E     R   H
Sbjct: 158 GCRNTEADRNHRNRGH 173


>gi|219362923|ref|NP_001137104.1| uncharacterized protein LOC100217281 [Zea mays]
 gi|194698372|gb|ACF83270.1| unknown [Zea mays]
 gi|194708474|gb|ACF88321.1| unknown [Zea mays]
 gi|414877953|tpg|DAA55084.1| TPA: hypothetical protein ZEAMMB73_574510 [Zea mays]
          Length = 394

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 81/162 (50%), Gaps = 24/162 (14%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           S R R R   DG DG     K C+C+ SRCLKLYC C+A+G +C + C C  CHN     
Sbjct: 63  SDRKRRRKADDG-DGC----KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQ- 116

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHN---KGCHCKKSGCL 249
                  GA          P +   P    +A ++  E  + +  +   + C CKKSGCL
Sbjct: 117 -------GAPWTA------PVLPLKPVQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCL 163

Query: 250 KKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGI 291
           KKYC+C+Q    CS NC+C DC+N  G  +   +  G  +G+
Sbjct: 164 KKYCDCYQGGAGCSINCKCDDCRNPFG-RKVGVILDGKSSGL 204


>gi|303283039|ref|XP_003060811.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
 gi|226458282|gb|EEH55580.1| tso1-like transcription factor [Micromonas pusilla CCMP1545]
          Length = 625

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 48/108 (44%), Positives = 56/108 (51%), Gaps = 14/108 (12%)

Query: 166 LYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           LYCECFAAG +C  C+C NC N  ++    Q        RNP AF  KI ++        
Sbjct: 280 LYCECFAAGAFCQDCSCQNCQNTPDNAALVQMTRTQIELRNPQAFANKIVAT-------- 331

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            D  E      H K     KS CLKKYCECFQA +LC E C+C  CKN
Sbjct: 332 -DGEEKHKKGCHCK-----KSACLKKYCECFQAGVLCQEYCKCEGCKN 373


>gi|145501814|ref|XP_001436887.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404032|emb|CAK69490.1| unnamed protein product [Paramecium tetraurelia]
          Length = 159

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 62/121 (51%), Gaps = 25/121 (20%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K CNC+ S+CLK YC+CFA G  C + CNCV C NN  +   R+ A    + R+P AF+ 
Sbjct: 60  KSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKVAKVQIVNRDPGAFKQ 119

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
                                     KGC+CKKSGC KKYCECF   + C+  CRC  C 
Sbjct: 120 SF------------------------KGCNCKKSGCQKKYCECFLNGLTCTHLCRCDGCL 155

Query: 273 N 273
           N
Sbjct: 156 N 156



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 32/47 (68%)

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           + K C+CKKS CLK+YC+CF    +CSENC C+ C N   + E+R +
Sbjct: 58  NTKSCNCKKSQCLKQYCDCFANGQMCSENCNCVGCFNNTLNIEQRKV 104


>gi|428185670|gb|EKX54522.1| hypothetical protein GUITHDRAFT_63344, partial [Guillardia theta
           CCMP2712]
          Length = 125

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 51/130 (39%), Positives = 69/130 (53%), Gaps = 21/130 (16%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K++ C C  ++CLK YC+CF   I C   C C +CHN+ +HE  R EA+      NP+AF
Sbjct: 9   KERGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKRIEAIRHIRMNNPSAF 68

Query: 211 R-------PKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
           +        +  ++P G +             K  +GC CK+S C KKYCECF A I C+
Sbjct: 69  KGTALELEDQEVTTPKGGK-------------KTVRGCRCKRSKCQKKYCECFSAGIPCT 115

Query: 264 ENCRCLDCKN 273
            NC C DC N
Sbjct: 116 SNCVCTDCAN 125



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 30/43 (69%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           +GC C ++ CLK+YC+CF+ +I C+ +C C DC N    EE+R
Sbjct: 11  RGCRCGRTKCLKQYCQCFRNDIRCTSDCVCSDCHNDGKHEEKR 53



 Score = 38.5 bits (88), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 148 GTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           G  K  + C C+ S+C K YCECF+AGI C   C C +C N
Sbjct: 85  GGKKTVRGCRCKRSKCQKKYCECFSAGIPCTSNCVCTDCAN 125


>gi|413949787|gb|AFW82436.1| hypothetical protein ZEAMMB73_293494 [Zea mays]
          Length = 341

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 53/105 (50%), Gaps = 2/105 (1%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K C C+ SRCLKLYC CFA G YC D C C  C N        Q      L R       
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQ-KRMSM 120

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
           KI   P    +  EDA  +  +    +GC+CKKS CLKKYC+C+Q
Sbjct: 121 KINRRPEANAEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQ 165



 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 22/41 (53%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K C CKKS CLK YC CF     CS+ C C  C N E   E
Sbjct: 62  KHCACKKSRCLKLYCPCFAGGGYCSDKCGCQPCFNKEAFSE 102


>gi|308808189|ref|XP_003081405.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
 gi|116059867|emb|CAL55574.1| Metallothionein-like protein (ISS) [Ostreococcus tauri]
          Length = 466

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/91 (47%), Positives = 50/91 (54%), Gaps = 14/91 (15%)

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
           C N VE+    ++       RNPNAF  KI       +D  +DAR       H KGCHCK
Sbjct: 171 CQNTVENVSVVEKTRSQIEARNPNAFLSKI-------EDDGDDAR-------HTKGCHCK 216

Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNFE 275
           KS CLKKYCECFQA + C E C+C  C N E
Sbjct: 217 KSACLKKYCECFQAGVKCQEYCKCEGCMNKE 247



 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 3/45 (6%)

Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           DG D   +  K C+C+ S CLK YCECF AG+ C + C C  C N
Sbjct: 203 DGDDA--RHTKGCHCKKSACLKKYCECFQAGVKCQEYCKCEGCMN 245


>gi|224092440|ref|XP_002309611.1| predicted protein [Populus trichocarpa]
 gi|222855587|gb|EEE93134.1| predicted protein [Populus trichocarpa]
          Length = 847

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 55/103 (53%), Gaps = 6/103 (5%)

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL----AAKHNKG 240
           C N  E+E    E       RNP AF PKI    H  +    D  +  L    + +H  G
Sbjct: 444 CFNRPEYEDTVLETRQQIESRNPLAFAPKIVQ--HVTEFQAIDVEDVDLFTPYSGRHKTG 501

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           C+CK+S C+KKYCEC+QAN+ CS  CRC  C+N  G +E  A+
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRNIHGRKEEYAM 544



 Score = 42.4 bits (98), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           CNC+ S C+K YCEC+ A + C + C C  C N
Sbjct: 502 CNCKRSMCVKKYCECYQANVGCSNACRCEGCRN 534


>gi|219363019|ref|NP_001136646.1| uncharacterized protein LOC100216775 [Zea mays]
 gi|194696492|gb|ACF82330.1| unknown [Zea mays]
          Length = 278

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 50/80 (62%)

Query: 205 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
           RNP AF PK+  +     + R+ + +   +A+H +GC+CKKS CLKKYCECFQ  + CS 
Sbjct: 20  RNPLAFAPKVIHTSEPGLELRDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSI 79

Query: 265 NCRCLDCKNFEGSEERRALF 284
           +CRC  CKN  G  E  A+ 
Sbjct: 80  SCRCEGCKNAFGKREGAAVL 99



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           + ++ CNC+ S CLK YCECF  G+ C   C C  C N
Sbjct: 51  RHKRGCNCKKSSCLKKYCECFQGGVGCSISCRCEGCKN 88


>gi|50787704|emb|CAH04409.1| hypothetical protein [Euplotes vannus]
          Length = 482

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/125 (41%), Positives = 65/125 (52%), Gaps = 7/125 (5%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           K C C+ S+C+KLYCECF +  +C   C C +C N  E+E    +     L RNP AF  
Sbjct: 357 KGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDCENMEENEEEIVKIREKILSRNPKAFEK 416

Query: 213 KI----ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           K+    AS P G       A  + L  +H KGC CK+S C   YCEC Q    C+E C C
Sbjct: 417 KLQIEEASQPLGMGPKAASAPSSNL--RHIKGCTCKRSKCSNNYCECHQHGAKCTELCSC 474

Query: 269 LDCKN 273
            DC N
Sbjct: 475 TDCNN 479



 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDC 271
           +EA L+    KGC CK+S C+K YCECF +   C  +C C DC
Sbjct: 349 KEANLS--ECKGCTCKRSKCIKLYCECFLSKGFCGPSCTCNDC 389


>gi|431910169|gb|ELK13242.1| Tesmin [Pteropus alecto]
          Length = 484

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 63/165 (38%), Positives = 90/165 (54%), Gaps = 34/165 (20%)

Query: 99  MPQWQARPQHVRMVNRVPHPVHKLPLPTLP-PGK-----QESPRSRPRANTDGKDGTPKK 152
           +PQ+Q +  +++  + VP P+  L    LP P K     Q+SP     ++ D   G  K 
Sbjct: 241 VPQYQDQNNYLQ--SDVPKPMTTLVGRFLPVPAKLNLITQQSPAEVTESSMDSDVGVRK- 297

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
                         YC+CFA+G  C+ CNC NC NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 298 --------------YCDCFASGDVCNNCNCNNCCNNLRHEIERFKAIKACLDRNPEAFQP 343

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
           KI   P    DA+          +HN+GC+CK+SGCLK YCEC++
Sbjct: 344 KIG--PGSLGDAK---------PRHNRGCNCKRSGCLKNYCECYE 377


>gi|293336269|ref|NP_001170528.1| uncharacterized protein LOC100384542 [Zea mays]
 gi|238005872|gb|ACR33971.1| unknown [Zea mays]
          Length = 272

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/80 (45%), Positives = 49/80 (61%)

Query: 205 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
           RNP AF PK+  +     D  + + +   +A+H +GC+CKKS CLKKYCECFQ  + CS 
Sbjct: 14  RNPLAFAPKVIHTSEPGMDLGDFSNKTPASARHKRGCNCKKSSCLKKYCECFQGGVGCSV 73

Query: 265 NCRCLDCKNFEGSEERRALF 284
           +CRC  CKN  G  E  ++ 
Sbjct: 74  SCRCEGCKNAFGKREGASVL 93



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           + ++ CNC+ S CLK YCECF  G+ C   C C  C N
Sbjct: 45  RHKRGCNCKKSSCLKKYCECFQGGVGCSVSCRCEGCKN 82


>gi|427797635|gb|JAA64269.1| Putative dna-dependent, partial [Rhipicephalus pulchellus]
          Length = 184

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 1/64 (1%)

Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFM 294
           +H KGC+CK+SGCLK YCEC++A ILCS  C+C+ CKNFE S ER+ L    D   +   
Sbjct: 3   RHTKGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKNFEDSSERKTLMQLADAAEVRVQ 62

Query: 295 QRAA 298
           Q+AA
Sbjct: 63  QQAA 66



 Score = 45.4 bits (106), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 1/35 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           K CNC+ S CLK YCEC+ A I C   C CV C N
Sbjct: 6   KGCNCKRSGCLKNYCECYEAKILCSSMCKCVGCKN 40


>gi|159475976|ref|XP_001696090.1| hypothetical protein CHLREDRAFT_104804 [Chlamydomonas reinhardtii]
 gi|158275261|gb|EDP01039.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 84

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 42/95 (44%), Positives = 53/95 (55%), Gaps = 12/95 (12%)

Query: 163 CLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
           CLKLYC+CFAAG +C  C+C +C N  E+    Q+       R+P AF  KI  +P    
Sbjct: 1   CLKLYCDCFAAGQFCGACSCASCQNRPEYADRVQQRREDIAARDPQAFTRKIMDAP---- 56

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
                        KH +GC+CK+S CLKKYCECFQ
Sbjct: 57  --------GGGGGKHKRGCNCKRSHCLKKYCECFQ 83


>gi|145523856|ref|XP_001447761.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415283|emb|CAK80364.1| unnamed protein product [Paramecium tetraurelia]
          Length = 573

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/66 (53%), Positives = 48/66 (72%), Gaps = 8/66 (12%)

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
           AF+PKI++S     D++ D        KHNKGC CKKSGC+KKYCECFQA + CS+ C+C
Sbjct: 433 AFQPKISTS-----DSKPDPLNF---GKHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKC 484

Query: 269 LDCKNF 274
           ++C+N+
Sbjct: 485 IECRNY 490



 Score = 44.7 bits (104), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K  K C C+ S C+K YCECF A + C D C C+ C N
Sbjct: 452 KHNKGCQCKKSGCMKKYCECFQAKVPCSDQCKCIECRN 489


>gi|299115549|emb|CBN75752.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 313

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 50/119 (42%), Positives = 62/119 (52%), Gaps = 14/119 (11%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C+  RCLKLYC+CFA   +C   C C  C N +   V  Q A  A LERNP+AF    
Sbjct: 201 CTCKKCRCLKLYCDCFAQSRFCGSACLCKQCCNRL-GAVVVQAARRAVLERNPDAF---- 255

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
               H A     + R   + +    GC C +S CLKKYC CFQ ++LC   C+C  C N
Sbjct: 256 ----HRAHSL--ETRPGSVTS--GAGCGCTRSHCLKKYCVCFQKSLLCGAGCKCEGCLN 306


>gi|413949791|gb|AFW82440.1| hypothetical protein ZEAMMB73_340549 [Zea mays]
          Length = 298

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 49/132 (37%), Positives = 61/132 (46%), Gaps = 9/132 (6%)

Query: 163 CLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 221
           C+  YC CFA G YC D C C  C N        Q      L R       KI   P   
Sbjct: 13  CICSYCPCFAGGGYCSDKCGCQPCFNKEAFSETVQTTRKVLLSRQKR-MSMKINRRPEAN 71

Query: 222 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG----- 276
            +  EDA  +  +    +GC+CKKS CLKKYC+C+Q    CS  CRC DC+N  G     
Sbjct: 72  AEPMEDAHHSSSSTPPRRGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPFGKNDGI 131

Query: 277 --SEERRALFHG 286
              + +R L+ G
Sbjct: 132 IADDSKRYLYTG 143



 Score = 38.1 bits (87), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 40/96 (41%), Gaps = 16/96 (16%)

Query: 138 RPRANTD-------GKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN-- 187
           RP AN +           TP + + CNC+ S CLK YC+C+  G  C   C C +C N  
Sbjct: 67  RPEANAEPMEDAHHSSSSTPPR-RGCNCKKSSCLKKYCDCYQDGTGCSLFCRCDDCQNPF 125

Query: 188 -----NVEHEVARQEAVGATLERNPNAFRPKIASSP 218
                 +  +  R    GA L+ +       +  SP
Sbjct: 126 GKNDGIIADDSKRYLYTGADLDHSEGEHDFVVERSP 161


>gi|194703140|gb|ACF85654.1| unknown [Zea mays]
          Length = 204

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 56/141 (39%), Positives = 71/141 (50%), Gaps = 17/141 (12%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHE 192
           S R R R   DG DG     K C+C+ SRCLKLYC C+A+G +C + C C  CHN     
Sbjct: 63  SDRKRRRKADDG-DGC----KACSCKKSRCLKLYCVCYASGSHCTELCGCEPCHNKPPQ- 116

Query: 193 VARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKY 252
                  GA         +P    +    Q+  E    + +     K C CKKSGCLKKY
Sbjct: 117 -------GAPWTAPVLPLKP--VQTLEAGQEIVEQLIRSPMDLIRRK-CTCKKSGCLKKY 166

Query: 253 CECFQANILCSENCRCLDCKN 273
           C+C+Q    CS NC+C DC+N
Sbjct: 167 CDCYQGGAGCSINCKCDDCRN 187


>gi|384253407|gb|EIE26882.1| hypothetical protein COCSUDRAFT_9768, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 81

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 40/95 (42%), Positives = 53/95 (55%), Gaps = 17/95 (17%)

Query: 163 CLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 221
           CLKLYC+CFA G++C D C C +C N  +   A  +A    + ++P AF+PK+  +  G 
Sbjct: 1   CLKLYCDCFATGLFCNDACMCKDCENRTDTLNAVFKARKFIMVKDPTAFKPKVLDASGG- 59

Query: 222 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECF 256
                          H KGC C+KS CLKKYCECF
Sbjct: 60  ---------------HVKGCACRKSRCLKKYCECF 79



 Score = 38.9 bits (89), Expect = 5.3,   Method: Composition-based stats.
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 248 CLKKYCECFQANILCSENCRCLDCKN 273
           CLK YC+CF   + C++ C C DC+N
Sbjct: 1   CLKLYCDCFATGLFCNDACMCKDCEN 26


>gi|145546570|ref|XP_001458968.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426790|emb|CAK91571.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 59/123 (47%), Gaps = 16/123 (13%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC  S CLK YC CF +G  C D C C +C N  E    R EA+    ++         
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTEFAEERDEAINHVYKK--------- 201

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
                  +D +    E  L+ + + GC CK +GC KKYCECF+   +C E C C DC N 
Sbjct: 202 -----CHRDKKVPVNEL-LSLQMSYGCKCKSTGCQKKYCECFKRGQICGEQCACEDCLNI 255

Query: 275 EGS 277
             S
Sbjct: 256 PFS 258


>gi|145505956|ref|XP_001438944.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406117|emb|CAK71547.1| unnamed protein product [Paramecium tetraurelia]
          Length = 272

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 62/129 (48%), Gaps = 16/129 (12%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC  S CLK YC CF +G  C D C C +C N  +    R EA+    ++         
Sbjct: 151 CNCSQSNCLKRYCACFHSGRMCLDECQCKDCKNCTDFYEERDEAINHVYKK--------- 201

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
                  +D +    E  L+ + + GC CK +GC KKYCECF+   +C E C C  C N 
Sbjct: 202 -----CHRDKKVPVNEL-LSLQMSYGCKCKTTGCQKKYCECFKRGQICGEQCSCEGCLNI 255

Query: 275 EGSEERRAL 283
             ++ ++ L
Sbjct: 256 PSAQNQKEL 264


>gi|224014361|ref|XP_002296843.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
 gi|220968480|gb|EED86827.1| hypothetical protein THAPSDRAFT_264396 [Thalassiosira pseudonana
           CCMP1335]
          Length = 141

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/118 (38%), Positives = 64/118 (54%), Gaps = 12/118 (10%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C  SRC+KLYC+CF  G  C+  CNC +C N  E    R+  +   L+ NP AF    
Sbjct: 35  CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREEREWKMKEVLKLNPKAF---- 90

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
                 ++D+ +   + Q  ++ N GC C  S CLKKYC CF A+  C++ C C DC+
Sbjct: 91  ------SEDSDKFNTKRQRMSRGN-GCACPSSHCLKKYCSCFGADAGCTDKCSCNDCE 141



 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEER 280
           C C KS C+K YC+CF    LC+  C C DCKN  E  EER
Sbjct: 35  CKCSKSRCIKLYCDCFHGGNLCNSLCNCTDCKNTTEFREER 75


>gi|115489472|ref|NP_001067223.1| Os12g0605300 [Oryza sativa Japonica Group]
 gi|77557030|gb|ABA99826.1| Tesmin/TSO1-like CXC domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649730|dbj|BAF30242.1| Os12g0605300 [Oryza sativa Japonica Group]
          Length = 407

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           YC CFA+G +C + C C  C+N   H V R             A   K   +    QD+ 
Sbjct: 114 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 163

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           E    + +     K C CKKSGCLKKYC+C+Q    CS NC+C DCKN  G
Sbjct: 164 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 213


>gi|125537321|gb|EAY83809.1| hypothetical protein OsI_39026 [Oryza sativa Indica Group]
 gi|125579998|gb|EAZ21144.1| hypothetical protein OsJ_36791 [Oryza sativa Japonica Group]
          Length = 421

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 55/111 (49%), Gaps = 12/111 (10%)

Query: 167 YCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           YC CFA+G +C + C C  C+N   H V R             A   K   +    QD+ 
Sbjct: 128 YCVCFASGSHCSESCGCDPCYNKSIHGVPRSTP----------ALPLKAVQTLETGQDSV 177

Query: 226 EDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           E    + +     K C CKKSGCLKKYC+C+Q    CS NC+C DCKN  G
Sbjct: 178 EQLIRSPMDFGRRK-CTCKKSGCLKKYCDCYQGGAGCSINCKCDDCKNPYG 227



 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 43/93 (46%), Gaps = 11/93 (11%)

Query: 106 PQHVRMVNRVPHPVHKLPLP---TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSR 162
           P + + ++ VP     LPL    TL  G Q+S     R+  D        +++C C+ S 
Sbjct: 146 PCYNKSIHGVPRSTPALPLKAVQTLETG-QDSVEQLIRSPMDFG------RRKCTCKKSG 198

Query: 163 CLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVA 194
           CLK YC+C+  G  C   C C +C N    +V 
Sbjct: 199 CLKKYCDCYQGGAGCSINCKCDDCKNPYGRKVG 231


>gi|449016427|dbj|BAM79829.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 932

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 15/117 (12%)

Query: 164 LKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
           LK YC+CFAAG +C   C C  C N+ +++   + A  +  +R+         S+   A 
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQVERARQSASQRS---------STSGEAL 869

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           +    A E     + ++GC CK++GCLK+YCECFQA   C+  C C  C N  G  E
Sbjct: 870 NGEPSASE-----RSSRGCRCKRTGCLKRYCECFQAGRECTAQCACEGCLNCRGLSE 921



 Score = 38.1 bits (87), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSEER 280
           LKKYC+CF A   C   C C  C N + ++E+
Sbjct: 819 LKKYCDCFAAGSFCGPECECEGCGNHQDNQEQ 850


>gi|432959452|ref|XP_004086297.1| PREDICTED: uncharacterized protein LOC101167937 [Oryzias latipes]
          Length = 377

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 28/49 (57%), Positives = 38/49 (77%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
           + +K C+C  SRCLKLYC+CF++G+ C  CNC+NCHNN E+E  R EA+
Sbjct: 325 RPRKTCHCTKSRCLKLYCDCFSSGMMCSSCNCINCHNNAENESQRHEAI 373



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           K CHC KS CLK YC+CF + ++CS +C C++C N   +E +R
Sbjct: 328 KTCHCTKSRCLKLYCDCFSSGMMCS-SCNCINCHNNAENESQR 369


>gi|195058418|ref|XP_001995449.1| GH22623 [Drosophila grimshawi]
 gi|193899655|gb|EDV98521.1| GH22623 [Drosophila grimshawi]
          Length = 916

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 59/110 (53%), Gaps = 6/110 (5%)

Query: 90  AHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGT 149
           +  +SP Q +P  Q+        N++  P   L    L P    +P     A+ D   G+
Sbjct: 697 SQTKSPQQKLPILQSSSTQD---NKLCVPAKPL-FDILKPAPMPTPSPAASASLDVLGGS 752

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAV 199
             ++K CNC  S+CLKLYC+CFA G +C  C C +C NN+++EV R+ A+
Sbjct: 753 --RRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVKRERAI 800



 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 35/76 (46%), Gaps = 8/76 (10%)

Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
           +  +P    +P  A  A  D     L     K C+C KS CLK YC+CF     C ++C 
Sbjct: 728 DILKPAPMPTPSPAASASLDV----LGGSRRKHCNCSKSQCLKLYCDCFANGEFC-QDCT 782

Query: 268 CLDCKN---FEGSEER 280
           C DC N   +E   ER
Sbjct: 783 CKDCFNNLDYEVKRER 798


>gi|145522672|ref|XP_001447180.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414680|emb|CAK79783.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 44/122 (36%), Positives = 61/122 (50%), Gaps = 17/122 (13%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCV-NCHNNVEHEVARQEAVGATLERNPNA 209
           +Q+ CNC+NS CLK YC CF +G  C   C C  +C NN +H+  R  A+   +  +   
Sbjct: 144 QQQPCNCKNSGCLKRYCRCFHSGRMCLKECQCSEDCQNNEQHQEQRNNAI---IHVDQKC 200

Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
           +R             R   R+A        GC C KS C K+YCEC+  N  C++ C+C 
Sbjct: 201 YR------------NRRMPRDALFKLDVIYGCSCTKSKCRKRYCECYLRNQKCTDKCKCF 248

Query: 270 DC 271
           DC
Sbjct: 249 DC 250


>gi|125580000|gb|EAZ21146.1| hypothetical protein OsJ_36793 [Oryza sativa Japonica Group]
          Length = 710

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 86/197 (43%), Gaps = 46/197 (23%)

Query: 92  MRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPK 151
           +R+P  A+P       H+  +  VP       +  +    Q SP+ + R   D  DGT  
Sbjct: 383 LRTPPCALPGIGL---HLNALATVPKDKMTAVVAYVGESSQGSPKKK-RHKFDNGDGT-- 436

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN---NVEHEVARQEAVGATLERNP 207
             K+C+C+ S+CLKLYCECF AG++C + C+C  C N   N+E  ++ +E + +   RNP
Sbjct: 437 SCKRCSCKKSKCLKLYCECFHAGVFCSEPCSCQGCLNMPSNMETVLSTREQIES---RNP 493

Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
            AF PK+  +  G +                                     + CS +CR
Sbjct: 494 LAFAPKVIRTEPGQE---------------------------------LAGGVGCSVSCR 520

Query: 268 CLDCKNFEGSEERRALF 284
           C  CKN  G  E  AL 
Sbjct: 521 CEGCKNAFGRREGVALL 537


>gi|145548172|ref|XP_001459767.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427593|emb|CAK92370.1| unnamed protein product [Paramecium tetraurelia]
          Length = 303

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 57/119 (47%), Gaps = 18/119 (15%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC  S CLKLYC+CF     C + C C++C N   H   RQ A+           + K+
Sbjct: 183 CNCSKSNCLKLYCQCFHQNKQCTELCKCLDCKNCDYHTQVRQTALE----------KIKM 232

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            S      +   D  +         GC C+KS C K YCECF+ N  C+ +CRC DC N
Sbjct: 233 KSQRQKHDNDLFDLSKVW-------GCKCQKSQCQKNYCECFRRNQKCNSSCRCKDCAN 284



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 44/77 (57%), Gaps = 6/77 (7%)

Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG----CHCKKSGCLKKYCECF 256
           +TLE+     R +  ++    Q   E  +E+ L ++ N+     C+C KS CLK YC+CF
Sbjct: 141 STLEQRSRRIRKQNINA--NLQRNYEYNQESDLNSQSNEHTMNPCNCSKSNCLKLYCQCF 198

Query: 257 QANILCSENCRCLDCKN 273
             N  C+E C+CLDCKN
Sbjct: 199 HQNKQCTELCKCLDCKN 215


>gi|300176906|emb|CBK25475.2| unnamed protein product [Blastocystis hominis]
          Length = 123

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 72/153 (47%), Gaps = 40/153 (26%)

Query: 122 LPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGC 180
           LP PTLPP     P  R                 C C  + CLKLYCEC  AG  C D C
Sbjct: 3   LPEPTLPP-----PSIR-----------------CTCTKTHCLKLYCECLKAGRTCTDLC 40

Query: 181 NCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKG 240
            CV+C N  +    R++A+    + NP AF     S+  G +D+R               
Sbjct: 41  ACVDCLNRAKFP-ERKKALKHIKKTNPKAFLR--VSNLKGHEDSR--------------C 83

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           C C+KS C KKYC C+Q+   C+  C+C+DCKN
Sbjct: 84  CACQKSLCNKKYCSCYQSGRYCTPKCKCVDCKN 116



 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEER-RALFH-GDHNGIAFMQ 295
           C C K+ CLK YCEC +A   C++ C C+DC N     ER +AL H    N  AF++
Sbjct: 15  CTCTKTHCLKLYCECLKAGRTCTDLCACVDCLNRAKFPERKKALKHIKKTNPKAFLR 71


>gi|145484801|ref|XP_001428410.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395495|emb|CAK61012.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 83/202 (41%), Gaps = 46/202 (22%)

Query: 83  SSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRAN 142
           SS P +  H +  S  M Q       +++ + +P  +   P P+   G    P   P   
Sbjct: 22  SSSPDMDEH-KDSSLDMSQL------IKLQHTIPDNILISPEPSKKKGSILLPSDDP--- 71

Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGA 201
                        CNC  S C K+YCEC A G  C   C C NCHN   +++     V  
Sbjct: 72  -------------CNCSRSGCRKMYCECLAKGRLCSSFCRCQNCHNKTSNKLVLN--VIE 116

Query: 202 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 261
            L++N                      R+   + +   GC C+KS CLKKYCECF +   
Sbjct: 117 ELDQNR--------------------NRKRFRSRRFKDGCTCQKSMCLKKYCECFNSGKS 156

Query: 262 CSENCRCLDCKNFEGSEERRAL 283
           C   C+C++C+N+   E  + +
Sbjct: 157 CGSGCKCVNCENYVLDEVEKII 178


>gi|340507965|gb|EGR33791.1| hypothetical protein IMG5_037680 [Ichthyophthirius multifiliis]
          Length = 96

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 41/81 (50%), Gaps = 9/81 (11%)

Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
           AV     RNPNAF  K         D             H KGC CKKS CLKKYCECF 
Sbjct: 2   AVNDAKNRNPNAFNIKFQIIEEKQDDK---------IVIHKKGCTCKKSNCLKKYCECFN 52

Query: 258 ANILCSENCRCLDCKNFEGSE 278
           A +LC++NC C  CKN +  E
Sbjct: 53  AGVLCNQNCVCDSCKNMQKEE 73



 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           +K C C+ S CLK YCECF AG+ C+  C C +C N
Sbjct: 33  KKGCTCKKSNCLKKYCECFNAGVLCNQNCVCDSCKN 68


>gi|145488037|ref|XP_001430023.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397118|emb|CAK62625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 255

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 54/119 (45%), Gaps = 18/119 (15%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C  S CLKLYC CF     C D C C  C N  EH   R  A+             K+
Sbjct: 136 CRCTKSNCLKLYCLCFHQNQKCSDLCKCFECKNKAEHFEIRFNAL------------EKV 183

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
               H  ++  +    +++      GC C+KS C K YCECF  N  CS +CRC DC N
Sbjct: 184 KQKLHRQKNDDDLFDRSKIW-----GCKCQKSQCQKNYCECFVRNQKCSSSCRCKDCAN 237


>gi|145496915|ref|XP_001434447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401573|emb|CAK67050.1| unnamed protein product [Paramecium tetraurelia]
          Length = 321

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 53/119 (44%), Gaps = 18/119 (15%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C  S CLKLYC+CF     C + C C  C N  +H   RQ A+           + KI
Sbjct: 201 CKCTKSNCLKLYCQCFHQNRQCTELCKCFACKNCDDHLPVRQTALE----------KIKI 250

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            S      D   D            GC C+KS C K YCEC+  N  CS +CRC DC N
Sbjct: 251 KSQRQKHDDDLFDRSRVW-------GCKCQKSQCQKNYCECYIRNQKCSSSCRCKDCAN 302



 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 42/86 (48%), Gaps = 2/86 (2%)

Query: 190 EHEVARQEAVGATLERNPNAFRP-KIASSPHGAQD-AREDAREAQLAAKHNKGCHCKKSG 247
           +H+    +    TLE+     R  K+  S +  QD  +E    +Q    +   C C KS 
Sbjct: 148 DHQKKADDPCSHTLEQRSRRIRKNKLNPSFNTTQDPNQESDLHSQQQEPNTTPCKCTKSN 207

Query: 248 CLKKYCECFQANILCSENCRCLDCKN 273
           CLK YC+CF  N  C+E C+C  CKN
Sbjct: 208 CLKLYCQCFHQNRQCTELCKCFACKN 233


>gi|145477875|ref|XP_001424960.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392027|emb|CAK57562.1| unnamed protein product [Paramecium tetraurelia]
          Length = 240

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/119 (36%), Positives = 53/119 (44%), Gaps = 23/119 (19%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           CNC  S C K+YCEC A G  C   C C NC N      + ++ +    E + N    KI
Sbjct: 72  CNCSKSECRKMYCECLAKGRLCSSACRCENCQN----RTSNKKVLHVIKELDLNKNLKKI 127

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            S                   +   GC CKKS CLKKYCECF +   C   C C +C+N
Sbjct: 128 KS------------------KRFKDGCTCKKSMCLKKYCECFHSGKSCGYGCNCENCQN 168



 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 3/63 (4%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C+ S CLK YCECF +G  C  GCNC NC N +  E   Q+ +    ++  + F  K 
Sbjct: 136 CTCKKSMCLKKYCECFHSGKSCGYGCNCENCQNQLLDEA--QKNICTPNQKKTSKFSSKF 193

Query: 215 ASS 217
            S+
Sbjct: 194 ESA 196



 Score = 41.6 bits (96), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%)

Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           R+  L    ++ C+C KS C K YCEC     LCS  CRC +C+N
Sbjct: 60  RDDSLLLSSDEPCNCSKSECRKMYCECLAKGRLCSSACRCENCQN 104


>gi|145520829|ref|XP_001446270.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413747|emb|CAK78873.1| unnamed protein product [Paramecium tetraurelia]
          Length = 330

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 58/127 (45%), Gaps = 30/127 (23%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPN 208
           P  ++ C C+ ++CLK YC+CF  G +C + C C++C NN E      E   + + RN N
Sbjct: 210 PLSKQGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNN-EQNKELIEQTRSQINRNLN 268

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
                                           C CKKS CLKKYCEC+ A   C  +C+C
Sbjct: 269 ----------------------------DEVKCFCKKSKCLKKYCECYNAGKKCGVDCKC 300

Query: 269 LDCKNFE 275
             C N+E
Sbjct: 301 EGCMNYE 307



 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 28/41 (68%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           +GC CKK+ CLK YC+CFQ    C E C CLDC N E ++E
Sbjct: 214 QGCICKKTQCLKFYCQCFQQGKFCGETCECLDCCNNEQNKE 254


>gi|224070845|ref|XP_002303261.1| predicted protein [Populus trichocarpa]
 gi|222840693|gb|EEE78240.1| predicted protein [Populus trichocarpa]
          Length = 163

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/62 (54%), Positives = 40/62 (64%), Gaps = 1/62 (1%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           ++CNC+ SRCLKLYCECFAAG YC D C+C NC N  E+  A          RNP AF P
Sbjct: 81  RRCNCKRSRCLKLYCECFAAGNYCLDTCSCENCFNKSEYADAVLHMRQQIEARNPVAFAP 140

Query: 213 KI 214
           K+
Sbjct: 141 KV 142



 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 27/45 (60%), Gaps = 2/45 (4%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           C+CK+S CLK YCECF A   C + C C +C  F  SE   A+ H
Sbjct: 83  CNCKRSRCLKLYCECFAAGNYCLDTCSCENC--FNKSEYADAVLH 125


>gi|312380815|gb|EFR26710.1| hypothetical protein AND_07034 [Anopheles darlingi]
          Length = 524

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 40/59 (67%), Gaps = 3/59 (5%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K++ CNC  S+CLKLYC+CFA G +C  CNC +CHN  + + ARQ+A+     RN + F
Sbjct: 325 KKRPCNCTKSQCLKLYCDCFANGEFCYNCNCKDCHNTQKDDYARQKAIRC---RNTDQF 380



 Score = 43.5 bits (101), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C+C KS CLK YC+CF     C  NC C DC N +  +  R
Sbjct: 329 CNCTKSQCLKLYCDCFANGEFCY-NCNCKDCHNTQKDDYAR 368


>gi|145494055|ref|XP_001433022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400138|emb|CAK65625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 188

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 61/149 (40%), Gaps = 40/149 (26%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPK 213
           +C C  S+C K YCECFAAG  C+  CNC  C+N +  +  ++   G             
Sbjct: 77  KCKCMKSKCQKSYCECFAAGKACNMDCNCQGCNNTICSQSIKKVQTG------------- 123

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                                     GC+CKK+GCLKKYCEC+     C+  C C+DC N
Sbjct: 124 --------------------------GCNCKKTGCLKKYCECYLKKQRCTFLCNCIDCCN 157

Query: 274 FEGSEERRALFHGDHNGIAFMQRAANAAI 302
            E SE  +         I   Q   N  I
Sbjct: 158 QEDSESEKENEQLQQQSIDCGQTITNQII 186


>gi|145515421|ref|XP_001443610.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410999|emb|CAK76213.1| unnamed protein product [Paramecium tetraurelia]
          Length = 275

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 66/150 (44%), Gaps = 16/150 (10%)

Query: 126 TLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
           T+   K   P+++ +  T        + + CNC  S CLK YC CF +G  C D C C +
Sbjct: 123 TIQQQKHSQPKNQIKIKTITNLYKASEIQPCNCSQSSCLKRYCTCFHSGRMCMDECQCKD 182

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
           C N       R+ A+    ++                +D    A E  L+ + + GC CK
Sbjct: 183 CKNCDLFSEEREAAINYVFKK--------------CNRDKNVPANEL-LSLQISYGCKCK 227

Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKNF 274
            +GC KKYCEC +    C + C C DC N 
Sbjct: 228 STGCQKKYCECLKRGQACGDLCSCEDCLNI 257



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE-GSEERRA 282
           C+C +S CLK+YC CF +  +C + C+C DCKN +  SEER A
Sbjct: 153 CNCSQSSCLKRYCTCFHSGRMCMDECQCKDCKNCDLFSEEREA 195


>gi|145521975|ref|XP_001446837.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414326|emb|CAK79440.1| unnamed protein product [Paramecium tetraurelia]
          Length = 285

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 23/127 (18%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RN 206
           +Q+ C CRNS CLK YC CF +G  C   C CV  C NN +H   R +A+    E   RN
Sbjct: 156 EQQPCFCRNSGCLKRYCRCFHSGRMCLKDCQCVEGCLNNHDHLEQRNDAIKHVNEKCHRN 215

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
            N                    ++A    K   GC+CKK+ C   YCECF     C+ +C
Sbjct: 216 KNV------------------PKDALFKLKDCFGCNCKKTRCQTGYCECFLRKSKCTMDC 257

Query: 267 RCLDCKN 273
           +C +C+N
Sbjct: 258 QCDNCEN 264


>gi|145476147|ref|XP_001424096.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391159|emb|CAK56698.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 63/133 (47%), Gaps = 20/133 (15%)

Query: 146 KDGTPK----KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG 200
           K+ +PK    K + C+C  + CLK+YC CF  G  C + C C +C N  E+   R EAV 
Sbjct: 68  KEESPKLESNKWQPCSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQRHEAVE 127

Query: 201 ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANI 260
                        I    H  +      +E     K   GC+CKK+ CLK+YCECF    
Sbjct: 128 Y------------IKKKAHRNKKV---TQEKLFETKEVWGCNCKKTRCLKRYCECFIRQK 172

Query: 261 LCSENCRCLDCKN 273
            C+ +C C  C+N
Sbjct: 173 TCTVDCNCNYCEN 185



 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 6/58 (10%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGDHNGIAFMQRAA 298
           C C K+ CLK YC CF     C E C+C DCKN +    +R      H  + ++++ A
Sbjct: 82  CSCTKTNCLKMYCSCFHNGQTCVELCKCEDCKNVDEYLNQR------HEAVEYIKKKA 133


>gi|145534021|ref|XP_001452755.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420454|emb|CAK85358.1| unnamed protein product [Paramecium tetraurelia]
          Length = 326

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/66 (50%), Positives = 44/66 (66%), Gaps = 7/66 (10%)

Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL 269
           FRPK+ S  +      ED ++ +   +H KGC+CKKS CLKKYCEC+Q  + CSE C+C 
Sbjct: 244 FRPKVESKSNS-----EDEQDHK--PRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCD 296

Query: 270 DCKNFE 275
           +CKN E
Sbjct: 297 ECKNCE 302



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 135 PRSRPRANT-DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           P+   ++N+ D +D  P+  K CNC+ S CLK YCEC+  G+ C + C C  C N
Sbjct: 246 PKVESKSNSEDEQDHKPRHFKGCNCKKSNCLKKYCECYQMGVKCSELCKCDECKN 300


>gi|145524257|ref|XP_001447956.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415489|emb|CAK80559.1| unnamed protein product [Paramecium tetraurelia]
          Length = 341

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/119 (35%), Positives = 54/119 (45%), Gaps = 18/119 (15%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C  S CLKLYC CF   I C + C C +CHN  ++   R +A+           + K+
Sbjct: 225 CKCSKSHCLKLYCACFHRNIECSELCQCHDCHNKSDYSQIRTQALE----------KVKV 274

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                   D   D            GC C+KS C K YCECF  N  CS  C+C +C+N
Sbjct: 275 KQQRRKHDDDLFDKDTVW-------GCQCRKSQCKKNYCECFIRNKKCSSLCKCNNCEN 326



 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 26/57 (45%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 217 SPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           + H  +   +D  E  L   +N  C C KS CLK YC CF  NI CSE C+C DC N
Sbjct: 204 TTHYREFFDKDLDEHNL---NNSPCKCSKSHCLKLYCACFHRNIECSELCQCHDCHN 257


>gi|219121911|ref|XP_002181300.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407286|gb|EEC47223.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 381

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 50/105 (47%), Gaps = 24/105 (22%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVA---RQEAVGATL 203
            PK  K C C+N+ CLKLYC CF  G +CD   C C++C+N  E       RQEAV   L
Sbjct: 272 VPKTVKPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNLREFNETGGKRQEAVSEIL 331

Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGC 248
            R                   R DA E++   K  +GC CKK+ C
Sbjct: 332 LR-------------------RIDAFESRPKKKTGEGCACKKNRC 357



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNF 274
           K  K C CK + CLK YC CFQ    C  + C+C+DC N 
Sbjct: 274 KTVKPCKCKNTHCLKLYCTCFQKGSFCDPDICKCIDCYNL 313


>gi|145491063|ref|XP_001431531.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398636|emb|CAK64133.1| unnamed protein product [Paramecium tetraurelia]
          Length = 246

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 60/125 (48%), Gaps = 17/125 (13%)

Query: 151 KKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVN-CHNNVEHEVARQEAVGATLERNPN 208
           K Q+ CNC++S C+K YC CF +G  C   C C + C N   +   R EA+      N  
Sbjct: 131 KCQQPCNCKSSGCIKRYCRCFHSGKTCLPECQCQDGCLNKEHNNCERSEAIKHV---NEK 187

Query: 209 AFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 268
            +R             R+  +EA        GC C KS C K+YCECF  N  C+E C+C
Sbjct: 188 CYR------------NRKIPKEALFKLDVIYGCSCTKSKCRKRYCECFLRNQNCTEKCKC 235

Query: 269 LDCKN 273
            DC N
Sbjct: 236 FDCCN 240


>gi|403367645|gb|EJY83644.1| hypothetical protein OXYTRI_18624 [Oxytricha trifallax]
 gi|403370830|gb|EJY85281.1| hypothetical protein OXYTRI_16860 [Oxytricha trifallax]
          Length = 590

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 59/125 (47%), Gaps = 32/125 (25%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD--GCNCVNCHNNVEHEVARQEAVGATLERNPNA 209
           K+ +C+C+ + C + YC+C   G  CD   C+C +C N +E++  R E            
Sbjct: 466 KKIKCSCQKNSCTQAYCDCLKNGQACDPSSCSCASCLNTIENQNLRLEI----------- 514

Query: 210 FRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS-ENCRC 268
                           ++ ++ Q  AK  +GC CK S C K+YCECFQ    C    C+C
Sbjct: 515 ----------------QEKKQKQGQAK--EGCSCKNSQCQKRYCECFQNGRQCDPSKCKC 556

Query: 269 LDCKN 273
            DCKN
Sbjct: 557 KDCKN 561



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/49 (51%), Positives = 30/49 (61%), Gaps = 8/49 (16%)

Query: 151 KKQKQ------CNCRNSRCLKLYCECFAAGIYCD--GCNCVNCHNNVEH 191
           KKQKQ      C+C+NS+C K YCECF  G  CD   C C +C NN+ H
Sbjct: 517 KKQKQGQAKEGCSCKNSQCQKRYCECFQNGRQCDPSKCKCKDCKNNLPH 565


>gi|325187539|emb|CCA22077.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 319

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 45/81 (55%), Gaps = 11/81 (13%)

Query: 195 RQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCE 254
           R +A+ + LERNP AF+PK+ ++     +             H +GC+C+KS CLK YCE
Sbjct: 20  RSQAIASILERNPLAFQPKVVANGSETNNLH-----------HLRGCNCRKSNCLKNYCE 68

Query: 255 CFQANILCSENCRCLDCKNFE 275
           C QA + C+  C C  C N E
Sbjct: 69  CHQARVSCTNRCACHKCCNTE 89



 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAV 199
           + CNCR S CLK YCEC  A + C + C C  C N  E   A++  +
Sbjct: 53  RGCNCRKSNCLKNYCECHQARVSCTNRCACHKCCNTEEFHSAKKMLI 99


>gi|194696554|gb|ACF82361.1| unknown [Zea mays]
          Length = 372

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/136 (32%), Positives = 65/136 (47%), Gaps = 19/136 (13%)

Query: 160 NSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSP 218
           NS   + YC C+A+G +C + C C  CHN            GA          P +   P
Sbjct: 62  NSDRKRSYCVCYASGSHCTELCGCEPCHNKPPQ--------GAPWTA------PVLPLKP 107

Query: 219 HGAQDAREDAREAQLAAKHN---KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
               +A ++  E  + +  +   + C CKKSGCLKKYC+C+Q    CS NC+C DC+N  
Sbjct: 108 VQTLEAGQEIVEQLIRSPMDLIRRKCTCKKSGCLKKYCDCYQGGAGCSINCKCDDCRNPF 167

Query: 276 GSEERRALFHGDHNGI 291
           G  +   +  G  +G+
Sbjct: 168 G-RKVGVILDGKSSGL 182


>gi|145541620|ref|XP_001456498.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424310|emb|CAK89101.1| unnamed protein product [Paramecium tetraurelia]
          Length = 262

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 73/175 (41%), Gaps = 38/175 (21%)

Query: 103 QARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQ--CNCRN 160
           Q   + + +VN++ +       P L   +    + +P+   D  D    K+K+  C+C  
Sbjct: 77  QQDEEDISLVNQM-YESQVFSQPILDISQISQIQLQPKTKLDLTDQPKYKKKKVGCSCTK 135

Query: 161 SRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
            +C   YC C   G  C   C CV+C NN +  +++Q  V                    
Sbjct: 136 RKCSSKYCFCAKVGQSCSNLCKCVDCSNN-KSFMSQQIQV-------------------- 174

Query: 220 GAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
                          A+  KGC CKK+ CLK YC+CF   + CS NC+C+ CKN 
Sbjct: 175 -------------TNAQEQKGCKCKKNFCLKGYCDCFAKGMQCSSNCKCISCKNM 216


>gi|145512403|ref|XP_001442118.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409390|emb|CAK74721.1| unnamed protein product [Paramecium tetraurelia]
          Length = 278

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 59/127 (46%), Gaps = 23/127 (18%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN-CHNNVEHEVARQEAVGATLE---RN 206
           +Q+ C CRNS CLK YC CF +G  C   C CV  C NN +H   R  A+    E   RN
Sbjct: 149 EQQPCFCRNSGCLKRYCRCFHSGRMCLKECQCVEGCLNNEDHLEERNMAIKHVNEKCHRN 208

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENC 266
            N   PK                +A    K   GC CKKS C   YCECF     C+ +C
Sbjct: 209 KNI--PK----------------DALFKLKDCFGCSCKKSRCQTGYCECFLRKSKCTIDC 250

Query: 267 RCLDCKN 273
            C +C+N
Sbjct: 251 ECQNCEN 257


>gi|77415370|gb|AAI06127.1| Lin54 protein [Mus musculus]
          Length = 523

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 37/64 (57%), Positives = 48/64 (75%)

Query: 147 DGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERN 206
           + T + +K CNC  S CLKLYC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RN
Sbjct: 451 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 510

Query: 207 PNAF 210
           P AF
Sbjct: 511 PEAF 514


>gi|298711558|emb|CBJ32620.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 78

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 30/58 (51%), Positives = 38/58 (65%), Gaps = 1/58 (1%)

Query: 163 CLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPH 219
           CLKLYC+CFA  + C D C C NC+N+ EH V R+ AV   L R+P+AF  K  + PH
Sbjct: 2   CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRAAVRELLCRSPHAFDAKFKTEPH 59



 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 248 CLKKYCECFQANILCSENCRCLDCKN-FEGSEERRA 282
           CLK YC+CF   ++CS+ CRC +C N  E   ERRA
Sbjct: 2   CLKLYCQCFAGQLVCSDACRCENCYNSAEHGVERRA 37


>gi|145490209|ref|XP_001431105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398208|emb|CAK63707.1| unnamed protein product [Paramecium tetraurelia]
          Length = 223

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 16/119 (13%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C+C+ + CLK+YC CF  G +C   C C  C N  E ++ R +AV    ++         
Sbjct: 89  CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKRMQAVDYVKKK--------- 139

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                 A   ++  +E         GC+C K+ C+KKYCECF     C+  C C  C N
Sbjct: 140 ------AHRNKKVPKEKIFETVEIWGCNCSKTRCVKKYCECFIRGKKCTVECNCDHCDN 192



 Score = 41.2 bits (95), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 26/41 (63%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C C+K+ CLK YC CF     C ++CRC +C+N E  + +R
Sbjct: 89  CSCQKTHCLKMYCSCFHNGRFCGKSCRCEECENTEEFKMKR 129


>gi|145517899|ref|XP_001444827.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412260|emb|CAK77430.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 16/119 (13%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C+C  + CLK+YC CF  G  C + C C +C N   H   R EAV    ++         
Sbjct: 82  CSCTKTYCLKMYCSCFHNGQICGESCKCEDCKNTNMHFHQRDEAVEYIKKK--------- 132

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
                 A   ++  +E     K   GC+CKK+ CLK+YCEC+     C+  C C  C+N
Sbjct: 133 ------AHRNKKVPQEKLFETKDIWGCNCKKTRCLKRYCECYIRQKTCTVECNCNHCEN 185


>gi|145506933|ref|XP_001439427.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406611|emb|CAK72030.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 16/119 (13%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C+C  + CLK+YC CF  G  C + C C +C N   +   R +AV    ++   A R K 
Sbjct: 82  CSCTKTNCLKMYCSCFHNGQTCVESCKCEDCKNTNNNVPQRDKAVEYIKKK---AHRNKK 138

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            S            +E     K   GC+CKK+ CLK+YCEC+     C+  C C  C+N
Sbjct: 139 VS------------QETLFETKDIWGCNCKKTRCLKRYCECYIRQKACTVECNCTHCEN 185



 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN----NVEHEVARQ 196
           CNC+ +RCLK YCEC+     C   CNC +C N    ++ +E+ RQ
Sbjct: 153 CNCKKTRCLKRYCECYIRQKACTVECNCTHCENGKDEDLYNEIRRQ 198


>gi|145477681|ref|XP_001424863.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391930|emb|CAK57465.1| unnamed protein product [Paramecium tetraurelia]
          Length = 259

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 70/149 (46%), Gaps = 19/149 (12%)

Query: 128 PPGKQESPRSRPRANTDGKDG--TPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVN 184
           P   Q + R   +AN +  D      + K C+C  + CL+LYC CF     C   C C  
Sbjct: 113 PIVNQRTRRKASKANYNVDDSPTMDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCSE 172

Query: 185 CHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCK 244
           C N+ +H     + V   +E+     + K   + H   D+  D R+         GC CK
Sbjct: 173 CFNDGQHS----DEVLKAIEQ----IKIKEQRASHHDLDSF-DTRQVW-------GCKCK 216

Query: 245 KSGCLKKYCECFQANILCSENCRCLDCKN 273
           K+ C+K YCECF  +  CS +C+C +C+N
Sbjct: 217 KTQCIKGYCECFIRHKKCSSHCQCSNCQN 245


>gi|291001253|ref|XP_002683193.1| predicted protein [Naegleria gruberi]
 gi|284096822|gb|EFC50449.1| predicted protein [Naegleria gruberi]
          Length = 459

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 33/46 (71%)

Query: 229 REAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
           ++ +    + KGC+CKK+GCLK+YCECFQ N  C+  CRC  CKN+
Sbjct: 186 QQPKFVDNNAKGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKNY 231



 Score = 39.3 bits (90), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           K CNC+ + CLK YCECF     C   C C  C N
Sbjct: 196 KGCNCKKTGCLKRYCECFQNNKRCTIKCRCQGCKN 230


>gi|145524820|ref|XP_001448232.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415776|emb|CAK80835.1| unnamed protein product [Paramecium tetraurelia]
          Length = 216

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 16/119 (13%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C+C  + CLK+YC CF  G  C + C C +C N  E+   R  AV    ++   A R K 
Sbjct: 82  CSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQRHNAVEYIRKK---AHRNKK 138

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            +            +E     K   GC+CKK+ C K+YCECF     C+ +C C  C+N
Sbjct: 139 VT------------QEKLFETKDVWGCNCKKTRCQKRYCECFIRQKTCTVDCNCNHCEN 185



 Score = 45.8 bits (107), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 7/77 (9%)

Query: 223 DAREDAREAQ-LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           D  E+  E+Q L     + C C K+ CLK YC CF     C E C+C DC+N +    +R
Sbjct: 63  DLDENKEESQILENTKWQPCSCSKTNCLKMYCSCFHNGQTCVEQCKCEDCQNTDEYLNQR 122

Query: 282 ALFHGDHNGIAFMQRAA 298
                 HN + ++++ A
Sbjct: 123 ------HNAVEYIRKKA 133


>gi|170073469|ref|XP_001870379.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870088|gb|EDS33471.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 103

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/71 (47%), Positives = 45/71 (63%), Gaps = 12/71 (16%)

Query: 199 VGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQA 258
           + A L ++P   +PKI     GA  A EDA       +H KGC+CK+SGCLK YCEC++A
Sbjct: 35  ITAGLGQSP--VQPKI-----GATSAAEDATR-----RHTKGCNCKRSGCLKNYCECYEA 82

Query: 259 NILCSENCRCL 269
            I CS NC+C+
Sbjct: 83  KIACSGNCKCI 93



 Score = 46.2 bits (108), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 30/53 (56%), Gaps = 2/53 (3%)

Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVN-CHNNVEH 191
           A +  +D T +  K CNC+ S CLK YCEC+ A I C G C C+  CH   E 
Sbjct: 50  ATSAAEDATRRHTKGCNCKRSGCLKNYCECYEAKIACSGNCKCIGECHAEREE 102


>gi|145512279|ref|XP_001442056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409328|emb|CAK74659.1| unnamed protein product [Paramecium tetraurelia]
          Length = 139

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 43/136 (31%), Positives = 62/136 (45%), Gaps = 21/136 (15%)

Query: 139 PRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQE 197
           P+    G D    K+  C C+ ++CLKLYC+CF  G  C   C C  C N++   + R +
Sbjct: 6   PKLQIQGNDSEETKEG-CQCQKTKCLKLYCQCFHDGKCCGQSCGCNGCKNSLNDYLERNK 64

Query: 198 AVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQ 257
           A+G  + +  + ++ K                   L+    +GC CKKS C   YCECF 
Sbjct: 65  AIGKIIHKYKDDYKQK-------------------LSGDKGQGCCCKKSKCKLNYCECFI 105

Query: 258 ANILCSENCRCLDCKN 273
               C E C C  C+N
Sbjct: 106 KGRSCGEQCHCKKCEN 121


>gi|297722381|ref|NP_001173554.1| Os03g0638300 [Oryza sativa Japonica Group]
 gi|255674730|dbj|BAH92282.1| Os03g0638300, partial [Oryza sativa Japonica Group]
          Length = 113

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 43/78 (55%), Gaps = 5/78 (6%)

Query: 138 RPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQ 196
           R +   D  D      K+C+C+ S+CLKLYCECFA+ +YC + C+C  C N+  HE    
Sbjct: 7   RQKCQNDNVDSC----KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEETVL 62

Query: 197 EAVGATLERNPNAFRPKI 214
                   RNP AF PK+
Sbjct: 63  STRNRIESRNPLAFAPKV 80



 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K C CKKS CLK YCECF + + CSE+C C  C N    EE
Sbjct: 19  KRCSCKKSKCLKLYCECFASKVYCSESCSCRGCFNDHSHEE 59


>gi|145528842|ref|XP_001450215.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417815|emb|CAK82818.1| unnamed protein product [Paramecium tetraurelia]
          Length = 244

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/169 (29%), Positives = 69/169 (40%), Gaps = 26/169 (15%)

Query: 149 TPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNP 207
           +PK Q +C C  S+C + YCECFA G  C   C C NC N     +   + +G   +R  
Sbjct: 90  SPKPQIKCQCSKSQCQQSYCECFARGKTCGKHCGCQNCQN-----MQMNKKLGKIQKRLV 144

Query: 208 NAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 267
              R K                  Q   ++ KGC C KS C   +C C Q    C+  C+
Sbjct: 145 KQKRIK------------------QRTPRYLKGCTCGKSKCQNNFCSCHQDGKYCNSECK 186

Query: 268 CLDCKNFEGSEERR--ALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTL 314
           C+DCKN      +    + H + N +          IIG   SS +  L
Sbjct: 187 CVDCKNVYNFCLKIFPEIEHVESNEMESTNANVVLFIIGCENSSNFQNL 235


>gi|118376115|ref|XP_001021240.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|89303007|gb|EAS00995.1| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 379

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 69/132 (52%), Gaps = 19/132 (14%)

Query: 155 QCNCRNSRCLKLYCECFAAGIYC--DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 212
           +CNC+ ++C+K YC C+A   +C    C C +C N        Q++  +T ++     +P
Sbjct: 246 ECNCQTTQCIKKYCSCYANNNFCVPSICKCQSCMNTF------QQSYNSTSQK-----KP 294

Query: 213 KIASSPHGAQDAR----EDAREAQLAAKH-NKGCHCKKSGCLKKYCECFQANILCSENCR 267
           K+ SS            ++  + ++  ++    C C+KS C K+YC+C+++   CS  C+
Sbjct: 295 KLISSQKSLSQVILYQIQNINQLKIQKENATISCTCQKSFCKKQYCDCYKSGKKCSTLCK 354

Query: 268 CLDCKNFEGSEE 279
           C +C N +G +E
Sbjct: 355 CQNCFN-QGQKE 365


>gi|195155474|ref|XP_002018629.1| GL25902 [Drosophila persimilis]
 gi|194114782|gb|EDW36825.1| GL25902 [Drosophila persimilis]
          Length = 296

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 265
           P+A   ++ ++P      R   R  +   +   KGC CK++ C+K YC+C+Q+  +C E 
Sbjct: 5   PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64

Query: 266 CRCLDCKN 273
           C+C+DCKN
Sbjct: 65  CKCVDCKN 72



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           P K  + R +PR     K+ T    K C C+ + C+K YC+C+ +   C + C CV+C N
Sbjct: 16  PVKTAAGRRQPRTPKTPKEST---IKGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN 72

Query: 188 NVEHEVA 194
           +VE  + 
Sbjct: 73  SVERPLV 79


>gi|198476800|ref|XP_001357481.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
 gi|198137857|gb|EAL34551.2| GA12700 [Drosophila pseudoobscura pseudoobscura]
          Length = 296

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 207 PNAFRPKIASSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSEN 265
           P+A   ++ ++P      R   R  +   +   KGC CK++ C+K YC+C+Q+  +C E 
Sbjct: 5   PSALTTRVIAAPVKTAAGRRQPRTPKTPKESTIKGCACKRTSCIKNYCDCYQSMRICHEF 64

Query: 266 CRCLDCKN 273
           C+C+DCKN
Sbjct: 65  CKCVDCKN 72



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 4/67 (5%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           P K  + R +PR     K+ T    K C C+ + C+K YC+C+ +   C + C CV+C N
Sbjct: 16  PVKTAAGRRQPRTPKTPKEST---IKGCACKRTSCIKNYCDCYQSMRICHEFCKCVDCKN 72

Query: 188 NVEHEVA 194
           +VE  + 
Sbjct: 73  SVERPLV 79


>gi|145545532|ref|XP_001458450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426270|emb|CAK91053.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 58/134 (43%), Gaps = 22/134 (16%)

Query: 144 DGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVG-- 200
           +G++    +Q  C+C  S CL++YC CF  G  C   C C  C N  ++ + R++AV   
Sbjct: 77  EGQEIQQVEQFSCSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKREKAVNYA 136

Query: 201 -ATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQAN 259
               +RN      KI  +                      GC+C K+ C+K YCECF   
Sbjct: 137 KKKAQRNKKVPEGKIFETTEIW------------------GCNCSKTRCVKNYCECFIRG 178

Query: 260 ILCSENCRCLDCKN 273
             CS  C C  C N
Sbjct: 179 KKCSVECNCQHCDN 192



 Score = 41.2 bits (95), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 5/55 (9%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN----NVEHEVARQEAVGATLER 205
           CNC  +RC+K YCECF  G  C   CNC +C N    ++ +E+ RQ  +   + +
Sbjct: 160 CNCSKTRCVKNYCECFIRGKKCSVECNCQHCDNGKDEDLINEIKRQNQIKKRIRK 214



 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 26/41 (63%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C C KS CL+ YC CF    +C ++C+C +C+N E +  +R
Sbjct: 89  CSCSKSHCLQMYCSCFHNGRVCGKSCKCQECENAEDNIIKR 129


>gi|145531914|ref|XP_001451718.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419384|emb|CAK84321.1| unnamed protein product [Paramecium tetraurelia]
          Length = 242

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 57/129 (44%), Gaps = 26/129 (20%)

Query: 146 KDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLE 204
           KD  P  Q +C C  S+C + YCECFA G  C   C C NC N         + +   LE
Sbjct: 89  KDLQP--QIKCQCNKSQCQQSYCECFARGKTCGKHCGCQNCQN---------KQMNKKLE 137

Query: 205 RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSE 264
           +                    +  R  Q  +++ KGC C KS C  K+C C Q    C+ 
Sbjct: 138 KIQKK--------------LIKQKRIKQRTSRYLKGCTCGKSKCQNKFCSCHQDGKYCNS 183

Query: 265 NCRCLDCKN 273
            CRC+DCKN
Sbjct: 184 GCRCVDCKN 192


>gi|145531469|ref|XP_001451501.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419156|emb|CAK84104.1| unnamed protein product [Paramecium tetraurelia]
          Length = 192

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 47/162 (29%), Positives = 65/162 (40%), Gaps = 41/162 (25%)

Query: 116 PHPVHKLPLPTLPPGKQESPRSRP-RANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAG 174
           P P+ K   P   P    + +  P + N        K   +C C  S+C K YCECFA G
Sbjct: 33  PMPILKFFDPDESPNILSAKKPFPIQKNIVKPKEIEKSIVKCKCLKSKCQKSYCECFAVG 92

Query: 175 IYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQL 233
             C   CNC+ C+N+   +  ++   G                                 
Sbjct: 93  RSCSIDCNCLGCNNSNCSQKIKKSQCG--------------------------------- 119

Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
                 GC+CKK+GC+KKYCEC+     C+  C C+DC N E
Sbjct: 120 ------GCNCKKTGCVKKYCECYLKKQRCTFLCNCIDCCNQE 155


>gi|224123378|ref|XP_002319064.1| predicted protein [Populus trichocarpa]
 gi|222857440|gb|EEE94987.1| predicted protein [Populus trichocarpa]
          Length = 213

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 48/114 (42%), Gaps = 33/114 (28%)

Query: 164 LKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
           L  YCECFAAG+YC + C+C++C NN  HE    E  G    RNP AF PK+  +   A 
Sbjct: 8   LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIESRNPLAFAPKVIRNLDSAS 67

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEG 276
           +                                F+  + CS  CRC  CKN  G
Sbjct: 68  E--------------------------------FRGGVGCSSYCRCEGCKNTFG 89



 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 32/72 (44%), Gaps = 10/72 (13%)

Query: 249 LKKYCECFQANILCSENCRCLDCKNFEGSE----ERRALFHGDHNGIAFMQRA-----AN 299
           L  YCECF A + C E C CLDC N  G E    E R       N +AF  +      + 
Sbjct: 8   LYSYCECFAAGLYCIEPCSCLDCFNNPGHEGTVLETRGQIES-RNPLAFAPKVIRNLDSA 66

Query: 300 AAIIGAVGSSGY 311
           +   G VG S Y
Sbjct: 67  SEFRGGVGCSSY 78


>gi|229594976|ref|XP_001032555.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila]
 gi|225566469|gb|EAR84892.3| Tesmin/TSO1-like CXC domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 622

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 3/83 (3%)

Query: 131 KQESPRSRPRANTDGKDGTPKKQKQ--CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           K++ P ++ ++    K+G   +  +  C CR S CLKLYCECFA G  C   C C+ C N
Sbjct: 302 KKQQPNNKLQSIDSSKEGEILRVSKTPCKCRKSLCLKLYCECFARGEICGHACVCLECKN 361

Query: 188 NVEHEVARQEAVGATLERNPNAF 210
           +  H   R EA+     +NP+AF
Sbjct: 362 SKNHLELRNEAIKVIEAKNPSAF 384



 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 34/64 (53%)

Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGS 277
           P+    + + ++E ++       C C+KS CLK YCECF    +C   C CL+CKN +  
Sbjct: 306 PNNKLQSIDSSKEGEILRVSKTPCKCRKSLCLKLYCECFARGEICGHACVCLECKNSKNH 365

Query: 278 EERR 281
            E R
Sbjct: 366 LELR 369


>gi|145530936|ref|XP_001451240.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418884|emb|CAK83843.1| unnamed protein product [Paramecium tetraurelia]
          Length = 263

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 83/200 (41%), Gaps = 27/200 (13%)

Query: 80  LPPSSQPHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPG---KQESPR 136
           L  SSQ H+     +  +  P++Q+  + +R  +R  + + +   P    G   K++  R
Sbjct: 71  LTTSSQHHL-----NEDEWKPEYQSSQKQLRFSDRSQNHIEEENKPKQARGIKDKEQRQR 125

Query: 137 SRPRANTDGKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEV 193
           ++ R +    D        K C+C  ++CL+LYC CF     C   C C  C N+  H+V
Sbjct: 126 TKRRVDYGVGDSFSNDSDLKPCHCSKTKCLQLYCSCFHKNKACSSKCQCKGCFNDGNHKV 185

Query: 194 ARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYC 253
              E V A  +     +R  ++ S   + D R+            KG           YC
Sbjct: 186 ---EMVKALQKVKLKEYR--VSQSDLDSFDTRQVLGCKCKKTHCKKG-----------YC 229

Query: 254 ECFQANILCSENCRCLDCKN 273
           ECF     C+  C+C +C N
Sbjct: 230 ECFIRGRKCTSQCKCCECNN 249


>gi|145481073|ref|XP_001426559.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393634|emb|CAK59161.1| unnamed protein product [Paramecium tetraurelia]
          Length = 254

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 50/117 (42%), Gaps = 35/117 (29%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNN-VEHEVARQEAVGATLERNPNAFRPK 213
           C C+ ++C+K YCECF  G  C   C C  C N  V+    + + V  T+E         
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCNKQVKQSSGKMQLVSPTIEF-------- 195

Query: 214 IASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
                                    +GC C+K  C KKYCEC + N  C++ C+C+D
Sbjct: 196 -------------------------QGCKCQKQNCSKKYCECLKRNQRCTDLCKCID 227



 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 20/33 (60%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           C CKK+ C+K YCECFQ    CS  C C  C N
Sbjct: 144 CRCKKTKCIKYYCECFQYGQECSFKCECTGCCN 176


>gi|195434044|ref|XP_002065013.1| GK14905 [Drosophila willistoni]
 gi|194161098|gb|EDW75999.1| GK14905 [Drosophila willistoni]
          Length = 250

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 45/69 (65%), Gaps = 8/69 (11%)

Query: 213 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 272
           K A S  G + AR  A++  +     KGC CK+S C+K YC+C+Q+  +CS++CRC+ C+
Sbjct: 3   KPAPSGTGKRSARGQAQQPTI-----KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCR 57

Query: 273 NFEGSEERR 281
           N   ++ER+
Sbjct: 58  N---TQERK 63



 Score = 39.3 bits (90), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 154 KQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHE 192
           K C+C+ S+C+K YC+C+ +   C   C C+ C N  E +
Sbjct: 24  KGCSCKRSQCIKNYCDCYQSMAVCSKHCRCIGCRNTQERK 63


>gi|145521839|ref|XP_001446769.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414258|emb|CAK79372.1| unnamed protein product [Paramecium tetraurelia]
          Length = 151

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 20/133 (15%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C C+ ++CLKLYC+CF  G  C + C+C  C N++     R +A+   + +         
Sbjct: 27  CQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAIVKIIHK--------- 77

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
                      +D  + + +     GC CKKS C   YCECF     C + C C  C+N 
Sbjct: 78  ----------YKDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNG 127

Query: 275 EGSEERRALFHGD 287
           +   ++    H D
Sbjct: 128 KKQNQKICANHED 140



 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 12/113 (10%)

Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKN--FEGSEERRALFHGDHNGIAFMQRA 297
           GC C+K+ CLK YC+CF     C E C C  CKN   +  E  +A+       +  + + 
Sbjct: 26  GCQCQKTKCLKLYCQCFHEGKCCGEQCSCSGCKNSLTDNFERNKAI-------VKIIHKY 78

Query: 298 ANAAIIGAVGSSGYGTLMTSNK---RKSEELLFGVAAKDQSVIRNPQSQQGNN 347
            + + +   G  G G     +K      E  + G +  DQ   +  Q+ +  N
Sbjct: 79  KDESKLKFSGDKGSGCCCKKSKCQLNYCECFIKGRSCGDQCHCKKCQNGKKQN 131


>gi|222628471|gb|EEE60603.1| hypothetical protein OsJ_14005 [Oryza sativa Japonica Group]
          Length = 97

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 39/76 (51%), Gaps = 5/76 (6%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           + + C C  S C+KLYC CF+A  YC   C C NC N   +E   +E V     +NP AF
Sbjct: 5   QHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAF 64

Query: 211 RPKIASSPHGAQDARE 226
            PKI       QDA E
Sbjct: 65  DPKIVR----VQDASE 76



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 30/47 (63%)

Query: 233 LAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           ++   ++ C C KS C+K YC CF A   CS+NCRC +CKN E  E+
Sbjct: 1   MSNDQHRSCRCVKSNCVKLYCPCFSAYGYCSQNCRCTNCKNREYYED 47


>gi|395834329|ref|XP_003790159.1| PREDICTED: protein lin-54 homolog [Otolemur garnettii]
          Length = 677

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 2/54 (3%)

Query: 234 AAKHNKGCHCKKSGCLKKY--CECFQANILCSENCRCLDCKNFEGSEERRALFH 285
           A++  K C+C KS CLK +       A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 518 ASRPRKPCNCTKSLCLKLWEVNNISMAKIMCSSICKCIGCKNFEESPERKTLMH 571


>gi|195473860|ref|XP_002089210.1| GE18995 [Drosophila yakuba]
 gi|194175311|gb|EDW88922.1| GE18995 [Drosophila yakuba]
          Length = 237

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 216 SSPHGAQDAREDAREAQLAAKHN-KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
           S+P      R + R+A+       KGC CK+S C+K YC+C+Q+  +C+  CRC+DC+N 
Sbjct: 2   STPKKKSVDRSEGRKAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRNT 61

Query: 275 E 275
           E
Sbjct: 62  E 62



 Score = 37.7 bits (86), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 30/61 (49%), Gaps = 4/61 (6%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           P K+   RS  R     K   P   K C C+ S+C+K YC+C+ +   C   C C++C N
Sbjct: 4   PKKKSVDRSEGR---KAKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTMFCRCIDCRN 60

Query: 188 N 188
            
Sbjct: 61  T 61


>gi|194759119|ref|XP_001961797.1| GF14749 [Drosophila ananassae]
 gi|190615494|gb|EDV31018.1| GF14749 [Drosophila ananassae]
          Length = 324

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 35/45 (77%), Gaps = 3/45 (6%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           KGC CK+S C+K YC+C+Q+ ++CS+ CRC+ C+N   +E R+A+
Sbjct: 103 KGCCCKRSQCIKNYCDCYQSMVICSKFCRCVGCRN---TEVRKAV 144


>gi|145529794|ref|XP_001450680.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418302|emb|CAK83283.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 23/123 (18%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K  +C C  S CLK YC+CFA G  C   C C  CHN  ++   R+EA+     +N +  
Sbjct: 54  KSLKCKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEEREEAINKLKMQNQSIE 113

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +      P G     + ++                   LK+YC+CFQ N  CSE+C+C +
Sbjct: 114 KE----VPVGISCKCKKSKC------------------LKRYCDCFQNNQKCSESCQCNN 151

Query: 271 CKN 273
           C N
Sbjct: 152 CSN 154



 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 281
           C C KS CLK+YC+CF    +C+  C+C  C N E   E R
Sbjct: 58  CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYLEER 98


>gi|223993335|ref|XP_002286351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977666|gb|EED95992.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 957

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 40/81 (49%), Gaps = 14/81 (17%)

Query: 143 TDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG--CNCVNCHNNVEHEVARQEAVG 200
           T  K    K  K C C+ ++CLKLYC CF+  I CD   C+C  C  N + E+AR    G
Sbjct: 305 TASKSNKKKTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERC-ANTQDELARDGDNG 363

Query: 201 -----------ATLERNPNAF 210
                      + LERNP AF
Sbjct: 364 MAMSAVVTARKSILERNPRAF 384



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query: 225 REDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           R++A  ++   K  + C CKKS CLKKYCECF  ++ C+  C+C +C N
Sbjct: 643 RQEAINSKALTKQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691



 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 47/108 (43%), Gaps = 21/108 (19%)

Query: 99  MPQWQARPQHVRMVN--------------RVPHPVHKLPLPTL----PPGKQESPRSRPR 140
           MP W+ R  + R V               R  HP H+ P        PP +Q     R +
Sbjct: 586 MPSWEGRHYYPREVESRRSYDYYYDPYHPRSSHP-HEPPQHYHGYYPPPHRQYYDEPRRQ 644

Query: 141 ANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
              + K  T K+ + C C+ S+CLK YCECFA  + C   C C NC N
Sbjct: 645 EAINSKALT-KQTRICRCKKSQCLKKYCECFADSLKCTSACKCENCGN 691



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 32/59 (54%), Gaps = 6/59 (10%)

Query: 236 KHNKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEGSEERRALFHGDHNGIAF 293
           K +KGC CKK+ CLK YC CF  +I C  + C C  C N +    R     GD NG+A 
Sbjct: 313 KTHKGCTCKKTKCLKLYCACFSHSIACDPSVCSCERCANTQDELARD----GD-NGMAM 366


>gi|145490347|ref|XP_001431174.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398277|emb|CAK63776.1| unnamed protein product [Paramecium tetraurelia]
          Length = 280

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 73/159 (45%), Gaps = 20/159 (12%)

Query: 119 VHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQ---KQCNCRNSRCLKLYCECFAAGI 175
           + +  +  +P  +Q   R +        D +P +Q   K C+C  + CL+LYC CF    
Sbjct: 111 IRQFEIQEIPTIQQIQTRRKGNKANYNVDDSPVEQDEMKPCHCTKTHCLQLYCSCFHNRR 170

Query: 176 YC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLA 234
            C   C C +C+N+ +HE   Q+AV           + K   + H   D+          
Sbjct: 171 RCMSECRCNDCYNDGQHEEEVQKAV--------EQIKIKEQRASHHDLDS--------FD 214

Query: 235 AKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
            K   GC CKK+ C+K YCECF     C+ +C+C +C+N
Sbjct: 215 TKQVWGCKCKKTKCVKGYCECFIRGKKCTSHCQCTECEN 253



 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 22/41 (53%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           K CHC K+ CL+ YC CF     C   CRC DC N    EE
Sbjct: 149 KPCHCTKTHCLQLYCSCFHNRRRCMSECRCNDCYNDGQHEE 189


>gi|195388490|ref|XP_002052913.1| GJ17819 [Drosophila virilis]
 gi|194149370|gb|EDW65068.1| GJ17819 [Drosophila virilis]
          Length = 225

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 33/45 (73%), Gaps = 3/45 (6%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           KGC CK+S C+K YC+C+Q+  +CS  C+C+ C+N   +EER A+
Sbjct: 25  KGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRN---TEERLAV 66



 Score = 38.9 bits (89), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 37/77 (48%), Gaps = 7/77 (9%)

Query: 136 RSRPRANTDGKDGTPKKQ--KQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHE 192
           RS    N +     PK+Q  K C C+ S+C+K YC+C+ +   C   C C+ C N  E  
Sbjct: 5   RSGNYFNKEQSKRQPKQQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCIGCRNTEERL 64

Query: 193 VA----RQEAVGATLER 205
                 ++ A+ A  ER
Sbjct: 65  AVATPPKKTALAAKRER 81


>gi|194856723|ref|XP_001968813.1| GG24300 [Drosophila erecta]
 gi|190660680|gb|EDV57872.1| GG24300 [Drosophila erecta]
          Length = 239

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 29/37 (78%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 275
           KGC CK+S C+K YC+C+Q+  +C++ CRC+ C+N E
Sbjct: 26  KGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRNTE 62



 Score = 39.7 bits (91), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 33/67 (49%), Gaps = 5/67 (7%)

Query: 129 PGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           P K+   R+  R    GK   P   K C C+ S+C+K YC+C+ +   C   C C+ C N
Sbjct: 4   PKKKNVERTESR---RGKGQGPGGVKGCCCKRSQCIKNYCDCYQSMAICTKFCRCIGCRN 60

Query: 188 N-VEHEV 193
             V H V
Sbjct: 61  TEVRHLV 67


>gi|147832526|emb|CAN77361.1| hypothetical protein VITISV_011036 [Vitis vinifera]
          Length = 514

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 46/121 (38%), Gaps = 45/121 (37%)

Query: 164 LKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 222
           L  YC+C AAG+YC D C C NC N  E+E   Q        R+P AF P+I        
Sbjct: 244 LSSYCDCLAAGVYCTDSCACSNCLNKSENEGVVQIIREKIESRDPLAFAPRI-------- 295

Query: 223 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN-FEGSEERR 281
                                              A I CS+ CRC DC+N F    +RR
Sbjct: 296 -----------------------------------AGIGCSDGCRCEDCRNSFGIKADRR 320

Query: 282 A 282
           A
Sbjct: 321 A 321


>gi|219114349|ref|XP_002176345.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217402591|gb|EEC42581.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 636

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 25/35 (71%)

Query: 240 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
           GC C++S CLKKYCECFQ    C  NC+C +C+N 
Sbjct: 245 GCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQNL 279



 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHN 187
           Q  C CR S CLK YCECF     C   C C NC N
Sbjct: 243 QFGCKCRRSFCLKKYCECFQNASPCGLNCKCTNCQN 278


>gi|145520831|ref|XP_001446271.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413748|emb|CAK78874.1| unnamed protein product [Paramecium tetraurelia]
          Length = 229

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 23/123 (18%)

Query: 152 KQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHEVARQEAVGATLERNPNAF 210
           K  +C C  S CLK YC+CFA G  C   C C  CHN  +++  R EA+     +N +  
Sbjct: 54  KSLKCKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERGEAINKLKLQNQSIE 113

Query: 211 RPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
           +      P G     + ++                   LK+YC+CFQ N  C E+C+C +
Sbjct: 114 KE----VPVGISCKCKKSKC------------------LKRYCDCFQNNQKCHESCQCSN 151

Query: 271 CKN 273
           C N
Sbjct: 152 CSN 154



 Score = 44.7 bits (104), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 25/42 (59%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 282
           C C KS CLK+YC+CF    +C+  C+C  C N E  +E R 
Sbjct: 58  CKCNKSMCLKQYCDCFANGNMCTTQCQCQGCHNTEDYKEERG 99


>gi|397568266|gb|EJK46048.1| hypothetical protein THAOC_35308 [Thalassiosira oceanica]
          Length = 754

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 24/37 (64%)

Query: 237 HNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           H  GC C+KS CLKKYCECF   + C  +C C +C N
Sbjct: 421 HRTGCKCRKSRCLKKYCECFSNGVKCGASCSCENCGN 457



 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 2/49 (4%)

Query: 238 NKGCHCKKSGCLKKYCECFQANILCSEN-CRCLDCKNFEG-SEERRALF 284
           +KGC C K+ CLK YC CF  N+LC+++ C C DCKN E  S E  A+ 
Sbjct: 231 DKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEAESTEDGAIL 279



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 37/69 (53%), Gaps = 5/69 (7%)

Query: 153 QKQCNCRNSRCLKLYCECFAAGIYC--DGC---NCVNCHNNVEHEVARQEAVGATLERNP 207
            K C+C  ++CLKLYC CFA  + C  D C   +C NC      + A  +A  + L +NP
Sbjct: 231 DKGCSCSKTQCLKLYCSCFANNLLCNKDVCICSDCKNCEAESTEDGAILKARRSILRKNP 290

Query: 208 NAFRPKIAS 216
            AF+ K  S
Sbjct: 291 KAFKSKFQS 299



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/33 (60%), Positives = 23/33 (69%), Gaps = 1/33 (3%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHN 187
           C CR SRCLK YCECF+ G+ C   C+C NC N
Sbjct: 425 CKCRKSRCLKKYCECFSNGVKCGASCSCENCGN 457


>gi|195118094|ref|XP_002003575.1| GI21907 [Drosophila mojavensis]
 gi|193914150|gb|EDW13017.1| GI21907 [Drosophila mojavensis]
          Length = 234

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 33/45 (73%), Gaps = 3/45 (6%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           KGC CK+S C+K YC+C+Q+  +CS  C+C+ C+N   ++ER A+
Sbjct: 25  KGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRN---TQERLAV 66



 Score = 39.7 bits (91), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 38/79 (48%), Gaps = 14/79 (17%)

Query: 132 QESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVE 190
           +E P+ +P+         P+  K C C+ S+C+K YC+C+ +   C   C CV C N  E
Sbjct: 12  KEHPKRQPK---------PQAPKGCCCKRSQCIKNYCDCYQSMAICSIYCKCVGCRNTQE 62

Query: 191 HEVA----RQEAVGATLER 205
                   ++ A+ A  ER
Sbjct: 63  RLAVATPPKKTALAAKRER 81


>gi|195034702|ref|XP_001988959.1| GH11450 [Drosophila grimshawi]
 gi|193904959|gb|EDW03826.1| GH11450 [Drosophila grimshawi]
          Length = 225

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 34/45 (75%), Gaps = 3/45 (6%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           +GC CK+S C+K YC+C+Q+  +CS +C+C+ C+N   +E+R A+
Sbjct: 22  RGCCCKRSQCIKNYCDCYQSMAVCSIHCKCIGCRN---TEQRMAV 63


>gi|195342745|ref|XP_002037959.1| GM18017 [Drosophila sechellia]
 gi|194132809|gb|EDW54377.1| GM18017 [Drosophila sechellia]
          Length = 240

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 27/35 (77%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           KGC CK+S C+K YC+C+Q+  +C++ CRC  C+N
Sbjct: 23  KGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRN 57



 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 5/79 (6%)

Query: 134 SPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDG-CNCVNCHNNVEHE 192
           SPR +      GK       K C+C+ S+C+K YC+C+ +   C   C C  C N V  E
Sbjct: 3   SPRKKTVQGRRGKGQGAGGVKGCSCKRSQCIKNYCDCYQSMAICTKFCRCTACRNTVVRE 62

Query: 193 VARQEAVGATLERNPNAFR 211
           +    +V     +N NA +
Sbjct: 63  LVDPNSVA----KNSNAVK 77


>gi|145484609|ref|XP_001428314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395399|emb|CAK60916.1| unnamed protein product [Paramecium tetraurelia]
          Length = 264

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 56/138 (40%), Gaps = 17/138 (12%)

Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVAR 195
           S+   N D       + K C+C  + CL+LYC CF     C   C C +C+N+ +H    
Sbjct: 129 SKANYNVDESPTLDSEIKPCHCSKTNCLQLYCSCFHNRRQCTQECKCCDCYNDGQHTDEV 188

Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
            +AV     +   A    + S      D R+         K  KG           YCEC
Sbjct: 189 LKAVEQIKIKEQRASHHDLDSF-----DTRQVWGCKCKKTKCVKG-----------YCEC 232

Query: 256 FQANILCSENCRCLDCKN 273
           F  N  CS +C C DC+N
Sbjct: 233 FIRNKKCSSHCHCSDCQN 250


>gi|24581949|ref|NP_608936.1| tombola [Drosophila melanogaster]
 gi|7296988|gb|AAF52259.1| tombola [Drosophila melanogaster]
 gi|294862322|gb|ADE06681.2| RT06796p1 [Drosophila melanogaster]
          Length = 243

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 28/38 (73%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSE 278
           C CK+S C+K YC+C+Q+  +C++ CRC+ C+N E  E
Sbjct: 28  CCCKRSQCIKNYCDCYQSMAICTKFCRCVGCRNTEVRE 65


>gi|145508477|ref|XP_001440188.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407394|emb|CAK72791.1| unnamed protein product [Paramecium tetraurelia]
          Length = 235

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 50/116 (43%), Gaps = 37/116 (31%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C CR S+C++ YC C A    C   C C NC N                       +PK 
Sbjct: 138 CKCRRSKCIQNYCVCSANNQECKLNCECYNCSNK----------------------QPK- 174

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD 270
            S+P            +QL++    GC+C+K  C K+YCEC + NI C+  C C +
Sbjct: 175 -STP------------SQLSSIEFNGCNCRKQNCSKRYCECQKRNIKCTSKCNCCE 217


>gi|145545676|ref|XP_001458522.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426342|emb|CAK91125.1| unnamed protein product [Paramecium tetraurelia]
          Length = 289

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 17/138 (12%)

Query: 137 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVAR 195
           ++   N D       + K C+C  + CL+LYC CF     C   C C +C+N+ +HE   
Sbjct: 139 NKANYNVDNSPVEEDEMKPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYNDGQHE--- 195

Query: 196 QEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCEC 255
            E V   +E+     + K   + H   D+           K   GC CKK+ C+K YCEC
Sbjct: 196 -EDVLKAVEQ----IKMKEQRASHHDLDS--------FDTKQVWGCKCKKTKCVKGYCEC 242

Query: 256 FQANILCSENCRCLDCKN 273
           F     C+ +C+C +C+N
Sbjct: 243 FIRGKKCTSHCQCTECEN 260



 Score = 42.7 bits (99), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 239 KGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 283
           K CHC K+ CL+ YC CF     C   C+C DC N +G  E   L
Sbjct: 156 KPCHCTKTHCLQLYCSCFHNRRQCMSECKCNDCYN-DGQHEEDVL 199


>gi|145483717|ref|XP_001427881.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394964|emb|CAK60483.1| unnamed protein product [Paramecium tetraurelia]
          Length = 219

 Score = 46.6 bits (109), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 50/122 (40%), Gaps = 38/122 (31%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C CR S+C++ YC C A    C   C C NC N                 ++P +  PK+
Sbjct: 122 CKCRKSKCIQNYCVCSANNQECKSNCECYNCSN-----------------KSPKSAPPKM 164

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCL-DCKN 273
           +S                       GC+C+K  C K+YCEC +    C+  C C  DC+N
Sbjct: 165 SSIEFN-------------------GCNCRKQNCSKRYCECQKRMTRCTSRCNCCEDCEN 205

Query: 274 FE 275
            E
Sbjct: 206 QE 207


>gi|145520699|ref|XP_001446205.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413682|emb|CAK78808.1| unnamed protein product [Paramecium tetraurelia]
          Length = 226

 Score = 46.2 bits (108), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 47/120 (39%), Gaps = 38/120 (31%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C CR S+C++ YC C A    C   C C NC N                       +PK 
Sbjct: 129 CKCRKSKCIQNYCVCSANNQECSFKCECYNCSNK----------------------QPKS 166

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD-CKN 273
           A S     +                GC+C+K  C K YCEC +  I C+  CRC D CKN
Sbjct: 167 AISKQSGSEYL--------------GCNCRKQDCSKGYCECQKRKIKCTSKCRCCDECKN 212


>gi|428180254|gb|EKX49122.1| hypothetical protein GUITHDRAFT_151625 [Guillardia theta CCMP2712]
          Length = 289

 Score = 46.2 bits (108), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 23/33 (69%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           C CK+S C+KKYCECFQ +  C+  C C DC N
Sbjct: 158 CKCKRSRCMKKYCECFQNSSACTSRCTCNDCGN 190


>gi|145492971|ref|XP_001432482.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124399594|emb|CAK65085.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 44.7 bits (104), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 69/144 (47%), Gaps = 22/144 (15%)

Query: 136 RSRPRANTDGK--DGTPKKQ---KQCNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNV 189
           ++R +AN      D +P ++   K C+C  + CL+LYC CF     C + C C +C N+ 
Sbjct: 101 QTRRKANKTNYNVDDSPTEEDDFKACHCSKTHCLQLYCSCFHNRRPCTNECKCNDCFNDG 160

Query: 190 EHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCL 249
           +HE    E + A         + K   + H   D+           K   GC CKK+ C+
Sbjct: 161 KHE---DEVIKAV-----EQIKLKEQRASHHDLDS--------FDTKQVWGCKCKKTKCV 204

Query: 250 KKYCECFQANILCSENCRCLDCKN 273
           K YCECF     C+ +C+C +C+N
Sbjct: 205 KGYCECFIRGKKCTSHCQCTECEN 228


>gi|218194439|gb|EEC76866.1| hypothetical protein OsI_15056 [Oryza sativa Indica Group]
          Length = 177

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 30/61 (49%), Gaps = 5/61 (8%)

Query: 167 YCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAR 225
           YC CF+A  YC   C C NC N   +E   +E V     +NP AF PKI       QDA 
Sbjct: 100 YCLCFSAYGYCSQNCRCTNCKNREYYEDFVEERVDMIKMKNPRAFDPKIVR----VQDAS 155

Query: 226 E 226
           E
Sbjct: 156 E 156



 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 241 CHCKKS--GCLKKYCECFQANILCSENCRCLDCKNFEGSEE 279
           CH KK   G  + YC CF A   CS+NCRC +CKN E  E+
Sbjct: 87  CHFKKRARGKQESYCLCFSAYGYCSQNCRCTNCKNREYYED 127


>gi|145495603|ref|XP_001433794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400914|emb|CAK66397.1| unnamed protein product [Paramecium tetraurelia]
          Length = 253

 Score = 44.3 bits (103), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 57/121 (47%), Gaps = 17/121 (14%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C+C  + CL+LYC CF     C + C C +C+N+ +HE    +AV           + K 
Sbjct: 149 CHCAKTHCLQLYCSCFHNRRPCTNECKCNDCYNDGKHEDEVLKAV--------EQIKLKE 200

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNF 274
             + H   D+           K   GC CKK+ C+K YCECF     C+ +C+C + K  
Sbjct: 201 QRASHHDLDS--------FDTKQVWGCKCKKTKCVKGYCECFIRGKKCTSHCQCTELKTI 252

Query: 275 E 275
           E
Sbjct: 253 E 253


>gi|16764321|ref|NP_459936.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
           Typhimurium str. LT2]
 gi|374981628|ref|ZP_09722952.1| Cell division protein FtsK [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|34395700|sp|Q8ZQD5.1|FTSK_SALTY RecName: Full=DNA translocase FtsK
 gi|16419472|gb|AAL19895.1| cell division protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. LT2]
 gi|321223284|gb|EFX48353.1| Cell division protein FtsK [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
          Length = 1351

 Score = 43.9 bits (102), Expect = 0.17,   Method: Composition-based stats.
 Identities = 30/110 (27%), Positives = 44/110 (40%), Gaps = 11/110 (10%)

Query: 44  PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
           PQ+  P + P P   QP  Q   A P ++Q     Q +P +S P   A           +
Sbjct: 364 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 421

Query: 95  PSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTD 144
           P +  PQWQ  P H       P P+   P    PP  ++   + P  +T+
Sbjct: 422 PQETQPQWQPEPTHQPTPVYQPEPIAAEPSHMPPPVIEQPVATEPEPDTE 471


>gi|378983520|ref|YP_005246675.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
           Typhimurium str. T000240]
 gi|312911948|dbj|BAJ35922.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
           Typhimurium str. T000240]
          Length = 1370

 Score = 43.9 bits (102), Expect = 0.17,   Method: Composition-based stats.
 Identities = 30/110 (27%), Positives = 44/110 (40%), Gaps = 11/110 (10%)

Query: 44  PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
           PQ+  P + P P   QP  Q   A P ++Q     Q +P +S P   A           +
Sbjct: 383 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 440

Query: 95  PSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTD 144
           P +  PQWQ  P H       P P+   P    PP  ++   + P  +T+
Sbjct: 441 PQETQPQWQPEPTHQPTPVYQPEPIAAEPSHMPPPVIEQPVATEPEPDTE 490


>gi|145529912|ref|XP_001450739.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418361|emb|CAK83342.1| unnamed protein product [Paramecium tetraurelia]
          Length = 226

 Score = 42.0 bits (97), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 45/120 (37%), Gaps = 38/120 (31%)

Query: 156 CNCRNSRCLKLYCECFAAGIYCD-GCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKI 214
           C CR S+C+K YC C A    C   C C NC N                       +PK 
Sbjct: 129 CRCRKSKCIKNYCVCSANNQECTFKCECYNCSNK----------------------QPKS 166

Query: 215 ASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD-CKN 273
           A S     +                GC+C++  C K YCEC +    C+  C C D CKN
Sbjct: 167 AVSKQSGSEYL--------------GCNCRRQDCSKGYCECQKRKTKCTARCNCCDECKN 212


>gi|145483579|ref|XP_001427812.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394895|emb|CAK60414.1| unnamed protein product [Paramecium tetraurelia]
          Length = 261

 Score = 41.6 bits (96), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 241 CHCKKSGCLKKYCECFQANILCSENCRCLDCKN 273
           C C KS CLK+YC+CF    +C+  C+C  C N
Sbjct: 90  CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122



 Score = 41.2 bits (95), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 156 CNCRNSRCLKLYCECFAAGIYC-DGCNCVNCHN 187
           C C  S CLK YC+CFA G  C + C C  CHN
Sbjct: 90  CKCNKSMCLKQYCDCFANGNMCTNQCQCQGCHN 122


>gi|123410061|ref|XP_001303597.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121884989|gb|EAX90667.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 162

 Score = 41.6 bits (96), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 26/62 (41%), Positives = 29/62 (46%), Gaps = 2/62 (3%)

Query: 150 PKKQKQCNCRNSRCLKLYCECFAAGIY-CDGCNCVNCHNNVEHEVARQEAVGATLERNPN 208
           PK    CNC  + CL   C CF    Y CDGC C  C N  E+E  R  +    L  NP 
Sbjct: 6   PKCASCCNCEGN-CLSTTCPCFLHSKYCCDGCKCQKCRNKKEYEQERVASFEQHLLENPL 64

Query: 209 AF 210
           AF
Sbjct: 65  AF 66


>gi|76155324|gb|AAX26592.2| SJCHGC03965 protein [Schistosoma japonicum]
          Length = 176

 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/22 (72%), Positives = 17/22 (77%)

Query: 153 QKQCNCRNSRCLKLYCECFAAG 174
           +K CNC  S CLKLYCECFA G
Sbjct: 144 RKPCNCTKSHCLKLYCECFAQG 165


>gi|345309828|ref|XP_001507154.2| PREDICTED: protein lin-54 homolog [Ornithorhynchus anatinus]
          Length = 182

 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 258 ANILCSENCRCLDCKNFEGSEERRALFH 285
           A I+CS  C+C+ CKNFE S ER+ L H
Sbjct: 49  AKIMCSSICKCIGCKNFEESPERKTLMH 76


>gi|16759831|ref|NP_455448.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
           Typhi str. CT18]
 gi|29142396|ref|NP_805738.1| DNA translocase FtsK [Salmonella enterica subsp. enterica serovar
           Typhi str. Ty2]
 gi|34395698|sp|Q8Z814.1|FTSK_SALTI RecName: Full=DNA translocase FtsK
 gi|25512677|pir||AF0611 cell division protein FtsK [imported] - Salmonella enterica subsp.
           enterica serovar Typhi (strain CT18)
 gi|16502124|emb|CAD05360.1| cell division protein FtsK [Salmonella enterica subsp. enterica
           serovar Typhi]
 gi|29138026|gb|AAO69587.1| cell division protein FtsK [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
          Length = 1343

 Score = 40.8 bits (94), Expect = 1.2,   Method: Composition-based stats.
 Identities = 34/114 (29%), Positives = 52/114 (45%), Gaps = 20/114 (17%)

Query: 44  PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
           PQ+  P + P P   QP  Q   A P ++Q     Q +P +S P   A           +
Sbjct: 364 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 421

Query: 95  PSQAMPQWQA-------RPQHVRM-VNRVPHPVHKLPLPTLP-PGKQESPRSRP 139
           P +  PQWQA       +P+ +    + +P PV + P+ T P PG +E+  +RP
Sbjct: 422 PQETQPQWQAPDAEQHWQPEPIAAEPSHMPPPVIEQPVATEPEPGIEETRPARP 475


>gi|378960139|ref|YP_005217625.1| DNA translocase ftsK [Salmonella enterica subsp. enterica serovar
           Typhi str. P-stx-12]
 gi|374354011|gb|AEZ45772.1| DNA translocase ftsK [Salmonella enterica subsp. enterica serovar
           Typhi str. P-stx-12]
          Length = 1317

 Score = 40.8 bits (94), Expect = 1.2,   Method: Composition-based stats.
 Identities = 34/114 (29%), Positives = 52/114 (45%), Gaps = 20/114 (17%)

Query: 44  PQSHSP-LQPQPPKSQPQVQSKVASPSQSQPLVRLQLLPPSSQPHVQA--------HMRS 94
           PQ+  P + P P   QP  Q   A P ++Q     Q +P +S P   A           +
Sbjct: 364 PQTGEPVIAPAPEGYQPHPQ--YAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLA 421

Query: 95  PSQAMPQWQA-------RPQHVRM-VNRVPHPVHKLPLPTLP-PGKQESPRSRP 139
           P +  PQWQA       +P+ +    + +P PV + P+ T P PG +E+  +RP
Sbjct: 422 PQETQPQWQAPDAEQHWQPEPIAAEPSHMPPPVIEQPVATEPEPGIEETRPARP 475


>gi|115615245|ref|XP_001199031.1| PREDICTED: uncharacterized protein LOC763146 [Strongylocentrotus
            purpuratus]
          Length = 1617

 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)

Query: 241  CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 273
            C CKK+ C       K  C C Q NI C++ CRC++CKN
Sbjct: 1560 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1598


>gi|384246535|gb|EIE20025.1| hypothetical protein COCSUDRAFT_58250 [Coccomyxa subellipsoidea
           C-169]
          Length = 556

 Score = 39.7 bits (91), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 8/70 (11%)

Query: 204 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 263
           +RNP AF+ KI    +G        + + ++  H +GCHCKKS C KKYCECFQA + C 
Sbjct: 162 QRNPQAFQEKIEKQANG--------QGSGVSGLHKRGCHCKKSHCKKKYCECFQAGVNCG 213

Query: 264 ENCRCLDCKN 273
            +C+C +C N
Sbjct: 214 VHCKCDECHN 223


>gi|285026357|dbj|BAI67897.1| liver stage-specific antigen-1 [Plasmodium vivax]
          Length = 2149

 Score = 39.7 bits (91), Expect = 3.1,   Method: Composition-based stats.
 Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 12/130 (9%)

Query: 34  EQSVKSQ-SSQPQSHSPLQPQPP-------KSQPQVQSKVASPSQSQPLVRLQLLPPSSQ 85
           +Q +K Q   QP+ H   + QP        + QP +Q ++ S +Q +P      L P  +
Sbjct: 698 KQQMKPQLRPQPRPHPQQELQPGAQPEEQLEEQPDLQLELQSEAQPKPQQPQMRLKPQLR 757

Query: 86  PHVQAHMRSPSQAMPQWQARPQHVRMVNRVPHPVHKLPLPTLPPGKQESPRSRPRANTDG 145
             VQ       +A PQ +  PQ        PHP  K P P + P  Q  P  + + N   
Sbjct: 758 SKVQMRPHPQMKAHPQIKPNPQMKPNPQMKPHPQMK-PNPQMKPNPQMKPNPQMKPNPQM 816

Query: 146 KDGTPKKQKQ 155
           K   P+ Q Q
Sbjct: 817 K---PQPQLQ 823


>gi|390344245|ref|XP_003726079.1| PREDICTED: uncharacterized protein LOC764337 [Strongylocentrotus
            purpuratus]
          Length = 1742

 Score = 39.3 bits (90), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 6/39 (15%)

Query: 241  CHCKKSGCL------KKYCECFQANILCSENCRCLDCKN 273
            C CKK+ C       K  C C Q NI C++ CRC++CKN
Sbjct: 1685 CSCKKTSCEEPKAAGKGRCSCRQHNITCTDLCRCINCKN 1723


>gi|195147164|ref|XP_002014550.1| GL18894 [Drosophila persimilis]
 gi|194106503|gb|EDW28546.1| GL18894 [Drosophila persimilis]
          Length = 1100

 Score = 38.9 bits (89), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 9/57 (15%)

Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
           K++ P+   R  T G    PK      CRN+RC      C+ +G  C  C+CV CHN
Sbjct: 817 KEKPPKPICRCGTSGVAVNPK----TTCRNTRC-----PCYKSGNTCTNCHCVGCHN 864


>gi|325191197|emb|CCA25983.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 511

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 39/88 (44%), Gaps = 8/88 (9%)

Query: 218 PHGAQDAREDAREAQLAAKHNKGCHCKKSG-CLKKYCECFQANILCSENCRCLDCKN-FE 275
           P G+ D REDA E       NK   C+ SG C    C C + N +C+  CRC  C+N F 
Sbjct: 299 PLGSSDRREDALEK------NKVVSCRCSGGCRNGRCACVRENGMCTPACRCTSCQNPFS 352

Query: 276 GSEERRALFHGDHNGIAFMQRAANAAII 303
             +   A      N   FMQ A    ++
Sbjct: 353 IVKAAGANMDILKNDSCFMQNACKGRVM 380


>gi|198473707|ref|XP_001356412.2| GA16882 [Drosophila pseudoobscura pseudoobscura]
 gi|198138072|gb|EAL33476.2| GA16882 [Drosophila pseudoobscura pseudoobscura]
          Length = 1102

 Score = 38.9 bits (89), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 9/57 (15%)

Query: 131 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLYCECFAAGIYCDGCNCVNCHN 187
           K++ P+   R  T G    PK      CRN+RC      C+ +G  C  C+CV CHN
Sbjct: 815 KEKPPKPICRCGTSGVAVNPK----TTCRNTRC-----PCYKSGNTCTNCHCVGCHN 862


>gi|345481297|ref|XP_003424335.1| PREDICTED: hypothetical protein LOC100679675 [Nasonia vitripennis]
          Length = 539

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 25/48 (52%), Gaps = 3/48 (6%)

Query: 234 AAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLD--CKNFEGSEE 279
             K+NKGC CK   C  K C C +  I+C   CRC D  C N E  +E
Sbjct: 103 GTKNNKGCSCK-GNCSNKICGCVKKGIMCQVFCRCNDAQCMNQEPEKE 149


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.128    0.388 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,412,691,960
Number of Sequences: 23463169
Number of extensions: 267316611
Number of successful extensions: 1327024
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 779
Number of HSP's successfully gapped in prelim test: 5435
Number of HSP's that attempted gapping in prelim test: 1261156
Number of HSP's gapped (non-prelim): 45855
length of query: 425
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 280
effective length of database: 8,957,035,862
effective search space: 2507970041360
effective search space used: 2507970041360
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)